Name zUMKDmd8JS5nsSi4apGgGQJmABFKDmABFKDm4ySLDm9ZRKDmCwx3Fo_0
Workunit 223078370
Created 17 May 2024, 21:17:51 UTC
Sent 18 May 2024, 2:19:59 UTC
Report deadline 26 May 2024, 2:19:59 UTC
Received 18 May 2024, 11:42:13 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 10695183
Run time 48 min 14 sec
CPU time 6 hours 24 min 42 sec
Validate state Valid
Credit 495.64
Device peak FLOPS 51.55 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.73 GB
Peak swap size 3.34 GB
Peak disk usage 1.32 GB

Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
02:45:56 (180142): wrapper (7.7.26015): starting
02:45:56 (180142): wrapper: running run_atlas (--nthreads 12)
[2024-05-18 02:45:56] Arguments: --nthreads 12
[2024-05-18 02:45:56] Threads: 12
[2024-05-18 02:45:56] Checking for CVMFS
[2024-05-18 02:45:56] Probing /cvmfs/atlas.cern.ch... OK
[2024-05-18 02:45:56] Probing /cvmfs/atlas-condb.cern.ch... OK
[2024-05-18 02:45:56] Running cvmfs_config stat atlas.cern.ch
[2024-05-18 02:45:56] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2024-05-18 02:45:56] 2.9.2.0 179502 184754 170380 132874 0 262 44958084 45568000 9374 130560 0 1238175564 99.804 700644524 18667 http://cvmfs-s1bnl.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://206.12.9.163:3129 1
[2024-05-18 02:45:56] CVMFS is ok
[2024-05-18 02:45:56] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2024-05-18 02:45:56] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2024-05-18 02:45:56] Further information can be found at the LHC@home message board.
[2024-05-18 02:45:56] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2024-05-18 02:45:56] Checking for apptainer binary...
[2024-05-18 02:45:56] Using apptainer found in PATH at /usr/bin/apptainer
[2024-05-18 02:45:56] Running /usr/bin/apptainer --version
[2024-05-18 02:45:56] apptainer version 1.1.8-1.el7
[2024-05-18 02:45:56] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2024-05-18 02:45:57] INFO: /etc/singularity/ exists; cleanup by system administrator is not complete (see https://apptainer.org/docs/admin/latest/singularity_migration.html) WARNING: Environment variable TMPDIR already has value [/home/boinc/slots/0/.apptainertmp], will not forward new value [/home/boinc] from parent process environment wns0169.triumf.lcg
[2024-05-18 02:45:57] apptainer works
[2024-05-18 02:45:57] Set ATHENA_PROC_NUMBER=12
[2024-05-18 02:45:57] Set ATHENA_CORE_NUMBER=12
[2024-05-18 02:45:57] Starting ATLAS job with PandaID=6208359345
[2024-05-18 02:45:57] Running command: /usr/bin/apptainer exec -B /cvmfs,/home/boinc/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2024-05-18 03:34:06]  *** The last 200 lines of the pilot log: ***
[2024-05-18 03:34:06]  ddmendpoint=NDGF-T1_DATADISK
[2024-05-18 03:34:06]  direct_access_lan=False
[2024-05-18 03:34:06]  direct_access_wan=False
[2024-05-18 03:34:06]  domain=
[2024-05-18 03:34:06]  filesize=359606
[2024-05-18 03:34:06]  filetype=log
[2024-05-18 03:34:06]  guid=6ef06b84-8571-4fd7-a37b-b068ba0e31b7
[2024-05-18 03:34:06]  inputddms=['NDGF-T1_DATADISK', 'CERN-PROD_DATADISK']
[2024-05-18 03:34:06]  is_tar=False
[2024-05-18 03:34:06]  lfn=log.38776104._014226.job.log.tgz.1
[2024-05-18 03:34:06]  mtime=0
[2024-05-18 03:34:06]  protocol_id=None
[2024-05-18 03:34:06]  protocols=[{'endpoint': 'davs://dav.ndgf.org:443', 'flavour': 'WEBDAV', 'id': 331, 'path': '/atlas/disk/atlasdatadisk/rucio/'}]
[2024-05-18 03:34:06]  replicas=None
[2024-05-18 03:34:06]  scope=mc23_13p6TeV
[2024-05-18 03:34:06]  status=None
[2024-05-18 03:34:06]  status_code=0
[2024-05-18 03:34:06]  storage_token=
[2024-05-18 03:34:06]  surl=/home/boinc/slots/0/PanDA_Pilot-6208359345/log.38776104._014226.job.log.tgz.1
[2024-05-18 03:34:06]  turl=davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/b4/e5/log.38776104._014226.job.log.tgz.1
[2024-05-18 03:34:06]  workdir=None
[2024-05-18 03:34:06] ]
[2024-05-18 03:34:06] 2024-05-18 10:33:36,214 | INFO     | transferring file log.38776104._014226.job.log.tgz.1 from /home/boinc/slots/0/PanDA_Pilot-6208359345/log.38776104._014226.job.log.tgz.1 to /home/boinc/slots/0/log.
[2024-05-18 03:34:06] 2024-05-18 10:33:36,214 | INFO     | executing command: /usr/bin/env mv /home/boinc/slots/0/PanDA_Pilot-6208359345/log.38776104._014226.job.log.tgz.1 /home/boinc/slots/0/log.38776104._014226.job.log.t
[2024-05-18 03:34:06] 2024-05-18 10:33:36,269 | INFO     | adding to output.list: log.38776104._014226.job.log.tgz.1 davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/b4/e5/log.38776104._014226.job.log.tg
[2024-05-18 03:34:06] 2024-05-18 10:33:36,270 | INFO     | summary of transferred files:
[2024-05-18 03:34:06] 2024-05-18 10:33:36,271 | INFO     |  -- lfn=log.38776104._014226.job.log.tgz.1, status_code=0, status=transferred
[2024-05-18 03:34:06] 2024-05-18 10:33:36,271 | INFO     | stage-out finished correctly
[2024-05-18 03:34:06] 2024-05-18 10:33:36,499 | INFO     | finished stage-out for finished payload, adding job to finished_jobs queue
[2024-05-18 03:34:06] 2024-05-18 10:33:39,000 | INFO     | job 6208359345 has state=finished
[2024-05-18 03:34:06] 2024-05-18 10:33:39,001 | INFO     | preparing for final server update for job 6208359345 in state='finished'
[2024-05-18 03:34:06] 2024-05-18 10:33:39,001 | INFO     | this job has now completed (state=finished)
[2024-05-18 03:34:06] 2024-05-18 10:33:39,002 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2024-05-18 03:34:06] 2024-05-18 10:33:39,002 | INFO     | job 6208359345 has finished - writing final server update
[2024-05-18 03:34:06] 2024-05-18 10:33:39,002 | INFO     | total number of processed events: 400 (read)
[2024-05-18 03:34:06] 2024-05-18 10:33:39,006 | INFO     | executing command: lscpu
[2024-05-18 03:34:06] 2024-05-18 10:33:39,053 | INFO     | found 48 cores (24 cores per socket, 2 sockets)
[2024-05-18 03:34:06] 2024-05-18 10:33:39,054 | INFO     | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo
[2024-05-18 03:34:06] 2024-05-18 10:33:39,137 | INFO     | executing command: export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase;source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh --quiet;lsetup
[2024-05-18 03:34:06] 2024-05-18 10:33:44,864 | INFO     | CPU arch script returned: x86-64-v4
[2024-05-18 03:34:06] 2024-05-18 10:33:44,865 | INFO     | using path: /home/boinc/slots/0/PanDA_Pilot-6208359345/memory_monitor_summary.json (trf name=prmon)
[2024-05-18 03:34:06] 2024-05-18 10:33:44,871 | INFO     | extracted standard info from prmon json
[2024-05-18 03:34:06] 2024-05-18 10:33:44,872 | INFO     | extracted standard memory fields from prmon json
[2024-05-18 03:34:06] 2024-05-18 10:33:44,872 | WARNING  | GPU info not found in prmon json
[2024-05-18 03:34:06] 2024-05-18 10:33:44,873 | WARNING  | format EVNTtoHITS has no such key: dbData
[2024-05-18 03:34:06] 2024-05-18 10:33:44,873 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2024-05-18 03:34:06] 2024-05-18 10:33:44,875 | INFO     | fitting pss+swap vs Time
[2024-05-18 03:34:06] 2024-05-18 10:33:44,876 | INFO     | model: linear, x: [1716025913.0, 1716025974.0, 1716026035.0, 1716026096.0, 1716026157.0, 1716026218.0, 1716026279.0, 1716026340.0, 1716026401.0, 1716026462.0, 1716
[2024-05-18 03:34:06] 2024-05-18 10:33:44,876 | INFO     | sum of square deviations: 18381740.0
[2024-05-18 03:34:06] 2024-05-18 10:33:44,876 | INFO     | sum of deviations: 10356063574.0
[2024-05-18 03:34:06] 2024-05-18 10:33:44,876 | INFO     | mean x: 1716027072.0
[2024-05-18 03:34:06] 2024-05-18 10:33:44,876 | INFO     | mean y: 2505262.282051282
[2024-05-18 03:34:06] 2024-05-18 10:33:44,877 | INFO     | -- intersect: -966787659996.1251
[2024-05-18 03:34:06] 2024-05-18 10:33:44,877 | INFO     | intersect: -966787659996.1251
[2024-05-18 03:34:06] 2024-05-18 10:33:44,877 | INFO     | chi2: 1.8959004214572324
[2024-05-18 03:34:06] 2024-05-18 10:33:44,877 | INFO     | -- intersect: -966787659996.1251
[2024-05-18 03:34:06] 2024-05-18 10:33:44,877 | INFO     | current memory leak: 563.39 B/s (using 39 data points, chi2=1.90)
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | ..............................
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | . Timing measurements:
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | . get job = 0 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | . initial setup = 2 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | . payload setup = 7 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | . stage-in = 0 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,878 | INFO     | . payload execution = 2812 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,879 | INFO     | . stage-out = 0 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,879 | INFO     | . log creation = 0 s
[2024-05-18 03:34:06] 2024-05-18 10:33:44,879 | INFO     | ..............................
[2024-05-18 03:34:06] 2024-05-18 10:33:44,999 | WARNING  | process 191232 is no longer using CPU - aborting
[2024-05-18 03:34:06] 2024-05-18 10:33:44,999 | INFO     | aborting job monitoring since job object (job id=6208359345) has expired
[2024-05-18 03:34:06] 2024-05-18 10:33:45,033 | INFO     | 
[2024-05-18 03:34:06] 2024-05-18 10:33:45,033 | INFO     | job summary report
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | --------------------------------------------------
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | PanDA job id: 6208359345
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | task id: 38776104
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | errors: (none)
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | status: LOG_TRANSFER = DONE 
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | pilot state: finished 
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | transexitcode: 0
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | exeerrorcode: 0
[2024-05-18 03:34:06] 2024-05-18 10:33:45,034 | INFO     | exeerrordiag: 
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | exitcode: 0
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | exitmsg: OK
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | cpuconsumptiontime: 23154 s
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | nevents: 400
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | neventsw: 0
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | pid: 191232
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | pgrp: 191232
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | corecount: 12
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | event service: False
[2024-05-18 03:34:06] 2024-05-18 10:33:45,035 | INFO     | sizes: {0: 2392029, 1: 2392228, 8: 2392434, 11: 2392462, 2824: 2417617, 2825: 2417616, 2826: 2426810, 2834: 2426924}
[2024-05-18 03:34:06] 2024-05-18 10:33:45,036 | INFO     | --------------------------------------------------
[2024-05-18 03:34:06] 2024-05-18 10:33:45,036 | INFO     | 
[2024-05-18 03:34:06] 2024-05-18 10:33:45,036 | INFO     | executing command: ls -lF /home/boinc/slots/0
[2024-05-18 03:34:06] 2024-05-18 10:33:45,081 | INFO     | queue jobs had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue payloads had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue data_in had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue data_out had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue current_data_in had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,082 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,083 | INFO     | queue completed_jobids has 1 job(s)
[2024-05-18 03:34:06] 2024-05-18 10:33:45,084 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,084 | INFO     | queue messages had 0 job(s) [purged]
[2024-05-18 03:34:06] 2024-05-18 10:33:45,084 | INFO     | job 6208359345 has completed (purged errors)
[2024-05-18 03:34:06] 2024-05-18 10:33:45,084 | INFO     | overall cleanup function is called
[2024-05-18 03:34:06] 2024-05-18 10:33:46,093 | INFO     | --- collectZombieJob: --- 10, [191232]
[2024-05-18 03:34:06] 2024-05-18 10:33:46,093 | INFO     | zombie collector waiting for pid 191232
[2024-05-18 03:34:06] 2024-05-18 10:33:46,093 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2024-05-18 03:34:06] 2024-05-18 10:33:47,099 | INFO     | collected zombie processes
[2024-05-18 03:34:06] 2024-05-18 10:33:47,099 | INFO     | will now attempt to kill all subprocesses of pid=191232
[2024-05-18 03:34:06] 2024-05-18 10:33:47,240 | INFO     | process IDs to be killed: [191232] (in reverse order)
[2024-05-18 03:34:06] 2024-05-18 10:33:47,359 | WARNING  | found no corresponding commands to process id(s)
[2024-05-18 03:34:06] 2024-05-18 10:33:47,360 | INFO     | Do not look for orphan processes in BOINC jobs
[2024-05-18 03:34:06] 2024-05-18 10:33:47,368 | INFO     | did not find any defunct processes belonging to 191232
[2024-05-18 03:34:06] 2024-05-18 10:33:47,375 | INFO     | did not find any defunct processes belonging to 191232
[2024-05-18 03:34:06] 2024-05-18 10:33:47,375 | INFO     | ready for new job
[2024-05-18 03:34:06] 2024-05-18 10:33:47,375 | INFO     | pilot has finished with previous job - re-establishing logging
[2024-05-18 03:34:06] 2024-05-18 10:33:47,377 | INFO     | *************************************
[2024-05-18 03:34:06] 2024-05-18 10:33:47,377 | INFO     | ***  PanDA Pilot version 3.7.5.4  ***
[2024-05-18 03:34:06] 2024-05-18 10:33:47,377 | INFO     | *************************************
[2024-05-18 03:34:06] 2024-05-18 10:33:47,377 | INFO     | 
[2024-05-18 03:34:06] 2024-05-18 10:33:47,380 | INFO     | architecture information:
[2024-05-18 03:34:06] 2024-05-18 10:33:47,380 | INFO     | executing command: cat /etc/os-release
[2024-05-18 03:34:06] 2024-05-18 10:33:47,424 | INFO     | cat /etc/os-release:
[2024-05-18 03:34:06] NAME="CentOS Linux"
[2024-05-18 03:34:06] VERSION="7 (Core)"
[2024-05-18 03:34:06] ID="centos"
[2024-05-18 03:34:06] ID_LIKE="rhel fedora"
[2024-05-18 03:34:06] VERSION_ID="7"
[2024-05-18 03:34:06] PRETTY_NAME="CentOS Linux 7 (Core)"
[2024-05-18 03:34:06] ANSI_COLOR="0;31"
[2024-05-18 03:34:06] CPE_NAME="cpe:/o:centos:centos:7"
[2024-05-18 03:34:06] HOME_URL="https://www.centos.org/"
[2024-05-18 03:34:06] BUG_REPORT_URL="https://bugs.centos.org/"
[2024-05-18 03:34:06] 
[2024-05-18 03:34:06] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2024-05-18 03:34:06] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2024-05-18 03:34:06] REDHAT_SUPPORT_PRODUCT="centos"
[2024-05-18 03:34:06] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2024-05-18 03:34:06] 
[2024-05-18 03:34:06] 2024-05-18 10:33:47,424 | INFO     | *************************************
[2024-05-18 03:34:06] 2024-05-18 10:33:47,928 | INFO     | executing command: df -mP /home/boinc/slots/0
[2024-05-18 03:34:06] 2024-05-18 10:33:47,972 | INFO     | sufficient remaining disk space (2663724875776 B)
[2024-05-18 03:34:06] 2024-05-18 10:33:47,972 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2024-05-18 03:34:06] 2024-05-18 10:33:47,972 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2024-05-18 03:34:06] 2024-05-18 10:33:47,973 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2024-05-18 03:34:06] 2024-05-18 10:33:48,319 | INFO     | all payload control threads have been joined
[2024-05-18 03:34:06] 2024-05-18 10:33:48,322 | INFO     | found 0 job(s) in 20 queues
[2024-05-18 03:34:06] 2024-05-18 10:33:48,322 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2024-05-18 03:34:06] 2024-05-18 10:33:48,322 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2024-05-18 03:34:06] 2024-05-18 10:33:48,509 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2024-05-18 03:34:06] 2024-05-18 10:33:48,509 | INFO     | aborting loop
[2024-05-18 03:34:06] 2024-05-18 10:33:48,978 | INFO     | [job] retrieve thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:48,992 | INFO     | all data control threads have been joined
[2024-05-18 03:34:06] 2024-05-18 10:33:49,021 | INFO     | all job control threads have been joined
[2024-05-18 03:34:06] 2024-05-18 10:33:49,068 | INFO     | [job] validate thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,153 | INFO     | [data] copytool_in thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,228 | INFO     | [payload] validate_post thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,325 | INFO     | [payload] control thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,361 | INFO     | [job] create_data_payload thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,369 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2024-05-18 03:34:06] 2024-05-18 10:33:49,515 | INFO     | [job] job monitor thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,579 | INFO     | [payload] execute_payloads thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,603 | INFO     | [payload] validate_pre thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:49,983 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2024-05-18 03:34:06] 2024-05-18 10:33:49,998 | INFO     | [data] control thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:50,027 | INFO     | [job] control thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:50,512 | INFO     | [payload] failed_post thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:50,989 | INFO     | [job] queue monitor thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:51,374 | INFO     | [data] copytool_out thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:51,980 | INFO     | [data] queue_monitor thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:33:58,484 | INFO     | job.realtimelogging is not enabled
[2024-05-18 03:34:06] 2024-05-18 10:33:59,490 | INFO     | [payload] run_realtimelog thread has finished
[2024-05-18 03:34:06] 2024-05-18 10:34:00,913 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 139744350156608)>', '<ExcThread(monitor, started 139743924836096)>']
[2024-05-18 03:34:06] 2024-05-18 10:34:01,383 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2024-05-18 03:34:06] 2024-05-18 10:34:01,383 | INFO     | [monitor] control thread has ended
[2024-05-18 03:34:06] 2024-05-18 10:34:05,938 | INFO     | all workflow threads have been joined
[2024-05-18 03:34:06] 2024-05-18 10:34:05,939 | INFO     | end of generic workflow (traces error code: 0)
[2024-05-18 03:34:06] 2024-05-18 10:34:05,939 | INFO     | traces error code: 0
[2024-05-18 03:34:06] 2024-05-18 10:34:05,940 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2024-05-18 03:34:06] 2024-05-18 10:34:06,037 [wrapper] ==== pilot stdout END ====
[2024-05-18 03:34:06] 2024-05-18 10:34:06,048 [wrapper] ==== wrapper stdout RESUME ====
[2024-05-18 03:34:06] 2024-05-18 10:34:06,060 [wrapper] pilotpid: 184453
[2024-05-18 03:34:06] 2024-05-18 10:34:06,070 [wrapper] Pilot exit status: 0
[2024-05-18 03:34:06] 2024-05-18 10:34:06,109 [wrapper] pandaids: 6208359345
[2024-05-18 03:34:06] 2024-05-18 10:34:06,176 [wrapper] cleanup: SIGTERM to supervisor_pilot  31777 184454
[2024-05-18 03:34:06] 2024-05-18 10:34:06,188 [wrapper] Test setup, not cleaning
[2024-05-18 03:34:06] 2024-05-18 10:34:06,199 [wrapper] ==== wrapper stdout END ====
[2024-05-18 03:34:06] 2024-05-18 10:34:06,210 [wrapper] ==== wrapper stderr END ====
[2024-05-18 03:34:06] 2024-05-18 10:34:06,229 [wrapper] apfmon messages muted
[2024-05-18 03:34:06]  *** Error codes and diagnostics ***
[2024-05-18 03:34:06]     "exeErrorCode": 0,
[2024-05-18 03:34:06]     "exeErrorDiag": "",
[2024-05-18 03:34:06]     "pilotErrorCode": 0,
[2024-05-18 03:34:06]     "pilotErrorDiag": "",
[2024-05-18 03:34:06]  *** Listing of results directory ***
[2024-05-18 03:34:06] total 816232
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc    467845 May 17 13:55 pilot3.tar.gz
[2024-05-18 03:34:06] -rwx------ 1 boinc boinc     32251 May 17 14:17 runpilot2-wrapper.sh
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc      5633 May 17 14:17 queuedata.json
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc       100 May 18 02:45 wrapper_26015_x86_64-pc-linux-gnu
[2024-05-18 03:34:06] -rwxr-xr-x 1 boinc boinc      7986 May 18 02:45 run_atlas
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc       105 May 18 02:45 job.xml
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc      6652 May 18 02:45 init_data.xml
[2024-05-18 03:34:06] -rw-r--r-- 2 boinc boinc 599142178 May 18 02:45 EVNT.38776100._000179.pool.root.1
[2024-05-18 03:34:06] drwxrwx--x 2 boinc boinc      4096 May 18 02:45 shared
[2024-05-18 03:34:06] -rw-r--r-- 2 boinc boinc    480009 May 18 02:45 input.tar.gz
[2024-05-18 03:34:06] -rw-r--r-- 2 boinc boinc     17537 May 18 02:45 start_atlas.sh
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc         0 May 18 02:45 boinc_lockfile
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc      2642 May 18 02:45 pandaJob.out
[2024-05-18 03:34:06] -rw------- 1 boinc boinc       424 May 18 02:45 setup.sh.local
[2024-05-18 03:34:06] -rw------- 1 boinc boinc   1019073 May 18 02:46 agis_schedconf.cvmfs.json
[2024-05-18 03:34:06] -rw------- 1 boinc boinc         0 May 18 02:46 agis_ddmendpoints.agis.ALL.json
[2024-05-18 03:34:06] -rw------- 1 boinc boinc   1324701 May 18 02:46 cric_ddmendpoints.json
[2024-05-18 03:34:06] drwx------ 4 boinc boinc      4096 May 18 02:46 pilot3
[2024-05-18 03:34:06] -rw------- 1 boinc boinc 231818251 May 18 03:33 HITS.38776104._014226.pool.root.1
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc       530 May 18 03:33 boinc_task_state.xml
[2024-05-18 03:34:06] -rw------- 1 boinc boinc        95 May 18 03:33 pilot_heartbeat.json
[2024-05-18 03:34:06] -rw------- 1 boinc boinc      1053 May 18 03:33 memory_monitor_summary.json
[2024-05-18 03:34:06] -rw------- 1 boinc boinc    359606 May 18 03:33 log.38776104._014226.job.log.tgz.1
[2024-05-18 03:34:06] -rw------- 1 boinc boinc      7770 May 18 03:33 heartbeat.json
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc      8192 May 18 03:33 boinc_mmap_file
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc        27 May 18 03:33 wrapper_checkpoint.txt
[2024-05-18 03:34:06] -rw------- 1 boinc boinc      4282 May 18 03:34 pilotlog.txt
[2024-05-18 03:34:06] -rw------- 1 boinc boinc    303000 May 18 03:34 log.38776104._014226.job.log.1
[2024-05-18 03:34:06] -rw------- 1 boinc boinc       357 May 18 03:34 output.list
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc       620 May 18 03:34 runtime_log
[2024-05-18 03:34:06] -rw------- 1 boinc boinc    686080 May 18 03:34 result.tar.gz
[2024-05-18 03:34:06] -rw------- 1 boinc boinc       655 May 18 03:34 zUMKDmd8JS5nsSi4apGgGQJmABFKDmABFKDm4ySLDm9ZRKDmCwx3Fo.diag
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc     11286 May 18 03:34 runtime_log.err
[2024-05-18 03:34:06] -rw-r--r-- 1 boinc boinc     21265 May 18 03:34 stderr.txt
[2024-05-18 03:34:06] HITS file was successfully produced:
[2024-05-18 03:34:06] -rw------- 1 boinc boinc 231818251 May 18 03:33 shared/HITS.pool.root.1
[2024-05-18 03:34:06]  *** Contents of shared directory: ***
[2024-05-18 03:34:06] total 812660
[2024-05-18 03:34:06] -rw-r--r-- 2 boinc boinc 599142178 May 18 02:45 ATLAS.root_0
[2024-05-18 03:34:06] -rw-r--r-- 2 boinc boinc    480009 May 18 02:45 input.tar.gz
[2024-05-18 03:34:06] -rw-r--r-- 2 boinc boinc     17537 May 18 02:45 start_atlas.sh
[2024-05-18 03:34:06] -rw------- 1 boinc boinc 231818251 May 18 03:33 HITS.pool.root.1
[2024-05-18 03:34:06] -rw------- 1 boinc boinc    686080 May 18 03:34 result.tar.gz
03:34:08 (180142): run_atlas exited; CPU time 23030.080253
03:34:08 (180142): called boinc_finish(0)

</stderr_txt>
]]>


©2024 CERN