Name 0RZKDmvZfO9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDm1v6LDmOpnnbn_1
Workunit 240055256
Created 22 Mar 2026, 23:52:53 UTC
Sent 23 Mar 2026, 0:02:48 UTC
Report deadline 31 Mar 2026, 0:02:48 UTC
Received 23 Mar 2026, 4:23:45 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 11059789
Run time 2 hours 15 min 53 sec
CPU time 16 hours 42 min 56 sec
Priority 28
Validate state Valid
Credit 150.99
Device peak FLOPS 8.00 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.91 GB
Peak swap size 3.03 GB
Peak disk usage 705.72 MB

Stderr output

<core_client_version>8.2.8</core_client_version>
<![CDATA[
<stderr_txt>
03:07:19 (3628): wrapper (7.7.26015): starting
03:07:19 (3628): wrapper: running run_atlas (--nthreads 8)
[2026-03-23 03:07:19] Arguments: --nthreads 8
[2026-03-23 03:07:19] Threads: 8
[2026-03-23 03:07:19] Checking for CVMFS
[2026-03-23 03:07:19] Probing /cvmfs/atlas.cern.ch... OK
[2026-03-23 03:07:19] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-03-23 03:07:19] Running cvmfs_config stat atlas.cern.ch
[2026-03-23 03:07:19] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-03-23 03:07:19] 2.11.0.0 5757 26853 62392 157596 0 188 3966863 4096001 5458 130560 0 3516095 99.711 1713543 268 http://cvmfs-stratum-one.cern.ch/cvmfs/atlas.cern.ch http://141.99.253.245:3128 1
[2026-03-23 03:07:19] CVMFS is ok
[2026-03-23 03:07:19] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-23 03:07:19] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-23 03:07:19] Further information can be found at the LHC@home message board.
[2026-03-23 03:07:19] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-23 03:07:19] Checking for apptainer binary...
[2026-03-23 03:07:19] /usr/bin/which: no apptainer in (/opt/ohpc/pub/libs/singularity/3.7.1/bin:/cm/shared/apps/slurm/current/sbin:/cm/shared/apps/slurm/current/bin:/home/mf493845/.local/bin:/home/mf493845/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin)
[2026-03-23 03:07:19] apptainer is not installed, using version from CVMFS
[2026-03-23 03:07:19] Checking apptainer works with /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-23 03:07:20] WARNING: Environment variable TMPDIR already has value [/tmp/job_11373040/atlas/pilot/slots/0/.apptainertmp], will not forward new value [/tmp] from parent process environment hpc-node053
[2026-03-23 03:07:20] apptainer works
[2026-03-23 03:07:20] Set ATHENA_PROC_NUMBER=8
[2026-03-23 03:07:20] Set ATHENA_CORE_NUMBER=8
[2026-03-23 03:07:20] Starting ATLAS job with PandaID=7062349768
[2026-03-23 03:07:20] Running command: /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs,/tmp/job_11373040/atlas/pilot/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-03-23 05:23:08]  *** The last 200 lines of the pilot log: ***
[2026-03-23 05:23:08] 2026-03-23 04:22:36,666 | INFO     | using path: /tmp/job_11373040/atlas/pilot/slots/0/PanDA_Pilot-7062349768/memory_monitor_summary.json (trf name=prmon)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,666 | INFO     | extracted standard info from prmon json
[2026-03-23 05:23:08] 2026-03-23 04:22:36,666 | INFO     | extracted standard memory fields from prmon json
[2026-03-23 05:23:08] 2026-03-23 04:22:36,666 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-23 05:23:08] 2026-03-23 04:22:36,667 | WARNING  | format EVNTtoHITS has no such key: dbData
[2026-03-23 05:23:08] 2026-03-23 04:22:36,667 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2026-03-23 05:23:08] 2026-03-23 04:22:36,668 | INFO     | fitting pss+swap vs Time
[2026-03-23 05:23:08] 2026-03-23 04:22:36,668 | INFO     | sum of square deviations: 729472282.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,668 | INFO     | sum of deviations: 26431093332.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,668 | INFO     | mean x: 1774235690.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,668 | INFO     | mean y: 2508871.2330827066
[2026-03-23 05:23:08] 2026-03-23 04:22:36,668 | INFO     | intersect: -64283674815.94286
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | chi2: 1.1497840556808578
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | sum of square deviations: 650252192.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | sum of deviations: 24775457867.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | mean x: 1774235537.5
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | mean y: 2505682.609375
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | intersect: -67598185781.82686
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | chi2: 1.1506106091395047
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | current chi2=1.1506106091395047 (change=-0.07188771270249114 %)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | right removable region: 127
[2026-03-23 05:23:08] 2026-03-23 04:22:36,669 | INFO     | sum of square deviations: 650252192.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | sum of deviations: 9413985918.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | mean x: 1774235842.5
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | mean y: 2541090.75
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | intersect: -25683848654.639534
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | chi2: 0.017184562799923198
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | current chi2=0.017184562799923198 (change=98.50540954059872 %)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | sum of square deviations: 576985702.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | sum of deviations: 7205560706.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,670 | INFO     | mean x: 1774235995.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | mean y: 2545619.4308943087
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | intersect: -22154615510.261734
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | chi2: 0.00261269954967546
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | current chi2=0.00261269954967546 (change=84.79624078834792 %)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | sum of square deviations: 509440249.5
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | sum of deviations: 6357443729.000001
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | mean x: 1774236147.5
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | mean y: 2547522.254237288
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | intersect: -22138628956.753033
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | chi2: 0.002475142976231411
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | current chi2=0.002475142976231411 (change=5.26492123677734 %)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,671 | INFO     | left removable region: 30
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | sum of square deviations: 282974608.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | sum of deviations: 4079429655.9999995
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | mean x: 1774236422.0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | mean y: 2549293.6288659796
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | intersect: -25575267485.826237
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | chi2: 0.0020027575546496907
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | current memory leak: 14.42 B/s (using 97 data points, chi2=0.00)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | could have reported an average CPU frequency of 2988 MHz (8 samples)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | ..............................
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | . Timing measurements:
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | . get job = 0 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | . initial setup = 2 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,672 | INFO     | . payload setup = 3 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,673 | INFO     | . stage-in = 0 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,673 | INFO     | . payload execution = 8087 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,673 | INFO     | . stage-out = 0 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,673 | INFO     | . log creation = 0 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,673 | INFO     | ..............................
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | 
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | job summary report
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | --------------------------------------------------
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | PanDA job id: 7062349768
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | task id: 49176206
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | errors: (none)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | status: LOG_TRANSFER = DONE 
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | pilot state: finished 
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | transexitcode: 0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | exeerrorcode: 0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | exeerrordiag: 
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | exitcode: 0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,784 | INFO     | exitmsg: OK
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | cpuconsumptiontime: 60007 s
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | nevents: 400
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | neventsw: 0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | pid: 14505
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | pgrp: 14505
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | corecount: 8
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | event service: False
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | sizes: {0: 2280356, 11: 2280356, 8094: 2308273, 8095: 2317357, 8098: 2317495}
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | --------------------------------------------------
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | 
[2026-03-23 05:23:08] 2026-03-23 04:22:36,785 | INFO     | executing command: ls -lF /tmp/job_11373040/atlas/pilot/slots/0
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue jobs had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue payloads had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue data_in had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue data_out had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue current_data_in had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,794 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | job 7062349768 has completed (purged errors)
[2026-03-23 05:23:08] 2026-03-23 04:22:36,795 | INFO     | overall cleanup function is called
[2026-03-23 05:23:08] 2026-03-23 04:22:37,801 | INFO     | --- collectZombieJob: --- 10, [14505]
[2026-03-23 05:23:08] 2026-03-23 04:22:37,801 | INFO     | zombie collector waiting for pid 14505
[2026-03-23 05:23:08] 2026-03-23 04:22:37,801 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-23 05:23:08] 2026-03-23 04:22:37,801 | INFO     | collected zombie processes
[2026-03-23 05:23:08] 2026-03-23 04:22:37,801 | INFO     | will attempt to kill all subprocesses of pid=14505
[2026-03-23 05:23:08] 2026-03-23 04:22:37,805 | WARNING  | process 14505 can no longer be monitored (due to stat problems) - aborting
[2026-03-23 05:23:08] 2026-03-23 04:22:38,049 | INFO     | using path: /tmp/job_11373040/atlas/pilot/slots/0/memory_monitor_summary.json (trf name=prmon)
[2026-03-23 05:23:08] 2026-03-23 04:22:38,051 | INFO     | process IDs to be killed: [14505] (in reverse order)
[2026-03-23 05:23:08] 2026-03-23 04:22:38,080 | WARNING  | found no corresponding commands to process id(s)
[2026-03-23 05:23:08] 2026-03-23 04:22:38,080 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-23 05:23:08] 2026-03-23 04:22:38,085 | INFO     | did not find any defunct processes belonging to 14505
[2026-03-23 05:23:08] 2026-03-23 04:22:38,089 | INFO     | did not find any defunct processes belonging to 14505
[2026-03-23 05:23:08] 2026-03-23 04:22:38,089 | INFO     | ready for new job
[2026-03-23 05:23:08] 2026-03-23 04:22:38,089 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-23 05:23:08] 2026-03-23 04:22:38,090 | INFO     | **************************************
[2026-03-23 05:23:08] 2026-03-23 04:22:38,090 | INFO     | ***  PanDA Pilot version 3.11.5.1  ***
[2026-03-23 05:23:08] 2026-03-23 04:22:38,090 | INFO     | **************************************
[2026-03-23 05:23:08] 2026-03-23 04:22:38,090 | INFO     | 
[2026-03-23 05:23:08] 2026-03-23 04:22:38,104 | INFO     | architecture information:
[2026-03-23 05:23:08] 2026-03-23 04:22:38,104 | INFO     | executing command: cat /etc/os-release
[2026-03-23 05:23:08] 2026-03-23 04:22:38,111 | INFO     | cat /etc/os-release:
[2026-03-23 05:23:08] NAME="CentOS Linux"
[2026-03-23 05:23:08] VERSION="7 (Core)"
[2026-03-23 05:23:08] ID="centos"
[2026-03-23 05:23:08] ID_LIKE="rhel fedora"
[2026-03-23 05:23:08] VERSION_ID="7"
[2026-03-23 05:23:08] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-23 05:23:08] ANSI_COLOR="0;31"
[2026-03-23 05:23:08] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-23 05:23:08] HOME_URL="https://www.centos.org/"
[2026-03-23 05:23:08] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-23 05:23:08] 
[2026-03-23 05:23:08] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-23 05:23:08] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-23 05:23:08] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-23 05:23:08] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-23 05:23:08] 
[2026-03-23 05:23:08] 2026-03-23 04:22:38,111 | INFO     | **************************************
[2026-03-23 05:23:08] 2026-03-23 04:22:38,150 | INFO     | number of running child processes to parent process 14505: 1
[2026-03-23 05:23:08] 2026-03-23 04:22:38,150 | INFO     | maximum number of monitored processes: 6
[2026-03-23 05:23:08] 2026-03-23 04:22:38,150 | INFO     | aborting job monitoring since job object (job id=7062349768) has expired
[2026-03-23 05:23:08] 2026-03-23 04:22:38,613 | INFO     | executing command: df -mP /tmp/job_11373040/atlas/pilot/slots/0
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | INFO     | sufficient remaining disk space (226971615232 B)
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | INFO     | current server update state: UPDATING_FINAL
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | INFO     | update_server=False
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-23 05:23:08] 2026-03-23 04:22:38,620 | INFO     | will abort loop
[2026-03-23 05:23:08] 2026-03-23 04:22:38,705 | INFO     | all data control threads have been joined
[2026-03-23 05:23:08] 2026-03-23 04:22:38,964 | INFO     | all job control threads have been joined
[2026-03-23 05:23:08] 2026-03-23 04:22:39,096 | INFO     | all payload control threads have been joined
[2026-03-23 05:23:08] 2026-03-23 04:22:39,244 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-23 05:23:08] 2026-03-23 04:22:39,625 | INFO     | [job] retrieve thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,625 | INFO     | [job] queue monitor thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,626 | INFO     | [job] job monitor thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,676 | INFO     | [job] create_data_payload thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,710 | INFO     | [data] control thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,802 | INFO     | [data] copytool_in thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,865 | INFO     | [payload] validate_post thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:39,910 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-23 05:23:08] 2026-03-23 04:22:39,969 | INFO     | [job] control thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:40,037 | INFO     | [monitor] cgroup control has ended
[2026-03-23 05:23:08] 2026-03-23 04:22:40,052 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:40,102 | INFO     | [payload] control thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:40,424 | INFO     | [payload] failed_post thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:40,917 | INFO     | [payload] validate_pre thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:40,921 | INFO     | [payload] execute_payloads thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:40,971 | INFO     | [job] validate thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:41,249 | INFO     | [data] copytool_out thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:43,915 | INFO     | [data] queue_monitor thread has finished
[2026-03-23 05:23:08] 2026-03-23 04:22:44,487 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 23456247945024)>', '<ExcThread(monitor, started 23456065292032)>']
[2026-03-23 05:23:08] 2026-03-23 04:22:49,512 | INFO     | all workflow threads have been joined
[2026-03-23 05:23:08] 2026-03-23 04:22:49,512 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-23 05:23:08] 2026-03-23 04:22:49,513 | INFO     | traces error code: 0
[2026-03-23 05:23:08] 2026-03-23 04:22:49,513 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-23 05:23:08] 2026-03-23 04:23:08,459 | INFO     | PID=8612 has CPU usage=3.1% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i PR -
[2026-03-23 05:23:08] 2026-03-23 04:23:08,459 | INFO     | found 0 job(s) in 20 queues
[2026-03-23 05:23:08] 2026-03-23 04:23:08,459 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-23 05:23:08] 2026-03-23 04:23:08,459 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-23 05:23:08] 2026-03-23 04:23:08,459 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-23 05:23:08] 2026-03-23 04:23:08,459 | INFO     | [monitor] control thread has ended
[2026-03-23 05:23:08] 2026-03-23 04:23:08,513 [wrapper] ==== pilot stdout END ====
[2026-03-23 05:23:08] 2026-03-23 04:23:08,515 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-23 05:23:08] 2026-03-23 04:23:08,516 [wrapper] pilotpid: 8612
[2026-03-23 05:23:08] 2026-03-23 04:23:08,517 [wrapper] Pilot exit status: 0
[2026-03-23 05:23:08] 2026-03-23 04:23:08,521 [wrapper] pandaids: 7062349768
[2026-03-23 05:23:08] 2026-03-23 04:23:08,540 [wrapper] cleanup supervisor_pilot  4285 8613
[2026-03-23 05:23:08] 2026-03-23 04:23:08,541 [wrapper] Test setup, not cleaning
[2026-03-23 05:23:08] 2026-03-23 04:23:08,543 [wrapper] apfmon messages muted
[2026-03-23 05:23:08] 2026-03-23 04:23:08,544 [wrapper] ==== wrapper stdout END ====
[2026-03-23 05:23:08] 2026-03-23 04:23:08,545 [wrapper] ==== wrapper stderr END ====
[2026-03-23 05:23:08]  *** Error codes and diagnostics ***
[2026-03-23 05:23:08]     "exeErrorCode": 0,
[2026-03-23 05:23:08]     "exeErrorDiag": "",
[2026-03-23 05:23:08]     "pilotErrorCode": 0,
[2026-03-23 05:23:08]     "pilotErrorDiag": "",
[2026-03-23 05:23:08]  *** Listing of results directory ***
[2026-03-23 05:23:08] total 482380
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user    585013 Mar 22 22:31 pilot3.tar.gz
[2026-03-23 05:23:08] -rwx------ 1 mf493845 unix-user     36322 Mar 22 22:31 runpilot2-wrapper.sh
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      5111 Mar 22 22:33 queuedata.json
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       100 Mar 23 02:05 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-23 05:23:08] -rwxr-xr-x 1 mf493845 unix-user      7986 Mar 23 02:05 run_atlas
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       105 Mar 23 02:05 job.xml
[2026-03-23 05:23:08] -rw-r--r-- 2 mf493845 unix-user     15845 Mar 23 03:07 start_atlas.sh
[2026-03-23 05:23:08] drwxrwx--x 2 mf493845 unix-user       100 Mar 23 03:07 shared
[2026-03-23 05:23:08] -rw-r--r-- 2 mf493845 unix-user    597545 Mar 23 03:07 input.tar.gz
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      6390 Mar 23 03:07 init_data.xml
[2026-03-23 05:23:08] -rw-r--r-- 2 mf493845 unix-user 243404175 Mar 23 03:07 EVNT.49133006._000003.pool.root.1
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user         0 Mar 23 03:07 boinc_setup_complete
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      8192 Mar 23 03:07 boinc_mmap_file
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user         0 Mar 23 03:07 boinc_lockfile
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      2597 Mar 23 03:07 pandaJob.out
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user   1010663 Mar 23 03:07 agis_schedconf.cvmfs.json
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user   1511579 Mar 23 03:07 agis_ddmendpoints.agis.ALL.json
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       419 Mar 23 03:07 workernode_map.json
[2026-03-23 05:23:08] drwx------ 5 mf493845 unix-user       440 Mar 23 03:07 pilot3
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user 243521858 Mar 23 05:22 HITS.49176206._000892.pool.root.1
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user        95 Mar 23 05:22 pilot_heartbeat.json
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      1020 Mar 23 05:22 memory_monitor_summary.json
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user    381771 Mar 23 05:22 log.49176206._000892.job.log.tgz.1
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      6311 Mar 23 05:22 heartbeat.json
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       744 Mar 23 05:23 pilotlog.txt
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user   1179739 Mar 23 05:23 log.49176206._000892.job.log.1
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       357 Mar 23 05:23 output.list
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       620 Mar 23 05:23 runtime_log
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user   1576960 Mar 23 05:23 result.tar.gz
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user      9219 Mar 23 05:23 runtime_log.err
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user       643 Mar 23 05:23 0RZKDmvZfO9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDm1v6LDmOpnnbn.diag
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user     21571 Mar 23 05:23 stderr.txt
[2026-03-23 05:23:08] HITS file was successfully produced:
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user 243521858 Mar 23 05:22 shared/HITS.pool.root.1
[2026-03-23 05:23:08]  *** Contents of shared directory: ***
[2026-03-23 05:23:08] total 477656
[2026-03-23 05:23:08] -rw-r--r-- 2 mf493845 unix-user     15845 Mar 23 03:07 start_atlas.sh
[2026-03-23 05:23:08] -rw-r--r-- 2 mf493845 unix-user    597545 Mar 23 03:07 input.tar.gz
[2026-03-23 05:23:08] -rw-r--r-- 2 mf493845 unix-user 243404175 Mar 23 03:07 ATLAS.root_0
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user 243521858 Mar 23 05:22 HITS.pool.root.1
[2026-03-23 05:23:08] -rw------- 1 mf493845 unix-user   1576960 Mar 23 05:23 result.tar.gz
05:23:10 (3628): run_atlas exited; CPU time 60130.797630
05:23:10 (3628): called boinc_finish(0)

</stderr_txt>
]]>


©2026 CERN