Name VgaMDmNkBH9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmvjCLDmuGy2nm_2
Workunit 239488119
Created 10 Mar 2026, 10:47:26 UTC
Sent 10 Mar 2026, 11:38:01 UTC
Report deadline 18 Mar 2026, 11:38:01 UTC
Received 10 Mar 2026, 16:22:33 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 10825546
Run time 2 hours 52 min 47 sec
CPU time 11 hours 12 min 59 sec
Priority 28
Validate state Valid
Credit 557.21
Device peak FLOPS 31.69 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.89 GB
Peak swap size 2.81 GB
Peak disk usage 528.24 MB

Stderr output

<core_client_version>8.2.8</core_client_version>
<![CDATA[
<stderr_txt>
13:22:25 (1400449): wrapper (7.7.26015): starting
13:22:25 (1400449): wrapper: running run_atlas (--nthreads 4)
[2026-03-10 13:22:25] Arguments: --nthreads 4
[2026-03-10 13:22:25] Threads: 4
[2026-03-10 13:22:25] Checking for CVMFS
[2026-03-10 13:22:25] Probing /cvmfs/atlas.cern.ch... OK
[2026-03-10 13:22:25] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-03-10 13:22:25] Running cvmfs_config stat atlas.cern.ch
[2026-03-10 13:22:25] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-03-10 13:22:25] 2.11.0.0 1271569 110 61412 157116 1 187 3612218 4096001 3935 130560 0 385883 99.994 6974 2043 http://s1bnl-cvmfs.openhtc.io/cvmfs/atlas.cern.ch DIRECT 1
[2026-03-10 13:22:25] CVMFS is ok
[2026-03-10 13:22:25] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-10 13:22:25] Small home clusters do not require a local http proxy but it is suggested if
[2026-03-10 13:22:25] more than 10 cores throughout the same LAN segment are regularly running ATLAS like tasks.
[2026-03-10 13:22:25] Further information can be found at the LHC@home message board.
[2026-03-10 13:22:25] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-10 13:22:25] Checking for apptainer binary...
[2026-03-10 13:22:25] Using apptainer found in PATH at /usr/bin/apptainer
[2026-03-10 13:22:25] Running /usr/bin/apptainer --version
[2026-03-10 13:22:25] apptainer version 1.4.5
[2026-03-10 13:22:25] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-10 13:22:25] x32-linux2
[2026-03-10 13:22:25] apptainer works
[2026-03-10 13:22:25] Set ATHENA_PROC_NUMBER=4
[2026-03-10 13:22:25] Set ATHENA_CORE_NUMBER=4
[2026-03-10 13:22:25] Starting ATLAS job with PandaID=7035851354
[2026-03-10 13:22:25] Running command: /usr/bin/apptainer exec -B /cvmfs,/var/lib/boinc-client/slots/2 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-03-10 16:15:09]  *** The last 200 lines of the pilot log: ***
[2026-03-10 16:15:09] 2026-03-10 20:14:36,741 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,407 | INFO     | job 7035851354 has state=finished
[2026-03-10 16:15:09] 2026-03-10 20:14:37,407 | INFO     | preparing for final server update for job 7035851354 in state='finished'
[2026-03-10 16:15:09] 2026-03-10 20:14:37,407 | INFO     | reading metadata from: /var/lib/boinc-client/slots/2/PanDA_Pilot-7035851354/jobReport.json
[2026-03-10 16:15:09] 2026-03-10 20:14:37,409 | INFO     | added worker_node to metadata from /var/lib/boinc-client/slots/2/workernode_map.json
[2026-03-10 16:15:09] 2026-03-10 20:14:37,409 | INFO     | this job has now completed (state=finished)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,409 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,409 | INFO     | log transfer has been attempted: DONE
[2026-03-10 16:15:09] 2026-03-10 20:14:37,409 | INFO     | job 7035851354 has finished - writing final server update
[2026-03-10 16:15:09] 2026-03-10 20:14:37,410 | INFO     | total number of processed events: 400 (read)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,414 | INFO     | using path: /var/lib/boinc-client/slots/2/PanDA_Pilot-7035851354/memory_monitor_summary.json (trf name=prmon)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,414 | INFO     | extracted standard info from prmon json
[2026-03-10 16:15:09] 2026-03-10 20:14:37,414 | INFO     | extracted standard memory fields from prmon json
[2026-03-10 16:15:09] 2026-03-10 20:14:37,414 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-10 16:15:09] 2026-03-10 20:14:37,414 | WARNING  | format EVNTtoHITS has no such key: dbData
[2026-03-10 16:15:09] 2026-03-10 20:14:37,415 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2026-03-10 16:15:09] 2026-03-10 20:14:37,416 | INFO     | fitting pss+swap vs Time
[2026-03-10 16:15:09] 2026-03-10 20:14:37,417 | INFO     | sum of square deviations: 1496660620.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,417 | INFO     | sum of deviations: 8655972956.000002
[2026-03-10 16:15:09] 2026-03-10 20:14:37,417 | INFO     | mean x: 1773168494.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,417 | INFO     | mean y: 2293691.023668639
[2026-03-10 16:15:09] 2026-03-10 20:14:37,418 | INFO     | intersect: -10252869253.328573
[2026-03-10 16:15:09] 2026-03-10 20:14:37,418 | INFO     | chi2: 1.0941509695284952
[2026-03-10 16:15:09] 2026-03-10 20:14:37,418 | INFO     | sum of square deviations: 1367709365.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | sum of deviations: 8430686566.500002
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | mean x: 1773168341.5
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | mean y: 2293424.5182926827
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | intersect: -10927679638.016916
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | chi2: 1.0948997986732343
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | current chi2=1.0948997986732343 (change=-0.06843928905549067 %)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | right removable region: 163
[2026-03-10 16:15:09] 2026-03-10 20:14:37,419 | INFO     | sum of square deviations: 1367709365.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,420 | INFO     | sum of deviations: -6489582722.000001
[2026-03-10 16:15:09] 2026-03-10 20:14:37,420 | INFO     | mean x: 1773168646.5
[2026-03-10 16:15:09] 2026-03-10 20:14:37,420 | INFO     | mean y: 2311233.6585365855
[2026-03-10 16:15:09] 2026-03-10 20:14:37,420 | INFO     | intersect: 8415739485.294826
[2026-03-10 16:15:09] 2026-03-10 20:14:37,420 | INFO     | chi2: 0.007425489580993172
[2026-03-10 16:15:09] 2026-03-10 20:14:37,420 | INFO     | current chi2=0.007425489580993172 (change=99.32134689016516 %)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,421 | INFO     | sum of square deviations: 1246386160.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,421 | INFO     | sum of deviations: -5994617254.000001
[2026-03-10 16:15:09] 2026-03-10 20:14:37,421 | INFO     | mean x: 1773168799.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,421 | INFO     | mean y: 2310608.037735849
[2026-03-10 16:15:09] 2026-03-10 20:14:37,421 | INFO     | intersect: 8530540957.402222
[2026-03-10 16:15:09] 2026-03-10 20:14:37,422 | INFO     | chi2: 0.007375780911837914
[2026-03-10 16:15:09] 2026-03-10 20:14:37,422 | INFO     | current chi2=0.007375780911837914 (change=0.669432885374938 %)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,422 | INFO     | left removable region: 20
[2026-03-10 16:15:09] 2026-03-10 20:14:37,422 | INFO     | sum of square deviations: 906703512.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,422 | INFO     | sum of deviations: -4158522988.0000005
[2026-03-10 16:15:09] 2026-03-10 20:14:37,422 | INFO     | mean x: 1773168921.0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | mean y: 2308530.93006993
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | intersect: 8134805672.493647
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | chi2: 0.006776946658178511
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | current memory leak: -4.59 B/s (using 143 data points, chi2=0.01)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | could have reported an average CPU frequency of 3823 MHz (9 samples)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | ..............................
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . Timing measurements:
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . get job = 0 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . initial setup = 3 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . payload setup = 2 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . stage-in = 0 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . payload execution = 10300 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,423 | INFO     | . stage-out = 0 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,424 | INFO     | . log creation = 0 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,424 | INFO     | ..............................
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | 
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | job summary report
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | --------------------------------------------------
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | PanDA job id: 7035851354
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | task id: 48844454
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | errors: (none)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | status: LOG_TRANSFER = DONE 
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | pilot state: finished 
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | transexitcode: 0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,487 | INFO     | exeerrorcode: 0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | exeerrordiag: 
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | exitcode: 0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | exitmsg: OK
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | cpuconsumptiontime: 40062 s
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | nevents: 400
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | neventsw: 0
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | pid: 1413337
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | pgrp: 1413337
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | corecount: 4
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | event service: False
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | sizes: {0: 2279911, 1: 2280372, 11: 2280372, 10307: 2309224, 10308: 2318236, 10311: 2318292, 10313: 2318678}
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | --------------------------------------------------
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | 
[2026-03-10 16:15:09] 2026-03-10 20:14:37,488 | INFO     | executing command: ls -lF /var/lib/boinc-client/slots/2
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue jobs had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue payloads had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue data_in had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue data_out had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue current_data_in had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,504 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | job 7035851354 has completed (purged errors)
[2026-03-10 16:15:09] 2026-03-10 20:14:37,505 | INFO     | overall cleanup function is called
[2026-03-10 16:15:09] 2026-03-10 20:14:38,512 | INFO     | --- collectZombieJob: --- 10, [1413337]
[2026-03-10 16:15:09] 2026-03-10 20:14:38,512 | INFO     | zombie collector waiting for pid 1413337
[2026-03-10 16:15:09] 2026-03-10 20:14:38,512 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-10 16:15:09] 2026-03-10 20:14:38,512 | INFO     | collected zombie processes
[2026-03-10 16:15:09] 2026-03-10 20:14:38,512 | INFO     | will attempt to kill all subprocesses of pid=1413337
[2026-03-10 16:15:09] 2026-03-10 20:14:38,653 | INFO     | process IDs to be killed: [1413337] (in reverse order)
[2026-03-10 16:15:09] 2026-03-10 20:14:38,708 | WARNING  | found no corresponding commands to process id(s)
[2026-03-10 16:15:09] 2026-03-10 20:14:38,708 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-10 16:15:09] 2026-03-10 20:14:38,712 | INFO     | did not find any defunct processes belonging to 1413337
[2026-03-10 16:15:09] 2026-03-10 20:14:38,716 | INFO     | did not find any defunct processes belonging to 1413337
[2026-03-10 16:15:09] 2026-03-10 20:14:38,716 | INFO     | ready for new job
[2026-03-10 16:15:09] 2026-03-10 20:14:38,716 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-10 16:15:09] 2026-03-10 20:14:38,717 | INFO     | **************************************
[2026-03-10 16:15:09] 2026-03-10 20:14:38,717 | INFO     | ***  PanDA Pilot version 3.11.4.1  ***
[2026-03-10 16:15:09] 2026-03-10 20:14:38,717 | INFO     | **************************************
[2026-03-10 16:15:09] 2026-03-10 20:14:38,717 | INFO     | 
[2026-03-10 16:15:09] 2026-03-10 20:14:38,719 | INFO     | architecture information:
[2026-03-10 16:15:09] 2026-03-10 20:14:38,719 | INFO     | executing command: cat /etc/os-release
[2026-03-10 16:15:09] 2026-03-10 20:14:38,730 | INFO     | cat /etc/os-release:
[2026-03-10 16:15:09] NAME="CentOS Linux"
[2026-03-10 16:15:09] VERSION="7 (Core)"
[2026-03-10 16:15:09] ID="centos"
[2026-03-10 16:15:09] ID_LIKE="rhel fedora"
[2026-03-10 16:15:09] VERSION_ID="7"
[2026-03-10 16:15:09] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-10 16:15:09] ANSI_COLOR="0;31"
[2026-03-10 16:15:09] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-10 16:15:09] HOME_URL="https://www.centos.org/"
[2026-03-10 16:15:09] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-10 16:15:09] 
[2026-03-10 16:15:09] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-10 16:15:09] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-10 16:15:09] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-10 16:15:09] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-10 16:15:09] 
[2026-03-10 16:15:09] 2026-03-10 20:14:38,731 | INFO     | **************************************
[2026-03-10 16:15:09] 2026-03-10 20:14:39,234 | INFO     | executing command: df -mP /var/lib/boinc-client/slots/2
[2026-03-10 16:15:09] 2026-03-10 20:14:39,245 | INFO     | sufficient remaining disk space (400839147520 B)
[2026-03-10 16:15:09] 2026-03-10 20:14:39,246 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-10 16:15:09] 2026-03-10 20:14:39,246 | INFO     | current server update state: UPDATING_FINAL
[2026-03-10 16:15:09] 2026-03-10 20:14:39,246 | INFO     | update_server=False
[2026-03-10 16:15:09] 2026-03-10 20:14:39,246 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-10 16:15:09] 2026-03-10 20:14:39,246 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-10 16:15:09] 2026-03-10 20:14:39,671 | INFO     | all data control threads have been joined
[2026-03-10 16:15:09] 2026-03-10 20:14:39,730 | INFO     | all payload control threads have been joined
[2026-03-10 16:15:09] 2026-03-10 20:14:40,117 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-10 16:15:09] 2026-03-10 20:14:40,128 | INFO     | all job control threads have been joined
[2026-03-10 16:15:09] 2026-03-10 20:14:40,249 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-10 16:15:09] 2026-03-10 20:14:40,249 | INFO     | aborting loop
[2026-03-10 16:15:09] 2026-03-10 20:14:40,251 | INFO     | [job] queue monitor thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:40,251 | INFO     | [job] retrieve thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:40,593 | INFO     | [payload] validate_post thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:40,676 | INFO     | [data] control thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:40,734 | INFO     | [payload] control thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:40,835 | INFO     | [job] validate thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,106 | INFO     | [payload] validate_pre thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,120 | INFO     | [payload] execute_payloads thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,133 | INFO     | [job] control thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,179 | INFO     | [job] create_data_payload thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,252 | INFO     | [data] copytool_in thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,254 | INFO     | [job] job monitor thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,472 | INFO     | [payload] failed_post thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:41,555 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-10 16:15:09] 2026-03-10 20:14:42,122 | INFO     | [data] copytool_out thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:45,560 | INFO     | [data] queue_monitor thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:49,553 | INFO     | job.realtimelogging is not enabled
[2026-03-10 16:15:09] 2026-03-10 20:14:50,252 | INFO     | [monitor] cgroup control has ended
[2026-03-10 16:15:09] 2026-03-10 20:14:50,558 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-10 16:15:09] 2026-03-10 20:14:50,716 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 127406387205952)>', '<ExcThread(monitor, started 127405758863104)>']
[2026-03-10 16:15:09] 2026-03-10 20:14:55,741 | INFO     | all workflow threads have been joined
[2026-03-10 16:15:09] 2026-03-10 20:14:55,742 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-10 16:15:09] 2026-03-10 20:14:55,742 | INFO     | traces error code: 0
[2026-03-10 16:15:09] 2026-03-10 20:14:55,742 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-10 16:15:09] 2026-03-10 20:15:09,503 | INFO     | PID=1408606 has CPU usage=2.9% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2026-03-10 16:15:09] 2026-03-10 20:15:09,504 | INFO     | .. there are 8 such processes running
[2026-03-10 16:15:09] 2026-03-10 20:15:09,504 | INFO     | found 0 job(s) in 20 queues
[2026-03-10 16:15:09] 2026-03-10 20:15:09,504 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-10 16:15:09] 2026-03-10 20:15:09,504 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-10 16:15:09] 2026-03-10 20:15:09,504 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-10 16:15:09] 2026-03-10 20:15:09,504 | INFO     | [monitor] control thread has ended
[2026-03-10 16:15:09] 2026-03-10 20:15:09,573 [wrapper] ==== pilot stdout END ====
[2026-03-10 16:15:09] 2026-03-10 20:15:09,576 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-10 16:15:09] 2026-03-10 20:15:09,578 [wrapper] pilotpid: 1408606
[2026-03-10 16:15:09] 2026-03-10 20:15:09,580 [wrapper] Pilot exit status: 0
[2026-03-10 16:15:09] 2026-03-10 20:15:09,587 [wrapper] pandaids: 7035851354
[2026-03-10 16:15:09] 2026-03-10 20:15:09,620 [wrapper] cleanup supervisor_pilot 1579679 1408607
[2026-03-10 16:15:09] 2026-03-10 20:15:09,622 [wrapper] Test setup, not cleaning
[2026-03-10 16:15:09] 2026-03-10 20:15:09,625 [wrapper] apfmon messages muted
[2026-03-10 16:15:09] 2026-03-10 20:15:09,627 [wrapper] ==== wrapper stdout END ====
[2026-03-10 16:15:09] 2026-03-10 20:15:09,629 [wrapper] ==== wrapper stderr END ====
[2026-03-10 16:15:09]  *** Error codes and diagnostics ***
[2026-03-10 16:15:09]     "exeErrorCode": 0,
[2026-03-10 16:15:09]     "exeErrorDiag": "",
[2026-03-10 16:15:09]     "pilotErrorCode": 0,
[2026-03-10 16:15:09]     "pilotErrorDiag": "",
[2026-03-10 16:15:09]  *** Listing of results directory ***
[2026-03-10 16:15:09] total 343240
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc    584627 Mar  2 01:15 pilot3.tar.gz
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc      5111 Mar  2 01:15 queuedata.json
[2026-03-10 16:15:09] -rwx------ 1 boinc boinc     36322 Mar  2 01:15 runpilot2-wrapper.sh
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc       100 Mar 10 13:22 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-10 16:15:09] -rwxr-xr-x 1 boinc boinc      7986 Mar 10 13:22 run_atlas
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc       105 Mar 10 13:22 job.xml
[2026-03-10 16:15:09] -rw-r--r-- 2 boinc boinc    597981 Mar 10 13:22 input.tar.gz
[2026-03-10 16:15:09] -rw-r--r-- 2 boinc boinc 200099818 Mar 10 13:22 EVNT.48453336._000001.pool.root.1
[2026-03-10 16:15:09] -rw-r--r-- 2 boinc boinc     15845 Mar 10 13:22 start_atlas.sh
[2026-03-10 16:15:09] drwxrwx--x 2 boinc boinc      4096 Mar 10 13:22 shared
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc      6477 Mar 10 13:22 init_data.xml
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc         0 Mar 10 13:22 boinc_setup_complete
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc         0 Mar 10 13:22 boinc_lockfile
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc      2647 Mar 10 13:22 pandaJob.out
[2026-03-10 16:15:09] -rw------- 1 boinc boinc   1006729 Mar 10 13:22 agis_schedconf.cvmfs.json
[2026-03-10 16:15:09] -rw------- 1 boinc boinc   1511578 Mar 10 13:22 agis_ddmendpoints.agis.ALL.json
[2026-03-10 16:15:09] -rw------- 1 boinc boinc       424 Mar 10 13:22 workernode_map.json
[2026-03-10 16:15:09] drwx------ 5 boinc boinc      4096 Mar 10 13:22 pilot3
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc       530 Mar 10 16:14 boinc_task_state.xml
[2026-03-10 16:15:09] -rw------- 1 boinc boinc 143772181 Mar 10 16:14 HITS.48844454._000009.pool.root.1
[2026-03-10 16:15:09] -rw------- 1 boinc boinc      1026 Mar 10 16:14 memory_monitor_summary.json
[2026-03-10 16:15:09] -rw------- 1 boinc boinc        95 Mar 10 16:14 pilot_heartbeat.json
[2026-03-10 16:15:09] -rw------- 1 boinc boinc    340473 Mar 10 16:14 log.48844454._000009.job.log.tgz.1
[2026-03-10 16:15:09] -rw------- 1 boinc boinc      6283 Mar 10 16:14 heartbeat.json
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc        28 Mar 10 16:15 wrapper_checkpoint.txt
[2026-03-10 16:15:09] -rw------- 1 boinc boinc       822 Mar 10 16:15 pilotlog.txt
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc      8192 Mar 10 16:15 boinc_mmap_file
[2026-03-10 16:15:09] -rw------- 1 boinc boinc   1492203 Mar 10 16:15 log.48844454._000009.job.log.1
[2026-03-10 16:15:09] -rw------- 1 boinc boinc       357 Mar 10 16:15 output.list
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc       620 Mar 10 16:15 runtime_log
[2026-03-10 16:15:09] -rw------- 1 boinc boinc   1853440 Mar 10 16:15 result.tar.gz
[2026-03-10 16:15:09] -rw------- 1 boinc boinc       646 Mar 10 16:15 VgaMDmNkBH9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmvjCLDmuGy2nm.diag
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc      8892 Mar 10 16:15 runtime_log.err
[2026-03-10 16:15:09] -rw-r--r-- 1 boinc boinc     21359 Mar 10 16:15 stderr.txt
[2026-03-10 16:15:09] HITS file was successfully produced:
[2026-03-10 16:15:09] -rw------- 1 boinc boinc 143772181 Mar 10 16:14 shared/HITS.pool.root.1
[2026-03-10 16:15:09]  *** Contents of shared directory: ***
[2026-03-10 16:15:09] total 338236
[2026-03-10 16:15:09] -rw-r--r-- 2 boinc boinc    597981 Mar 10 13:22 input.tar.gz
[2026-03-10 16:15:09] -rw-r--r-- 2 boinc boinc 200099818 Mar 10 13:22 ATLAS.root_0
[2026-03-10 16:15:09] -rw-r--r-- 2 boinc boinc     15845 Mar 10 13:22 start_atlas.sh
[2026-03-10 16:15:09] -rw------- 1 boinc boinc 143772181 Mar 10 16:14 HITS.pool.root.1
[2026-03-10 16:15:09] -rw------- 1 boinc boinc   1853440 Mar 10 16:15 result.tar.gz
16:15:10 (1400449): run_atlas exited; CPU time 40213.568844
16:15:10 (1400449): called boinc_finish(0)

</stderr_txt>
]]>


©2026 CERN