Name 5SZNDmqVzP8n9Rq4apOajLDm4fhM0noT9bVof3QYDmoCJMDmRqigEo_2
Workunit 235557646
Created 14 Oct 2025, 8:29:29 UTC
Sent 14 Oct 2025, 8:29:31 UTC
Report deadline 22 Oct 2025, 8:29:31 UTC
Received 14 Oct 2025, 9:20:06 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 10874972
Run time 46 min 19 sec
CPU time 5 hours 50 min 45 sec
Validate state Valid
Credit 266.46
Device peak FLOPS 35.67 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.72 GB
Peak swap size 31.65 GB
Peak disk usage 586.97 MB

Stderr output

<core_client_version>8.1.0</core_client_version>
<![CDATA[
<stderr_txt>
04:29:49 (3387827): wrapper (7.7.26015): starting
04:29:49 (3387827): wrapper: running run_atlas (--nthreads 11)
[2025-10-14 04:29:49] Arguments: --nthreads 11
[2025-10-14 04:29:49] Threads: 11
[2025-10-14 04:29:49] Checking for CVMFS
[2025-10-14 04:29:49] Probing /cvmfs/atlas.cern.ch... OK
[2025-10-14 04:29:49] Probing /cvmfs/atlas-condb.cern.ch... OK
[2025-10-14 04:29:49] Running cvmfs_config stat atlas.cern.ch
[2025-10-14 04:29:50] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2025-10-14 04:29:50] 2.13.2.0 3090366 1076 78500 151690 1 114 31775738 32503808 35433 16776704 0 8321362 99.757 5393975 30430 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.231.239:6081 1
[2025-10-14 04:29:50] CVMFS is ok
[2025-10-14 04:29:50] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2025-10-14 04:29:50] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2025-10-14 04:29:50] Further information can be found at the LHC@home message board.
[2025-10-14 04:29:50] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2025-10-14 04:29:50] Checking for apptainer binary...
[2025-10-14 04:29:50] Using apptainer found in PATH at /usr/bin/apptainer
[2025-10-14 04:29:50] Running /usr/bin/apptainer --version
[2025-10-14 04:29:50] apptainer version 1.4.3-1.el9
[2025-10-14 04:29:50] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2025-10-14 04:29:50] c8-7-2.aglt2.org
[2025-10-14 04:29:50] apptainer works
[2025-10-14 04:29:50] Set ATHENA_PROC_NUMBER=11
[2025-10-14 04:29:50] Set ATHENA_CORE_NUMBER=11
[2025-10-14 04:29:50] Starting ATLAS job with PandaID=6832673502
[2025-10-14 04:29:50] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2025-10-14 05:16:23]  *** The last 200 lines of the pilot log: ***
[2025-10-14 05:16:23]  workdir=None
[2025-10-14 05:16:23] ]
[2025-10-14 05:16:23] 2025-10-14 09:15:16,523 | INFO     | transferring file log.46376905._111490.job.log.tgz.1 from /tmp/boinchome/slots/0/PanDA_Pilot-6832673502/log.46376905._111490.job.log.tgz.1 to /tmp/boinchome/slots/
[2025-10-14 05:16:23] 2025-10-14 09:15:16,524 | INFO     | executing command: /usr/bin/env mv /tmp/boinchome/slots/0/PanDA_Pilot-6832673502/log.46376905._111490.job.log.tgz.1 /tmp/boinchome/slots/0/log.46376905._111490.job
[2025-10-14 05:16:23] 2025-10-14 09:15:16,555 | INFO     | adding to output.list: log.46376905._111490.job.log.tgz.1 davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/65/26/log.46376905._111490.job.log.tg
[2025-10-14 05:16:23] 2025-10-14 09:15:16,556 | INFO     | alt stage-out settings: ['pl', 'write_lan', 'w', 'default'], allow_altstageout=False, remain_files=0, has_altstorage=True
[2025-10-14 05:16:23] 2025-10-14 09:15:16,556 | INFO     | summary of transferred files:
[2025-10-14 05:16:23] 2025-10-14 09:15:16,556 | INFO     |  -- lfn=log.46376905._111490.job.log.tgz.1, status_code=0, status=transferred
[2025-10-14 05:16:23] 2025-10-14 09:15:16,556 | INFO     | stage-out finished correctly
[2025-10-14 05:16:23] 2025-10-14 09:15:17,975 | INFO     | finished stage-out for finished payload, adding job to finished_jobs queue
[2025-10-14 05:16:23] 2025-10-14 09:15:18,068 | INFO     | time since job start (2706s) is within the limit (172800.0s)
[2025-10-14 05:16:23] 2025-10-14 09:15:20,075 | INFO     | time since job start (2708s) is within the limit (172800.0s)
[2025-10-14 05:16:23] 2025-10-14 09:15:20,155 | WARNING  | process 3405396 can no longer be monitored (due to stat problems) - aborting
[2025-10-14 05:16:23] 2025-10-14 09:15:20,155 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-6832673502/memory_monitor_summary.json (trf name=prmon)
[2025-10-14 05:16:23] 2025-10-14 09:15:20,662 | INFO     | number of running child processes to parent process 3405396: 1
[2025-10-14 05:16:23] 2025-10-14 09:15:20,662 | INFO     | maximum number of monitored processes: 6
[2025-10-14 05:16:23] 2025-10-14 09:15:20,874 | INFO     | job 6832673502 has state=finished
[2025-10-14 05:16:23] 2025-10-14 09:15:20,874 | INFO     | preparing for final server update for job 6832673502 in state='finished'
[2025-10-14 05:16:23] 2025-10-14 09:15:20,874 | INFO     | this job has now completed (state=finished)
[2025-10-14 05:16:23] 2025-10-14 09:15:20,874 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2025-10-14 05:16:23] 2025-10-14 09:15:20,874 | INFO     | log transfer has been attempted: DONE
[2025-10-14 05:16:23] 2025-10-14 09:15:20,875 | INFO     | job 6832673502 has finished - writing final server update
[2025-10-14 05:16:23] 2025-10-14 09:15:20,875 | INFO     | total number of processed events: 400 (read)
[2025-10-14 05:16:23] 2025-10-14 09:15:20,887 | INFO     | executing command: lscpu
[2025-10-14 05:16:23] 2025-10-14 09:15:21,041 | INFO     | executing command: export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase;source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh --quiet;lsetup
[2025-10-14 05:16:23] 2025-10-14 09:15:22,082 | INFO     | time since job start (2710s) is within the limit (172800.0s)
[2025-10-14 05:16:23] 2025-10-14 09:15:23,167 | INFO     | monitor loop #186: job 0:6832673502 is in state 'finished'
[2025-10-14 05:16:23] 2025-10-14 09:15:23,169 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2025-10-14 05:16:23] 2025-10-14 09:15:24,089 | INFO     | time since job start (2712s) is within the limit (172800.0s)
[2025-10-14 05:16:23] 2025-10-14 09:15:25,673 | INFO     | monitor loop #187: job 0:6832673502 is in state 'finished'
[2025-10-14 05:16:23] 2025-10-14 09:15:25,674 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2025-10-14 05:16:23] 2025-10-14 09:15:26,096 | INFO     | time since job start (2714s) is within the limit (172800.0s)
[2025-10-14 05:16:23] 2025-10-14 09:15:27,878 | INFO     | PID=3394070 has CPU usage=14.2% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i 
[2025-10-14 05:16:23] 2025-10-14 09:15:27,878 | INFO     | .. there are 36 such processes running
[2025-10-14 05:16:23] 2025-10-14 09:15:28,192 | INFO     | monitor loop #188: job 0:6832673502 is in state 'finished'
[2025-10-14 05:16:23] 2025-10-14 09:15:28,192 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,691 | INFO     | CPU arch script returned: x86-64-v4
[2025-10-14 05:16:23] 2025-10-14 09:15:28,698 | INFO     | found 64 cores (32 cores per socket, 2 sockets) HT, CPU MHz: 3775.6389296875004
[2025-10-14 05:16:23] 2025-10-14 09:15:28,710 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-6832673502/memory_monitor_summary.json (trf name=prmon)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,711 | INFO     | extracted standard info from prmon json
[2025-10-14 05:16:23] 2025-10-14 09:15:28,711 | INFO     | extracted standard memory fields from prmon json
[2025-10-14 05:16:23] 2025-10-14 09:15:28,711 | WARNING  | GPU info not found in prmon json: 'gpu'
[2025-10-14 05:16:23] 2025-10-14 09:15:28,715 | WARNING  | format EVNTtoHITS has no such key: dbData
[2025-10-14 05:16:23] 2025-10-14 09:15:28,715 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | fitting pss+swap vs Time
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | sum of square deviations: 15695178.0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | sum of deviations: 385084947.99999994
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | mean x: 1760432053.0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | mean y: 3077724.189189189
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | -- intersect: -43189543957.8993
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | intersect: -43189543957.8993
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | chi2: 0.0002302853984829374
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | -- intersect: -43189543957.8993
[2025-10-14 05:16:23] 2025-10-14 09:15:28,716 | INFO     | current memory leak: 24.54 B/s (using 37 data points, chi2=0.00)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | could have reported an average CPU frequency of 3778 MHz (5 samples)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | ..............................
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . Timing measurements:
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . get job = 0 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . initial setup = 2 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . payload setup = 10 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . stage-in = 0 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . payload execution = 2669 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . stage-out = 0 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | . log creation = 0 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,717 | INFO     | ..............................
[2025-10-14 05:16:23] 2025-10-14 09:15:28,772 | INFO     | 
[2025-10-14 05:16:23] 2025-10-14 09:15:28,774 | INFO     | job summary report
[2025-10-14 05:16:23] 2025-10-14 09:15:28,774 | INFO     | --------------------------------------------------
[2025-10-14 05:16:23] 2025-10-14 09:15:28,774 | INFO     | PanDA job id: 6832673502
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | task id: 46376905
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | errors: (none)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | status: LOG_TRANSFER = DONE 
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | pilot state: finished 
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | transexitcode: 0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | exeerrorcode: 0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | exeerrordiag: 
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | exitcode: 0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | exitmsg: OK
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | cpuconsumptiontime: 20850 s
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | nevents: 400
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | neventsw: 0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | pid: 3405396
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | pgrp: 3405396
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | corecount: 11
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | event service: False
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | sizes: {0: 2348820, 6: 2349082, 11: 2349110, 2683: 2374403, 2684: 2374402, 2686: 2383456, 2687: 2383640, 2698: 2383866}
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | --------------------------------------------------
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | 
[2025-10-14 05:16:23] 2025-10-14 09:15:28,775 | INFO     | executing command: ls -lF /tmp/boinchome/slots/0
[2025-10-14 05:16:23] 2025-10-14 09:15:28,810 | INFO     | queue jobs had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue payloads had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue data_in had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue data_out had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue current_data_in had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,811 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,812 | INFO     | queue completed_jobids has 1 job(s)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,812 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,812 | INFO     | queue messages had 0 job(s) [purged]
[2025-10-14 05:16:23] 2025-10-14 09:15:28,812 | INFO     | job 6832673502 has completed (purged errors)
[2025-10-14 05:16:23] 2025-10-14 09:15:28,812 | INFO     | overall cleanup function is called
[2025-10-14 05:16:23] 2025-10-14 09:15:29,822 | INFO     | --- collectZombieJob: --- 10, [3405396]
[2025-10-14 05:16:23] 2025-10-14 09:15:29,822 | INFO     | zombie collector waiting for pid 3405396
[2025-10-14 05:16:23] 2025-10-14 09:15:29,822 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2025-10-14 05:16:23] 2025-10-14 09:15:29,823 | INFO     | collected zombie processes
[2025-10-14 05:16:23] 2025-10-14 09:15:29,823 | INFO     | will attempt to kill all subprocesses of pid=3405396
[2025-10-14 05:16:23] 2025-10-14 09:15:30,291 | INFO     | process IDs to be killed: [3405396] (in reverse order)
[2025-10-14 05:16:23] 2025-10-14 09:15:30,481 | WARNING  | found no corresponding commands to process id(s)
[2025-10-14 05:16:23] 2025-10-14 09:15:30,481 | INFO     | Do not look for orphan processes in BOINC jobs
[2025-10-14 05:16:23] 2025-10-14 09:15:30,507 | INFO     | did not find any defunct processes belonging to 3405396
[2025-10-14 05:16:23] 2025-10-14 09:15:30,531 | INFO     | did not find any defunct processes belonging to 3405396
[2025-10-14 05:16:23] 2025-10-14 09:15:30,531 | INFO     | ready for new job
[2025-10-14 05:16:23] 2025-10-14 09:15:30,531 | INFO     | pilot has finished with previous job - re-establishing logging
[2025-10-14 05:16:23] 2025-10-14 09:15:30,532 | INFO     | ***************************************
[2025-10-14 05:16:23] 2025-10-14 09:15:30,532 | INFO     | ***  PanDA Pilot version 3.10.5.57  ***
[2025-10-14 05:16:23] 2025-10-14 09:15:30,532 | INFO     | ***************************************
[2025-10-14 05:16:23] 2025-10-14 09:15:30,532 | INFO     | 
[2025-10-14 05:16:23] 2025-10-14 09:15:30,549 | INFO     | architecture information:
[2025-10-14 05:16:23] 2025-10-14 09:15:30,550 | INFO     | executing command: cat /etc/os-release
[2025-10-14 05:16:23] 2025-10-14 09:15:30,582 | INFO     | cat /etc/os-release:
[2025-10-14 05:16:23] NAME="CentOS Linux"
[2025-10-14 05:16:23] VERSION="7 (Core)"
[2025-10-14 05:16:23] ID="centos"
[2025-10-14 05:16:23] ID_LIKE="rhel fedora"
[2025-10-14 05:16:23] VERSION_ID="7"
[2025-10-14 05:16:23] PRETTY_NAME="CentOS Linux 7 (Core)"
[2025-10-14 05:16:23] ANSI_COLOR="0;31"
[2025-10-14 05:16:23] CPE_NAME="cpe:/o:centos:centos:7"
[2025-10-14 05:16:23] HOME_URL="https://www.centos.org/"
[2025-10-14 05:16:23] BUG_REPORT_URL="https://bugs.centos.org/"
[2025-10-14 05:16:23] 
[2025-10-14 05:16:23] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2025-10-14 05:16:23] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2025-10-14 05:16:23] REDHAT_SUPPORT_PRODUCT="centos"
[2025-10-14 05:16:23] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2025-10-14 05:16:23] 
[2025-10-14 05:16:23] 2025-10-14 09:15:30,582 | INFO     | ***************************************
[2025-10-14 05:16:23] 2025-10-14 09:15:31,085 | INFO     | executing command: df -mP /tmp/boinchome/slots/0
[2025-10-14 05:16:23] 2025-10-14 09:15:31,465 | INFO     | sufficient remaining disk space (96684998656 B)
[2025-10-14 05:16:23] 2025-10-14 09:15:31,465 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2025-10-14 05:16:23] 2025-10-14 09:15:31,465 | INFO     | current server update state: UPDATING_FINAL
[2025-10-14 05:16:23] 2025-10-14 09:15:31,466 | INFO     | update_server=False
[2025-10-14 05:16:23] 2025-10-14 09:15:31,466 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2025-10-14 05:16:23] 2025-10-14 09:15:31,713 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2025-10-14 05:16:23] 2025-10-14 09:15:31,714 | INFO     | aborting loop
[2025-10-14 05:16:23] 2025-10-14 09:15:31,928 | INFO     | all data control threads have been joined
[2025-10-14 05:16:23] 2025-10-14 09:15:32,157 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2025-10-14 05:16:23] 2025-10-14 09:15:32,383 | INFO     | all payload control threads have been joined
[2025-10-14 05:16:23] 2025-10-14 09:15:32,465 | INFO     | all job control threads have been joined
[2025-10-14 05:16:23] 2025-10-14 09:15:32,471 | INFO     | [job] retrieve thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:32,722 | INFO     | [job] job monitor thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:32,934 | INFO     | [data] control thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,063 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2025-10-14 05:16:23] 2025-10-14 09:15:33,097 | INFO     | [job] create_data_payload thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,147 | INFO     | [payload] validate_post thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,154 | INFO     | [job] validate thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,321 | INFO     | [payload] failed_post thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,390 | INFO     | [payload] control thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,471 | INFO     | [job] control thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,518 | INFO     | [payload] execute_payloads thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,724 | INFO     | [data] copytool_in thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,772 | INFO     | [payload] validate_pre thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:33,786 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2025-10-14 05:16:23] 2025-10-14 09:15:33,859 | INFO     | [payload] run_realtimelog thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:34,165 | INFO     | [data] copytool_out thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:34,792 | INFO     | [job] queue monitor thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:15:37,076 | INFO     | [data] queue_monitor thread has finished
[2025-10-14 05:16:23] 2025-10-14 09:16:00,732 | INFO     | PID=3394070 has CPU usage=3.7% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2025-10-14 05:16:23] 2025-10-14 09:16:00,733 | INFO     | .. there are 36 such processes running
[2025-10-14 05:16:23] 2025-10-14 09:16:00,733 | INFO     | found 0 job(s) in 20 queues
[2025-10-14 05:16:23] 2025-10-14 09:16:00,733 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2025-10-14 05:16:23] 2025-10-14 09:16:00,734 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2025-10-14 05:16:23] 2025-10-14 09:16:16,967 | INFO     | [monitor] cgroup control has ended
[2025-10-14 05:16:23] 2025-10-14 09:16:17,993 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140632046786368)>', '<ExcThread(monitor, started 140631674427136)>']
[2025-10-14 05:16:23] 2025-10-14 09:16:18,833 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2025-10-14 05:16:23] 2025-10-14 09:16:18,833 | INFO     | [monitor] control thread has ended
[2025-10-14 05:16:23] 2025-10-14 09:16:23,019 | INFO     | all workflow threads have been joined
[2025-10-14 05:16:23] 2025-10-14 09:16:23,020 | INFO     | end of generic workflow (traces error code: 0)
[2025-10-14 05:16:23] 2025-10-14 09:16:23,020 | INFO     | traces error code: 0
[2025-10-14 05:16:23] 2025-10-14 09:16:23,020 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2025-10-14 05:16:23] 2025-10-14 09:16:23,321 [wrapper] ==== pilot stdout END ====
[2025-10-14 05:16:23] 2025-10-14 09:16:23,324 [wrapper] ==== wrapper stdout RESUME ====
[2025-10-14 05:16:23] 2025-10-14 09:16:23,329 [wrapper] pilotpid: 3394070
[2025-10-14 05:16:23] 2025-10-14 09:16:23,332 [wrapper] Pilot exit status: 0
[2025-10-14 05:16:23] 2025-10-14 09:16:23,362 [wrapper] pandaids: 6832673502
[2025-10-14 05:16:23] 2025-10-14 09:16:23,544 [wrapper] cleanup supervisor_pilot 3734461 3394071
[2025-10-14 05:16:23] 2025-10-14 09:16:23,546 [wrapper] Test setup, not cleaning
[2025-10-14 05:16:23] 2025-10-14 09:16:23,551 [wrapper] apfmon messages muted
[2025-10-14 05:16:23] 2025-10-14 09:16:23,555 [wrapper] ==== wrapper stdout END ====
[2025-10-14 05:16:23] 2025-10-14 09:16:23,560 [wrapper] ==== wrapper stderr END ====
[2025-10-14 05:16:23]  *** Error codes and diagnostics ***
[2025-10-14 05:16:23]     "exeErrorCode": 0,
[2025-10-14 05:16:23]     "exeErrorDiag": "",
[2025-10-14 05:16:23]     "pilotErrorCode": 0,
[2025-10-14 05:16:23]     "pilotErrorDiag": "",
[2025-10-14 05:16:23]  *** Listing of results directory ***
[2025-10-14 05:16:23] total 389956
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas    552446 Oct  6 08:07 pilot3.tar.gz
[2025-10-14 05:16:23] -rwx------. 1 boincer umatlas     36292 Oct  6 08:28 runpilot2-wrapper.sh
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas      5112 Oct  6 08:28 queuedata.json
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas       100 Oct 14 04:29 wrapper_26015_x86_64-pc-linux-gnu
[2025-10-14 05:16:23] -rwxr-xr-x. 1 boincer umatlas      7986 Oct 14 04:29 run_atlas
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas       105 Oct 14 04:29 job.xml
[2025-10-14 05:16:23] -rw-r--r--. 2 boincer umatlas 212961024 Oct 14 04:29 EVNT.46376901._004133.pool.root.1
[2025-10-14 05:16:23] -rw-r--r--. 2 boincer umatlas    565604 Oct 14 04:29 input.tar.gz
[2025-10-14 05:16:23] -rw-r--r--. 2 boincer umatlas     15120 Oct 14 04:29 start_atlas.sh
[2025-10-14 05:16:23] drwxrwx--x. 2 boincer umatlas      4096 Oct 14 04:29 shared
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas         0 Oct 14 04:29 boinc_lockfile
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas      2521 Oct 14 04:29 pandaJob.out
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas    972799 Oct 14 04:30 agis_schedconf.cvmfs.json
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas   1560580 Oct 14 04:30 agis_ddmendpoints.agis.ALL.json
[2025-10-14 05:16:23] drwx------. 4 boincer umatlas      4096 Oct 14 04:30 pilot3
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas      7159 Oct 14 05:11 init_data.xml
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas 180951226 Oct 14 05:14 HITS.46376905._111490.pool.root.1
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas       530 Oct 14 05:14 boinc_task_state.xml
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas      1043 Oct 14 05:15 memory_monitor_summary.json
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas    351807 Oct 14 05:15 log.46376905._111490.job.log.tgz.1
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas      7795 Oct 14 05:15 heartbeat.json
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas        95 Oct 14 05:16 pilot_heartbeat.json
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas        27 Oct 14 05:16 wrapper_checkpoint.txt
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas      8192 Oct 14 05:16 boinc_mmap_file
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas      4824 Oct 14 05:16 pilotlog.txt
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas    406350 Oct 14 05:16 log.46376905._111490.job.log.1
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas       357 Oct 14 05:16 output.list
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas       620 Oct 14 05:16 runtime_log
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas    778240 Oct 14 05:16 result.tar.gz
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas      8695 Oct 14 05:16 runtime_log.err
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas       651 Oct 14 05:16 5SZNDmqVzP8n9Rq4apOajLDm4fhM0noT9bVof3QYDmoCJMDmRqigEo.diag
[2025-10-14 05:16:23] -rw-r--r--. 1 boincer umatlas     22397 Oct 14 05:16 stderr.txt
[2025-10-14 05:16:23] HITS file was successfully produced:
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas 180951226 Oct 14 05:14 shared/HITS.pool.root.1
[2025-10-14 05:16:23]  *** Contents of shared directory: ***
[2025-10-14 05:16:23] total 386020
[2025-10-14 05:16:23] -rw-r--r--. 2 boincer umatlas 212961024 Oct 14 04:29 ATLAS.root_0
[2025-10-14 05:16:23] -rw-r--r--. 2 boincer umatlas    565604 Oct 14 04:29 input.tar.gz
[2025-10-14 05:16:23] -rw-r--r--. 2 boincer umatlas     15120 Oct 14 04:29 start_atlas.sh
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas 180951226 Oct 14 05:14 HITS.pool.root.1
[2025-10-14 05:16:23] -rw-------. 1 boincer umatlas    778240 Oct 14 05:16 result.tar.gz
05:16:25 (3387827): run_atlas exited; CPU time 20901.393893
05:16:25 (3387827): called boinc_finish(0)

</stderr_txt>
]]>


©2025 CERN