Job instance 434226689

Name	xgHLDmurhQ9nsSi4ap6QjLDmwznN0nGgGQJmrXlLDmC7HKDmHyaT2n_1
Workunit	240165289
Created	29 Mar 2026, 2:17:13 UTC
Sent	29 Mar 2026, 2:27:57 UTC
Report deadline	6 Apr 2026, 2:27:57 UTC
Received	29 Mar 2026, 18:37:59 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	10752018
Run time	2 hours 43 min 13 sec
CPU time	10 hours 27 min 32 sec
Priority	28
Validate state	Valid
Credit	617.19
Device peak FLOPS	37.97 GFLOPS
Application version	ATLAS Simulation v3.01 (native_mt) x86_64-pc-linux-gnu
Peak working set size	2.51 GB
Peak swap size	2.85 GB
Peak disk usage	707.66 MB
Stderr output

<core_client_version>8.2.2</core_client_version>
<![CDATA[
<stderr_txt>
16:53:10 (2341836): wrapper (7.7.26015): starting
16:53:10 (2341836): wrapper: running run_atlas (--nthreads 4)
[2026-03-29 16:53:10] Arguments: --nthreads 4
[2026-03-29 16:53:10] Threads: 4
[2026-03-29 16:53:10] Checking for CVMFS
[2026-03-29 16:53:10] Probing /cvmfs/atlas.cern.ch... OK
[2026-03-29 16:53:11] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-03-29 16:53:11] Running cvmfs_config stat atlas.cern.ch
[2026-03-29 16:53:11] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-03-29 16:53:11] 2.9.2.0 2056898 2300 84844 157845 1 50 29196112 36864001 0 130560 0 1001714 99.984 86206 14693 http://cvmfs-stratum-one.cern.ch:8000/cvmfs/atlas.cern.ch http://130.183.36.13:3128 1
[2026-03-29 16:53:11] CVMFS is ok
[2026-03-29 16:53:11] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-29 16:53:11] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-29 16:53:11] Further information can be found at the LHC@home message board.
[2026-03-29 16:53:11] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-29 16:53:11] Checking for apptainer binary...
[2026-03-29 16:53:11] Using apptainer found in PATH at /usr/bin/apptainer
[2026-03-29 16:53:11] Running /usr/bin/apptainer --version
[2026-03-29 16:53:11] apptainer version 1.4.1-1.1
[2026-03-29 16:53:11] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-29 16:53:11] thA317a
[2026-03-29 16:53:11] apptainer works
[2026-03-29 16:53:11] Set ATHENA_PROC_NUMBER=4
[2026-03-29 16:53:11] Set ATHENA_CORE_NUMBER=4
[2026-03-29 16:53:11] Starting ATLAS job with PandaID=7071984732
[2026-03-29 16:53:11] Running command: /usr/bin/apptainer exec -B /cvmfs,/local/data/boinc/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-03-29 19:36:41]  *** The last 200 lines of the pilot log: ***
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | job 7071984732 has completed (purged errors)
[2026-03-29 19:36:41] 2026-03-29 17:35:53,522 | INFO     | overall cleanup function is called
[2026-03-29 19:36:41] 2026-03-29 17:35:54,529 | INFO     | --- collectZombieJob: --- 10, [2350453]
[2026-03-29 19:36:41] 2026-03-29 17:35:54,529 | INFO     | zombie collector waiting for pid 2350453
[2026-03-29 19:36:41] 2026-03-29 17:35:54,529 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-29 19:36:41] 2026-03-29 17:35:54,529 | INFO     | collected zombie processes
[2026-03-29 19:36:41] 2026-03-29 17:35:54,529 | INFO     | will attempt to kill all subprocesses of pid=2350453
[2026-03-29 19:36:41] 2026-03-29 17:35:54,596 | INFO     | process IDs to be killed: [2350453] (in reverse order)
[2026-03-29 19:36:41] 2026-03-29 17:35:54,630 | WARNING  | found no corresponding commands to process id(s)
[2026-03-29 19:36:41] 2026-03-29 17:35:54,630 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-29 19:36:41] 2026-03-29 17:35:54,632 | INFO     | did not find any defunct processes belonging to 2350453
[2026-03-29 19:36:41] 2026-03-29 17:35:54,634 | INFO     | did not find any defunct processes belonging to 2350453
[2026-03-29 19:36:41] 2026-03-29 17:35:54,634 | INFO     | ready for new job
[2026-03-29 19:36:41] 2026-03-29 17:35:54,634 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-29 19:36:41] 2026-03-29 17:35:54,635 | INFO     | **************************************
[2026-03-29 19:36:41] 2026-03-29 17:35:54,635 | INFO     | ***  PanDA Pilot version 3.11.5.1  ***
[2026-03-29 19:36:41] 2026-03-29 17:35:54,635 | INFO     | **************************************
[2026-03-29 19:36:41] 2026-03-29 17:35:54,635 | INFO     | 
[2026-03-29 19:36:41] 2026-03-29 17:35:54,635 | INFO     | architecture information:
[2026-03-29 19:36:41] 2026-03-29 17:35:54,635 | INFO     | executing command: cat /etc/os-release
[2026-03-29 19:36:41] 2026-03-29 17:35:54,646 | INFO     | cat /etc/os-release:
[2026-03-29 19:36:41] NAME="CentOS Linux"
[2026-03-29 19:36:41] VERSION="7 (Core)"
[2026-03-29 19:36:41] ID="centos"
[2026-03-29 19:36:41] ID_LIKE="rhel fedora"
[2026-03-29 19:36:41] VERSION_ID="7"
[2026-03-29 19:36:41] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-29 19:36:41] ANSI_COLOR="0;31"
[2026-03-29 19:36:41] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-29 19:36:41] HOME_URL="https://www.centos.org/"
[2026-03-29 19:36:41] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-29 19:36:41] 
[2026-03-29 19:36:41] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-29 19:36:41] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-29 19:36:41] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-29 19:36:41] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-29 19:36:41] 
[2026-03-29 19:36:41] 2026-03-29 17:35:54,646 | INFO     | **************************************
[2026-03-29 19:36:41] 2026-03-29 17:35:55,149 | INFO     | executing command: df -mP /local/data/boinc/slots/0
[2026-03-29 19:36:41] 2026-03-29 17:35:55,162 | INFO     | sufficient remaining disk space (229544820736 B)
[2026-03-29 19:36:41] 2026-03-29 17:35:55,162 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-29 19:36:41] 2026-03-29 17:35:55,162 | INFO     | current server update state: UPDATING_FINAL
[2026-03-29 19:36:41] 2026-03-29 17:35:55,162 | INFO     | update_server=False
[2026-03-29 19:36:41] 2026-03-29 17:35:55,162 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-29 19:36:41] 2026-03-29 17:35:55,163 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-29 19:36:41] 2026-03-29 17:35:55,163 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-29 19:36:41] 2026-03-29 17:35:55,163 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-29 19:36:41] 2026-03-29 17:35:55,543 | INFO     | all payload control threads have been joined
[2026-03-29 19:36:41] 2026-03-29 17:35:55,583 | INFO     | all job control threads have been joined
[2026-03-29 19:36:41] 2026-03-29 17:35:55,641 | WARNING  | process 2350453 can no longer be monitored (due to stat problems) - aborting
[2026-03-29 19:36:41] 2026-03-29 17:35:55,688 | INFO     | using path: /local/data/boinc/slots/0/memory_monitor_summary.json (trf name=prmon)
[2026-03-29 19:36:41] 2026-03-29 17:35:55,744 | INFO     | all data control threads have been joined
[2026-03-29 19:36:41] 2026-03-29 17:35:55,748 | INFO     | number of running child processes to parent process 2350453: 1
[2026-03-29 19:36:41] 2026-03-29 17:35:55,748 | INFO     | maximum number of monitored processes: 6
[2026-03-29 19:36:41] 2026-03-29 17:35:55,748 | INFO     | aborting job monitoring since job object (job id=7071984732) has expired
[2026-03-29 19:36:41] 2026-03-29 17:35:55,748 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-29 19:36:41] 2026-03-29 17:35:55,748 | INFO     | will abort loop
[2026-03-29 19:36:41] 2026-03-29 17:35:56,168 | INFO     | [job] retrieve thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,168 | INFO     | [job] queue monitor thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,259 | INFO     | [payload] validate_post thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,398 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,549 | INFO     | [payload] control thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,589 | INFO     | [job] control thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,617 | INFO     | [payload] failed_post thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,749 | INFO     | [data] control thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:56,754 | INFO     | [job] job monitor thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:57,099 | INFO     | [job] create_data_payload thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:57,100 | INFO     | [job] validate thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:57,169 | INFO     | [data] copytool_out thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:57,193 | INFO     | [payload] validate_pre thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:57,344 | INFO     | [data] copytool_in thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:57,436 | INFO     | [payload] execute_payloads thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:35:59,169 | INFO     | [data] queue_monitor thread has finished
[2026-03-29 19:36:41] 2026-03-29 17:36:24,880 | INFO     | PID=2345949 has CPU usage=2.1% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2026-03-29 19:36:41] 2026-03-29 17:36:24,880 | INFO     | found 0 job(s) in 20 queues
[2026-03-29 19:36:41] 2026-03-29 17:36:24,880 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-29 19:36:41] 2026-03-29 17:36:24,880 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-29 19:36:41] 2026-03-29 17:36:34,414 | INFO     | [monitor] cgroup control has ended
[2026-03-29 19:36:41] 2026-03-29 17:36:36,395 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140183718532928)>', '<ExcThread(monitor, started 140183076849408)>']
[2026-03-29 19:36:41] 2026-03-29 17:36:36,944 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-29 19:36:41] 2026-03-29 17:36:36,945 | INFO     | [monitor] control thread has ended
[2026-03-29 19:36:41] 2026-03-29 17:36:41,421 | INFO     | all workflow threads have been joined
[2026-03-29 19:36:41] 2026-03-29 17:36:41,421 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-29 19:36:41] 2026-03-29 17:36:41,421 | INFO     | traces error code: 0
[2026-03-29 19:36:41] 2026-03-29 17:36:41,421 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-29 19:36:41] 2026-03-29 17:36:41,471 [wrapper] ==== pilot stdout END ====
[2026-03-29 19:36:41] 2026-03-29 17:36:41,473 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-29 19:36:41] 2026-03-29 17:36:41,476 [wrapper] pilotpid: 2345949
[2026-03-29 19:36:41] 2026-03-29 17:36:41,479 [wrapper] Pilot exit status: 0
[2026-03-29 19:36:41] 2026-03-29 17:36:41,487 [wrapper] pandaids: 7071984732
[2026-03-29 19:36:41] 2026-03-29 17:36:41,509 [wrapper] cleanup supervisor_pilot 2357411 2345950
[2026-03-29 19:36:41] 2026-03-29 17:36:41,512 [wrapper] Test setup, not cleaning
[2026-03-29 19:36:41] 2026-03-29 17:36:41,515 [wrapper] apfmon messages muted
[2026-03-29 19:36:41] 2026-03-29 17:36:41,517 [wrapper] ==== wrapper stdout END ====
[2026-03-29 19:36:41] 2026-03-29 17:36:41,520 [wrapper] ==== wrapper stderr END ====
[2026-03-29 19:36:41]  *** Error codes and diagnostics ***
[2026-03-29 19:36:41]     "exeErrorCode": 0,
[2026-03-29 19:36:41]     "exeErrorDiag": "",
[2026-03-29 19:36:41]     "pilotErrorCode": 0,
[2026-03-29 19:36:41]     "pilotErrorDiag": "",
[2026-03-29 19:36:41]  *** Listing of results directory ***
[2026-03-29 19:36:41] total 488700
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc    585013 Mar 28 13:19 pilot3.tar.gz
[2026-03-29 19:36:41] -rwx------ 1 boinc boinc     36322 Mar 28 13:22 runpilot2-wrapper.sh
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc      5118 Mar 28 13:24 queuedata.json
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc       100 Mar 29 16:53 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-29 19:36:41] -rwxr-xr-x 1 boinc boinc      7986 Mar 29 16:53 run_atlas
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc       105 Mar 29 16:53 job.xml
[2026-03-29 19:36:41] -rw-r--r-- 2 boinc boinc 242234655 Mar 29 16:53 EVNT.49137320._000011.pool.root.1
[2026-03-29 19:36:41] -rw-r--r-- 2 boinc boinc    597554 Mar 29 16:53 input.tar.gz
[2026-03-29 19:36:41] -rw-r--r-- 2 boinc boinc     15845 Mar 29 16:53 start_atlas.sh
[2026-03-29 19:36:41] drwxrwx--x 2 boinc boinc      4096 Mar 29 16:53 shared
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc         0 Mar 29 16:53 boinc_setup_complete
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc      6439 Mar 29 16:53 init_data.xml
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc         0 Mar 29 16:53 boinc_lockfile
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc      2610 Mar 29 16:53 pandaJob.out
[2026-03-29 19:36:41] -rw------- 1 boinc boinc   1003389 Mar 29 16:53 agis_schedconf.cvmfs.json
[2026-03-29 19:36:41] -rw------- 1 boinc boinc   1511579 Mar 29 16:53 agis_ddmendpoints.agis.ALL.json
[2026-03-29 19:36:41] -rw------- 1 boinc boinc       417 Mar 29 16:53 workernode_map.json
[2026-03-29 19:36:41] drwx------ 5 boinc boinc      4096 Mar 29 16:53 pilot3
[2026-03-29 19:36:41] -rw------- 1 boinc boinc 250799936 Mar 29 19:35 HITS.49182524._001702.pool.root.1
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc       529 Mar 29 19:35 boinc_task_state.xml
[2026-03-29 19:36:41] -rw------- 1 boinc boinc      1017 Mar 29 19:35 memory_monitor_summary.json
[2026-03-29 19:36:41] -rw------- 1 boinc boinc    349845 Mar 29 19:35 log.49182524._001702.job.log.tgz.1
[2026-03-29 19:36:41] -rw------- 1 boinc boinc      6284 Mar 29 19:35 heartbeat.json
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc        27 Mar 29 19:36 wrapper_checkpoint.txt
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc      8192 Mar 29 19:36 boinc_mmap_file
[2026-03-29 19:36:41] -rw------- 1 boinc boinc        95 Mar 29 19:36 pilot_heartbeat.json
[2026-03-29 19:36:41] -rw------- 1 boinc boinc      5272 Mar 29 19:36 pilotlog.txt
[2026-03-29 19:36:41] -rw------- 1 boinc boinc   1383024 Mar 29 19:36 log.49182524._001702.job.log.1
[2026-03-29 19:36:41] -rw------- 1 boinc boinc       357 Mar 29 19:36 output.list
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc       620 Mar 29 19:36 runtime_log
[2026-03-29 19:36:41] -rw------- 1 boinc boinc   1751040 Mar 29 19:36 result.tar.gz
[2026-03-29 19:36:41] -rw------- 1 boinc boinc       659 Mar 29 19:36 xgHLDmurhQ9nsSi4ap6QjLDmwznN0nGgGQJmrXlLDmC7HKDmHyaT2n.diag
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc      8901 Mar 29 19:36 runtime_log.err
[2026-03-29 19:36:41] -rw-r--r-- 1 boinc boinc     12772 Mar 29 19:36 stderr.txt
[2026-03-29 19:36:41] HITS file was successfully produced:
[2026-03-29 19:36:41] -rw------- 1 boinc boinc 250799936 Mar 29 19:35 shared/HITS.pool.root.1
[2026-03-29 19:36:41]  *** Contents of shared directory: ***
[2026-03-29 19:36:41] total 483804
[2026-03-29 19:36:41] -rw-r--r-- 2 boinc boinc 242234655 Mar 29 16:53 ATLAS.root_0
[2026-03-29 19:36:41] -rw-r--r-- 2 boinc boinc    597554 Mar 29 16:53 input.tar.gz
[2026-03-29 19:36:41] -rw-r--r-- 2 boinc boinc     15845 Mar 29 16:53 start_atlas.sh
[2026-03-29 19:36:41] -rw------- 1 boinc boinc 250799936 Mar 29 19:35 HITS.pool.root.1
[2026-03-29 19:36:41] -rw------- 1 boinc boinc   1751040 Mar 29 19:36 result.tar.gz
19:36:43 (2341836): run_atlas exited; CPU time 37601.040847
19:36:43 (2341836): called boinc_finish(0)

</stderr_txt>
]]>