| Name | oGHMDm4yaL9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmHwsLDmvnpFqn_0 |
| Workunit | 239910148 |
| Created | 14 Mar 2026, 10:12:22 UTC |
| Sent | 14 Mar 2026, 16:05:12 UTC |
| Report deadline | 22 Mar 2026, 16:05:12 UTC |
| Received | 15 Mar 2026, 12:58:39 UTC |
| Server state | Over |
| Outcome | Success |
| Client state | Done |
| Exit status | 0 (0x00000000) |
| Computer ID | 10879054 |
| Run time | 1 hours 20 min 1 sec |
| CPU time | 3 hours 43 min 18 sec |
| Priority | 28 |
| Validate state | Valid |
| Credit | 247.17 |
| Device peak FLOPS | 28.26 GFLOPS |
| Application version | ATLAS Simulation v3.01 (native_mt) x86_64-pc-linux-gnu |
| Peak working set size | 2.48 GB |
| Peak swap size | 2.79 GB |
| Peak disk usage | 1.00 GB |
<core_client_version>8.2.2</core_client_version> <![CDATA[ <stderr_txt> 11:37:13 (3798884): wrapper (7.7.26015): starting 11:37:13 (3798884): wrapper: running run_atlas (--nthreads 3) [2026-03-15 11:37:13] Arguments: --nthreads 3 [2026-03-15 11:37:13] Threads: 3 [2026-03-15 11:37:13] Checking for CVMFS [2026-03-15 11:37:13] Probing /cvmfs/atlas.cern.ch... OK [2026-03-15 11:37:14] Probing /cvmfs/atlas-condb.cern.ch... OK [2026-03-15 11:37:14] Running cvmfs_config stat atlas.cern.ch [2026-03-15 11:37:14] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2026-03-15 11:37:14] 2.9.2.0 64072 25491 89408 157310 1 50 29757477 36864000 0 130560 0 10021241 99.982 958317 10766 http://cvmfs-stratum-one.cern.ch:8000/cvmfs/atlas.cern.ch http://130.183.36.13:3128 1 [2026-03-15 11:37:14] CVMFS is ok [2026-03-15 11:37:14] Efficiency of ATLAS tasks can be improved by the following measure(s): [2026-03-15 11:37:14] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io. [2026-03-15 11:37:14] Further information can be found at the LHC@home message board. [2026-03-15 11:37:14] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 [2026-03-15 11:37:14] Checking for apptainer binary... [2026-03-15 11:37:14] Using apptainer found in PATH at /usr/bin/apptainer [2026-03-15 11:37:14] Running /usr/bin/apptainer --version [2026-03-15 11:37:14] apptainer version 1.4.1-1.1 [2026-03-15 11:37:14] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname [2026-03-15 11:37:14] thA383 [2026-03-15 11:37:14] apptainer works [2026-03-15 11:37:14] Set ATHENA_PROC_NUMBER=3 [2026-03-15 11:37:14] Set ATHENA_CORE_NUMBER=3 [2026-03-15 11:37:14] Starting ATLAS job with PandaID=7046600873 [2026-03-15 11:37:14] Running command: /usr/bin/apptainer exec -B /cvmfs,/local/data/boinc/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh [2026-03-15 12:57:51] *** The last 200 lines of the pilot log: *** [2026-03-15 12:57:51] 2026-03-15 11:57:20,093 | INFO | overall cleanup function is called [2026-03-15 12:57:51] 2026-03-15 11:57:21,096 | INFO | --- collectZombieJob: --- 10, [3807355] [2026-03-15 12:57:51] 2026-03-15 11:57:21,096 | INFO | zombie collector waiting for pid 3807355 [2026-03-15 12:57:51] 2026-03-15 11:57:21,097 | INFO | harmless exception when collecting zombies: [Errno 10] No child processes [2026-03-15 12:57:51] 2026-03-15 11:57:21,097 | INFO | collected zombie processes [2026-03-15 12:57:51] 2026-03-15 11:57:21,097 | INFO | will attempt to kill all subprocesses of pid=3807355 [2026-03-15 12:57:51] 2026-03-15 11:57:21,149 | INFO | process IDs to be killed: [3807355] (in reverse order) [2026-03-15 12:57:51] 2026-03-15 11:57:21,178 | WARNING | found no corresponding commands to process id(s) [2026-03-15 12:57:51] 2026-03-15 11:57:21,178 | INFO | Do not look for orphan processes in BOINC jobs [2026-03-15 12:57:51] 2026-03-15 11:57:21,180 | INFO | did not find any defunct processes belonging to 3807355 [2026-03-15 12:57:51] 2026-03-15 11:57:21,181 | INFO | did not find any defunct processes belonging to 3807355 [2026-03-15 12:57:51] 2026-03-15 11:57:21,181 | INFO | ready for new job [2026-03-15 12:57:51] 2026-03-15 11:57:21,181 | INFO | pilot has finished with previous job - re-establishing logging [2026-03-15 12:57:51] 2026-03-15 11:57:21,182 | INFO | ************************************** [2026-03-15 12:57:51] 2026-03-15 11:57:21,182 | INFO | *** PanDA Pilot version 3.11.5.1 *** [2026-03-15 12:57:51] 2026-03-15 11:57:21,182 | INFO | ************************************** [2026-03-15 12:57:51] 2026-03-15 11:57:21,182 | INFO | [2026-03-15 12:57:51] 2026-03-15 11:57:21,182 | INFO | architecture information: [2026-03-15 12:57:51] 2026-03-15 11:57:21,182 | INFO | executing command: cat /etc/os-release [2026-03-15 12:57:51] 2026-03-15 11:57:21,192 | INFO | cat /etc/os-release: [2026-03-15 12:57:51] NAME="CentOS Linux" [2026-03-15 12:57:51] VERSION="7 (Core)" [2026-03-15 12:57:51] ID="centos" [2026-03-15 12:57:51] ID_LIKE="rhel fedora" [2026-03-15 12:57:51] VERSION_ID="7" [2026-03-15 12:57:51] PRETTY_NAME="CentOS Linux 7 (Core)" [2026-03-15 12:57:51] ANSI_COLOR="0;31" [2026-03-15 12:57:51] CPE_NAME="cpe:/o:centos:centos:7" [2026-03-15 12:57:51] HOME_URL="https://www.centos.org/" [2026-03-15 12:57:51] BUG_REPORT_URL="https://bugs.centos.org/" [2026-03-15 12:57:51] [2026-03-15 12:57:51] CENTOS_MANTISBT_PROJECT="CentOS-7" [2026-03-15 12:57:51] CENTOS_MANTISBT_PROJECT_VERSION="7" [2026-03-15 12:57:51] REDHAT_SUPPORT_PRODUCT="centos" [2026-03-15 12:57:51] REDHAT_SUPPORT_PRODUCT_VERSION="7" [2026-03-15 12:57:51] [2026-03-15 12:57:51] 2026-03-15 11:57:21,192 | INFO | ************************************** [2026-03-15 12:57:51] 2026-03-15 11:57:21,687 | WARNING | process 3807355 can no longer be monitored (due to stat problems) - aborting [2026-03-15 12:57:51] 2026-03-15 11:57:21,695 | INFO | executing command: df -mP /local/data/boinc/slots/0 [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | INFO | sufficient remaining disk space (227965665280 B) [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | WARNING | since timefloor is set to 0, pilot was only allowed to run one job [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | INFO | current server update state: UPDATING_FINAL [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | INFO | update_server=False [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | WARNING | setting graceful_stop since proceed_with_getjob() returned False (pilot will end) [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | WARNING | data:copytool_out:received graceful stop - abort after this iteration [2026-03-15 12:57:51] 2026-03-15 11:57:21,707 | WARNING | job:queue_monitor:received graceful stop - abort after this iteration [2026-03-15 12:57:51] 2026-03-15 11:57:21,731 | INFO | using path: /local/data/boinc/slots/0/memory_monitor_summary.json (trf name=prmon) [2026-03-15 12:57:51] 2026-03-15 11:57:21,779 | INFO | number of running child processes to parent process 3807355: 1 [2026-03-15 12:57:51] 2026-03-15 11:57:21,779 | INFO | maximum number of monitored processes: 6 [2026-03-15 12:57:51] 2026-03-15 11:57:21,779 | INFO | aborting job monitoring since job object (job id=7046600873) has expired [2026-03-15 12:57:51] 2026-03-15 11:57:21,779 | WARNING | job:job_monitor:received graceful stop - abort after this iteration [2026-03-15 12:57:51] 2026-03-15 11:57:21,779 | INFO | will abort loop [2026-03-15 12:57:51] 2026-03-15 11:57:22,350 | INFO | all data control threads have been joined [2026-03-15 12:57:51] 2026-03-15 11:57:22,655 | INFO | all job control threads have been joined [2026-03-15 12:57:51] 2026-03-15 11:57:22,712 | INFO | [job] retrieve thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:22,713 | INFO | [job] queue monitor thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:22,719 | INFO | [data] copytool_in thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:22,785 | INFO | [job] job monitor thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:22,850 | INFO | [payload] validate_post thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:22,922 | WARNING | data:queue_monitoring:received graceful stop - abort after this iteration [2026-03-15 12:57:51] 2026-03-15 11:57:23,019 | INFO | all payload control threads have been joined [2026-03-15 12:57:51] 2026-03-15 11:57:23,356 | INFO | [data] control thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:23,661 | INFO | [job] control thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:23,713 | INFO | [data] copytool_out thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:23,930 | INFO | [payload] execute_payloads thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:23,949 | INFO | [payload] failed_post thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:24,025 | INFO | [payload] control thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:24,085 | INFO | [job] create_data_payload thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:24,091 | INFO | [job] validate thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:24,167 | INFO | [payload] validate_pre thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:26,928 | INFO | [data] queue_monitor thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:30,381 | INFO | [monitor] cgroup control has ended [2026-03-15 12:57:51] 2026-03-15 11:57:31,949 | INFO | job.realtimelogging is not enabled [2026-03-15 12:57:51] 2026-03-15 11:57:32,955 | INFO | [payload] run_realtimelog thread has finished [2026-03-15 12:57:51] 2026-03-15 11:57:33,630 | INFO | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140329288820544)>', '<ExcThread(monitor, started 140329006413568)>'] [2026-03-15 12:57:51] 2026-03-15 11:57:38,654 | INFO | all workflow threads have been joined [2026-03-15 12:57:51] 2026-03-15 11:57:38,655 | INFO | end of generic workflow (traces error code: 0) [2026-03-15 12:57:51] 2026-03-15 11:57:38,655 | INFO | traces error code: 0 [2026-03-15 12:57:51] 2026-03-15 11:57:38,655 | INFO | pilot has finished (exit code=0, shell exit code=0) [2026-03-15 12:57:51] 2026-03-15 11:57:51,558 | INFO | PID=3802924 has CPU usage=1.8% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P [2026-03-15 12:57:51] 2026-03-15 11:57:51,558 | INFO | found 0 job(s) in 20 queues [2026-03-15 12:57:51] 2026-03-15 11:57:51,558 | WARNING | pilot monitor received instruction that args.graceful_stop has been set [2026-03-15 12:57:51] 2026-03-15 11:57:51,559 | WARNING | will wait for a maximum of 300 s for threads to finish [2026-03-15 12:57:51] 2026-03-15 11:57:51,559 | WARNING | job_aborted has been set - aborting pilot monitoring [2026-03-15 12:57:51] 2026-03-15 11:57:51,559 | INFO | [monitor] control thread has ended [2026-03-15 12:57:51] 2026-03-15 11:57:51,606 [wrapper] ==== pilot stdout END ==== [2026-03-15 12:57:51] 2026-03-15 11:57:51,609 [wrapper] ==== wrapper stdout RESUME ==== [2026-03-15 12:57:51] 2026-03-15 11:57:51,611 [wrapper] pilotpid: 3802924 [2026-03-15 12:57:51] 2026-03-15 11:57:51,614 [wrapper] Pilot exit status: 0 [2026-03-15 12:57:51] 2026-03-15 11:57:51,621 [wrapper] pandaids: 7046600873 [2026-03-15 12:57:51] 2026-03-15 11:57:51,639 [wrapper] cleanup supervisor_pilot 3813243 3802925 [2026-03-15 12:57:51] 2026-03-15 11:57:51,642 [wrapper] Test setup, not cleaning [2026-03-15 12:57:51] 2026-03-15 11:57:51,644 [wrapper] apfmon messages muted [2026-03-15 12:57:51] 2026-03-15 11:57:51,646 [wrapper] ==== wrapper stdout END ==== [2026-03-15 12:57:51] 2026-03-15 11:57:51,649 [wrapper] ==== wrapper stderr END ==== [2026-03-15 12:57:51] *** Error codes and diagnostics *** [2026-03-15 12:57:51] "exeErrorCode": 0, [2026-03-15 12:57:51] "exeErrorDiag": "", [2026-03-15 12:57:51] "pilotErrorCode": 0, [2026-03-15 12:57:51] "pilotErrorDiag": "", [2026-03-15 12:57:51] *** Listing of results directory *** [2026-03-15 12:57:51] total 619536 [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 585013 Mar 14 10:56 pilot3.tar.gz [2026-03-15 12:57:51] -rwx------ 1 boinc boinc 36322 Mar 14 11:01 runpilot2-wrapper.sh [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 5111 Mar 14 11:01 queuedata.json [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 100 Mar 15 11:37 wrapper_26015_x86_64-pc-linux-gnu [2026-03-15 12:57:51] -rwxr-xr-x 1 boinc boinc 7986 Mar 15 11:37 run_atlas [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 105 Mar 15 11:37 job.xml [2026-03-15 12:57:51] -rw-r--r-- 2 boinc boinc 443583462 Mar 15 11:37 EVNT.46701403._002556.pool.root.1 [2026-03-15 12:57:51] -rw-r--r-- 2 boinc boinc 597532 Mar 15 11:37 input.tar.gz [2026-03-15 12:57:51] -rw-r--r-- 2 boinc boinc 15845 Mar 15 11:37 start_atlas.sh [2026-03-15 12:57:51] drwxrwx--x 2 boinc boinc 4096 Mar 15 11:37 shared [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 0 Mar 15 11:37 boinc_setup_complete [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 6438 Mar 15 11:37 init_data.xml [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 0 Mar 15 11:37 boinc_lockfile [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 2572 Mar 15 11:37 pandaJob.out [2026-03-15 12:57:51] -rw------- 1 boinc boinc 1006481 Mar 15 11:37 agis_schedconf.cvmfs.json [2026-03-15 12:57:51] -rw------- 1 boinc boinc 1511579 Mar 15 11:37 agis_ddmendpoints.agis.ALL.json [2026-03-15 12:57:51] -rw------- 1 boinc boinc 427 Mar 15 11:37 workernode_map.json [2026-03-15 12:57:51] drwx------ 5 boinc boinc 4096 Mar 15 11:37 pilot3 [2026-03-15 12:57:51] -rw------- 1 boinc boinc 184934424 Mar 15 12:56 HITS.48955015._024969.pool.root.1 [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 530 Mar 15 12:57 boinc_task_state.xml [2026-03-15 12:57:51] -rw------- 1 boinc boinc 1015 Mar 15 12:57 memory_monitor_summary.json [2026-03-15 12:57:51] -rw------- 1 boinc boinc 280058 Mar 15 12:57 log.48955015._024969.job.log.tgz.1 [2026-03-15 12:57:51] -rw------- 1 boinc boinc 6315 Mar 15 12:57 heartbeat.json [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 27 Mar 15 12:57 wrapper_checkpoint.txt [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 8192 Mar 15 12:57 boinc_mmap_file [2026-03-15 12:57:51] -rw------- 1 boinc boinc 747 Mar 15 12:57 pilotlog.txt [2026-03-15 12:57:51] -rw------- 1 boinc boinc 95 Mar 15 12:57 pilot_heartbeat.json [2026-03-15 12:57:51] -rw------- 1 boinc boinc 704095 Mar 15 12:57 log.48955015._024969.job.log.1 [2026-03-15 12:57:51] -rw------- 1 boinc boinc 357 Mar 15 12:57 output.list [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 620 Mar 15 12:57 runtime_log [2026-03-15 12:57:51] -rw------- 1 boinc boinc 1003520 Mar 15 12:57 result.tar.gz [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 8896 Mar 15 12:57 runtime_log.err [2026-03-15 12:57:51] -rw------- 1 boinc boinc 657 Mar 15 12:57 oGHMDm4yaL9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmHwsLDmvnpFqn.diag [2026-03-15 12:57:51] -rw-r--r-- 1 boinc boinc 11433 Mar 15 12:57 stderr.txt [2026-03-15 12:57:51] HITS file was successfully produced: [2026-03-15 12:57:51] -rw------- 1 boinc boinc 184934424 Mar 15 12:56 shared/HITS.pool.root.1 [2026-03-15 12:57:51] *** Contents of shared directory: *** [2026-03-15 12:57:51] total 615376 [2026-03-15 12:57:51] -rw-r--r-- 2 boinc boinc 443583462 Mar 15 11:37 ATLAS.root_0 [2026-03-15 12:57:51] -rw-r--r-- 2 boinc boinc 597532 Mar 15 11:37 input.tar.gz [2026-03-15 12:57:51] -rw-r--r-- 2 boinc boinc 15845 Mar 15 11:37 start_atlas.sh [2026-03-15 12:57:51] -rw------- 1 boinc boinc 184934424 Mar 15 12:56 HITS.pool.root.1 [2026-03-15 12:57:51] -rw------- 1 boinc boinc 1003520 Mar 15 12:57 result.tar.gz 12:57:52 (3798884): run_atlas exited; CPU time 13383.837484 12:57:52 (3798884): called boinc_finish(0) </stderr_txt> ]]>
©2026 CERN