| Name | 9aLNDme7T48n9Rq4apOajLDm4fhM0noT9bVo0NGKDmpb2KDmO8iHIo_0 |
| Workunit | 238716855 |
| Created | 26 Jan 2026, 5:20:55 UTC |
| Sent | 26 Jan 2026, 9:38:33 UTC |
| Report deadline | 3 Feb 2026, 9:38:33 UTC |
| Received | 26 Jan 2026, 21:12:26 UTC |
| Server state | Over |
| Outcome | Validate error |
| Client state | Done |
| Exit status | 0 (0x00000000) |
| Computer ID | 10878327 |
| Run time | 5 hours 24 min 39 sec |
| CPU time | 22 hours 26 min 0 sec |
| Priority | 28 |
| Validate state | Invalid |
| Credit | 0.00 |
| Device peak FLOPS | 25.18 GFLOPS |
| Application version | ATLAS Simulation v3.01 (native_mt) x86_64-pc-linux-gnu |
| Peak working set size | 2.70 GB |
| Peak swap size | 31.64 GB |
| Peak disk usage | 974.35 MB |
<core_client_version>8.1.0</core_client_version>
<![CDATA[
<stderr_txt>
04:48:22 (2020318): wrapper (7.7.26015): starting
04:48:22 (2020318): wrapper: running run_atlas (--nthreads 10)
[2026-01-26 04:48:22] Arguments: --nthreads 10
[2026-01-26 04:48:22] Threads: 10
[2026-01-26 04:48:22] Checking for CVMFS
[2026-01-26 04:48:22] Probing /cvmfs/atlas.cern.ch... OK
[2026-01-26 04:48:22] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-01-26 04:48:22] Running cvmfs_config stat atlas.cern.ch
[2026-01-26 04:48:22] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-01-26 04:48:22] 2.13.3.0 5374 45816 177152 155541 3 545 27545066 39288833 7974 16776704 0 26385438 99.731 22819834 37118 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2026-01-26 04:48:22] CVMFS is ok
[2026-01-26 04:48:22] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-01-26 04:48:22] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-01-26 04:48:22] Further information can be found at the LHC@home message board.
[2026-01-26 04:48:22] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-01-26 04:48:22] Checking for apptainer binary...
[2026-01-26 04:48:22] Using apptainer found in PATH at /usr/bin/apptainer
[2026-01-26 04:48:22] Running /usr/bin/apptainer --version
[2026-01-26 04:48:22] apptainer version 1.4.5-2.el9
[2026-01-26 04:48:22] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-01-26 04:48:22] c-210-24.aglt2.org
[2026-01-26 04:48:22] apptainer works
[2026-01-26 04:48:22] Set ATHENA_PROC_NUMBER=10
[2026-01-26 04:48:22] Set ATHENA_CORE_NUMBER=10
[2026-01-26 04:48:22] Starting ATLAS job with PandaID=6984048940
[2026-01-26 04:48:22] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
06:20:04 (2020318): BOINC client no longer exists - exiting
06:20:04 (2020318): timer handler: client dead, exiting
08:17:04 (2313802): wrapper (7.7.26015): starting
08:17:04 (2313802): wrapper: running run_atlas (--nthreads 8)
[2026-01-26 08:17:05] Arguments: --nthreads 8
[2026-01-26 08:17:05] Threads: 8
[2026-01-26 08:17:05] This job has been restarted, cleaning up previous attempt
[2026-01-26 08:17:05] Checking for CVMFS
[2026-01-26 08:17:05] Probing /cvmfs/atlas.cern.ch... OK
[2026-01-26 08:17:06] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-01-26 08:17:06] Running cvmfs_config stat atlas.cern.ch
[2026-01-26 08:17:08] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-01-26 08:17:08] 2.13.3.0 5374 46025 162252 155548 2 19 27888384 39288832 7900 16776704 0 26494674 99.732 22828159 37117 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2026-01-26 08:17:08] CVMFS is ok
[2026-01-26 08:17:08] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-01-26 08:17:08] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-01-26 08:17:08] Further information can be found at the LHC@home message board.
[2026-01-26 08:17:08] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-01-26 08:17:08] Checking for apptainer binary...
[2026-01-26 08:17:08] Using apptainer found in PATH at /usr/bin/apptainer
[2026-01-26 08:17:08] Running /usr/bin/apptainer --version
[2026-01-26 08:17:08] apptainer version 1.4.5-2.el9
[2026-01-26 08:17:08] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-01-26 08:17:08] c-210-24.aglt2.org
[2026-01-26 08:17:08] apptainer works
[2026-01-26 08:17:09] Set ATHENA_PROC_NUMBER=8
[2026-01-26 08:17:09] Set ATHENA_CORE_NUMBER=8
[2026-01-26 08:17:09] Starting ATLAS job with PandaID=6984048940
[2026-01-26 08:17:09] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
08:30:35 (2339545): wrapper (7.7.26015): starting
08:30:35 (2339545): wrapper: running run_atlas (--nthreads 8)
[2026-01-26 08:30:35] Arguments: --nthreads 8
[2026-01-26 08:30:35] Threads: 8
[2026-01-26 08:30:35] This job has been restarted, cleaning up previous attempt
[2026-01-26 08:30:35] Checking for CVMFS
[2026-01-26 08:30:35] Probing /cvmfs/atlas.cern.ch... OK
[2026-01-26 08:30:35] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-01-26 08:30:35] Running cvmfs_config stat atlas.cern.ch
[2026-01-26 08:30:36] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-01-26 08:30:36] 2.13.3.0 5374 46038 162608 155548 1 195 27917247 39288833 6160 16776704 0 26520728 99.732 22828229 37117 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2026-01-26 08:30:36] CVMFS is ok
[2026-01-26 08:30:36] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-01-26 08:30:36] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-01-26 08:30:36] Further information can be found at the LHC@home message board.
[2026-01-26 08:30:36] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-01-26 08:30:36] Checking for apptainer binary...
[2026-01-26 08:30:36] Using apptainer found in PATH at /usr/bin/apptainer
[2026-01-26 08:30:36] Running /usr/bin/apptainer --version
[2026-01-26 08:30:36] apptainer version 1.4.5-2.el9
[2026-01-26 08:30:36] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-01-26 08:30:36] c-210-24.aglt2.org
[2026-01-26 08:30:36] apptainer works
[2026-01-26 08:30:36] Set ATHENA_PROC_NUMBER=8
[2026-01-26 08:30:36] Set ATHENA_CORE_NUMBER=8
[2026-01-26 08:30:36] Starting ATLAS job with PandaID=6984048940
[2026-01-26 08:30:36] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-01-26 16:10:42] *** The last 200 lines of the pilot log: ***
[2026-01-26 16:10:42] 2026-01-26 21:09:36,831 | INFO | will abort job monitoring soon since job state=finished (job is still in queue)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,620 | INFO | time since job start (27532s) is within the limit (349056.0s)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,784 | INFO | job 6984048940 has state=finished
[2026-01-26 16:10:42] 2026-01-26 21:09:37,785 | INFO | preparing for final server update for job 6984048940 in state='finished'
[2026-01-26 16:10:42] 2026-01-26 21:09:37,785 | INFO | reading metadata from: /tmp/boinchome/slots/0/PanDA_Pilot-6984048940/jobReport.json
[2026-01-26 16:10:42] 2026-01-26 21:09:37,787 | INFO | added worker_node to metadata from /tmp/boinchome/slots/0/workernode_map.json
[2026-01-26 16:10:42] 2026-01-26 21:09:37,787 | INFO | this job has now completed (state=finished)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,787 | INFO | pilot will not update the server (heartbeat message will be written to file)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,788 | INFO | log transfer has been attempted: DONE
[2026-01-26 16:10:42] 2026-01-26 21:09:37,788 | INFO | job 6984048940 has finished - writing final server update
[2026-01-26 16:10:42] 2026-01-26 21:09:37,788 | INFO | total number of processed events: 400 (read)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,820 | INFO | using path: /tmp/boinchome/slots/0/PanDA_Pilot-6984048940/memory_monitor_summary.json (trf name=prmon)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,824 | INFO | extracted standard info from prmon json
[2026-01-26 16:10:42] 2026-01-26 21:09:37,825 | INFO | extracted standard memory fields from prmon json
[2026-01-26 16:10:42] 2026-01-26 21:09:37,825 | WARNING | GPU info not found in prmon json: 'gpu'
[2026-01-26 16:10:42] 2026-01-26 21:09:37,825 | WARNING | format EVNTtoHITS has no such key: dbData
[2026-01-26 16:10:42] 2026-01-26 21:09:37,825 | WARNING | format EVNTtoHITS has no such key: dbTime
[2026-01-26 16:10:42] 2026-01-26 21:09:37,838 | INFO | fitting pss+swap vs Time
[2026-01-26 16:10:42] 2026-01-26 21:09:37,842 | INFO | sum of square deviations: 28442593382.79818
[2026-01-26 16:10:42] 2026-01-26 21:09:37,854 | INFO | sum of deviations: 3696663183.18183
[2026-01-26 16:10:42] 2026-01-26 21:09:37,855 | INFO | mean x: 1769447992.9157429
[2026-01-26 16:10:42] 2026-01-26 21:09:37,855 | INFO | mean y: 2552704.6363636362
[2026-01-26 16:10:42] 2026-01-26 21:09:37,855 | INFO | intersect: -227421164.5510565
[2026-01-26 16:10:42] 2026-01-26 21:09:37,856 | INFO | chi2: 1.4869257121141761
[2026-01-26 16:10:42] 2026-01-26 21:09:37,856 | INFO | sum of square deviations: 27507101801.05825
[2026-01-26 16:10:42] 2026-01-26 21:09:37,866 | INFO | sum of deviations: 3029136077.4125934
[2026-01-26 16:10:42] 2026-01-26 21:09:37,870 | INFO | mean x: 1769447840.426009
[2026-01-26 16:10:42] 2026-01-26 21:09:37,870 | INFO | mean y: 2552596.860986547
[2026-01-26 16:10:42] 2026-01-26 21:09:37,871 | INFO | intersect: -192302474.72376394
[2026-01-26 16:10:42] 2026-01-26 21:09:37,871 | INFO | chi2: 1.4863646126868446
[2026-01-26 16:10:42] 2026-01-26 21:09:37,871 | INFO | current chi2=1.4863646126868446 (change=0.03773553868630849 %)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,872 | INFO | right removable region: 445
[2026-01-26 16:10:42] 2026-01-26 21:09:37,872 | INFO | sum of square deviations: 27506849871.058243
[2026-01-26 16:10:42] 2026-01-26 21:09:37,882 | INFO | sum of deviations: -63006450874.08981
[2026-01-26 16:10:42] 2026-01-26 21:09:37,882 | INFO | mean x: 1769448145.426009
[2026-01-26 16:10:42] 2026-01-26 21:09:37,882 | INFO | mean y: 2563520.9215246635
[2026-01-26 16:10:42] 2026-01-26 21:09:37,886 | INFO | intersect: 4055613876.438081
[2026-01-26 16:10:42] 2026-01-26 21:09:37,887 | INFO | chi2: 0.1682338188715679
[2026-01-26 16:10:42] 2026-01-26 21:09:37,887 | INFO | current chi2=0.1682338188715679 (change=88.6857952955588 %)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,888 | INFO | sum of square deviations: 26591850934.2223
[2026-01-26 16:10:42] 2026-01-26 21:09:37,897 | INFO | sum of deviations: -69006054971.93657
[2026-01-26 16:10:42] 2026-01-26 21:09:37,897 | INFO | mean x: 1769448297.936508
[2026-01-26 16:10:42] 2026-01-26 21:09:37,897 | INFO | mean y: 2564512.2879818594
[2026-01-26 16:10:42] 2026-01-26 21:09:37,897 | INFO | intersect: 4594296273.248879
[2026-01-26 16:10:42] 2026-01-26 21:09:37,898 | INFO | chi2: 0.14269382458215696
[2026-01-26 16:10:42] 2026-01-26 21:09:37,902 | INFO | current chi2=0.14269382458215696 (change=15.181248610250323 %)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,902 | INFO | left removable region: 20
[2026-01-26 16:10:42] 2026-01-26 21:09:37,903 | INFO | sum of square deviations: 23801140929.75511
[2026-01-26 16:10:42] 2026-01-26 21:09:37,912 | INFO | sum of deviations: -51323245487.58347
[2026-01-26 16:10:42] 2026-01-26 21:09:37,912 | INFO | mean x: 1769448419.9458823
[2026-01-26 16:10:42] 2026-01-26 21:09:37,912 | INFO | mean y: 2561482.4094117647
[2026-01-26 16:10:42] 2026-01-26 21:09:37,913 | INFO | intersect: 3818085952.5396595
[2026-01-26 16:10:42] 2026-01-26 21:09:37,913 | INFO | chi2: 0.12658380598423272
[2026-01-26 16:10:42] 2026-01-26 21:09:37,913 | INFO | current memory leak: -2.16 B/s (using 425 data points, chi2=0.13)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,918 | INFO | could have reported an average CPU frequency of 2599 MHz (19 samples)
[2026-01-26 16:10:42] 2026-01-26 21:09:37,918 | INFO | ..............................
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . Timing measurements:
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . get job = 0 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . initial setup = 1 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . payload setup = 5 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . stage-in = 0 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . payload execution = 27501 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . stage-out = 3 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | . log creation = 0 s
[2026-01-26 16:10:42] 2026-01-26 21:09:37,919 | INFO | ..............................
[2026-01-26 16:10:42] 2026-01-26 21:09:38,268 | INFO |
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | job summary report
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | --------------------------------------------------
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | PanDA job id: 6984048940
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | task id: 48329534
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | errors: (none)
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | status: LOG_TRANSFER = DONE
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | pilot state: finished
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | transexitcode: 0
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | exeerrorcode: 0
[2026-01-26 16:10:42] 2026-01-26 21:09:38,269 | INFO | exeerrordiag:
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | exitcode: 0
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | exitmsg: OK
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | cpuconsumptiontime: 73277 s
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | nevents: 400
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | neventsw: 0
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | pid: 2352149
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | pgrp: 2352149
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | corecount: 8
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | event service: False
[2026-01-26 16:10:42] 2026-01-26 21:09:38,270 | INFO | sizes: {0: 2283864, 1: 2284381, 12: 2284381, 27511: 2321441, 27512: 2321440, 27515: 2330494, 27517: 2330678, 27520: 2330872}
[2026-01-26 16:10:42] 2026-01-26 21:09:38,271 | INFO | --------------------------------------------------
[2026-01-26 16:10:42] 2026-01-26 21:09:38,271 | INFO |
[2026-01-26 16:10:42] 2026-01-26 21:09:38,271 | INFO | executing command: ls -lF /tmp/boinchome/slots/0
[2026-01-26 16:10:42] 2026-01-26 21:09:38,351 | INFO | queue jobs had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,351 | INFO | queue payloads had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,351 | INFO | queue data_in had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,351 | INFO | queue data_out had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,351 | INFO | queue current_data_in had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,351 | INFO | queue validated_jobs had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue validated_payloads had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue monitored_payloads had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue finished_jobs had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue finished_payloads had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue finished_data_in had 1 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue finished_data_out had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue failed_jobs had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,352 | INFO | queue failed_payloads had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | queue failed_data_in had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | queue failed_data_out had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | queue completed_jobs had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | queue completed_jobids has 1 job(s)
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | queue realtimelog_payloads had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | queue messages had 0 job(s) [purged]
[2026-01-26 16:10:42] 2026-01-26 21:09:38,353 | INFO | job 6984048940 has completed (purged errors)
[2026-01-26 16:10:42] 2026-01-26 21:09:38,354 | INFO | overall cleanup function is called
[2026-01-26 16:10:42] 2026-01-26 21:09:39,365 | INFO | --- collectZombieJob: --- 10, [2352149]
[2026-01-26 16:10:42] 2026-01-26 21:09:39,365 | INFO | zombie collector waiting for pid 2352149
[2026-01-26 16:10:42] 2026-01-26 21:09:39,366 | INFO | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-01-26 16:10:42] 2026-01-26 21:09:39,366 | INFO | collected zombie processes
[2026-01-26 16:10:42] 2026-01-26 21:09:39,366 | INFO | will attempt to kill all subprocesses of pid=2352149
[2026-01-26 16:10:42] 2026-01-26 21:09:39,655 | INFO | process IDs to be killed: [2352149] (in reverse order)
[2026-01-26 16:10:42] 2026-01-26 21:09:39,856 | WARNING | found no corresponding commands to process id(s)
[2026-01-26 16:10:42] 2026-01-26 21:09:39,856 | INFO | Do not look for orphan processes in BOINC jobs
[2026-01-26 16:10:42] 2026-01-26 21:09:39,872 | INFO | did not find any defunct processes belonging to 2352149
[2026-01-26 16:10:42] 2026-01-26 21:09:39,883 | INFO | did not find any defunct processes belonging to 2352149
[2026-01-26 16:10:42] 2026-01-26 21:09:39,884 | INFO | ready for new job
[2026-01-26 16:10:42] 2026-01-26 21:09:39,884 | INFO | pilot has finished with previous job - re-establishing logging
[2026-01-26 16:10:42] 2026-01-26 21:09:39,891 | INFO | **************************************
[2026-01-26 16:10:42] 2026-01-26 21:09:39,891 | INFO | *** PanDA Pilot version 3.11.3.9 ***
[2026-01-26 16:10:42] 2026-01-26 21:09:39,891 | INFO | **************************************
[2026-01-26 16:10:42] 2026-01-26 21:09:39,891 | INFO |
[2026-01-26 16:10:42] 2026-01-26 21:09:39,893 | INFO | architecture information:
[2026-01-26 16:10:42] 2026-01-26 21:09:39,896 | INFO | executing command: cat /etc/os-release
[2026-01-26 16:10:42] 2026-01-26 21:09:39,958 | INFO | cat /etc/os-release:
[2026-01-26 16:10:42] NAME="CentOS Linux"
[2026-01-26 16:10:42] VERSION="7 (Core)"
[2026-01-26 16:10:42] ID="centos"
[2026-01-26 16:10:42] ID_LIKE="rhel fedora"
[2026-01-26 16:10:42] VERSION_ID="7"
[2026-01-26 16:10:42] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-01-26 16:10:42] ANSI_COLOR="0;31"
[2026-01-26 16:10:42] CPE_NAME="cpe:/o:centos:centos:7"
[2026-01-26 16:10:42] HOME_URL="https://www.centos.org/"
[2026-01-26 16:10:42] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-01-26 16:10:42]
[2026-01-26 16:10:42] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-01-26 16:10:42] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-01-26 16:10:42] REDHAT_SUPPORT_PRODUCT="centos"
[2026-01-26 16:10:42] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-01-26 16:10:42]
[2026-01-26 16:10:42] 2026-01-26 21:09:39,959 | INFO | **************************************
[2026-01-26 16:10:42] 2026-01-26 21:09:40,462 | INFO | executing command: df -mP /tmp/boinchome/slots/0
[2026-01-26 16:10:42] 2026-01-26 21:09:40,509 | INFO | sufficient remaining disk space (52552531968 B)
[2026-01-26 16:10:42] 2026-01-26 21:09:40,509 | WARNING | since timefloor is set to 0, pilot was only allowed to run one job
[2026-01-26 16:10:42] 2026-01-26 21:09:40,509 | INFO | current server update state: UPDATING_FINAL
[2026-01-26 16:10:42] 2026-01-26 21:09:40,510 | INFO | update_server=False
[2026-01-26 16:10:42] 2026-01-26 21:09:40,510 | WARNING | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-01-26 16:10:42] 2026-01-26 21:09:40,510 | WARNING | job:job_monitor:received graceful stop - abort after this iteration
[2026-01-26 16:10:42] 2026-01-26 21:09:40,510 | INFO | aborting loop
[2026-01-26 16:10:42] 2026-01-26 21:09:40,582 | INFO | all job control threads have been joined
[2026-01-26 16:10:42] 2026-01-26 21:09:40,772 | INFO | all payload control threads have been joined
[2026-01-26 16:10:42] 2026-01-26 21:09:41,210 | WARNING | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-01-26 16:10:42] 2026-01-26 21:09:41,241 | WARNING | data:copytool_out:received graceful stop - abort after this iteration
[2026-01-26 16:10:42] 2026-01-26 21:09:41,515 | INFO | [job] retrieve thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:41,516 | INFO | [job] job monitor thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:41,588 | INFO | [job] control thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:41,606 | INFO | all data control threads have been joined
[2026-01-26 16:10:42] 2026-01-26 21:09:41,778 | INFO | [payload] control thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:41,807 | INFO | [data] copytool_in thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:41,865 | INFO | [job] create_data_payload thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:41,868 | INFO | [job] validate thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:42,272 | INFO | [payload] validate_pre thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:42,277 | INFO | [payload] execute_payloads thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:42,317 | INFO | [payload] validate_post thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:42,432 | INFO | [payload] run_realtimelog thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:42,612 | INFO | [data] control thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:42,895 | INFO | [payload] failed_post thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:43,150 | WARNING | job:queue_monitor:received graceful stop - abort after this iteration
[2026-01-26 16:10:42] 2026-01-26 21:09:43,249 | INFO | [data] copytool_out thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:44,156 | INFO | [job] queue monitor thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:09:45,219 | INFO | [data] queue_monitor thread has finished
[2026-01-26 16:10:42] 2026-01-26 21:10:11,592 | INFO | PID=2346971 has CPU usage=2.2% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2026-01-26 16:10:42] 2026-01-26 21:10:11,592 | INFO | .. there are 7 such processes running
[2026-01-26 16:10:42] 2026-01-26 21:10:11,593 | INFO | found 0 job(s) in 20 queues
[2026-01-26 16:10:42] 2026-01-26 21:10:11,593 | WARNING | pilot monitor received instruction that args.graceful_stop has been set
[2026-01-26 16:10:42] 2026-01-26 21:10:11,593 | WARNING | will wait for a maximum of 300 s for threads to finish
[2026-01-26 16:10:42] 2026-01-26 21:10:34,819 | INFO | [monitor] cgroup control has ended
[2026-01-26 16:10:42] 2026-01-26 21:10:35,940 | INFO | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140684290086720)>', '<ExcThread(monitor, started 140684012304128)>']
[2026-01-26 16:10:42] 2026-01-26 21:10:36,744 | WARNING | job_aborted has been set - aborting pilot monitoring
[2026-01-26 16:10:42] 2026-01-26 21:10:36,745 | INFO | [monitor] control thread has ended
[2026-01-26 16:10:42] 2026-01-26 21:10:40,970 | INFO | all workflow threads have been joined
[2026-01-26 16:10:42] 2026-01-26 21:10:40,975 | INFO | end of generic workflow (traces error code: 0)
[2026-01-26 16:10:42] 2026-01-26 21:10:40,976 | INFO | traces error code: 0
[2026-01-26 16:10:42] 2026-01-26 21:10:40,976 | INFO | pilot has finished (exit code=0, shell exit code=0)
[2026-01-26 16:10:42] 2026-01-26 21:10:41,259 [wrapper] ==== pilot stdout END ====
[2026-01-26 16:10:42] 2026-01-26 21:10:41,280 [wrapper] ==== wrapper stdout RESUME ====
[2026-01-26 16:10:42] 2026-01-26 21:10:41,296 [wrapper] pilotpid: 2346971
[2026-01-26 16:10:42] 2026-01-26 21:10:41,318 [wrapper] Pilot exit status: 0
[2026-01-26 16:10:42] 2026-01-26 21:10:41,412 [wrapper] pandaids: 6984048940 6984048940 6984048940
[2026-01-26 16:10:42] 2026-01-26 21:10:41,473 [wrapper] cleanup supervisor_pilot 2983892 2346972
[2026-01-26 16:10:42] 2026-01-26 21:10:41,475 [wrapper] Test setup, not cleaning
[2026-01-26 16:10:42] 2026-01-26 21:10:41,477 [wrapper] apfmon messages muted
[2026-01-26 16:10:42] 2026-01-26 21:10:41,480 [wrapper] ==== wrapper stdout END ====
[2026-01-26 16:10:42] 2026-01-26 21:10:41,482 [wrapper] ==== wrapper stderr END ====
[2026-01-26 16:10:42] *** Error codes and diagnostics ***
[2026-01-26 16:10:42] "exeErrorCode": 0,
[2026-01-26 16:10:42] "exeErrorDiag": "",
[2026-01-26 16:10:42] "pilotErrorCode": 0,
[2026-01-26 16:10:42] "pilotErrorDiag": "",
[2026-01-26 16:10:42] *** Listing of results directory ***
[2026-01-26 16:10:42] total 730260
[2026-01-26 16:10:42] drwx------. 5 boincer umatlas 4096 Jan 14 05:00 pilot3
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 584422 Jan 26 00:14 pilot3.tar.gz
[2026-01-26 16:10:42] -rwx------. 1 boincer umatlas 36322 Jan 26 00:15 runpilot2-wrapper.sh
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 5111 Jan 26 00:16 queuedata.json
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 100 Jan 26 04:48 wrapper_26015_x86_64-pc-linux-gnu
[2026-01-26 16:10:42] -rwxr-xr-x. 1 boincer umatlas 7986 Jan 26 04:48 run_atlas
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 105 Jan 26 04:48 job.xml
[2026-01-26 16:10:42] -rw-r--r--. 3 boincer umatlas 271466378 Jan 26 04:48 EVNT.48329532._000004.pool.root.1
[2026-01-26 16:10:42] -rw-r--r--. 3 boincer umatlas 271466378 Jan 26 04:48 ATLAS.root_0
[2026-01-26 16:10:42] -rw-r--r--. 2 boincer umatlas 15845 Jan 26 04:48 start_atlas.sh
[2026-01-26 16:10:42] drwxrwx--x. 2 boincer umatlas 4096 Jan 26 04:48 shared
[2026-01-26 16:10:42] -rw-r--r--. 2 boincer umatlas 597770 Jan 26 04:48 input.tar.gz
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 0 Jan 26 04:48 boinc_lockfile
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 0 Jan 26 08:18 wrapper_sigint_2318066
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 2606 Jan 26 08:30 pandaJob.out
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 57 Jan 26 08:30 setup.sh.local
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 1002197 Jan 26 08:30 agis_schedconf.cvmfs.json
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 441 Jan 26 08:30 workernode_map.json
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 6661 Jan 26 16:01 init_data.xml
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 192517703 Jan 26 16:08 HITS.48329534._000077.pool.root.1
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 532 Jan 26 16:08 boinc_task_state.xml
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 1047 Jan 26 16:09 memory_monitor_summary.json
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 1516246 Jan 26 16:09 agis_ddmendpoints.agis.ALL.json
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 613304 Jan 26 16:09 log.48329534._000077.job.log.tgz.1
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 6391 Jan 26 16:09 heartbeat.json
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 97 Jan 26 16:10 pilot_heartbeat.json
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 8192 Jan 26 16:10 boinc_mmap_file
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 28 Jan 26 16:10 wrapper_checkpoint.txt
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 4819 Jan 26 16:10 pilotlog.txt
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 3582149 Jan 26 16:10 log.48329534._000077.job.log.1
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 357 Jan 26 16:10 output.list
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 620 Jan 26 16:10 runtime_log
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 4208640 Jan 26 16:10 result.tar.gz
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 9427 Jan 26 16:10 runtime_log.err
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 911 Jan 26 16:10 9aLNDme7T48n9Rq4apOajLDm4fhM0noT9bVo0NGKDmpb2KDmO8iHIo.diag
[2026-01-26 16:10:42] -rw-r--r--. 1 boincer umatlas 25810 Jan 26 16:10 stderr.txt
[2026-01-26 16:10:42] HITS file was successfully produced:
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 192517703 Jan 26 16:08 shared/HITS.pool.root.1
[2026-01-26 16:10:42] *** Contents of shared directory: ***
[2026-01-26 16:10:42] total 457832
[2026-01-26 16:10:42] -rw-r--r--. 3 boincer umatlas 271466378 Jan 26 04:48 ATLAS.root_0
[2026-01-26 16:10:42] -rw-r--r--. 2 boincer umatlas 15845 Jan 26 04:48 start_atlas.sh
[2026-01-26 16:10:42] -rw-r--r--. 2 boincer umatlas 597770 Jan 26 04:48 input.tar.gz
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 192517703 Jan 26 16:08 HITS.pool.root.1
[2026-01-26 16:10:42] -rw-------. 1 boincer umatlas 4208640 Jan 26 16:10 result.tar.gz
16:10:44 (2339545): run_atlas exited; CPU time 73947.294826
16:10:44 (2339545): called boinc_finish(0)
</stderr_txt>
]]>
©2026 CERN