Name O3lNDmzMeO9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmUk6LDmigzYxn_1
Workunit 240054457
Created 22 Mar 2026, 23:22:11 UTC
Sent 22 Mar 2026, 23:25:23 UTC
Report deadline 30 Mar 2026, 23:25:23 UTC
Received 23 Mar 2026, 12:11:54 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 11039197
Run time 15 min 59 sec
CPU time 11 hours 51 min 26 sec
Priority 28
Validate state Valid
Credit 266.47
Device peak FLOPS 12.00 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.74 GB
Peak swap size 3.22 GB
Peak disk usage 672.25 MB

Stderr output

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<stderr_txt>
11:05:05 (2736566): wrapper (7.7.26015): starting
11:05:05 (2736566): wrapper: running run_atlas (--nthreads 12)
[2026-03-23 11:05:05] Arguments: --nthreads 12
[2026-03-23 11:05:05] Threads: 12
[2026-03-23 11:05:05] Checking for CVMFS
[2026-03-23 11:05:05] No cvmfs_config command found, will try listing directly
[2026-03-23 11:05:05] CVMFS is ok
[2026-03-23 11:05:05] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-23 11:05:05] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-23 11:05:05] Further information can be found at the LHC@home message board.
[2026-03-23 11:05:05] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-23 11:05:05] Checking for apptainer binary...
[2026-03-23 11:05:05] which: no apptainer in ((null))
[2026-03-23 11:05:05] apptainer is not installed, using version from CVMFS
[2026-03-23 11:05:05] Checking apptainer works with /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-23 11:05:05] WARNING: Environment variable TMPDIR already has value [/var/lib/boinc/var/slot120/slots/1/.apptainertmp], will not forward new value [/tmp] from parent process environment skurut29.grid.cesnet.cz
[2026-03-23 11:05:05] apptainer works
[2026-03-23 11:05:05] Set ATHENA_PROC_NUMBER=12
[2026-03-23 11:05:05] Set ATHENA_CORE_NUMBER=12
[2026-03-23 11:05:05] Starting ATLAS job with PandaID=7062340368
[2026-03-23 11:05:05] Running command: /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs,/var/lib/boinc/var/slot120/slots/1 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-03-23 12:10:58]  *** The last 200 lines of the pilot log: ***
[2026-03-23 12:10:58]  mtime=0
[2026-03-23 12:10:58]  protocol_id=None
[2026-03-23 12:10:58]  protocols=[{'endpoint': 'davs://dav.ndgf.org:443', 'flavour': 'WEBDAV', 'id': 331, 'path': '/atlas/disk/atlasdatadisk/rucio/'}]
[2026-03-23 12:10:58]  replicas=None
[2026-03-23 12:10:58]  scope=mc23_13p6TeV
[2026-03-23 12:10:58]  status=None
[2026-03-23 12:10:58]  status_code=0
[2026-03-23 12:10:58]  storage_token=
[2026-03-23 12:10:58]  surl=/var/lib/boinc/var/slot120/slots/1/PanDA_Pilot-7062340368/log.49182303._000212.job.log.tgz.1
[2026-03-23 12:10:58]  turl=davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/38/43/log.49182303._000212.job.log.tgz.1
[2026-03-23 12:10:58]  workdir=None
[2026-03-23 12:10:58] ]
[2026-03-23 12:10:58] 2026-03-23 11:10:01,573 | INFO     | transferring file log.49182303._000212.job.log.tgz.1 from /var/lib/boinc/var/slot120/slots/1/PanDA_Pilot-7062340368/log.49182303._000212.job.log.tgz.1 to /var/lib/
[2026-03-23 12:10:58] 2026-03-23 11:10:01,573 | INFO     | executing command: /usr/bin/env mv /var/lib/boinc/var/slot120/slots/1/PanDA_Pilot-7062340368/log.49182303._000212.job.log.tgz.1 /var/lib/boinc/var/slot120/slots/1/
[2026-03-23 12:10:58] 2026-03-23 11:10:01,587 | INFO     | adding to output.list: log.49182303._000212.job.log.tgz.1 davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/38/43/log.49182303._000212.job.log.tg
[2026-03-23 12:10:58] 2026-03-23 11:10:01,588 | INFO     | alt stage-out settings: ['pl', 'write_lan', 'w', 'default'], allow_altstageout=False, remain_files=0, has_altstorage=True
[2026-03-23 12:10:58] 2026-03-23 11:10:01,588 | INFO     | summary of transferred files:
[2026-03-23 12:10:58] 2026-03-23 11:10:01,588 | INFO     |  -- lfn=log.49182303._000212.job.log.tgz.1, status_code=0, status=transferred
[2026-03-23 12:10:58] 2026-03-23 11:10:01,588 | INFO     | stage-out finished correctly
[2026-03-23 12:10:58] 2026-03-23 11:10:01,929 | INFO     | finished stage-out for finished payload, adding job to finished_jobs queue
[2026-03-23 12:10:58] 2026-03-23 11:10:01,977 | INFO     | job 7062340368 has state=finished
[2026-03-23 12:10:58] 2026-03-23 11:10:01,977 | INFO     | preparing for final server update for job 7062340368 in state='finished'
[2026-03-23 12:10:58] 2026-03-23 11:10:01,977 | INFO     | reading metadata from: /var/lib/boinc/var/slot120/slots/1/PanDA_Pilot-7062340368/jobReport.json
[2026-03-23 12:10:58] 2026-03-23 11:10:01,979 | INFO     | added worker_node to metadata from /var/lib/boinc/var/slot120/slots/1/workernode_map.json
[2026-03-23 12:10:58] 2026-03-23 11:10:01,980 | INFO     | this job has now completed (state=finished)
[2026-03-23 12:10:58] 2026-03-23 11:10:01,980 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2026-03-23 12:10:58] 2026-03-23 11:10:01,980 | INFO     | log transfer has been attempted: DONE
[2026-03-23 12:10:58] 2026-03-23 11:10:01,980 | INFO     | job 7062340368 has finished - writing final server update
[2026-03-23 12:10:58] 2026-03-23 11:10:01,980 | WARNING  | failed to read HTCondor job classAd: [Errno 2] No such file or directory: '/scratch/condor/execute/dir_2441966/.job.ad'
[2026-03-23 12:10:58] 2026-03-23 11:10:01,980 | INFO     | total number of processed events: 400 (read)
[2026-03-23 12:10:58] 2026-03-23 11:10:01,999 | INFO     | using path: /var/lib/boinc/var/slot120/slots/1/PanDA_Pilot-7062340368/memory_monitor_summary.json (trf name=prmon)
[2026-03-23 12:10:58] 2026-03-23 11:10:01,999 | INFO     | extracted standard info from prmon json
[2026-03-23 12:10:58] 2026-03-23 11:10:01,999 | INFO     | extracted standard memory fields from prmon json
[2026-03-23 12:10:58] 2026-03-23 11:10:01,999 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-23 12:10:58] 2026-03-23 11:10:01,999 | WARNING  | format EVNTtoHITS has no such key: dbData
[2026-03-23 12:10:58] 2026-03-23 11:10:01,999 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2026-03-23 12:10:58] 2026-03-23 11:10:02,000 | INFO     | fitting pss+swap vs Time
[2026-03-23 12:10:58] 2026-03-23 11:10:02,000 | INFO     | sum of square deviations: 57407588.0
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | sum of deviations: 718796244.9999999
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | mean x: 1774262351.0
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | mean y: 2697290.701754386
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | intersect: -22212712901.129887
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | chi2: 0.0011690633578539502
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | current memory leak: 12.52 B/s (using 57 data points, chi2=0.00)
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | could have reported an average CPU frequency of 3620 MHz (6 samples)
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | ..............................
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . Timing measurements:
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . get job = 0 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . initial setup = 4 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . payload setup = 6 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . stage-in = 0 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . payload execution = 3862 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . stage-out = 0 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | . log creation = 0 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,001 | INFO     | ..............................
[2026-03-23 12:10:58] 2026-03-23 11:10:02,072 | INFO     | time since job start (3886s) is within the limit (345600.0s)
[2026-03-23 12:10:58] 2026-03-23 11:10:02,359 | INFO     | 
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | job summary report
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | --------------------------------------------------
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | PanDA job id: 7062340368
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | task id: 49182303
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | errors: (none)
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | status: LOG_TRANSFER = DONE 
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | pilot state: finished 
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | transexitcode: 0
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | exeerrorcode: 0
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | exeerrordiag: 
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | exitcode: 0
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | exitmsg: OK
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | cpuconsumptiontime: 42418 s
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | nevents: 400
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | neventsw: 0
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | pid: 2764913
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | pgrp: 2764913
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | corecount: 12
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | event service: False
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | sizes: {0: 2280289, 11: 2280289, 3872: 2306200, 3873: 2315222, 3874: 2315472}
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | --------------------------------------------------
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | 
[2026-03-23 12:10:58] 2026-03-23 11:10:02,360 | INFO     | executing command: ls -lF /var/lib/boinc/var/slot120/slots/1
[2026-03-23 12:10:58] 2026-03-23 11:10:02,376 | INFO     | queue jobs had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,376 | INFO     | queue payloads had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,376 | INFO     | queue data_in had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,376 | INFO     | queue data_out had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue current_data_in had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | job 7062340368 has completed (purged errors)
[2026-03-23 12:10:58] 2026-03-23 11:10:02,377 | INFO     | overall cleanup function is called
[2026-03-23 12:10:58] 2026-03-23 11:10:03,384 | INFO     | --- collectZombieJob: --- 10, [2764913]
[2026-03-23 12:10:58] 2026-03-23 11:10:03,384 | INFO     | zombie collector waiting for pid 2764913
[2026-03-23 12:10:58] 2026-03-23 11:10:03,384 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-23 12:10:58] 2026-03-23 11:10:03,384 | INFO     | collected zombie processes
[2026-03-23 12:10:58] 2026-03-23 11:10:03,384 | INFO     | will attempt to kill all subprocesses of pid=2764913
[2026-03-23 12:10:58] 2026-03-23 11:10:04,168 | INFO     | PID=2745035 has CPU usage=10.0% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i 
[2026-03-23 12:10:58] 2026-03-23 11:10:04,168 | INFO     | .. there are 15 such processes running
[2026-03-23 12:10:58] 2026-03-23 11:10:04,674 | INFO     | process IDs to be killed: [2764913] (in reverse order)
[2026-03-23 12:10:58] 2026-03-23 11:10:04,761 | WARNING  | process 2764913 can no longer be monitored (due to stat problems) - aborting
[2026-03-23 12:10:58] 2026-03-23 11:10:04,843 | WARNING  | found no corresponding commands to process id(s)
[2026-03-23 12:10:58] 2026-03-23 11:10:04,843 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-23 12:10:58] 2026-03-23 11:10:04,901 | INFO     | did not find any defunct processes belonging to 2764913
[2026-03-23 12:10:58] 2026-03-23 11:10:04,949 | INFO     | did not find any defunct processes belonging to 2764913
[2026-03-23 12:10:58] 2026-03-23 11:10:04,949 | INFO     | ready for new job
[2026-03-23 12:10:58] 2026-03-23 11:10:04,949 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-23 12:10:58] 2026-03-23 11:10:04,950 | INFO     | **************************************
[2026-03-23 12:10:58] 2026-03-23 11:10:04,950 | INFO     | ***  PanDA Pilot version 3.11.5.1  ***
[2026-03-23 12:10:58] 2026-03-23 11:10:04,951 | INFO     | **************************************
[2026-03-23 12:10:58] 2026-03-23 11:10:04,951 | INFO     | 
[2026-03-23 12:10:58] 2026-03-23 11:10:04,958 | INFO     | architecture information:
[2026-03-23 12:10:58] 2026-03-23 11:10:04,958 | INFO     | executing command: cat /etc/os-release
[2026-03-23 12:10:58] 2026-03-23 11:10:04,972 | INFO     | cat /etc/os-release:
[2026-03-23 12:10:58] NAME="CentOS Linux"
[2026-03-23 12:10:58] VERSION="7 (Core)"
[2026-03-23 12:10:58] ID="centos"
[2026-03-23 12:10:58] ID_LIKE="rhel fedora"
[2026-03-23 12:10:58] VERSION_ID="7"
[2026-03-23 12:10:58] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-23 12:10:58] ANSI_COLOR="0;31"
[2026-03-23 12:10:58] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-23 12:10:58] HOME_URL="https://www.centos.org/"
[2026-03-23 12:10:58] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-23 12:10:58] 
[2026-03-23 12:10:58] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-23 12:10:58] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-23 12:10:58] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-23 12:10:58] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-23 12:10:58] 
[2026-03-23 12:10:58] 2026-03-23 11:10:04,972 | INFO     | **************************************
[2026-03-23 12:10:58] 2026-03-23 11:10:05,126 | INFO     | using path: /var/lib/boinc/var/slot120/slots/1/memory_monitor_summary.json (trf name=prmon)
[2026-03-23 12:10:58] 2026-03-23 11:10:05,475 | INFO     | executing command: df -mP /var/lib/boinc/var/slot120/slots/1
[2026-03-23 12:10:58] 2026-03-23 11:10:05,490 | INFO     | sufficient remaining disk space (28011659264 B)
[2026-03-23 12:10:58] 2026-03-23 11:10:05,490 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-23 12:10:58] 2026-03-23 11:10:05,490 | INFO     | current server update state: UPDATING_FINAL
[2026-03-23 12:10:58] 2026-03-23 11:10:05,490 | INFO     | update_server=False
[2026-03-23 12:10:58] 2026-03-23 11:10:05,490 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-23 12:10:58] 2026-03-23 11:10:05,490 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-23 12:10:58] 2026-03-23 11:10:05,516 | INFO     | number of running child processes to parent process 2764913: 1
[2026-03-23 12:10:58] 2026-03-23 11:10:05,516 | INFO     | maximum number of monitored processes: 6
[2026-03-23 12:10:58] 2026-03-23 11:10:05,516 | INFO     | aborting job monitoring since job object (job id=7062340368) has expired
[2026-03-23 12:10:58] 2026-03-23 11:10:05,516 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-23 12:10:58] 2026-03-23 11:10:05,516 | INFO     | will abort loop
[2026-03-23 12:10:58] 2026-03-23 11:10:05,968 | INFO     | all job control threads have been joined
[2026-03-23 12:10:58] 2026-03-23 11:10:06,117 | INFO     | all data control threads have been joined
[2026-03-23 12:10:58] 2026-03-23 11:10:06,236 | INFO     | all payload control threads have been joined
[2026-03-23 12:10:58] 2026-03-23 11:10:06,495 | INFO     | [job] retrieve thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,521 | INFO     | [job] job monitor thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,646 | INFO     | [job] validate thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,826 | INFO     | [payload] validate_pre thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,947 | INFO     | [payload] failed_post thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,949 | INFO     | [job] create_data_payload thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,950 | INFO     | [payload] execute_payloads thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,959 | INFO     | [data] copytool_in thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:06,973 | INFO     | [job] control thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:07,061 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-23 12:10:58] 2026-03-23 11:10:07,122 | INFO     | [data] control thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:07,241 | INFO     | [payload] control thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:07,484 | INFO     | [payload] validate_post thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:07,495 | INFO     | [data] copytool_out thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:07,986 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-23 12:10:58] 2026-03-23 11:10:08,066 | INFO     | [job] queue monitor thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:11,991 | INFO     | [data] queue_monitor thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:16,230 | INFO     | job.realtimelogging is not enabled
[2026-03-23 12:10:58] 2026-03-23 11:10:17,235 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-23 12:10:58] 2026-03-23 11:10:22,733 | INFO     | [monitor] cgroup control has ended
[2026-03-23 12:10:58] 2026-03-23 11:10:24,166 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 23029694973760)>', '<ExcThread(monitor, started 23029456582400)>']
[2026-03-23 12:10:58] 2026-03-23 11:10:29,191 | INFO     | all workflow threads have been joined
[2026-03-23 12:10:58] 2026-03-23 11:10:29,191 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-23 12:10:58] 2026-03-23 11:10:29,191 | INFO     | traces error code: 0
[2026-03-23 12:10:58] 2026-03-23 11:10:29,191 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | INFO     | PID=2745035 has CPU usage=8.7% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | INFO     | .. there are 15 such processes running
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | INFO     | found 0 job(s) in 20 queues
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | CRITICAL | thread 'MainThread' is not alive
[2026-03-23 12:10:58] 2026-03-23 11:10:36,716 | INFO     | [monitor] control thread has ended
[2026-03-23 12:10:58] 2026-03-23 11:10:36,901 [wrapper] ==== pilot stdout END ====
[2026-03-23 12:10:58] 2026-03-23 11:10:36,903 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-23 12:10:58] 2026-03-23 11:10:36,905 [wrapper] pilotpid: 2745035
[2026-03-23 12:10:58] 2026-03-23 11:10:36,907 [wrapper] Pilot exit status: 0
[2026-03-23 12:10:58] 2026-03-23 11:10:36,915 [wrapper] pandaids: 7062340368
[2026-03-23 12:10:58] 2026-03-23 11:10:37,029 [wrapper] cleanup supervisor_pilot 3088950 2745036
[2026-03-23 12:10:58] 2026-03-23 11:10:37,031 [wrapper] Test setup, not cleaning
[2026-03-23 12:10:58] 2026-03-23 11:10:37,033 [wrapper] apfmon messages muted
[2026-03-23 12:10:58] 2026-03-23 11:10:37,034 [wrapper] ==== wrapper stdout END ====
[2026-03-23 12:10:58] 2026-03-23 11:10:37,035 [wrapper] ==== wrapper stderr END ====
[2026-03-23 12:10:58]  *** Error codes and diagnostics ***
[2026-03-23 12:10:58]     "exeErrorCode": 0,
[2026-03-23 12:10:58]     "exeErrorDiag": "",
[2026-03-23 12:10:58]     "pilotErrorCode": 0,
[2026-03-23 12:10:58]     "pilotErrorDiag": "",
[2026-03-23 12:10:58]  *** Listing of results directory ***
[2026-03-23 12:10:58] total 462220
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc    585013 Mar 22 20:50 pilot3.tar.gz
[2026-03-23 12:10:58] -rwx------. 1 boinc boinc     36322 Mar 22 21:14 runpilot2-wrapper.sh
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc      5111 Mar 22 21:15 queuedata.json
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc       100 Mar 23 11:05 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-23 12:10:58] -rwxr-xr-x. 1 boinc boinc      7986 Mar 23 11:05 run_atlas
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc       105 Mar 23 11:05 job.xml
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc      6710 Mar 23 11:05 init_data.xml
[2026-03-23 12:10:58] -rw-r--r--. 2 boinc boinc 234919636 Mar 23 11:05 EVNT.49136696._000006.pool.root.1
[2026-03-23 12:10:58] -rw-r--r--. 2 boinc boinc     15845 Mar 23 11:05 start_atlas.sh
[2026-03-23 12:10:58] drwxrwx--x. 2 boinc boinc      4096 Mar 23 11:05 shared
[2026-03-23 12:10:58] -rw-r--r--. 2 boinc boinc    597553 Mar 23 11:05 input.tar.gz
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc         0 Mar 23 11:05 boinc_lockfile
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc      2630 Mar 23 11:05 pandaJob.out
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc   1010858 Mar 23 11:05 agis_schedconf.cvmfs.json
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc   1511579 Mar 23 11:05 agis_ddmendpoints.agis.ALL.json
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc       445 Mar 23 11:05 workernode_map.json
[2026-03-23 12:10:58] drwx------. 5 boinc boinc      4096 Mar 23 11:05 pilot3
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc 232501666 Mar 23 12:09 HITS.49182303._000212.pool.root.1
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc       528 Mar 23 12:09 boinc_task_state.xml
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc        95 Mar 23 12:09 pilot_heartbeat.json
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc      1029 Mar 23 12:09 memory_monitor_summary.json
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc    388060 Mar 23 12:10 log.49182303._000212.job.log.tgz.1
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc      6352 Mar 23 12:10 heartbeat.json
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc       893 Mar 23 12:10 pilotlog.txt
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc    586139 Mar 23 12:10 log.49182303._000212.job.log.1
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc       357 Mar 23 12:10 output.list
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc       620 Mar 23 12:10 runtime_log
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc    993280 Mar 23 12:10 result.tar.gz
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc      9223 Mar 23 12:10 runtime_log.err
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc       663 Mar 23 12:10 O3lNDmzMeO9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmUk6LDmigzYxn.diag
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc        26 Mar 23 12:10 wrapper_checkpoint.txt
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc      8192 Mar 23 12:10 boinc_mmap_file
[2026-03-23 12:10:58] -rw-r--r--. 1 boinc boinc     21629 Mar 23 12:10 stderr.txt
[2026-03-23 12:10:58] HITS file was successfully produced:
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc 232501666 Mar 23 12:09 shared/HITS.pool.root.1
[2026-03-23 12:10:58]  *** Contents of shared directory: ***
[2026-03-23 12:10:58] total 458052
[2026-03-23 12:10:58] -rw-r--r--. 2 boinc boinc 234919636 Mar 23 11:05 ATLAS.root_0
[2026-03-23 12:10:58] -rw-r--r--. 2 boinc boinc     15845 Mar 23 11:05 start_atlas.sh
[2026-03-23 12:10:58] -rw-r--r--. 2 boinc boinc    597553 Mar 23 11:05 input.tar.gz
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc 232501666 Mar 23 12:09 HITS.pool.root.1
[2026-03-23 12:10:58] -rw-------. 1 boinc boinc    993280 Mar 23 12:10 result.tar.gz
12:10:59 (2736566): run_atlas exited; CPU time 42566.225795
12:10:59 (2736566): called boinc_finish(0)

</stderr_txt>
]]>


©2026 CERN