Name kCKNDmvZfO9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmWw6LDmvJAXkm_1
Workunit 240055292
Created 23 Mar 2026, 0:40:45 UTC
Sent 23 Mar 2026, 0:44:11 UTC
Report deadline 31 Mar 2026, 0:44:11 UTC
Received 25 Mar 2026, 7:59:35 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 11039543
Run time 55 min 40 sec
CPU time 12 hours 39 min 59 sec
Priority 28
Validate state Valid
Credit 927.92
Device peak FLOPS 12.00 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.75 GB
Peak swap size 3.19 GB
Peak disk usage 574.14 MB

Stderr output

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<stderr_txt>
06:47:42 (3540976): wrapper (7.7.26015): starting
06:47:42 (3540976): wrapper: running run_atlas (--nthreads 12)
[2026-03-25 06:47:42] Arguments: --nthreads 12
[2026-03-25 06:47:42] Threads: 12
[2026-03-25 06:47:42] Checking for CVMFS
[2026-03-25 06:47:42] No cvmfs_config command found, will try listing directly
[2026-03-25 06:47:42] CVMFS is ok
[2026-03-25 06:47:42] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-25 06:47:42] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-25 06:47:42] Further information can be found at the LHC@home message board.
[2026-03-25 06:47:42] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-25 06:47:42] Checking for apptainer binary...
[2026-03-25 06:47:42] which: no apptainer in ((null))
[2026-03-25 06:47:42] apptainer is not installed, using version from CVMFS
[2026-03-25 06:47:42] Checking apptainer works with /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-25 06:47:42] WARNING: Environment variable TMPDIR already has value [/var/lib/boinc/var/slot84/slots/3/.apptainertmp], will not forward new value [/tmp] from parent process environment skurut25.grid.cesnet.cz
[2026-03-25 06:47:42] apptainer works
[2026-03-25 06:47:42] Set ATHENA_PROC_NUMBER=12
[2026-03-25 06:47:42] Set ATHENA_CORE_NUMBER=12
[2026-03-25 06:47:42] Starting ATLAS job with PandaID=7062347031
[2026-03-25 06:47:42] Running command: /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs,/var/lib/boinc/var/slot84/slots/3 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-03-25 07:58:25]  *** The last 200 lines of the pilot log: ***
[2026-03-25 07:58:25]  guid=b8d3c514-3214-4994-9a09-13d9196470a1
[2026-03-25 07:58:25]  inputddms=['NDGF-T1_DATADISK', 'CERN-PROD_DATADISK']
[2026-03-25 07:58:25]  is_altstaged=None
[2026-03-25 07:58:25]  is_tar=False
[2026-03-25 07:58:25]  lfn=log.49177840._000064.job.log.tgz.1
[2026-03-25 07:58:25]  mtime=0
[2026-03-25 07:58:25]  protocol_id=None
[2026-03-25 07:58:25]  protocols=[{'endpoint': 'davs://dav.ndgf.org:443', 'flavour': 'WEBDAV', 'id': 331, 'path': '/atlas/disk/atlasdatadisk/rucio/'}]
[2026-03-25 07:58:25]  replicas=None
[2026-03-25 07:58:25]  scope=mc23_13p6TeV
[2026-03-25 07:58:25]  status=None
[2026-03-25 07:58:25]  status_code=0
[2026-03-25 07:58:25]  storage_token=
[2026-03-25 07:58:25]  surl=/var/lib/boinc/var/slot84/slots/3/PanDA_Pilot-7062347031/log.49177840._000064.job.log.tgz.1
[2026-03-25 07:58:25]  turl=davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/d0/4c/log.49177840._000064.job.log.tgz.1
[2026-03-25 07:58:25]  workdir=None
[2026-03-25 07:58:25] ]
[2026-03-25 07:58:25] 2026-03-25 06:57:37,281 | INFO     | transferring file log.49177840._000064.job.log.tgz.1 from /var/lib/boinc/var/slot84/slots/3/PanDA_Pilot-7062347031/log.49177840._000064.job.log.tgz.1 to /var/lib/b
[2026-03-25 07:58:25] 2026-03-25 06:57:37,282 | INFO     | executing command: /usr/bin/env mv /var/lib/boinc/var/slot84/slots/3/PanDA_Pilot-7062347031/log.49177840._000064.job.log.tgz.1 /var/lib/boinc/var/slot84/slots/3/lo
[2026-03-25 07:58:25] 2026-03-25 06:57:37,300 | INFO     | adding to output.list: log.49177840._000064.job.log.tgz.1 davs://dav.ndgf.org:443/atlas/disk/atlasdatadisk/rucio/mc23_13p6TeV/d0/4c/log.49177840._000064.job.log.tg
[2026-03-25 07:58:25] 2026-03-25 06:57:37,300 | INFO     | alt stage-out settings: ['pl', 'write_lan', 'w', 'default'], allow_altstageout=False, remain_files=0, has_altstorage=True
[2026-03-25 07:58:25] 2026-03-25 06:57:37,301 | INFO     | summary of transferred files:
[2026-03-25 07:58:25] 2026-03-25 06:57:37,301 | INFO     |  -- lfn=log.49177840._000064.job.log.tgz.1, status_code=0, status=transferred
[2026-03-25 07:58:25] 2026-03-25 06:57:37,301 | INFO     | stage-out finished correctly
[2026-03-25 07:58:25] 2026-03-25 06:57:37,425 | INFO     | finished stage-out for finished payload, adding job to finished_jobs queue
[2026-03-25 07:58:25] 2026-03-25 06:57:37,653 | WARNING  | process 3552201 can no longer be monitored (due to stat problems) - aborting
[2026-03-25 07:58:25] 2026-03-25 06:57:37,653 | INFO     | using path: /var/lib/boinc/var/slot84/slots/3/PanDA_Pilot-7062347031/memory_monitor_summary.json (trf name=prmon)
[2026-03-25 07:58:25] 2026-03-25 06:57:38,019 | INFO     | number of running child processes to parent process 3552201: 1
[2026-03-25 07:58:25] 2026-03-25 06:57:38,019 | INFO     | maximum number of monitored processes: 6
[2026-03-25 07:58:25] 2026-03-25 06:57:38,277 | INFO     | time since job start (4185s) is within the limit (345600.0s)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,093 | INFO     | job 7062347031 has state=finished
[2026-03-25 07:58:25] 2026-03-25 06:57:40,094 | INFO     | preparing for final server update for job 7062347031 in state='finished'
[2026-03-25 07:58:25] 2026-03-25 06:57:40,094 | INFO     | reading metadata from: /var/lib/boinc/var/slot84/slots/3/PanDA_Pilot-7062347031/jobReport.json
[2026-03-25 07:58:25] 2026-03-25 06:57:40,095 | INFO     | added worker_node to metadata from /var/lib/boinc/var/slot84/slots/3/workernode_map.json
[2026-03-25 07:58:25] 2026-03-25 06:57:40,096 | INFO     | this job has now completed (state=finished)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,096 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,096 | INFO     | log transfer has been attempted: DONE
[2026-03-25 07:58:25] 2026-03-25 06:57:40,096 | INFO     | job 7062347031 has finished - writing final server update
[2026-03-25 07:58:25] 2026-03-25 06:57:40,096 | WARNING  | failed to read HTCondor job classAd: [Errno 2] No such file or directory: '/scratch/condor/execute/dir_3428672/.job.ad'
[2026-03-25 07:58:25] 2026-03-25 06:57:40,096 | INFO     | total number of processed events: 400 (read)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,113 | INFO     | using path: /var/lib/boinc/var/slot84/slots/3/PanDA_Pilot-7062347031/memory_monitor_summary.json (trf name=prmon)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,114 | INFO     | extracted standard info from prmon json
[2026-03-25 07:58:25] 2026-03-25 06:57:40,114 | INFO     | extracted standard memory fields from prmon json
[2026-03-25 07:58:25] 2026-03-25 06:57:40,114 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-25 07:58:25] 2026-03-25 06:57:40,114 | WARNING  | format EVNTtoHITS has no such key: dbData
[2026-03-25 07:58:25] 2026-03-25 06:57:40,114 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | fitting pss+swap vs Time
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | sum of square deviations: 73882315.5
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | sum of deviations: -468862622.5
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | mean x: 1774419860.5
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | mean y: 2701555.370967742
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | intersect: 11263300841.950096
[2026-03-25 07:58:25] 2026-03-25 06:57:40,115 | INFO     | chi2: 0.003156920062389868
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | current memory leak: -6.35 B/s (using 62 data points, chi2=0.00)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | could have reported an average CPU frequency of 3613 MHz (6 samples)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | ..............................
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . Timing measurements:
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . get job = 0 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . initial setup = 4 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . payload setup = 6 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . stage-in = 0 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . payload execution = 4160 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . stage-out = 0 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | . log creation = 0 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,116 | INFO     | ..............................
[2026-03-25 07:58:25] 2026-03-25 06:57:40,165 | INFO     | 
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | job summary report
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | --------------------------------------------------
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | PanDA job id: 7062347031
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | task id: 49177840
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | errors: (none)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | status: LOG_TRANSFER = DONE 
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | pilot state: finished 
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | transexitcode: 0
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | exeerrorcode: 0
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | exeerrordiag: 
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | exitcode: 0
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | exitmsg: OK
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | cpuconsumptiontime: 45310 s
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | nevents: 400
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | neventsw: 0
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | pid: 3552201
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | pgrp: 3552201
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | corecount: 12
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | event service: False
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | sizes: {0: 2280285, 10: 2280285, 4171: 2306270, 4173: 2315346, 4176: 2315484}
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | --------------------------------------------------
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | 
[2026-03-25 07:58:25] 2026-03-25 06:57:40,166 | INFO     | executing command: ls -lF /var/lib/boinc/var/slot84/slots/3
[2026-03-25 07:58:25] 2026-03-25 06:57:40,182 | INFO     | queue jobs had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue payloads had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue data_in had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue data_out had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue current_data_in had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | job 7062347031 has completed (purged errors)
[2026-03-25 07:58:25] 2026-03-25 06:57:40,183 | INFO     | overall cleanup function is called
[2026-03-25 07:58:25] 2026-03-25 06:57:41,191 | INFO     | --- collectZombieJob: --- 10, [3552201]
[2026-03-25 07:58:25] 2026-03-25 06:57:41,191 | INFO     | zombie collector waiting for pid 3552201
[2026-03-25 07:58:25] 2026-03-25 06:57:41,191 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-25 07:58:25] 2026-03-25 06:57:41,191 | INFO     | collected zombie processes
[2026-03-25 07:58:25] 2026-03-25 06:57:41,191 | INFO     | will attempt to kill all subprocesses of pid=3552201
[2026-03-25 07:58:25] 2026-03-25 06:57:41,556 | INFO     | process IDs to be killed: [3552201] (in reverse order)
[2026-03-25 07:58:25] 2026-03-25 06:57:41,720 | WARNING  | found no corresponding commands to process id(s)
[2026-03-25 07:58:25] 2026-03-25 06:57:41,720 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-25 07:58:25] 2026-03-25 06:57:41,745 | INFO     | did not find any defunct processes belonging to 3552201
[2026-03-25 07:58:25] 2026-03-25 06:57:41,767 | INFO     | did not find any defunct processes belonging to 3552201
[2026-03-25 07:58:25] 2026-03-25 06:57:41,768 | INFO     | ready for new job
[2026-03-25 07:58:25] 2026-03-25 06:57:41,768 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-25 07:58:25] 2026-03-25 06:57:41,769 | INFO     | **************************************
[2026-03-25 07:58:25] 2026-03-25 06:57:41,769 | INFO     | ***  PanDA Pilot version 3.11.5.1  ***
[2026-03-25 07:58:25] 2026-03-25 06:57:41,769 | INFO     | **************************************
[2026-03-25 07:58:25] 2026-03-25 06:57:41,769 | INFO     | 
[2026-03-25 07:58:25] 2026-03-25 06:57:41,774 | INFO     | architecture information:
[2026-03-25 07:58:25] 2026-03-25 06:57:41,774 | INFO     | executing command: cat /etc/os-release
[2026-03-25 07:58:25] 2026-03-25 06:57:41,787 | INFO     | cat /etc/os-release:
[2026-03-25 07:58:25] NAME="CentOS Linux"
[2026-03-25 07:58:25] VERSION="7 (Core)"
[2026-03-25 07:58:25] ID="centos"
[2026-03-25 07:58:25] ID_LIKE="rhel fedora"
[2026-03-25 07:58:25] VERSION_ID="7"
[2026-03-25 07:58:25] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-25 07:58:25] ANSI_COLOR="0;31"
[2026-03-25 07:58:25] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-25 07:58:25] HOME_URL="https://www.centos.org/"
[2026-03-25 07:58:25] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-25 07:58:25] 
[2026-03-25 07:58:25] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-25 07:58:25] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-25 07:58:25] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-25 07:58:25] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-25 07:58:25] 
[2026-03-25 07:58:25] 2026-03-25 06:57:41,788 | INFO     | **************************************
[2026-03-25 07:58:25] 2026-03-25 06:57:42,291 | INFO     | executing command: df -mP /var/lib/boinc/var/slot84/slots/3
[2026-03-25 07:58:25] 2026-03-25 06:57:42,308 | INFO     | sufficient remaining disk space (160850509824 B)
[2026-03-25 07:58:25] 2026-03-25 06:57:42,308 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-25 07:58:25] 2026-03-25 06:57:42,308 | INFO     | current server update state: UPDATING_FINAL
[2026-03-25 07:58:25] 2026-03-25 06:57:42,308 | INFO     | update_server=False
[2026-03-25 07:58:25] 2026-03-25 06:57:42,308 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-25 07:58:25] 2026-03-25 06:57:42,308 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-25 07:58:25] 2026-03-25 06:57:42,309 | INFO     | aborting loop
[2026-03-25 07:58:25] 2026-03-25 06:57:42,765 | INFO     | all data control threads have been joined
[2026-03-25 07:58:25] 2026-03-25 06:57:42,859 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-25 07:58:25] 2026-03-25 06:57:43,313 | INFO     | [job] job monitor thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,313 | INFO     | [job] retrieve thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,353 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,480 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-25 07:58:25] 2026-03-25 06:57:43,510 | INFO     | [job] validate thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,601 | INFO     | [payload] failed_post thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,730 | INFO     | [job] create_data_payload thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,749 | INFO     | all payload control threads have been joined
[2026-03-25 07:58:25] 2026-03-25 06:57:43,770 | INFO     | [data] control thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,866 | INFO     | [payload] validate_pre thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:43,907 | INFO     | [payload] execute_payloads thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:44,003 | INFO     | all job control threads have been joined
[2026-03-25 07:58:25] 2026-03-25 06:57:44,177 | INFO     | [data] copytool_in thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:44,510 | INFO     | [payload] validate_post thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:44,754 | INFO     | [payload] control thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:44,865 | INFO     | [data] copytool_out thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:45,008 | INFO     | [job] control thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:45,176 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-25 07:58:25] 2026-03-25 06:57:46,181 | INFO     | [job] queue monitor thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:47,486 | INFO     | [data] queue_monitor thread has finished
[2026-03-25 07:58:25] 2026-03-25 06:57:59,874 | INFO     | [monitor] cgroup control has ended
[2026-03-25 07:58:25] 2026-03-25 06:58:00,560 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 22464690374464)>', '<ExcThread(monitor, started 22464452376320)>']
[2026-03-25 07:58:25] 2026-03-25 06:58:05,584 | INFO     | all workflow threads have been joined
[2026-03-25 07:58:25] 2026-03-25 06:58:05,584 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-25 07:58:25] 2026-03-25 06:58:05,584 | INFO     | traces error code: 0
[2026-03-25 07:58:25] 2026-03-25 06:58:05,584 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | INFO     | PID=3545612 has CPU usage=6.9% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | INFO     | .. there are 14 such processes running
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | INFO     | found 0 job(s) in 20 queues
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-25 07:58:25] 2026-03-25 06:58:11,834 | INFO     | [monitor] control thread has ended
[2026-03-25 07:58:25] 2026-03-25 06:58:11,962 [wrapper] ==== pilot stdout END ====
[2026-03-25 07:58:25] 2026-03-25 06:58:11,964 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-25 07:58:25] 2026-03-25 06:58:11,966 [wrapper] pilotpid: 3545612
[2026-03-25 07:58:25] 2026-03-25 06:58:11,969 [wrapper] Pilot exit status: 0
[2026-03-25 07:58:25] 2026-03-25 06:58:11,977 [wrapper] pandaids: 7062347031
[2026-03-25 07:58:25] 2026-03-25 06:58:12,085 [wrapper] cleanup supervisor_pilot 3836149 3545613
[2026-03-25 07:58:25] 2026-03-25 06:58:12,088 [wrapper] Test setup, not cleaning
[2026-03-25 07:58:25] 2026-03-25 06:58:12,091 [wrapper] apfmon messages muted
[2026-03-25 07:58:25] 2026-03-25 06:58:12,093 [wrapper] ==== wrapper stdout END ====
[2026-03-25 07:58:25] 2026-03-25 06:58:12,095 [wrapper] ==== wrapper stderr END ====
[2026-03-25 07:58:25]  *** Error codes and diagnostics ***
[2026-03-25 07:58:25]     "exeErrorCode": 0,
[2026-03-25 07:58:25]     "exeErrorDiag": "",
[2026-03-25 07:58:25]     "pilotErrorCode": 0,
[2026-03-25 07:58:25]     "pilotErrorDiag": "",
[2026-03-25 07:58:25]  *** Listing of results directory ***
[2026-03-25 07:58:25] total 421324
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc    585013 Mar 22 22:31 pilot3.tar.gz
[2026-03-25 07:58:25] -rwx------. 1 boinc boinc     36322 Mar 22 22:31 runpilot2-wrapper.sh
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc      5111 Mar 22 22:33 queuedata.json
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc       100 Mar 25 06:47 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-25 07:58:25] -rwxr-xr-x. 1 boinc boinc      7986 Mar 25 06:47 run_atlas
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc       105 Mar 25 06:47 job.xml
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc      6709 Mar 25 06:47 init_data.xml
[2026-03-25 07:58:25] -rw-r--r--. 2 boinc boinc 221838503 Mar 25 06:47 EVNT.49127495._000003.pool.root.1
[2026-03-25 07:58:25] -rw-r--r--. 2 boinc boinc     15845 Mar 25 06:47 start_atlas.sh
[2026-03-25 07:58:25] drwxrwx--x. 2 boinc boinc      4096 Mar 25 06:47 shared
[2026-03-25 07:58:25] -rw-r--r--. 2 boinc boinc    597534 Mar 25 06:47 input.tar.gz
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc         0 Mar 25 06:47 boinc_lockfile
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc      2593 Mar 25 06:47 pandaJob.out
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc   1010836 Mar 25 06:47 agis_schedconf.cvmfs.json
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc   1511579 Mar 25 06:47 agis_ddmendpoints.agis.ALL.json
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc       444 Mar 25 06:47 workernode_map.json
[2026-03-25 07:58:25] drwx------. 5 boinc boinc      4096 Mar 25 06:47 pilot3
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc 203646245 Mar 25 07:57 HITS.49177840._000064.pool.root.1
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc        95 Mar 25 07:57 pilot_heartbeat.json
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc      1030 Mar 25 07:57 memory_monitor_summary.json
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc    381744 Mar 25 07:57 log.49177840._000064.job.log.tgz.1
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc      6348 Mar 25 07:57 heartbeat.json
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc       529 Mar 25 07:57 boinc_task_state.xml
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc        27 Mar 25 07:58 wrapper_checkpoint.txt
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc       823 Mar 25 07:58 pilotlog.txt
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc    625914 Mar 25 07:58 log.49177840._000064.job.log.1
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc       357 Mar 25 07:58 output.list
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc       620 Mar 25 07:58 runtime_log
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc   1024000 Mar 25 07:58 result.tar.gz
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc      9206 Mar 25 07:58 runtime_log.err
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc       663 Mar 25 07:58 kCKNDmvZfO9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmWw6LDmvJAXkm.diag
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc      8192 Mar 25 07:58 boinc_mmap_file
[2026-03-25 07:58:25] -rw-r--r--. 1 boinc boinc     21276 Mar 25 07:58 stderr.txt
[2026-03-25 07:58:25] HITS file was successfully produced:
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc 203646245 Mar 25 07:57 shared/HITS.pool.root.1
[2026-03-25 07:58:25]  *** Contents of shared directory: ***
[2026-03-25 07:58:25] total 417124
[2026-03-25 07:58:25] -rw-r--r--. 2 boinc boinc 221838503 Mar 25 06:47 ATLAS.root_0
[2026-03-25 07:58:25] -rw-r--r--. 2 boinc boinc     15845 Mar 25 06:47 start_atlas.sh
[2026-03-25 07:58:25] -rw-r--r--. 2 boinc boinc    597534 Mar 25 06:47 input.tar.gz
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc 203646245 Mar 25 07:57 HITS.pool.root.1
[2026-03-25 07:58:25] -rw-------. 1 boinc boinc   1024000 Mar 25 07:58 result.tar.gz
07:58:26 (3540976): run_atlas exited; CPU time 45459.835375
07:58:26 (3540976): called boinc_finish(0)

</stderr_txt>
]]>


©2026 CERN