Name eh9KDmVY6I9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmAuVLDmiVR21m_1
Workunit 239677555
Created 7 Mar 2026, 17:26:38 UTC
Sent 7 Mar 2026, 17:27:48 UTC
Report deadline 15 Mar 2026, 17:27:48 UTC
Received 7 Mar 2026, 18:20:11 UTC
Server state Over
Outcome Validate error
Client state Done
Exit status 0 (0x00000000)
Computer ID 10824859
Run time 18 min 25 sec
CPU time 36 min 19 sec
Priority 28
Validate state Invalid
Credit 0.00
Device peak FLOPS 19.07 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.50 GB
Peak swap size 2.96 GB
Peak disk usage 639.35 MB

Stderr output

<core_client_version>7.7.0</core_client_version>
<![CDATA[
<stderr_txt>
12:28:05 (3207): wrapper (7.7.26015): starting
12:28:05 (3207): wrapper: running run_atlas (--nthreads 8)
[2026-03-07 12:28:05] Arguments: --nthreads 8
[2026-03-07 12:28:05] Threads: 8
[2026-03-07 12:28:05] Checking for CVMFS
[2026-03-07 12:28:05] Probing /cvmfs/atlas.cern.ch... OK
[2026-03-07 12:28:06] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-03-07 12:28:06] Running cvmfs_config stat atlas.cern.ch
[2026-03-07 12:28:06] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-03-07 12:28:06] 2.11.2.0 2847 219039 74596 157009 3 51 14934638 20275200 1 130560 0 60423722 99.849 15139584 27063 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2026-03-07 12:28:06] CVMFS is ok
[2026-03-07 12:28:06] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-07 12:28:06] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-07 12:28:06] Further information can be found at the LHC@home message board.
[2026-03-07 12:28:06] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-07 12:28:06] Checking for apptainer binary...
[2026-03-07 12:28:06] Using apptainer found in PATH at /usr/bin/apptainer
[2026-03-07 12:28:06] Running /usr/bin/apptainer --version
[2026-03-07 12:28:06] apptainer version 1.3.2-1.el7
[2026-03-07 12:28:06] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-07 12:28:07] c-204-19.aglt2.org
[2026-03-07 12:28:07] apptainer works
[2026-03-07 12:28:07] Set ATHENA_PROC_NUMBER=8
[2026-03-07 12:28:07] Set ATHENA_CORE_NUMBER=8
[2026-03-07 12:28:07] Starting ATLAS job with PandaID=7045007157
[2026-03-07 12:28:07] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
13:48:22 (2970): wrapper (7.7.26015): starting
13:48:22 (2970): wrapper: running run_atlas (--nthreads 8)
[2026-03-07 13:48:22] Arguments: --nthreads 8
[2026-03-07 13:48:22] Threads: 8
[2026-03-07 13:48:22] This job has been restarted, cleaning up previous attempt
[2026-03-07 13:48:22] Checking for CVMFS
[2026-03-07 13:48:22] Probing /cvmfs/atlas.cern.ch... OK
[2026-03-07 13:48:23] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-03-07 13:48:23] Running cvmfs_config stat atlas.cern.ch
[2026-03-07 13:48:23] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-03-07 13:48:23] 2.11.2.0 2886 0 28812 157009 3 1 14956040 20275200 1 130560 0 3 100.000 0 n/a http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2026-03-07 13:48:23] CVMFS is ok
[2026-03-07 13:48:23] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-07 13:48:23] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-07 13:48:23] Further information can be found at the LHC@home message board.
[2026-03-07 13:48:23] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-07 13:48:23] Checking for apptainer binary...
[2026-03-07 13:48:23] Using apptainer found in PATH at /usr/bin/apptainer
[2026-03-07 13:48:23] Running /usr/bin/apptainer --version
[2026-03-07 12:48:24] apptainer version 1.3.2-1.el7
[2026-03-07 12:48:24] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-07 12:48:27] c-204-19.aglt2.org
[2026-03-07 12:48:27] apptainer works
[2026-03-07 12:48:27] Set ATHENA_PROC_NUMBER=8
[2026-03-07 12:48:27] Set ATHENA_CORE_NUMBER=8
[2026-03-07 12:48:27] Starting ATLAS job with PandaID=7045007157
[2026-03-07 12:48:27] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
13:03:06 (21227): wrapper (7.7.26015): starting
13:03:06 (21227): wrapper: running run_atlas (--nthreads 8)
[2026-03-07 13:03:06] Arguments: --nthreads 8
[2026-03-07 13:03:06] Threads: 8
[2026-03-07 13:03:06] This job has been restarted, cleaning up previous attempt
[2026-03-07 13:03:06] Checking for CVMFS
[2026-03-07 13:03:07] Probing /cvmfs/atlas.cern.ch... OK
[2026-03-07 13:03:07] Probing /cvmfs/atlas-condb.cern.ch... OK
[2026-03-07 13:03:07] Running cvmfs_config stat atlas.cern.ch
[2026-03-07 13:03:08] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2026-03-07 13:03:08] 2.11.2.0 2886 307445734561825815 53168 157009 1 186 14973565 20275200 917 130560 0 45046 100.000 2 2000 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2026-03-07 13:03:08] CVMFS is ok
[2026-03-07 13:03:08] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2026-03-07 13:03:08] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2026-03-07 13:03:08] Further information can be found at the LHC@home message board.
[2026-03-07 13:03:08] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2026-03-07 13:03:08] Checking for apptainer binary...
[2026-03-07 13:03:08] Using apptainer found in PATH at /usr/bin/apptainer
[2026-03-07 13:03:08] Running /usr/bin/apptainer --version
[2026-03-07 13:03:08] apptainer version 1.3.2-1.el7
[2026-03-07 13:03:08] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2026-03-07 13:03:16] c-204-19.aglt2.org
[2026-03-07 13:03:16] apptainer works
[2026-03-07 13:03:16] Set ATHENA_PROC_NUMBER=8
[2026-03-07 13:03:16] Set ATHENA_CORE_NUMBER=8
[2026-03-07 13:03:16] Starting ATLAS job with PandaID=7045007157
[2026-03-07 13:03:16] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2026-03-07 13:09:13]  *** The last 200 lines of the pilot log: ***
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | WARNING  | wrong length of table data, x=[1772906634.0, 1772906695.0, 1772906756.0, 1772906817.0], y=[1187.0, 1211747.0, 2063208.0, 2231876.0] (must be same and length>=4)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | INFO     | could have reported an average CPU frequency of 1729 MHz (4 samples)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | INFO     | ..............................
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . Timing measurements:
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . get job = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . initial setup = 2 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . payload setup = 9 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . stage-in = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . payload execution = 985 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . stage-out = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . log creation = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | ..............................
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/pilotlog.txt
[2026-03-07 13:09:13] 2026-03-07 18:08:39,903 | WARNING  | detected the following tail of warning/fatal messages in the pilot log:
[2026-03-07 13:09:13] - Log from pilotlog.txt -
[2026-03-07 13:09:13] 2026-03-07 18:08:39,877 | WARNING  | making sure that job.state is set to failed since a pilot error code is set
[2026-03-07 13:09:13] 2026-03-07 18:08:39,877 | INFO     | payload/TRF did not report the number of read events
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/memory_monitor_summary.json (trf name=prmon)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | INFO     | extracted standard info from prmon json
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | INFO     | extracted standard memory fields from prmon json
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | WARNING  | wrong length of table data, x=[1772906634.0, 1772906695.0, 1772906756.0, 1772906817.0], y=[1187.0, 1211747.0, 2063208.0, 2231876.0] (must be same and length>=4)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | INFO     | could have reported an average CPU frequency of 1729 MHz (4 samples)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | INFO     | ..............................
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . Timing measurements:
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . get job = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . initial setup = 2 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . payload setup = 9 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . stage-in = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . payload execution = 985 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . stage-out = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . log creation = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | ..............................
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/pilotlog.txt
[2026-03-07 13:09:13] 2026-03-07 18:08:39,904 | WARNING  | 
[2026-03-07 13:09:13] [begin log extracts]
[2026-03-07 13:09:13] - Log from pilotlog.txt -
[2026-03-07 13:09:13] 2026-03-07 18:08:39,877 | WARNING  | making sure that job.state is set to failed since a pilot error code is set
[2026-03-07 13:09:13] 2026-03-07 18:08:39,877 | INFO     | payload/TRF did not report the number of read events
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/memory_monitor_summary.json (trf name=prmon)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | INFO     | extracted standard info from prmon json
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | INFO     | extracted standard memory fields from prmon json
[2026-03-07 13:09:13] 2026-03-07 18:08:39,879 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | WARNING  | wrong length of table data, x=[1772906634.0, 1772906695.0, 1772906756.0, 1772906817.0], y=[1187.0, 1211747.0, 2063208.0, 2231876.0] (must be same and length>=4)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | INFO     | could have reported an average CPU frequency of 1729 MHz (4 samples)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,880 | INFO     | ..............................
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . Timing measurements:
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . get job = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . initial setup = 2 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . payload setup = 9 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . stage-in = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . payload execution = 985 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . stage-out = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,881 | INFO     | . log creation = 0 s
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | ..............................
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2026-03-07 13:09:13] 2026-03-07 18:08:39,882 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/pilotlog.txt
[2026-03-07 13:09:13] [end log extracts]
[2026-03-07 13:09:13] 2026-03-07 18:08:39,904 | WARNING  | pilotErrorCodes = [1315, 1187] (will report primary/first error code)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,905 | WARNING  | pilotErrorDiags = ['Unknown transform failure', 'Payload metadata does not exist'] (will report primary/first error diag)
[2026-03-07 13:09:13] 2026-03-07 18:08:39,997 | INFO     | 
[2026-03-07 13:09:13] 2026-03-07 18:08:39,998 | INFO     | job summary report
[2026-03-07 13:09:13] 2026-03-07 18:08:39,998 | INFO     | --------------------------------------------------
[2026-03-07 13:09:13] 2026-03-07 18:08:39,998 | INFO     | PanDA job id: 7045007157
[2026-03-07 13:09:13] 2026-03-07 18:08:39,998 | INFO     | task id: 48967011
[2026-03-07 13:09:13] 2026-03-07 18:08:39,999 | INFO     | error 1/2: 1315: Unknown transform failure
[2026-03-07 13:09:13] 2026-03-07 18:08:39,999 | INFO     | error 2/2: 1187: Payload metadata does not exist
[2026-03-07 13:09:13] 2026-03-07 18:08:39,999 | INFO     | pilot error code: 1187
[2026-03-07 13:09:13] 2026-03-07 18:08:39,999 | INFO     | pilot error diag: metadata does not exist: /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/metadata.xml
[2026-03-07 13:09:13] 2026-03-07 18:08:39,999 | INFO     | status: LOG_TRANSFER = DONE 
[2026-03-07 13:09:13] 2026-03-07 18:08:39,999 | INFO     | pilot state: failed 
[2026-03-07 13:09:13] 2026-03-07 18:08:40,000 | INFO     | transexitcode: 251
[2026-03-07 13:09:13] 2026-03-07 18:08:40,000 | INFO     | exeerrorcode: 0
[2026-03-07 13:09:13] 2026-03-07 18:08:40,000 | INFO     | exeerrordiag: 
[2026-03-07 13:09:13] 2026-03-07 18:08:40,000 | INFO     | exitcode: 0
[2026-03-07 13:09:13] 2026-03-07 18:08:40,000 | INFO     | exitmsg: 
[2026-03-07 13:09:13] 2026-03-07 18:08:40,001 | INFO     | cpuconsumptiontime: 3938 s
[2026-03-07 13:09:13] 2026-03-07 18:08:40,001 | INFO     | nevents: 0
[2026-03-07 13:09:13] 2026-03-07 18:08:40,001 | INFO     | neventsw: 0
[2026-03-07 13:09:13] 2026-03-07 18:08:40,001 | INFO     | pid: 10862
[2026-03-07 13:09:13] 2026-03-07 18:08:40,001 | INFO     | pgrp: 10862
[2026-03-07 13:09:13] 2026-03-07 18:08:40,001 | INFO     | corecount: 8
[2026-03-07 13:09:13] 2026-03-07 18:08:40,002 | INFO     | event service: False
[2026-03-07 13:09:13] 2026-03-07 18:08:40,002 | INFO     | sizes: {0: 2294725, 1: 2295843, 12: 2295843, 1059: 2305185, 1063: 2305185, 1178: 2305209}
[2026-03-07 13:09:13] 2026-03-07 18:08:40,002 | INFO     | --------------------------------------------------
[2026-03-07 13:09:13] 2026-03-07 18:08:40,002 | INFO     | 
[2026-03-07 13:09:13] 2026-03-07 18:08:40,002 | INFO     | executing command: ls -lF /tmp/boinchome/slots/0
[2026-03-07 13:09:13] 2026-03-07 18:08:40,033 | INFO     | queue jobs had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,034 | INFO     | queue payloads had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,034 | INFO     | queue data_in had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,034 | INFO     | queue data_out had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,034 | INFO     | queue current_data_in had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,035 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,035 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,035 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,035 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,035 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,036 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,036 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,036 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,036 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,036 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,036 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,037 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,037 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-07 13:09:13] 2026-03-07 18:08:40,037 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,037 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-07 13:09:13] 2026-03-07 18:08:40,037 | INFO     | job 7045007157 has completed (purged errors)
[2026-03-07 13:09:13] 2026-03-07 18:08:40,038 | INFO     | overall cleanup function is called
[2026-03-07 13:09:13] 2026-03-07 18:08:41,045 | INFO     | --- collectZombieJob: --- 10, [10862]
[2026-03-07 13:09:13] 2026-03-07 18:08:41,045 | INFO     | zombie collector waiting for pid 10862
[2026-03-07 13:09:13] 2026-03-07 18:08:41,045 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-07 13:09:13] 2026-03-07 18:08:41,045 | INFO     | collected zombie processes
[2026-03-07 13:09:13] 2026-03-07 18:08:41,045 | INFO     | will attempt to kill all subprocesses of pid=10862
[2026-03-07 13:09:13] 2026-03-07 18:08:41,141 | INFO     | process IDs to be killed: [10862] (in reverse order)
[2026-03-07 13:09:13] 2026-03-07 18:08:41,211 | WARNING  | found no corresponding commands to process id(s)
[2026-03-07 13:09:13] 2026-03-07 18:08:41,211 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-07 13:09:13] 2026-03-07 18:08:41,233 | INFO     | did not find any defunct processes belonging to 10862
[2026-03-07 13:09:13] 2026-03-07 18:08:41,236 | INFO     | did not find any defunct processes belonging to 10862
[2026-03-07 13:09:13] 2026-03-07 18:08:41,236 | INFO     | ready for new job
[2026-03-07 13:09:13] 2026-03-07 18:08:41,236 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-07 13:09:13] 2026-03-07 18:08:41,243 | INFO     | **************************************
[2026-03-07 13:09:13] 2026-03-07 18:08:41,243 | INFO     | ***  PanDA Pilot version 3.11.4.1  ***
[2026-03-07 13:09:13] 2026-03-07 18:08:41,243 | INFO     | **************************************
[2026-03-07 13:09:13] 2026-03-07 18:08:41,243 | INFO     | 
[2026-03-07 13:09:13] 2026-03-07 18:08:41,243 | INFO     | architecture information:
[2026-03-07 13:09:13] 2026-03-07 18:08:41,244 | INFO     | executing command: cat /etc/os-release
[2026-03-07 13:09:13] 2026-03-07 18:08:41,262 | INFO     | cat /etc/os-release:
[2026-03-07 13:09:13] NAME="CentOS Linux"
[2026-03-07 13:09:13] VERSION="7 (Core)"
[2026-03-07 13:09:13] ID="centos"
[2026-03-07 13:09:13] ID_LIKE="rhel fedora"
[2026-03-07 13:09:13] VERSION_ID="7"
[2026-03-07 13:09:13] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-07 13:09:13] ANSI_COLOR="0;31"
[2026-03-07 13:09:13] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-07 13:09:13] HOME_URL="https://www.centos.org/"
[2026-03-07 13:09:13] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-07 13:09:13] 
[2026-03-07 13:09:13] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-07 13:09:13] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-07 13:09:13] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-07 13:09:13] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-07 13:09:13] 
[2026-03-07 13:09:13] 2026-03-07 18:08:41,263 | INFO     | **************************************
[2026-03-07 13:09:13] 2026-03-07 18:08:41,766 | INFO     | executing command: df -mP /tmp/boinchome/slots/0
[2026-03-07 13:09:13] 2026-03-07 18:08:41,786 | INFO     | sufficient remaining disk space (18722324480 B)
[2026-03-07 13:09:13] 2026-03-07 18:08:41,786 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-07 13:09:13] 2026-03-07 18:08:41,787 | INFO     | current server update state: UPDATING_FINAL
[2026-03-07 13:09:13] 2026-03-07 18:08:41,787 | INFO     | update_server=False
[2026-03-07 13:09:13] 2026-03-07 18:08:41,787 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-07 13:09:13] 2026-03-07 18:08:41,788 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-07 13:09:13] 2026-03-07 18:08:41,826 | INFO     | all data control threads have been joined
[2026-03-07 13:09:13] 2026-03-07 18:08:41,892 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-07 13:09:13] 2026-03-07 18:08:42,172 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-07 13:09:13] 2026-03-07 18:08:42,173 | INFO     | aborting loop
[2026-03-07 13:09:13] 2026-03-07 18:08:42,416 | INFO     | all job control threads have been joined
[2026-03-07 13:09:13] 2026-03-07 18:08:42,657 | INFO     | all payload control threads have been joined
[2026-03-07 13:09:13] 2026-03-07 18:08:42,789 | INFO     | [job] retrieve thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:42,793 | INFO     | [job] queue monitor thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:42,832 | INFO     | [data] control thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:42,957 | INFO     | [payload] validate_pre thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:42,990 | INFO     | [payload] validate_post thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,171 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,177 | INFO     | [job] job monitor thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,266 | INFO     | [data] copytool_in thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,367 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-07 13:09:13] 2026-03-07 18:08:43,421 | INFO     | [job] control thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,519 | INFO     | [payload] execute_payloads thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,663 | INFO     | [payload] control thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,713 | INFO     | [payload] failed_post thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:43,898 | INFO     | [data] copytool_out thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:44,046 | INFO     | [job] validate thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:44,292 | INFO     | [job] create_data_payload thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:47,373 | INFO     | [data] queue_monitor thread has finished
[2026-03-07 13:09:13] 2026-03-07 18:08:51,241 | INFO     | [monitor] cgroup control has ended
[2026-03-07 13:09:13] 2026-03-07 18:08:51,754 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140443489507136)>', '<ExcThread(monitor, started 140442452608768)>']
[2026-03-07 13:09:13] 2026-03-07 18:08:56,756 | INFO     | all workflow threads have been joined
[2026-03-07 13:09:13] 2026-03-07 18:08:56,756 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-07 13:09:13] 2026-03-07 18:08:56,757 | INFO     | traces error code: 0
[2026-03-07 13:09:13] 2026-03-07 18:08:56,757 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-07 13:09:13] 2026-03-07 18:09:12,955 | INFO     | PID=7285 has CPU usage=1.0% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i PR -
[2026-03-07 13:09:13] 2026-03-07 18:09:12,955 | INFO     | .. there are 2 such processes running
[2026-03-07 13:09:13] 2026-03-07 18:09:12,956 | INFO     | found 0 job(s) in 20 queues
[2026-03-07 13:09:13] 2026-03-07 18:09:12,956 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-07 13:09:13] 2026-03-07 18:09:12,956 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-07 13:09:13] 2026-03-07 18:09:12,956 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-07 13:09:13] 2026-03-07 18:09:12,957 | INFO     | [monitor] control thread has ended
[2026-03-07 13:09:13] 2026-03-07 18:09:13,074 [wrapper] ==== pilot stdout END ====
[2026-03-07 13:09:13] 2026-03-07 18:09:13,079 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-07 13:09:13] 2026-03-07 18:09:13,084 [wrapper] pilotpid: 7285
[2026-03-07 13:09:13] 2026-03-07 18:09:13,088 [wrapper] Pilot exit status: 0
[2026-03-07 13:09:13] 2026-03-07 18:09:13,115 [wrapper] pandaids: 7045007157 7045007157 7045007157
[2026-03-07 13:09:13] 2026-03-07 18:09:13,156 [wrapper] cleanup supervisor_pilot 27102 7286
[2026-03-07 13:09:13] 2026-03-07 18:09:13,160 [wrapper] Test setup, not cleaning
[2026-03-07 13:09:13] 2026-03-07 18:09:13,165 [wrapper] apfmon messages muted
[2026-03-07 13:09:13] 2026-03-07 18:09:13,170 [wrapper] ==== wrapper stdout END ====
[2026-03-07 13:09:13] 2026-03-07 18:09:13,174 [wrapper] ==== wrapper stderr END ====
[2026-03-07 13:09:13]  *** Error codes and diagnostics ***
[2026-03-07 13:09:13]     "exeErrorCode": 0,
[2026-03-07 13:09:13]     "exeErrorDiag": "",
[2026-03-07 13:09:13]     "pilotErrorCode": 1315,
[2026-03-07 13:09:13]     "pilotErrorDiag": "Unknown transform failure",
[2026-03-07 13:09:13]  *** Listing of results directory ***
[2026-03-07 13:09:13] total 432436
[2026-03-07 13:09:13] drwx------ 5 boincer umatlas      4096 Feb  2 10:00 pilot3
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas    584627 Mar  7 08:06 pilot3.tar.gz
[2026-03-07 13:09:13] -rwx------ 1 boincer umatlas     36322 Mar  7 08:07 runpilot2-wrapper.sh
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas      5111 Mar  7 08:07 queuedata.json
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas       100 Mar  7 12:28 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-07 13:09:13] -rwxr-xr-x 1 boincer umatlas      7986 Mar  7 12:28 run_atlas
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas       105 Mar  7 12:28 job.xml
[2026-03-07 13:09:13] -rw-r--r-- 3 boincer umatlas 219146985 Mar  7 12:28 EVNT.48967008._000134.pool.root.1
[2026-03-07 13:09:13] -rw-r--r-- 3 boincer umatlas 219146985 Mar  7 12:28 ATLAS.root_0
[2026-03-07 13:09:13] -rw-r--r-- 2 boincer umatlas     15845 Mar  7 12:28 start_atlas.sh
[2026-03-07 13:09:13] drwxrwx--x 2 boincer umatlas      4096 Mar  7 12:28 shared
[2026-03-07 13:09:13] -rw-r--r-- 2 boincer umatlas    597952 Mar  7 12:28 input.tar.gz
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas         0 Mar  7 12:28 boinc_lockfile
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas   1521291 Mar  7 12:28 agis_ddmendpoints.agis.ALL.json
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas      5907 Mar  7 13:03 init_data.xml
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas      2563 Mar  7 13:03 pandaJob.out
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas        57 Mar  7 13:03 setup.sh.local
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas    999177 Mar  7 13:03 agis_schedconf.cvmfs.json
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas       433 Mar  7 13:03 workernode_map.json
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas      1023 Mar  7 13:07 memory_monitor_summary.json
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas     95262 Mar  7 13:07 log.48967011._003412.job.log.tgz.1
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas       527 Mar  7 13:07 boinc_task_state.xml
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas      8236 Mar  7 13:08 heartbeat.json
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas      8192 Mar  7 13:09 boinc_mmap_file
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas        26 Mar  7 13:09 wrapper_checkpoint.txt
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas        96 Mar  7 13:09 pilot_heartbeat.json
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas     76034 Mar  7 13:09 pilotlog.txt
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas    218973 Mar  7 13:09 log.48967011._003412.job.log.1
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas       376 Mar  7 13:09 output.list
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas       655 Mar  7 13:09 runtime_log
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas    337920 Mar  7 13:09 result.tar.gz
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas      9649 Mar  7 13:09 runtime_log.err
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas       903 Mar  7 13:09 eh9KDmVY6I9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmAuVLDmiVR21m.diag
[2026-03-07 13:09:13] -rw-r--r-- 1 boincer umatlas     25981 Mar  7 13:09 stderr.txt
[2026-03-07 13:09:13] No HITS result produced
[2026-03-07 13:09:13]  *** Contents of shared directory: ***
[2026-03-07 13:09:13] total 214944
[2026-03-07 13:09:13] -rw-r--r-- 3 boincer umatlas 219146985 Mar  7 12:28 ATLAS.root_0
[2026-03-07 13:09:13] -rw-r--r-- 2 boincer umatlas     15845 Mar  7 12:28 start_atlas.sh
[2026-03-07 13:09:13] -rw-r--r-- 2 boincer umatlas    597952 Mar  7 12:28 input.tar.gz
[2026-03-07 13:09:13] -rw------- 1 boincer umatlas    337920 Mar  7 13:09 result.tar.gz
[2026-03-07 13:10:36]  *** The last 200 lines of the pilot log: ***
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | WARNING  | wrong length of table data, x=[1772906634.0, 1772906695.0, 1772906756.0, 1772906817.0], y=[1187.0, 1211747.0, 2063208.0, 2231876.0] (must be same and length>=4)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | INFO     | could have reported an average CPU frequency of 1729 MHz (4 samples)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | INFO     | ..............................
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . Timing measurements:
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . get job = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . initial setup = 2 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . payload setup = 9 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . stage-in = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . payload execution = 985 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . stage-out = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . log creation = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | ..............................
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/pilotlog.txt
[2026-03-07 13:10:36] 2026-03-07 18:08:39,903 | WARNING  | detected the following tail of warning/fatal messages in the pilot log:
[2026-03-07 13:10:36] - Log from pilotlog.txt -
[2026-03-07 13:10:36] 2026-03-07 18:08:39,877 | WARNING  | making sure that job.state is set to failed since a pilot error code is set
[2026-03-07 13:10:36] 2026-03-07 18:08:39,877 | INFO     | payload/TRF did not report the number of read events
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/memory_monitor_summary.json (trf name=prmon)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | INFO     | extracted standard info from prmon json
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | INFO     | extracted standard memory fields from prmon json
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | WARNING  | wrong length of table data, x=[1772906634.0, 1772906695.0, 1772906756.0, 1772906817.0], y=[1187.0, 1211747.0, 2063208.0, 2231876.0] (must be same and length>=4)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | INFO     | could have reported an average CPU frequency of 1729 MHz (4 samples)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | INFO     | ..............................
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . Timing measurements:
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . get job = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . initial setup = 2 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . payload setup = 9 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . stage-in = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . payload execution = 985 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . stage-out = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . log creation = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | ..............................
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/pilotlog.txt
[2026-03-07 13:10:36] 2026-03-07 18:08:39,904 | WARNING  | 
[2026-03-07 13:10:36] [begin log extracts]
[2026-03-07 13:10:36] - Log from pilotlog.txt -
[2026-03-07 13:10:36] 2026-03-07 18:08:39,877 | WARNING  | making sure that job.state is set to failed since a pilot error code is set
[2026-03-07 13:10:36] 2026-03-07 18:08:39,877 | INFO     | payload/TRF did not report the number of read events
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/memory_monitor_summary.json (trf name=prmon)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | INFO     | extracted standard info from prmon json
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | INFO     | extracted standard memory fields from prmon json
[2026-03-07 13:10:36] 2026-03-07 18:08:39,879 | WARNING  | GPU info not found in prmon json: 'gpu'
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | WARNING  | wrong length of table data, x=[1772906634.0, 1772906695.0, 1772906756.0, 1772906817.0], y=[1187.0, 1211747.0, 2063208.0, 2231876.0] (must be same and length>=4)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | INFO     | could have reported an average CPU frequency of 1729 MHz (4 samples)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,880 | INFO     | ..............................
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . Timing measurements:
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . get job = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . initial setup = 2 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . payload setup = 9 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . stage-in = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . payload execution = 985 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . stage-out = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,881 | INFO     | . log creation = 0 s
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | ..............................
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2026-03-07 13:10:36] 2026-03-07 18:08:39,882 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/pilotlog.txt
[2026-03-07 13:10:36] [end log extracts]
[2026-03-07 13:10:36] 2026-03-07 18:08:39,904 | WARNING  | pilotErrorCodes = [1315, 1187] (will report primary/first error code)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,905 | WARNING  | pilotErrorDiags = ['Unknown transform failure', 'Payload metadata does not exist'] (will report primary/first error diag)
[2026-03-07 13:10:36] 2026-03-07 18:08:39,997 | INFO     | 
[2026-03-07 13:10:36] 2026-03-07 18:08:39,998 | INFO     | job summary report
[2026-03-07 13:10:36] 2026-03-07 18:08:39,998 | INFO     | --------------------------------------------------
[2026-03-07 13:10:36] 2026-03-07 18:08:39,998 | INFO     | PanDA job id: 7045007157
[2026-03-07 13:10:36] 2026-03-07 18:08:39,998 | INFO     | task id: 48967011
[2026-03-07 13:10:36] 2026-03-07 18:08:39,999 | INFO     | error 1/2: 1315: Unknown transform failure
[2026-03-07 13:10:36] 2026-03-07 18:08:39,999 | INFO     | error 2/2: 1187: Payload metadata does not exist
[2026-03-07 13:10:36] 2026-03-07 18:08:39,999 | INFO     | pilot error code: 1187
[2026-03-07 13:10:36] 2026-03-07 18:08:39,999 | INFO     | pilot error diag: metadata does not exist: /tmp/boinchome/slots/0/PanDA_Pilot-7045007157/metadata.xml
[2026-03-07 13:10:36] 2026-03-07 18:08:39,999 | INFO     | status: LOG_TRANSFER = DONE 
[2026-03-07 13:10:36] 2026-03-07 18:08:39,999 | INFO     | pilot state: failed 
[2026-03-07 13:10:36] 2026-03-07 18:08:40,000 | INFO     | transexitcode: 251
[2026-03-07 13:10:36] 2026-03-07 18:08:40,000 | INFO     | exeerrorcode: 0
[2026-03-07 13:10:36] 2026-03-07 18:08:40,000 | INFO     | exeerrordiag: 
[2026-03-07 13:10:36] 2026-03-07 18:08:40,000 | INFO     | exitcode: 0
[2026-03-07 13:10:36] 2026-03-07 18:08:40,000 | INFO     | exitmsg: 
[2026-03-07 13:10:36] 2026-03-07 18:08:40,001 | INFO     | cpuconsumptiontime: 3938 s
[2026-03-07 13:10:36] 2026-03-07 18:08:40,001 | INFO     | nevents: 0
[2026-03-07 13:10:36] 2026-03-07 18:08:40,001 | INFO     | neventsw: 0
[2026-03-07 13:10:36] 2026-03-07 18:08:40,001 | INFO     | pid: 10862
[2026-03-07 13:10:36] 2026-03-07 18:08:40,001 | INFO     | pgrp: 10862
[2026-03-07 13:10:36] 2026-03-07 18:08:40,001 | INFO     | corecount: 8
[2026-03-07 13:10:36] 2026-03-07 18:08:40,002 | INFO     | event service: False
[2026-03-07 13:10:36] 2026-03-07 18:08:40,002 | INFO     | sizes: {0: 2294725, 1: 2295843, 12: 2295843, 1059: 2305185, 1063: 2305185, 1178: 2305209}
[2026-03-07 13:10:36] 2026-03-07 18:08:40,002 | INFO     | --------------------------------------------------
[2026-03-07 13:10:36] 2026-03-07 18:08:40,002 | INFO     | 
[2026-03-07 13:10:36] 2026-03-07 18:08:40,002 | INFO     | executing command: ls -lF /tmp/boinchome/slots/0
[2026-03-07 13:10:36] 2026-03-07 18:08:40,033 | INFO     | queue jobs had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,034 | INFO     | queue payloads had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,034 | INFO     | queue data_in had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,034 | INFO     | queue data_out had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,034 | INFO     | queue current_data_in had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,035 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,035 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,035 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,035 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,035 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,036 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,036 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,036 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,036 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,036 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,036 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,037 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,037 | INFO     | queue completed_jobids has 1 job(s)
[2026-03-07 13:10:36] 2026-03-07 18:08:40,037 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,037 | INFO     | queue messages had 0 job(s) [purged]
[2026-03-07 13:10:36] 2026-03-07 18:08:40,037 | INFO     | job 7045007157 has completed (purged errors)
[2026-03-07 13:10:36] 2026-03-07 18:08:40,038 | INFO     | overall cleanup function is called
[2026-03-07 13:10:36] 2026-03-07 18:08:41,045 | INFO     | --- collectZombieJob: --- 10, [10862]
[2026-03-07 13:10:36] 2026-03-07 18:08:41,045 | INFO     | zombie collector waiting for pid 10862
[2026-03-07 13:10:36] 2026-03-07 18:08:41,045 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2026-03-07 13:10:36] 2026-03-07 18:08:41,045 | INFO     | collected zombie processes
[2026-03-07 13:10:36] 2026-03-07 18:08:41,045 | INFO     | will attempt to kill all subprocesses of pid=10862
[2026-03-07 13:10:36] 2026-03-07 18:08:41,141 | INFO     | process IDs to be killed: [10862] (in reverse order)
[2026-03-07 13:10:36] 2026-03-07 18:08:41,211 | WARNING  | found no corresponding commands to process id(s)
[2026-03-07 13:10:36] 2026-03-07 18:08:41,211 | INFO     | Do not look for orphan processes in BOINC jobs
[2026-03-07 13:10:36] 2026-03-07 18:08:41,233 | INFO     | did not find any defunct processes belonging to 10862
[2026-03-07 13:10:36] 2026-03-07 18:08:41,236 | INFO     | did not find any defunct processes belonging to 10862
[2026-03-07 13:10:36] 2026-03-07 18:08:41,236 | INFO     | ready for new job
[2026-03-07 13:10:36] 2026-03-07 18:08:41,236 | INFO     | pilot has finished with previous job - re-establishing logging
[2026-03-07 13:10:36] 2026-03-07 18:08:41,243 | INFO     | **************************************
[2026-03-07 13:10:36] 2026-03-07 18:08:41,243 | INFO     | ***  PanDA Pilot version 3.11.4.1  ***
[2026-03-07 13:10:36] 2026-03-07 18:08:41,243 | INFO     | **************************************
[2026-03-07 13:10:36] 2026-03-07 18:08:41,243 | INFO     | 
[2026-03-07 13:10:36] 2026-03-07 18:08:41,243 | INFO     | architecture information:
[2026-03-07 13:10:36] 2026-03-07 18:08:41,244 | INFO     | executing command: cat /etc/os-release
[2026-03-07 13:10:36] 2026-03-07 18:08:41,262 | INFO     | cat /etc/os-release:
[2026-03-07 13:10:36] NAME="CentOS Linux"
[2026-03-07 13:10:36] VERSION="7 (Core)"
[2026-03-07 13:10:36] ID="centos"
[2026-03-07 13:10:36] ID_LIKE="rhel fedora"
[2026-03-07 13:10:36] VERSION_ID="7"
[2026-03-07 13:10:36] PRETTY_NAME="CentOS Linux 7 (Core)"
[2026-03-07 13:10:36] ANSI_COLOR="0;31"
[2026-03-07 13:10:36] CPE_NAME="cpe:/o:centos:centos:7"
[2026-03-07 13:10:36] HOME_URL="https://www.centos.org/"
[2026-03-07 13:10:36] BUG_REPORT_URL="https://bugs.centos.org/"
[2026-03-07 13:10:36] 
[2026-03-07 13:10:36] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2026-03-07 13:10:36] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2026-03-07 13:10:36] REDHAT_SUPPORT_PRODUCT="centos"
[2026-03-07 13:10:36] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2026-03-07 13:10:36] 
[2026-03-07 13:10:36] 2026-03-07 18:08:41,263 | INFO     | **************************************
[2026-03-07 13:10:36] 2026-03-07 18:08:41,766 | INFO     | executing command: df -mP /tmp/boinchome/slots/0
[2026-03-07 13:10:36] 2026-03-07 18:08:41,786 | INFO     | sufficient remaining disk space (18722324480 B)
[2026-03-07 13:10:36] 2026-03-07 18:08:41,786 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2026-03-07 13:10:36] 2026-03-07 18:08:41,787 | INFO     | current server update state: UPDATING_FINAL
[2026-03-07 13:10:36] 2026-03-07 18:08:41,787 | INFO     | update_server=False
[2026-03-07 13:10:36] 2026-03-07 18:08:41,787 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2026-03-07 13:10:36] 2026-03-07 18:08:41,788 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2026-03-07 13:10:36] 2026-03-07 18:08:41,826 | INFO     | all data control threads have been joined
[2026-03-07 13:10:36] 2026-03-07 18:08:41,892 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2026-03-07 13:10:36] 2026-03-07 18:08:42,172 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2026-03-07 13:10:36] 2026-03-07 18:08:42,173 | INFO     | aborting loop
[2026-03-07 13:10:36] 2026-03-07 18:08:42,416 | INFO     | all job control threads have been joined
[2026-03-07 13:10:36] 2026-03-07 18:08:42,657 | INFO     | all payload control threads have been joined
[2026-03-07 13:10:36] 2026-03-07 18:08:42,789 | INFO     | [job] retrieve thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:42,793 | INFO     | [job] queue monitor thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:42,832 | INFO     | [data] control thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:42,957 | INFO     | [payload] validate_pre thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:42,990 | INFO     | [payload] validate_post thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,171 | INFO     | [payload] run_realtimelog thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,177 | INFO     | [job] job monitor thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,266 | INFO     | [data] copytool_in thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,367 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2026-03-07 13:10:36] 2026-03-07 18:08:43,421 | INFO     | [job] control thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,519 | INFO     | [payload] execute_payloads thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,663 | INFO     | [payload] control thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,713 | INFO     | [payload] failed_post thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:43,898 | INFO     | [data] copytool_out thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:44,046 | INFO     | [job] validate thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:44,292 | INFO     | [job] create_data_payload thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:47,373 | INFO     | [data] queue_monitor thread has finished
[2026-03-07 13:10:36] 2026-03-07 18:08:51,241 | INFO     | [monitor] cgroup control has ended
[2026-03-07 13:10:36] 2026-03-07 18:08:51,754 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140443489507136)>', '<ExcThread(monitor, started 140442452608768)>']
[2026-03-07 13:10:36] 2026-03-07 18:08:56,756 | INFO     | all workflow threads have been joined
[2026-03-07 13:10:36] 2026-03-07 18:08:56,756 | INFO     | end of generic workflow (traces error code: 0)
[2026-03-07 13:10:36] 2026-03-07 18:08:56,757 | INFO     | traces error code: 0
[2026-03-07 13:10:36] 2026-03-07 18:08:56,757 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2026-03-07 13:10:36] 2026-03-07 18:09:12,955 | INFO     | PID=7285 has CPU usage=1.0% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i PR -
[2026-03-07 13:10:36] 2026-03-07 18:09:12,955 | INFO     | .. there are 2 such processes running
[2026-03-07 13:10:36] 2026-03-07 18:09:12,956 | INFO     | found 0 job(s) in 20 queues
[2026-03-07 13:10:36] 2026-03-07 18:09:12,956 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2026-03-07 13:10:36] 2026-03-07 18:09:12,956 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2026-03-07 13:10:36] 2026-03-07 18:09:12,956 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2026-03-07 13:10:36] 2026-03-07 18:09:12,957 | INFO     | [monitor] control thread has ended
[2026-03-07 13:10:36] 2026-03-07 18:09:13,074 [wrapper] ==== pilot stdout END ====
[2026-03-07 13:10:36] 2026-03-07 18:09:13,079 [wrapper] ==== wrapper stdout RESUME ====
[2026-03-07 13:10:36] 2026-03-07 18:09:13,084 [wrapper] pilotpid: 7285
[2026-03-07 13:10:36] 2026-03-07 18:09:13,088 [wrapper] Pilot exit status: 0
[2026-03-07 13:10:36] 2026-03-07 18:09:13,115 [wrapper] pandaids: 7045007157 7045007157 7045007157
[2026-03-07 13:10:36] 2026-03-07 18:09:13,156 [wrapper] cleanup supervisor_pilot 27102 7286
[2026-03-07 13:10:36] 2026-03-07 18:09:13,160 [wrapper] Test setup, not cleaning
[2026-03-07 13:10:36] 2026-03-07 18:09:13,165 [wrapper] apfmon messages muted
[2026-03-07 13:10:36] 2026-03-07 18:09:13,170 [wrapper] ==== wrapper stdout END ====
[2026-03-07 13:10:36] 2026-03-07 18:09:13,174 [wrapper] ==== wrapper stderr END ====
[2026-03-07 13:10:36]  *** Error codes and diagnostics ***
[2026-03-07 13:10:36]     "exeErrorCode": 65,
[2026-03-07 13:10:36]     "exeErrorDiag": "Non-zero return code from EVNTtoHITS (1); Logfile error in log.EVNTtoHITS: \"EventSelector                                                0   FATAL in sysStart(): exception with tag=EventSelector is caught\"",
[2026-03-07 13:10:36]     "pilotErrorCode": 1305,
[2026-03-07 13:10:36]     "pilotErrorDiag": "Failed to execute payload:PyJobTransforms.transform.execute  CRITICAL Transform executor raised TransformValidationException: Non-zero return code from EVNTtoHITS (1); Logfile error in log.EVNTtoHITS: \"EventSelector                                                0   FATAL in sysStart(",
[2026-03-07 13:10:36]  *** Listing of results directory ***
[2026-03-07 13:10:36] total 432456
[2026-03-07 13:10:36] drwx------ 5 boincer umatlas      4096 Feb  2 10:00 pilot3
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas    584627 Mar  7 08:06 pilot3.tar.gz
[2026-03-07 13:10:36] -rwx------ 1 boincer umatlas     36322 Mar  7 08:07 runpilot2-wrapper.sh
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas      5111 Mar  7 08:07 queuedata.json
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas       100 Mar  7 12:28 wrapper_26015_x86_64-pc-linux-gnu
[2026-03-07 13:10:36] -rwxr-xr-x 1 boincer umatlas      7986 Mar  7 12:28 run_atlas
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas       105 Mar  7 12:28 job.xml
[2026-03-07 13:10:36] -rw-r--r-- 3 boincer umatlas 219146985 Mar  7 12:28 EVNT.48967008._000134.pool.root.1
[2026-03-07 13:10:36] -rw-r--r-- 3 boincer umatlas 219146985 Mar  7 12:28 ATLAS.root_0
[2026-03-07 13:10:36] -rw-r--r-- 2 boincer umatlas     15845 Mar  7 12:28 start_atlas.sh
[2026-03-07 13:10:36] -rw-r--r-- 2 boincer umatlas    597952 Mar  7 12:28 input.tar.gz
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas         0 Mar  7 12:28 boinc_lockfile
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas   1521291 Mar  7 12:28 agis_ddmendpoints.agis.ALL.json
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas      2563 Mar  7 13:03 pandaJob.out
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas        57 Mar  7 13:03 setup.sh.local
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas    999177 Mar  7 13:03 agis_schedconf.cvmfs.json
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas       433 Mar  7 13:03 workernode_map.json
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas      1023 Mar  7 13:07 memory_monitor_summary.json
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas     95262 Mar  7 13:07 log.48967011._003412.job.log.tgz.1
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas       527 Mar  7 13:07 boinc_task_state.xml
[2026-03-07 13:10:36] drwxrwx--x 2 boincer umatlas      4096 Mar  7 13:09 shared
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas        96 Mar  7 13:09 pilot_heartbeat.json
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas      2575 Mar  7 13:09 heartbeat.json
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas      5907 Mar  7 13:09 init_data.xml
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas      8192 Mar  7 13:10 boinc_mmap_file
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas        26 Mar  7 13:10 wrapper_checkpoint.txt
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas      4742 Mar  7 13:10 pilotlog.txt
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas    218973 Mar  7 13:10 log.48967011._003412.job.log.1
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas       376 Mar  7 13:10 output.list
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas       655 Mar  7 13:10 runtime_log
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas    327680 Mar  7 13:10 result.tar.gz
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas      9649 Mar  7 13:10 runtime_log.err
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas       996 Mar  7 13:10 eh9KDmVY6I9nsSi4ap6QjLDmwznN0nGgGQJmIVkRDmAuVLDmiVR21m.diag
[2026-03-07 13:10:36] -rw-r--r-- 1 boincer umatlas     49803 Mar  7 13:10 stderr.txt
[2026-03-07 13:10:36] No HITS result produced
[2026-03-07 13:10:36]  *** Contents of shared directory: ***
[2026-03-07 13:10:36] total 214932
[2026-03-07 13:10:36] -rw-r--r-- 3 boincer umatlas 219146985 Mar  7 12:28 ATLAS.root_0
[2026-03-07 13:10:36] -rw-r--r-- 2 boincer umatlas     15845 Mar  7 12:28 start_atlas.sh
[2026-03-07 13:10:36] -rw-r--r-- 2 boincer umatlas    597952 Mar  7 12:28 input.tar.gz
[2026-03-07 13:10:36] -rw------- 1 boincer umatlas    327680 Mar  7 13:10 result.tar.gz
13:10:37 (21227): run_atlas exited; CPU time 213.523750
13:10:37 (21227): called boinc_finish(0)

</stderr_txt>
]]>


©2026 CERN