Name Ps2NDmOlVz7nsSi4ap6QjLDmwznN0nGgGQJmUXzaDmKQPLDmnl2dkn_0
Workunit 233788471
Created 25 Jul 2025, 11:28:15 UTC
Sent 25 Jul 2025, 14:23:16 UTC
Report deadline 2 Aug 2025, 14:23:16 UTC
Received 25 Jul 2025, 14:37:15 UTC
Server state Over
Outcome Validate error
Client state Done
Exit status 0 (0x00000000)
Computer ID 10877980
Run time 10 min 31 sec
CPU time 20 min 30 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 12.00 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.63 GB
Peak swap size 2.96 GB
Peak disk usage 287.81 MB

Stderr output

<core_client_version>8.1.0</core_client_version>
<![CDATA[
<stderr_txt>
10:23:27 (1013088): wrapper (7.7.26015): starting
10:23:27 (1013088): wrapper: running run_atlas (--nthreads 8)
[2025-07-25 10:23:27] Arguments: --nthreads 8
[2025-07-25 10:23:27] Threads: 8
[2025-07-25 10:23:27] Checking for CVMFS
[2025-07-25 10:23:27] Probing /cvmfs/atlas.cern.ch... OK
[2025-07-25 10:23:27] Probing /cvmfs/atlas-condb.cern.ch... OK
[2025-07-25 10:23:27] Running cvmfs_config stat atlas.cern.ch
[2025-07-25 10:23:28] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2025-07-25 10:23:28] 2.13.1.0 645510 11419 253168 148744 3 211 23449682 40960000 22887 16776704 0 30741009 99.710 36789577 36303 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.237.109:6081 1
[2025-07-25 10:23:28] CVMFS is ok
[2025-07-25 10:23:28] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2025-07-25 10:23:28] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2025-07-25 10:23:28] Further information can be found at the LHC@home message board.
[2025-07-25 10:23:28] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2025-07-25 10:23:28] Checking for apptainer binary...
[2025-07-25 10:23:28] Using apptainer found in PATH at /usr/bin/apptainer
[2025-07-25 10:23:28] Running /usr/bin/apptainer --version
[2025-07-25 10:23:28] apptainer version 1.4.1-1.el9
[2025-07-25 10:23:28] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2025-07-25 10:23:28] c-212-5.aglt2.org
[2025-07-25 10:23:28] apptainer works
[2025-07-25 10:23:28] Set ATHENA_PROC_NUMBER=8
[2025-07-25 10:23:28] Set ATHENA_CORE_NUMBER=8
[2025-07-25 10:23:28] Starting ATLAS job with PandaID=6745015445
[2025-07-25 10:23:28] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2025-07-25 10:34:16]  *** The last 200 lines of the pilot log: ***
[2025-07-25 10:34:16] 2025-07-25 14:33:42,305 | INFO     | could have reported an average CPU frequency of 3009 MHz (4 samples)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | ..............................
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . Timing measurements:
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . get job = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . initial setup = 2 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . payload setup = 6 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . stage-in = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . payload execution = 445 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . stage-out = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . log creation = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | ..............................
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-6745015445/pilotlog.txt
[2025-07-25 10:34:16] 2025-07-25 14:33:42,327 | WARNING  | detected the following tail of warning/fatal messages in the pilot log:
[2025-07-25 10:34:16] - Log from pilotlog.txt -
[2025-07-25 10:34:16] 2025-07-25 14:33:42,303 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-6745015445/memory_monitor_summary.json (trf name=prmon)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | INFO     | extracted standard info from prmon json
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | INFO     | extracted standard memory fields from prmon json
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | WARNING  | GPU info not found in prmon json: 'gpu'
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | WARNING  | format EVNTtoHITS has no such key: dbData
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2025-07-25 10:34:16] 2025-07-25 14:33:42,305 | WARNING  | wrong length of table data, x=[1753453753.0], y=[2865334.0] (must be same and length>=4)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,305 | INFO     | could have reported an average CPU frequency of 3009 MHz (4 samples)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | ..............................
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . Timing measurements:
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . get job = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . initial setup = 2 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . payload setup = 6 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . stage-in = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . payload execution = 445 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . stage-out = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . log creation = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | ..............................
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-6745015445/pilotlog.txt
[2025-07-25 10:34:16] 2025-07-25 14:33:42,327 | WARNING  | 
[2025-07-25 10:34:16] [begin log extracts]
[2025-07-25 10:34:16] - Log from pilotlog.txt -
[2025-07-25 10:34:16] 2025-07-25 14:33:42,303 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-6745015445/memory_monitor_summary.json (trf name=prmon)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | INFO     | extracted standard info from prmon json
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | INFO     | extracted standard memory fields from prmon json
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | WARNING  | GPU info not found in prmon json: 'gpu'
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | WARNING  | format EVNTtoHITS has no such key: dbData
[2025-07-25 10:34:16] 2025-07-25 14:33:42,304 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2025-07-25 10:34:16] 2025-07-25 14:33:42,305 | WARNING  | wrong length of table data, x=[1753453753.0], y=[2865334.0] (must be same and length>=4)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,305 | INFO     | could have reported an average CPU frequency of 3009 MHz (4 samples)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | ..............................
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . Timing measurements:
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . get job = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . initial setup = 2 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . payload setup = 6 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . stage-in = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . payload execution = 445 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . stage-out = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | . log creation = 0 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | ..............................
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | building log extracts (sent to the server as 'pilotLog')
[2025-07-25 10:34:16] 2025-07-25 14:33:42,306 | INFO     | executing command: tail -n 20 /tmp/boinchome/slots/0/PanDA_Pilot-6745015445/pilotlog.txt
[2025-07-25 10:34:16] [end log extracts]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,327 | WARNING  | pilotErrorCodes = [1305, 1328] (will report primary/first error code)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,328 | WARNING  | pilotErrorDiags = ['Failed to execute payload:PyJobTransforms.transform.execute  CRITICAL Transform executor raised TransformValidationException: EVNTtoHITS got a 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,733 | INFO     | 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | job summary report
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | --------------------------------------------------
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | PanDA job id: 6745015445
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | task id: 45743349
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | error 1/2: 1305: Failed to execute payload:PyJobTransforms.transform.execute 2025-07-25 14:31:24,179 CRITICAL Transform executor raised TransformValidationExceptio
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | error 2/2: 1328: Invalid memory reference or a segmentation fault in payload: EVNTtoHITS got a SIGSEGV signal (exit code 139); Logfile error in log.EVNTtoHITS: "IS
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | pilot error code: 1328
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | pilot error diag: Invalid memory reference or a segmentation fault in payload: EVNTtoHITS got a SIGSEGV signal (exit code 139); Logfile error in log.EVNTtoHITS: "I
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | status: LOG_TRANSFER = DONE 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,734 | INFO     | pilot state: failed 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | transexitcode: 65
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | exeerrorcode: 0
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | exeerrordiag: 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | exitcode: 65
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | exitmsg: EVNTtoHITS got a SIGSEGV signal (exit code 139); Logfile error in log.EVNTtoHITS: "ISF_Kernel_FullG4MT_QS                                16     4   FATAL 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | cpuconsumptiontime: 1219 s
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | nevents: 0
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | neventsw: 0
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | pid: 1023158
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | pgrp: 1023158
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | corecount: 8
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | event service: False
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | sizes: {0: 2397471, 5: 2397733, 11: 2397789, 456: 2428205, 457: 2432500, 460: 2432556, 586: 2432878}
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | --------------------------------------------------
[2025-07-25 10:34:16] 2025-07-25 14:33:42,735 | INFO     | 
[2025-07-25 10:34:16] 2025-07-25 14:33:42,736 | INFO     | executing command: ls -lF /tmp/boinchome/slots/0
[2025-07-25 10:34:16] 2025-07-25 14:33:42,802 | INFO     | queue jobs had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,802 | INFO     | queue payloads had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,802 | INFO     | queue data_in had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,802 | INFO     | queue data_out had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue current_data_in had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue completed_jobids has 1 job(s)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,803 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,804 | INFO     | queue messages had 0 job(s) [purged]
[2025-07-25 10:34:16] 2025-07-25 14:33:42,804 | INFO     | job 6745015445 has completed (purged errors)
[2025-07-25 10:34:16] 2025-07-25 14:33:42,804 | INFO     | overall cleanup function is called
[2025-07-25 10:34:16] 2025-07-25 14:33:43,829 | INFO     | --- collectZombieJob: --- 10, [1023158]
[2025-07-25 10:34:16] 2025-07-25 14:33:43,830 | INFO     | zombie collector waiting for pid 1023158
[2025-07-25 10:34:16] 2025-07-25 14:33:43,834 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2025-07-25 10:34:16] 2025-07-25 14:33:43,834 | INFO     | collected zombie processes
[2025-07-25 10:34:16] 2025-07-25 14:33:43,834 | INFO     | will attempt to kill all subprocesses of pid=1023158
[2025-07-25 10:34:16] 2025-07-25 14:33:44,534 | INFO     | process IDs to be killed: [1023158] (in reverse order)
[2025-07-25 10:34:16] 2025-07-25 14:33:44,870 | WARNING  | found no corresponding commands to process id(s)
[2025-07-25 10:34:16] 2025-07-25 14:33:44,871 | INFO     | Do not look for orphan processes in BOINC jobs
[2025-07-25 10:34:16] 2025-07-25 14:33:44,912 | INFO     | did not find any defunct processes belonging to 1023158
[2025-07-25 10:34:16] 2025-07-25 14:33:44,927 | INFO     | did not find any defunct processes belonging to 1023158
[2025-07-25 10:34:16] 2025-07-25 14:33:44,927 | INFO     | ready for new job
[2025-07-25 10:34:16] 2025-07-25 14:33:44,927 | INFO     | pilot has finished with previous job - re-establishing logging
[2025-07-25 10:34:16] 2025-07-25 14:33:44,929 | INFO     | ***************************************
[2025-07-25 10:34:16] 2025-07-25 14:33:44,930 | INFO     | ***  PanDA Pilot version 3.10.4.12  ***
[2025-07-25 10:34:16] 2025-07-25 14:33:44,930 | INFO     | ***************************************
[2025-07-25 10:34:16] 2025-07-25 14:33:44,930 | INFO     | 
[2025-07-25 10:34:16] 2025-07-25 14:33:44,957 | INFO     | architecture information:
[2025-07-25 10:34:16] 2025-07-25 14:33:44,972 | INFO     | executing command: cat /etc/os-release
[2025-07-25 10:34:16] 2025-07-25 14:33:45,024 | INFO     | cat /etc/os-release:
[2025-07-25 10:34:16] NAME="CentOS Linux"
[2025-07-25 10:34:16] VERSION="7 (Core)"
[2025-07-25 10:34:16] ID="centos"
[2025-07-25 10:34:16] ID_LIKE="rhel fedora"
[2025-07-25 10:34:16] VERSION_ID="7"
[2025-07-25 10:34:16] PRETTY_NAME="CentOS Linux 7 (Core)"
[2025-07-25 10:34:16] ANSI_COLOR="0;31"
[2025-07-25 10:34:16] CPE_NAME="cpe:/o:centos:centos:7"
[2025-07-25 10:34:16] HOME_URL="https://www.centos.org/"
[2025-07-25 10:34:16] BUG_REPORT_URL="https://bugs.centos.org/"
[2025-07-25 10:34:16] 
[2025-07-25 10:34:16] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2025-07-25 10:34:16] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2025-07-25 10:34:16] REDHAT_SUPPORT_PRODUCT="centos"
[2025-07-25 10:34:16] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2025-07-25 10:34:16] 
[2025-07-25 10:34:16] 2025-07-25 14:33:45,024 | INFO     | ***************************************
[2025-07-25 10:34:16] 2025-07-25 14:33:45,528 | INFO     | executing command: df -mP /tmp/boinchome/slots/0
[2025-07-25 10:34:16] 2025-07-25 14:33:45,559 | INFO     | sufficient remaining disk space (129380646912 B)
[2025-07-25 10:34:16] 2025-07-25 14:33:45,560 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2025-07-25 10:34:16] 2025-07-25 14:33:45,560 | INFO     | current server update state: UPDATING_FINAL
[2025-07-25 10:34:16] 2025-07-25 14:33:45,560 | INFO     | update_server=False
[2025-07-25 10:34:16] 2025-07-25 14:33:45,560 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2025-07-25 10:34:16] 2025-07-25 14:33:45,644 | INFO     | all payload control threads have been joined
[2025-07-25 10:34:16] 2025-07-25 14:33:45,718 | INFO     | all data control threads have been joined
[2025-07-25 10:34:16] 2025-07-25 14:33:45,905 | WARNING  | job monitor detected an abort_job request (signal=args.signal)
[2025-07-25 10:34:16] 2025-07-25 14:33:45,906 | WARNING  | cannot recover job monitoring - aborting pilot
[2025-07-25 10:34:16] 2025-07-25 14:33:45,906 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2025-07-25 10:34:16] 2025-07-25 14:33:45,906 | INFO     | will abort loop
[2025-07-25 10:34:16] 2025-07-25 14:33:46,566 | INFO     | [job] retrieve thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:46,645 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2025-07-25 10:34:16] 2025-07-25 14:33:46,650 | INFO     | [payload] control thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:46,725 | INFO     | [data] control thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:46,791 | INFO     | [payload] execute_payloads thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:46,820 | INFO     | [job] validate thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:46,912 | INFO     | [job] job monitor thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:47,017 | INFO     | [job] create_data_payload thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:47,228 | INFO     | all job control threads have been joined
[2025-07-25 10:34:16] 2025-07-25 14:33:47,266 | INFO     | [payload] validate_pre thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:47,433 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2025-07-25 10:34:16] 2025-07-25 14:33:47,548 | INFO     | [payload] run_realtimelog thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:47,671 | INFO     | [data] copytool_in thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:47,952 | INFO     | [payload] failed_post thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:48,112 | INFO     | [payload] validate_post thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:48,235 | INFO     | [job] control thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:48,439 | INFO     | [job] queue monitor thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:48,650 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2025-07-25 10:34:16] 2025-07-25 14:33:48,651 | INFO     | [data] copytool_out thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:52,658 | INFO     | [data] queue_monitor thread has finished
[2025-07-25 10:34:16] 2025-07-25 14:33:54,389 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140245262763840)>', '<ExcThread(monitor, started 140244624070400)>']
[2025-07-25 10:34:16] 2025-07-25 14:33:59,416 | INFO     | all workflow threads have been joined
[2025-07-25 10:34:16] 2025-07-25 14:33:59,418 | INFO     | end of generic workflow (traces error code: 0)
[2025-07-25 10:34:16] 2025-07-25 14:33:59,419 | INFO     | traces error code: 0
[2025-07-25 10:34:16] 2025-07-25 14:33:59,419 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2025-07-25 10:34:16] 2025-07-25 14:34:15,417 | INFO     | PID=1016677 has CPU usage=2.4% CMD=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/x86_64/python/3.9.20-x86_64-centos7/bin/python3 pilot3/pilot.py -q BOINC_MCORE -i P
[2025-07-25 10:34:16] 2025-07-25 14:34:15,419 | INFO     | .. there are 21 such processes running
[2025-07-25 10:34:16] 2025-07-25 14:34:15,420 | INFO     | found 0 job(s) in 20 queues
[2025-07-25 10:34:16] 2025-07-25 14:34:15,420 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2025-07-25 10:34:16] 2025-07-25 14:34:15,420 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2025-07-25 10:34:16] 2025-07-25 14:34:15,420 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2025-07-25 10:34:16] 2025-07-25 14:34:15,421 | INFO     | [monitor] control thread has ended
[2025-07-25 10:34:16] 2025-07-25 14:34:15,798 [wrapper] ==== pilot stdout END ====
[2025-07-25 10:34:16] 2025-07-25 14:34:15,804 [wrapper] ==== wrapper stdout RESUME ====
[2025-07-25 10:34:16] 2025-07-25 14:34:15,828 [wrapper] pilotpid: 1016677
[2025-07-25 10:34:16] 2025-07-25 14:34:15,831 [wrapper] Pilot exit status: 0
[2025-07-25 10:34:16] 2025-07-25 14:34:15,854 [wrapper] pandaids: 6745015445
[2025-07-25 10:34:16] 2025-07-25 14:34:15,949 [wrapper] cleanup supervisor_pilot 1016683 1016678
[2025-07-25 10:34:16] 2025-07-25 14:34:15,961 [wrapper] Test setup, not cleaning
[2025-07-25 10:34:16] 2025-07-25 14:34:15,988 [wrapper] apfmon messages muted
[2025-07-25 10:34:16] 2025-07-25 14:34:16,018 [wrapper] ==== wrapper stdout END ====
[2025-07-25 10:34:16] 2025-07-25 14:34:16,022 [wrapper] ==== wrapper stderr END ====
[2025-07-25 10:34:16]  *** Error codes and diagnostics ***
[2025-07-25 10:34:16]     "exeErrorCode": 0,
[2025-07-25 10:34:16]     "exeErrorDiag": "",
[2025-07-25 10:34:16]     "pilotErrorCode": 1305,
[2025-07-25 10:34:16]     "pilotErrorDiag": "Failed to execute payload:PyJobTransforms.transform.execute  CRITICAL Transform executor raised TransformValidationException: EVNTtoHITS got a SIGSEGV signal (exit code 139); Logfile error in log.EVNTtoHITS: \"ISF_Kernel_FullG4MT_QS                                16     4   FATAL  Stan",
[2025-07-25 10:34:16]  *** Listing of results directory ***
[2025-07-25 10:34:16] total 146504
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas    530466 Jul 25 07:25 pilot3.tar.gz
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas      5112 Jul 25 07:26 queuedata.json
[2025-07-25 10:34:16] -rwx------. 1 boincer umatlas     36292 Jul 25 07:27 runpilot2-wrapper.sh
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas       100 Jul 25 10:23 wrapper_26015_x86_64-pc-linux-gnu
[2025-07-25 10:34:16] -rwxr-xr-x. 1 boincer umatlas      7986 Jul 25 10:23 run_atlas
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas       105 Jul 25 10:23 job.xml
[2025-07-25 10:34:16] -rw-r--r--. 2 boincer umatlas 145481413 Jul 25 10:23 EVNT.45743347._001075.pool.root.1
[2025-07-25 10:34:16] -rw-r--r--. 2 boincer umatlas     15095 Jul 25 10:23 start_atlas.sh
[2025-07-25 10:34:16] drwxrwx--x. 2 boincer umatlas      4096 Jul 25 10:23 shared
[2025-07-25 10:34:16] -rw-r--r--. 2 boincer umatlas    542887 Jul 25 10:23 input.tar.gz
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas         0 Jul 25 10:23 boinc_lockfile
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas      2561 Jul 25 10:23 pandaJob.out
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas    979325 Jul 25 10:23 agis_schedconf.cvmfs.json
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas   1593052 Jul 25 10:23 agis_ddmendpoints.agis.ALL.json
[2025-07-25 10:34:16] drwx------. 4 boincer umatlas      4096 Jul 25 10:23 pilot3
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas      6937 Jul 25 10:27 init_data.xml
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas       527 Jul 25 10:31 boinc_task_state.xml
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas      1033 Jul 25 10:31 memory_monitor_summary.json
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas    218854 Jul 25 10:31 log.45743349._020362.job.log.tgz.1
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas     11601 Jul 25 10:33 heartbeat.json
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas        25 Jul 25 10:34 wrapper_checkpoint.txt
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas      8192 Jul 25 10:34 boinc_mmap_file
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas        95 Jul 25 10:34 pilot_heartbeat.json
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas       823 Jul 25 10:34 pilotlog.txt
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas    111642 Jul 25 10:34 log.45743349._020362.job.log.1
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas       188 Jul 25 10:34 output.list
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas       620 Jul 25 10:34 runtime_log
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas    358400 Jul 25 10:34 result.tar.gz
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas      8502 Jul 25 10:34 runtime_log.err
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas       644 Jul 25 10:34 Ps2NDmOlVz7nsSi4ap6QjLDmwznN0nGgGQJmUXzaDmKQPLDmnl2dkn.diag
[2025-07-25 10:34:16] -rw-r--r--. 1 boincer umatlas     16437 Jul 25 10:34 stderr.txt
[2025-07-25 10:34:16] No HITS result produced
[2025-07-25 10:34:16]  *** Contents of shared directory: ***
[2025-07-25 10:34:16] total 142972
[2025-07-25 10:34:16] -rw-r--r--. 2 boincer umatlas 145481413 Jul 25 10:23 ATLAS.root_0
[2025-07-25 10:34:16] -rw-r--r--. 2 boincer umatlas     15095 Jul 25 10:23 start_atlas.sh
[2025-07-25 10:34:16] -rw-r--r--. 2 boincer umatlas    542887 Jul 25 10:23 input.tar.gz
[2025-07-25 10:34:16] -rw-------. 1 boincer umatlas    358400 Jul 25 10:34 result.tar.gz
10:34:18 (1013088): run_atlas exited; CPU time 1220.351898
10:34:18 (1013088): called boinc_finish(0)

</stderr_txt>
]]>


©2025 CERN