Name leqKDm5WvO5n7Olcko1bjSoqABFKDmABFKDmOtvXDmHqVKDmjh3NIm_0
Workunit 222848304
Created 8 May 2024, 12:25:46 UTC
Sent 8 May 2024, 12:25:48 UTC
Report deadline 16 May 2024, 12:25:48 UTC
Received 13 May 2024, 7:31:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 10691224
Run time 1 days 16 hours 49 min 33 sec
CPU time 6 days 17 hours 35 min 23 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 11.86 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 1.87 GB
Peak swap size 36.52 GB
Peak disk usage 4.08 GB

Stderr output

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<stderr_txt>
16:29:35 (1284993): wrapper (7.7.26015): starting
16:29:35 (1284993): wrapper: running run_atlas (--nthreads 4)
[2024-05-11 16:29:35] Arguments: --nthreads 4
[2024-05-11 16:29:35] Threads: 4
[2024-05-11 16:29:35] Checking for CVMFS
[2024-05-11 16:29:35] No cvmfs_config command found, will try listing directly
[2024-05-11 16:29:35] CVMFS is ok
[2024-05-11 16:29:35] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2024-05-11 16:29:35] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2024-05-11 16:29:35] Further information can be found at the LHC@home message board.
[2024-05-11 16:29:35] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2024-05-11 16:29:35] Checking for apptainer binary...
[2024-05-11 16:29:35] which: no apptainer in ((null))
[2024-05-11 16:29:35] apptainer is not installed, using version from CVMFS
[2024-05-11 16:29:35] Checking apptainer works with /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2024-05-11 16:29:37] WARNING: Environment variable TMPDIR already has value [/scratch/boinc/var/slot40/slots/1/.apptainertmp], will not forward new value [/tmp] from parent process environment skurut06.grid.cesnet.cz
[2024-05-11 16:29:37] apptainer works
[2024-05-11 16:29:37] Set ATHENA_PROC_NUMBER=4
[2024-05-11 16:29:37] Set ATHENA_CORE_NUMBER=4
[2024-05-11 16:29:37] Starting ATLAS job with PandaID=6199047905
[2024-05-11 16:29:37] Running command: /cvmfs/atlas.cern.ch/repo/containers/sw/apptainer/x86_64-el7/current/bin/apptainer exec -B /cvmfs,/scratch/boinc/var/slot40/slots/1 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2024-05-13 09:19:05]  *** The last 200 lines of the pilot log: ***
[2024-05-13 09:19:05] 2024-05-13 07:18:40,337 | INFO     | preparing for final server update for job 6199047905 in state='finished'
[2024-05-13 09:19:05] 2024-05-13 07:18:40,338 | INFO     | this job has now completed (state=finished)
[2024-05-13 09:19:05] 2024-05-13 07:18:40,338 | INFO     | pilot will not update the server (heartbeat message will be written to file)
[2024-05-13 09:19:05] 2024-05-13 07:18:40,338 | INFO     | job 6199047905 has finished - writing final server update
[2024-05-13 09:19:05] 2024-05-13 07:18:40,338 | WARNING  | failed to read HTCondor job classAd: [Errno 2] No such file or directory: '/scratch/condor/execute/dir_19761/.job.ad'
[2024-05-13 09:19:05] 2024-05-13 07:18:40,338 | INFO     | total number of processed events: 3800 (read)
[2024-05-13 09:19:05] 2024-05-13 07:18:40,343 | INFO     | executing command: lscpu
[2024-05-13 09:19:05] 2024-05-13 07:18:40,386 | INFO     | found 32 cores (16 cores per socket, 2 sockets)
[2024-05-13 09:19:05] 2024-05-13 07:18:40,386 | INFO     | executing command: grep -o 'avx2[^ ]*\|AVX2[^ ]*' /proc/cpuinfo
[2024-05-13 09:19:05] 2024-05-13 07:18:40,417 | INFO     | executing command: export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase;source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh --quiet;lsetup
[2024-05-13 09:19:05] 2024-05-13 07:18:42,081 | INFO     | monitor loop #11176: job 0:6199047905 is in state 'finished'
[2024-05-13 09:19:05] 2024-05-13 07:18:42,081 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-13 09:19:05] 2024-05-13 07:18:42,915 | INFO     | 146938s have passed since pilot start
[2024-05-13 09:19:05] 2024-05-13 07:18:43,513 | INFO     | CPU arch script returned: x86-64-v4
[2024-05-13 09:19:05] 2024-05-13 07:18:43,513 | INFO     | using path: /scratch/boinc/var/slot40/slots/1/PanDA_Pilot-6199047905/memory_monitor_summary.json (trf name=prmon)
[2024-05-13 09:19:05] 2024-05-13 07:18:43,514 | INFO     | extracted standard info from prmon json
[2024-05-13 09:19:05] 2024-05-13 07:18:43,514 | INFO     | extracted standard memory fields from prmon json
[2024-05-13 09:19:05] 2024-05-13 07:18:43,514 | WARNING  | GPU info not found in prmon json
[2024-05-13 09:19:05] 2024-05-13 07:18:43,534 | INFO     | fitting pss+swap vs Time
[2024-05-13 09:19:05] 2024-05-13 07:18:43,536 | INFO     | model: linear, x: [1715437813.0, 1715437874.0, 1715437935.0, 1715437996.0, 1715438057.0, 1715438118.0, 1715438179.0, 1715438240.0, 1715438301.0, 1715438362.0, 1715
[2024-05-13 09:19:05] 2024-05-13 07:18:43,536 | INFO     | sum of square deviations: 4334996479660.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,606 | INFO     | sum of deviations: 4585231646570.998
[2024-05-13 09:19:05] 2024-05-13 07:18:43,607 | INFO     | mean x: 1715511257.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,607 | INFO     | mean y: 2772818.870900789
[2024-05-13 09:19:05] 2024-05-13 07:18:43,607 | INFO     | -- intersect: -1811765334.1709526
[2024-05-13 09:19:05] 2024-05-13 07:18:43,607 | INFO     | intersect: -1811765334.1709526
[2024-05-13 09:19:05] 2024-05-13 07:18:43,608 | INFO     | chi2: 15.955635684425918
[2024-05-13 09:19:05] 2024-05-13 07:18:43,610 | INFO     | model: linear, x: [1715437813.0, 1715437874.0, 1715437935.0, 1715437996.0, 1715438057.0, 1715438118.0, 1715438179.0, 1715438240.0, 1715438301.0, 1715438362.0, 1715
[2024-05-13 09:19:05] 2024-05-13 07:18:43,610 | INFO     | sum of square deviations: 4308059956005.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,679 | INFO     | sum of deviations: 5356699267002.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,679 | INFO     | mean x: 1715511104.5
[2024-05-13 09:19:05] 2024-05-13 07:18:43,679 | INFO     | mean y: 2777186.0690515805
[2024-05-13 09:19:05] 2024-05-13 07:18:43,679 | INFO     | -- intersect: -2130312225.3723419
[2024-05-13 09:19:05] 2024-05-13 07:18:43,679 | INFO     | intersect: -2130312225.3723419
[2024-05-13 09:19:05] 2024-05-13 07:18:43,681 | INFO     | chi2: 13.001890095330884
[2024-05-13 09:19:05] 2024-05-13 07:18:43,681 | INFO     | current chi2=13.001890095330884 (change=18.51224011073495 %)
[2024-05-13 09:19:05] 2024-05-13 07:18:43,681 | INFO     | right removable region: 2403
[2024-05-13 09:19:05] 2024-05-13 07:18:43,682 | INFO     | model: linear, x: [1715438118.0, 1715438179.0, 1715438240.0, 1715438301.0, 1715438362.0, 1715438423.0, 1715438484.0, 1715438545.0, 1715438606.0, 1715438667.0, 1715
[2024-05-13 09:19:05] 2024-05-13 07:18:43,683 | INFO     | sum of square deviations: 4308059956005.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,752 | INFO     | sum of deviations: 3809527369685.499
[2024-05-13 09:19:05] 2024-05-13 07:18:43,752 | INFO     | mean x: 1715511409.5
[2024-05-13 09:19:05] 2024-05-13 07:18:43,752 | INFO     | mean y: 2777209.8065723795
[2024-05-13 09:19:05] 2024-05-13 07:18:43,752 | INFO     | -- intersect: -1514213671.0627775
[2024-05-13 09:19:05] 2024-05-13 07:18:43,752 | INFO     | intersect: -1514213671.0627775
[2024-05-13 09:19:05] 2024-05-13 07:18:43,754 | INFO     | chi2: 12.957348275042397
[2024-05-13 09:19:05] 2024-05-13 07:18:43,754 | INFO     | current chi2=12.957348275042397 (change=18.791400535110668 %)
[2024-05-13 09:19:05] 2024-05-13 07:18:43,754 | INFO     | left removable region: 10
[2024-05-13 09:19:05] 2024-05-13 07:18:43,755 | INFO     | model: linear, x: [1715438423.0, 1715438484.0, 1715438545.0, 1715438606.0, 1715438667.0, 1715438728.0, 1715438789.0, 1715438850.0, 1715438911.0, 1715438972.0, 1715
[2024-05-13 09:19:05] 2024-05-13 07:18:43,755 | INFO     | sum of square deviations: 4249192869012.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,824 | INFO     | sum of deviations: 4076480783630.001
[2024-05-13 09:19:05] 2024-05-13 07:18:43,824 | INFO     | mean x: 1715511379.0
[2024-05-13 09:19:05] 2024-05-13 07:18:43,824 | INFO     | mean y: 2786167.968240702
[2024-05-13 09:19:05] 2024-05-13 07:18:43,824 | INFO     | -- intersect: -1642996780.975222
[2024-05-13 09:19:05] 2024-05-13 07:18:43,824 | INFO     | intersect: -1642996780.975222
[2024-05-13 09:19:05] 2024-05-13 07:18:43,825 | INFO     | chi2: 7.409426407718181
[2024-05-13 09:19:05] 2024-05-13 07:18:43,825 | INFO     | -- intersect: -1642996780.975222
[2024-05-13 09:19:05] 2024-05-13 07:18:43,826 | INFO     | current memory leak: 0.96 B/s (using 2393 data points, chi2=7.41)
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | ..............................
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . Timing measurements:
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . get job = 0 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . initial setup = 1 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . payload setup = 7 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . stage-in = 0 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . payload execution = 146900 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . stage-out = 5 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | . log creation = 2 s
[2024-05-13 09:19:05] 2024-05-13 07:18:43,827 | INFO     | ..............................
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | 
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | job summary report
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | --------------------------------------------------
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | PanDA job id: 6199047905
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | task id: 38734053
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | errors: (none)
[2024-05-13 09:19:05] 2024-05-13 07:18:44,103 | INFO     | status: LOG_TRANSFER = DONE 
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | pilot state: finished 
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | transexitcode: 0
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | exeerrorcode: 0
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | exeerrordiag: 
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | exitcode: 0
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | exitmsg: OK
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | cpuconsumptiontime: 591716 s
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | nevents: 3800
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | neventsw: 0
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | pid: 1297308
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | pgrp: 1297308
[2024-05-13 09:19:05] 2024-05-13 07:18:44,104 | INFO     | corecount: 4
[2024-05-13 09:19:05] 2024-05-13 07:18:44,105 | INFO     | event service: False
[2024-05-13 09:19:05] 2024-05-13 07:18:44,105 | INFO     | sizes: {0: 2386088, 4: 2386294, 11: 2386322, 146912: 2502465, 146917: 2511447, 146918: 2511503, 146924: 2511853}
[2024-05-13 09:19:05] 2024-05-13 07:18:44,105 | INFO     | --------------------------------------------------
[2024-05-13 09:19:05] 2024-05-13 07:18:44,105 | INFO     | 
[2024-05-13 09:19:05] 2024-05-13 07:18:44,105 | INFO     | executing command: ls -lF /scratch/boinc/var/slot40/slots/1
[2024-05-13 09:19:05] 2024-05-13 07:18:44,126 | INFO     | queue jobs had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue payloads had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue data_in had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue data_out had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue current_data_in had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,127 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | queue completed_jobids has 1 job(s)
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | queue messages had 0 job(s) [purged]
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | job 6199047905 has completed (purged errors)
[2024-05-13 09:19:05] 2024-05-13 07:18:44,128 | INFO     | overall cleanup function is called
[2024-05-13 09:19:05] 2024-05-13 07:18:45,141 | INFO     | --- collectZombieJob: --- 10, [1297308]
[2024-05-13 09:19:05] 2024-05-13 07:18:45,141 | INFO     | zombie collector waiting for pid 1297308
[2024-05-13 09:19:05] 2024-05-13 07:18:45,141 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2024-05-13 09:19:05] 2024-05-13 07:18:46,145 | INFO     | collected zombie processes
[2024-05-13 09:19:05] 2024-05-13 07:18:46,145 | INFO     | will now attempt to kill all subprocesses of pid=1297308
[2024-05-13 09:19:05] 2024-05-13 07:18:46,284 | INFO     | process IDs to be killed: [1297308] (in reverse order)
[2024-05-13 09:19:05] 2024-05-13 07:18:46,382 | WARNING  | found no corresponding commands to process id(s)
[2024-05-13 09:19:05] 2024-05-13 07:18:46,383 | INFO     | Do not look for orphan processes in BOINC jobs
[2024-05-13 09:19:05] 2024-05-13 07:18:46,390 | INFO     | did not find any defunct processes belonging to 1297308
[2024-05-13 09:19:05] 2024-05-13 07:18:46,396 | INFO     | did not find any defunct processes belonging to 1297308
[2024-05-13 09:19:05] 2024-05-13 07:18:46,396 | INFO     | ready for new job
[2024-05-13 09:19:05] 2024-05-13 07:18:46,396 | INFO     | pilot has finished with previous job - re-establishing logging
[2024-05-13 09:19:05] 2024-05-13 07:18:46,408 | INFO     | **************************************
[2024-05-13 09:19:05] 2024-05-13 07:18:46,408 | INFO     | ***  PanDA Pilot version 3.7.3.84  ***
[2024-05-13 09:19:05] 2024-05-13 07:18:46,408 | INFO     | **************************************
[2024-05-13 09:19:05] 2024-05-13 07:18:46,408 | INFO     | 
[2024-05-13 09:19:05] 2024-05-13 07:18:46,411 | INFO     | architecture information:
[2024-05-13 09:19:05] 2024-05-13 07:18:46,411 | INFO     | executing command: cat /etc/os-release
[2024-05-13 09:19:05] 2024-05-13 07:18:46,425 | INFO     | cat /etc/os-release:
[2024-05-13 09:19:05] NAME="CentOS Linux"
[2024-05-13 09:19:05] VERSION="7 (Core)"
[2024-05-13 09:19:05] ID="centos"
[2024-05-13 09:19:05] ID_LIKE="rhel fedora"
[2024-05-13 09:19:05] VERSION_ID="7"
[2024-05-13 09:19:05] PRETTY_NAME="CentOS Linux 7 (Core)"
[2024-05-13 09:19:05] ANSI_COLOR="0;31"
[2024-05-13 09:19:05] CPE_NAME="cpe:/o:centos:centos:7"
[2024-05-13 09:19:05] HOME_URL="https://www.centos.org/"
[2024-05-13 09:19:05] BUG_REPORT_URL="https://bugs.centos.org/"
[2024-05-13 09:19:05] 
[2024-05-13 09:19:05] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2024-05-13 09:19:05] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2024-05-13 09:19:05] REDHAT_SUPPORT_PRODUCT="centos"
[2024-05-13 09:19:05] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2024-05-13 09:19:05] 
[2024-05-13 09:19:05] 2024-05-13 07:18:46,426 | INFO     | **************************************
[2024-05-13 09:19:05] 2024-05-13 07:18:46,928 | INFO     | executing command: df -mP /scratch/boinc/var/slot40/slots/1
[2024-05-13 09:19:05] 2024-05-13 07:18:46,942 | INFO     | sufficient remaining disk space (642519138304 B)
[2024-05-13 09:19:05] 2024-05-13 07:18:46,943 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2024-05-13 09:19:05] 2024-05-13 07:18:46,943 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2024-05-13 09:19:05] 2024-05-13 07:18:47,091 | WARNING  | job monitor detected an abort_job request (signal=args.signal)
[2024-05-13 09:19:05] 2024-05-13 07:18:47,091 | WARNING  | cannot recover job monitoring - aborting pilot
[2024-05-13 09:19:05] 2024-05-13 07:18:47,091 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2024-05-13 09:19:05] 2024-05-13 07:18:47,091 | INFO     | will abort loop
[2024-05-13 09:19:05] 2024-05-13 07:18:47,481 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2024-05-13 09:19:05] 2024-05-13 07:18:47,856 | INFO     | all data control threads have been joined
[2024-05-13 09:19:05] 2024-05-13 07:18:47,931 | INFO     | found 0 job(s) in 20 queues
[2024-05-13 09:19:05] 2024-05-13 07:18:47,931 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2024-05-13 09:19:05] 2024-05-13 07:18:47,931 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2024-05-13 09:19:05] 2024-05-13 07:18:47,948 | INFO     | [payload] validate_pre thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:47,948 | INFO     | [job] retrieve thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:47,999 | INFO     | all payload control threads have been joined
[2024-05-13 09:19:05] 2024-05-13 07:18:48,096 | INFO     | [job] job monitor thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:48,101 | INFO     | all job control threads have been joined
[2024-05-13 09:19:05] 2024-05-13 07:18:48,270 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2024-05-13 09:19:05] 2024-05-13 07:18:48,584 | INFO     | [data] copytool_in thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:48,788 | INFO     | [payload] validate_post thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:48,861 | INFO     | [data] control thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:48,892 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2024-05-13 09:19:05] 2024-05-13 07:18:49,004 | INFO     | [payload] control thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,082 | INFO     | [job] create_data_payload thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,103 | INFO     | [job] control thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,229 | INFO     | [payload] execute_payloads thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,365 | INFO     | [payload] failed_post thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,381 | INFO     | [job] validate thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,486 | INFO     | [data] copytool_out thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:49,897 | INFO     | [job] queue monitor thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:52,275 | INFO     | [data] queue_monitor thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:57,181 | INFO     | job.realtimelogging is not enabled
[2024-05-13 09:19:05] 2024-05-13 07:18:58,184 | INFO     | [payload] run_realtimelog thread has finished
[2024-05-13 09:19:05] 2024-05-13 07:18:59,339 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 47486417519680)>', '<ExcThread(monitor, started 47486597175040)>']
[2024-05-13 09:19:05] 2024-05-13 07:18:59,984 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2024-05-13 09:19:05] 2024-05-13 07:18:59,984 | INFO     | [monitor] control thread has ended
[2024-05-13 09:19:05] 2024-05-13 07:19:04,364 | INFO     | all workflow threads have been joined
[2024-05-13 09:19:05] 2024-05-13 07:19:04,364 | INFO     | end of generic workflow (traces error code: 0)
[2024-05-13 09:19:05] 2024-05-13 07:19:04,364 | INFO     | traces error code: 0
[2024-05-13 09:19:05] 2024-05-13 07:19:04,364 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2024-05-13 09:19:05] 2024-05-13 07:19:04,437 [wrapper] ==== pilot stdout END ====
[2024-05-13 09:19:05] 2024-05-13 07:19:04,438 [wrapper] ==== wrapper stdout RESUME ====
[2024-05-13 09:19:05] 2024-05-13 07:19:04,440 [wrapper] pilotpid: 1289330
[2024-05-13 09:19:05] 2024-05-13 07:19:04,443 [wrapper] Pilot exit status: 0
[2024-05-13 09:19:05] 2024-05-13 07:19:04,450 [wrapper] pandaids: 6199047905
[2024-05-13 09:19:05] 2024-05-13 07:19:04,509 [wrapper] cleanup: SIGTERM to supervisor_pilot 2238617 1289331
[2024-05-13 09:19:05] 2024-05-13 07:19:04,512 [wrapper] Test setup, not cleaning
[2024-05-13 09:19:05] 2024-05-13 07:19:04,514 [wrapper] ==== wrapper stdout END ====
[2024-05-13 09:19:05] 2024-05-13 07:19:04,517 [wrapper] ==== wrapper stderr END ====
[2024-05-13 09:19:05] 2024-05-13 07:19:04,521 [wrapper] apfmon messages muted
[2024-05-13 09:19:05]  *** Error codes and diagnostics ***
[2024-05-13 09:19:05]     "exeErrorCode": 0,
[2024-05-13 09:19:05]     "exeErrorDiag": "",
[2024-05-13 09:19:05]     "pilotErrorCode": 0,
[2024-05-13 09:19:05]     "pilotErrorDiag": "",
[2024-05-13 09:19:05]  *** Listing of results directory ***
[2024-05-13 09:19:05] total 2676996
[2024-05-13 09:19:05] -rwx------ 1 boinc boinc      32251 May  8 14:24 runpilot2-wrapper.sh
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc     468928 May  8 14:24 pilot3.tar.gz
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc       5634 May  8 14:25 queuedata.json
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc        100 May 11 16:29 wrapper_26015_x86_64-pc-linux-gnu
[2024-05-13 09:19:05] -rwxr-xr-x 1 boinc boinc       7986 May 11 16:29 run_atlas
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc        105 May 11 16:29 job.xml
[2024-05-13 09:19:05] -rw-r--r-- 2 boinc boinc  326815254 May 11 16:29 EVNT.38734051._000500.pool.root.1
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc       6661 May 11 16:29 init_data.xml
[2024-05-13 09:19:05] -rw-r--r-- 2 boinc boinc      17539 May 11 16:29 start_atlas.sh
[2024-05-13 09:19:05] drwxrwx--x 2 boinc boinc       4096 May 11 16:29 shared
[2024-05-13 09:19:05] -rw-r--r-- 2 boinc boinc     481747 May 11 16:29 input.tar.gz
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc          0 May 11 16:29 boinc_lockfile
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc       2851 May 11 16:29 pandaJob.out
[2024-05-13 09:19:05] -rw------- 1 boinc boinc        424 May 11 16:29 setup.sh.local
[2024-05-13 09:19:05] -rw------- 1 boinc boinc    1002363 May 11 16:29 agis_schedconf.cvmfs.json
[2024-05-13 09:19:05] -rw------- 1 boinc boinc    1319809 May 11 16:29 cric_ddmendpoints.json
[2024-05-13 09:19:05] drwx------ 4 boinc boinc       4096 May 11 16:29 pilot3
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc        534 May 13 09:08 boinc_task_state.xml
[2024-05-13 09:19:05] -rw------- 1 boinc boinc 2319284826 May 13 09:17 HITS.38734053._001935.pool.root.1
[2024-05-13 09:19:05] -rw------- 1 boinc boinc       1065 May 13 09:18 memory_monitor_summary.json
[2024-05-13 09:19:05] -rw------- 1 boinc boinc          0 May 13 09:18 agis_ddmendpoints.agis.ALL.json
[2024-05-13 09:19:05] -rw------- 1 boinc boinc    7697587 May 13 09:18 log.38734053._001935.job.log.tgz.1
[2024-05-13 09:19:05] -rw------- 1 boinc boinc         95 May 13 09:18 pilot_heartbeat.json
[2024-05-13 09:19:05] -rw------- 1 boinc boinc      27943 May 13 09:18 heartbeat.json
[2024-05-13 09:19:05] -rw------- 1 boinc boinc       4483 May 13 09:19 pilotlog.txt
[2024-05-13 09:19:05] -rw------- 1 boinc boinc   38098371 May 13 09:19 log.38734053._001935.job.log.1
[2024-05-13 09:19:05] -rw------- 1 boinc boinc        353 May 13 09:19 output.list
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc        620 May 13 09:19 runtime_log
[2024-05-13 09:19:05] -rw------- 1 boinc boinc   45834240 May 13 09:19 result.tar.gz
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc      11412 May 13 09:19 runtime_log.err
[2024-05-13 09:19:05] -rw------- 1 boinc boinc        671 May 13 09:19 leqKDm5WvO5n7Olcko1bjSoqABFKDmABFKDmOtvXDmHqVKDmjh3NIm.diag
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc       8192 May 13 09:19 boinc_mmap_file
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc         30 May 13 09:19 wrapper_checkpoint.txt
[2024-05-13 09:19:05] -rw-r--r-- 1 boinc boinc      21638 May 13 09:19 stderr.txt
[2024-05-13 09:19:05] HITS file was successfully produced:
[2024-05-13 09:19:05] -rw------- 1 boinc boinc 2319284826 May 13 09:17 shared/HITS.pool.root.1
[2024-05-13 09:19:05]  *** Contents of shared directory: ***
[2024-05-13 09:19:05] total 2629344
[2024-05-13 09:19:05] -rw-r--r-- 2 boinc boinc  326815254 May 11 16:29 ATLAS.root_0
[2024-05-13 09:19:05] -rw-r--r-- 2 boinc boinc      17539 May 11 16:29 start_atlas.sh
[2024-05-13 09:19:05] -rw-r--r-- 2 boinc boinc     481747 May 11 16:29 input.tar.gz
[2024-05-13 09:19:05] -rw------- 1 boinc boinc 2319284826 May 13 09:17 HITS.pool.root.1
[2024-05-13 09:19:05] -rw------- 1 boinc boinc   45834240 May 13 09:19 result.tar.gz
09:19:07 (1284993): run_atlas exited; CPU time 581723.837413
09:19:07 (1284993): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>leqKDm5WvO5n7Olcko1bjSoqABFKDmABFKDmOtvXDmHqVKDmjh3NIm_0_r2109484840_ATLAS_hits</file_name>
  <error_code>-131 (file size too big)</error_code>
</file_xfer_error>
</message>
]]>


©2024 CERN