Name 3UmKDmDWrR5n9Rq4apoT9bVoABFKDmABFKDmlqFKDmiOKKDmUf2wZm_0
Workunit 223048783
Created 16 May 2024, 14:44:39 UTC
Sent 16 May 2024, 18:21:48 UTC
Report deadline 24 May 2024, 18:21:48 UTC
Received 17 May 2024, 3:27:08 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 10804872
Run time 8 hours 53 min 8 sec
CPU time 1 days 7 hours 21 min 23 sec
Validate state Valid
Credit 2,096.68
Device peak FLOPS 44.07 GFLOPS
Application version ATLAS Simulation v3.01 (native_mt)
x86_64-pc-linux-gnu
Peak working set size 2.53 GB
Peak swap size 31.46 GB
Peak disk usage 1.49 GB

Stderr output

<core_client_version>7.7.0</core_client_version>
<![CDATA[
<stderr_txt>
14:22:41 (68850): wrapper (7.7.26015): starting
14:22:41 (68850): wrapper: running run_atlas (--nthreads 8)
[2024-05-16 14:22:41] Arguments: --nthreads 8
[2024-05-16 14:22:41] Threads: 8
[2024-05-16 14:22:41] Checking for CVMFS
[2024-05-16 14:22:41] Probing /cvmfs/atlas.cern.ch... OK
[2024-05-16 14:22:42] Probing /cvmfs/atlas-condb.cern.ch... OK
[2024-05-16 14:22:42] Running cvmfs_config stat atlas.cern.ch
[2024-05-16 14:22:43] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2024-05-16 14:22:43] 2.11.2.0 29010 1135 81588 132809 1 98 16508196 18432000 21615 130560 0 9293529 98.282 32067532 53630 http://cvmfs-s1fnal.opensciencegrid.org:8000/cvmfs/atlas.cern.ch http://192.41.231.237:6081 1
[2024-05-16 14:22:43] CVMFS is ok
[2024-05-16 14:22:43] Efficiency of ATLAS tasks can be improved by the following measure(s):
[2024-05-16 14:22:43] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io.
[2024-05-16 14:22:43] Further information can be found at the LHC@home message board.
[2024-05-16 14:22:43] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7
[2024-05-16 14:22:43] Checking for apptainer binary...
[2024-05-16 14:22:43] Using apptainer found in PATH at /usr/bin/apptainer
[2024-05-16 14:22:43] Running /usr/bin/apptainer --version
[2024-05-16 14:22:43] apptainer version 1.3.0-1.el7
[2024-05-16 14:22:43] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname
[2024-05-16 14:22:55] c7-16-9.aglt2.org
[2024-05-16 14:22:55] apptainer works
[2024-05-16 14:22:55] Set ATHENA_PROC_NUMBER=8
[2024-05-16 14:22:55] Set ATHENA_CORE_NUMBER=8
[2024-05-16 14:22:55] Starting ATLAS job with PandaID=6207395569
[2024-05-16 14:22:55] Running command: /usr/bin/apptainer exec -B /cvmfs,/tmp/boinchome/slots/0 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh
[2024-05-16 23:15:45]  *** The last 200 lines of the pilot log: ***
[2024-05-16 23:15:45] 2024-05-17 03:15:09,458 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:11,962 | INFO     | monitor loop #2263: job 0:6207395569 is in state 'finished'
[2024-05-16 23:15:45] 2024-05-17 03:15:11,962 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:14,468 | INFO     | monitor loop #2264: job 0:6207395569 is in state 'finished'
[2024-05-16 23:15:45] 2024-05-17 03:15:14,469 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:16,973 | INFO     | monitor loop #2265: job 0:6207395569 is in state 'finished'
[2024-05-16 23:15:45] 2024-05-17 03:15:16,974 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:19,478 | INFO     | monitor loop #2266: job 0:6207395569 is in state 'finished'
[2024-05-16 23:15:45] 2024-05-17 03:15:19,478 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:21,983 | INFO     | monitor loop #2267: job 0:6207395569 is in state 'finished'
[2024-05-16 23:15:45] 2024-05-17 03:15:21,984 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:24,488 | INFO     | monitor loop #2268: job 0:6207395569 is in state 'finished'
[2024-05-16 23:15:45] 2024-05-17 03:15:24,488 | INFO     | will abort job monitoring soon since job state=finished (job is still in queue)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,657 | INFO     | CPU arch script returned: x86-64-v3
[2024-05-16 23:15:45] 2024-05-17 03:15:25,658 | INFO     | using path: /tmp/boinchome/slots/0/PanDA_Pilot-6207395569/memory_monitor_summary.json (trf name=prmon)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,659 | INFO     | extracted standard info from prmon json
[2024-05-16 23:15:45] 2024-05-17 03:15:25,659 | INFO     | extracted standard memory fields from prmon json
[2024-05-16 23:15:45] 2024-05-17 03:15:25,659 | WARNING  | GPU info not found in prmon json
[2024-05-16 23:15:45] 2024-05-17 03:15:25,660 | WARNING  | format EVNTtoHITS has no such key: dbData
[2024-05-16 23:15:45] 2024-05-17 03:15:25,660 | WARNING  | format EVNTtoHITS has no such key: dbTime
[2024-05-16 23:15:45] 2024-05-17 03:15:25,668 | INFO     | fitting pss+swap vs Time
[2024-05-16 23:15:45] 2024-05-17 03:15:25,669 | INFO     | model: linear, x: [1715884003.0, 1715884064.0, 1715884125.0, 1715884186.0, 1715884247.0, 1715884308.0, 1715884369.0, 1715884430.0, 1715884491.0, 1715884552.0, 1715
[2024-05-16 23:15:45] 2024-05-17 03:15:25,669 | INFO     | sum of square deviations: 43599953991.877
[2024-05-16 23:15:45] 2024-05-17 03:15:25,674 | INFO     | sum of deviations: 841821502246.3219
[2024-05-16 23:15:45] 2024-05-17 03:15:25,674 | INFO     | mean x: 1715899832.5153847
[2024-05-16 23:15:45] 2024-05-17 03:15:25,674 | INFO     | mean y: 2436540.5846153847
[2024-05-16 23:15:45] 2024-05-17 03:15:25,674 | INFO     | -- intersect: -33127905179.074795
[2024-05-16 23:15:45] 2024-05-17 03:15:25,674 | INFO     | intersect: -33127905179.074795
[2024-05-16 23:15:45] 2024-05-17 03:15:25,675 | INFO     | chi2: 15.918451146859
[2024-05-16 23:15:45] 2024-05-17 03:15:25,675 | INFO     | model: linear, x: [1715884003.0, 1715884064.0, 1715884125.0, 1715884186.0, 1715884247.0, 1715884308.0, 1715884369.0, 1715884430.0, 1715884491.0, 1715884552.0, 1715
[2024-05-16 23:15:45] 2024-05-17 03:15:25,675 | INFO     | sum of square deviations: 42354314471.875946
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | sum of deviations: 905970572949.8331
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | mean x: 1715899680.015534
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | mean y: 2444341.7708737864
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | -- intersect: -36701127315.854744
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | intersect: -36701127315.854744
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | chi2: 14.301741337592002
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | current chi2=14.301741337592002 (change=10.156200464176472 %)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,680 | INFO     | right removable region: 514
[2024-05-16 23:15:45] 2024-05-17 03:15:25,681 | INFO     | model: linear, x: [1715884308.0, 1715884369.0, 1715884430.0, 1715884491.0, 1715884552.0, 1715884613.0, 1715884674.0, 1715884735.0, 1715884796.0, 1715884857.0, 1715
[2024-05-16 23:15:45] 2024-05-17 03:15:25,681 | INFO     | sum of square deviations: 42354309591.875946
[2024-05-16 23:15:45] 2024-05-17 03:15:25,686 | INFO     | sum of deviations: 694001193005.3665
[2024-05-16 23:15:45] 2024-05-17 03:15:25,686 | INFO     | mean x: 1715899985.015534
[2024-05-16 23:15:45] 2024-05-17 03:15:25,686 | INFO     | mean y: 2454619.8291262137
[2024-05-16 23:15:45] 2024-05-17 03:15:25,686 | INFO     | -- intersect: -28113613099.21799
[2024-05-16 23:15:45] 2024-05-17 03:15:25,686 | INFO     | intersect: -28113613099.21799
[2024-05-16 23:15:45] 2024-05-17 03:15:25,687 | INFO     | chi2: 12.624887887955767
[2024-05-16 23:15:45] 2024-05-17 03:15:25,687 | INFO     | current chi2=12.624887887955767 (change=20.69022437244539 %)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,687 | INFO     | left removable region: 10
[2024-05-16 23:15:45] 2024-05-17 03:15:25,687 | INFO     | model: linear, x: [1715884613.0, 1715884674.0, 1715884735.0, 1715884796.0, 1715884857.0, 1715884918.0, 1715884979.0, 1715885040.0, 1715885101.0, 1715885162.0, 1715
[2024-05-16 23:15:45] 2024-05-17 03:15:25,687 | INFO     | sum of square deviations: 39697888179.873024
[2024-05-16 23:15:45] 2024-05-17 03:15:25,691 | INFO     | sum of deviations: 639360555691.5879
[2024-05-16 23:15:45] 2024-05-17 03:15:25,691 | INFO     | mean x: 1715899954.515873
[2024-05-16 23:15:45] 2024-05-17 03:15:25,691 | INFO     | mean y: 2477515.301587302
[2024-05-16 23:15:45] 2024-05-17 03:15:25,691 | INFO     | -- intersect: -27633217951.91125
[2024-05-16 23:15:45] 2024-05-17 03:15:25,691 | INFO     | intersect: -27633217951.91125
[2024-05-16 23:15:45] 2024-05-17 03:15:25,692 | INFO     | chi2: 9.041321880464984
[2024-05-16 23:15:45] 2024-05-17 03:15:25,692 | INFO     | -- intersect: -27633217951.91125
[2024-05-16 23:15:45] 2024-05-17 03:15:25,692 | INFO     | current memory leak: 16.11 B/s (using 504 data points, chi2=9.04)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | ..............................
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . Timing measurements:
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . get job = 0 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . initial setup = 4 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . payload setup = 39 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . stage-in = 0 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . payload execution = 31708 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . stage-out = 5 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | . log creation = 0 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,693 | INFO     | ..............................
[2024-05-16 23:15:45] 2024-05-17 03:15:25,763 | INFO     | 
[2024-05-16 23:15:45] 2024-05-17 03:15:25,767 | INFO     | job summary report
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | --------------------------------------------------
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | PanDA job id: 6207395569
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | task id: 38921001
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | errors: (none)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | status: LOG_TRANSFER = DONE 
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | pilot state: finished 
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | transexitcode: 0
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | exeerrorcode: 0
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | exeerrordiag: 
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | exitcode: 0
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | exitmsg: OK
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | cpuconsumptiontime: 113251 s
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | nevents: 400
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | neventsw: 0
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | pid: 37649
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | pgrp: 37649
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | corecount: 8
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | event service: False
[2024-05-16 23:15:45] 2024-05-17 03:15:25,768 | INFO     | sizes: {0: 2391986, 1: 2392185, 12: 2392185, 22: 2392213, 31: 2392447, 33: 2392475, 31751: 2430310, 31752: 2430309, 31757: 2439325, 31759: 2439381, 31800: 2439551}
[2024-05-16 23:15:45] 2024-05-17 03:15:25,769 | INFO     | --------------------------------------------------
[2024-05-16 23:15:45] 2024-05-17 03:15:25,769 | INFO     | 
[2024-05-16 23:15:45] 2024-05-17 03:15:25,769 | INFO     | executing command: ls -lF /tmp/boinchome/slots/0
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue jobs had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue payloads had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue data_in had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue data_out had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue current_data_in had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue validated_jobs had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue validated_payloads had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue monitored_payloads had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue finished_jobs had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue finished_payloads had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue finished_data_in had 1 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue finished_data_out had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,860 | INFO     | queue failed_jobs had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue failed_payloads had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue failed_data_in had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue failed_data_out had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue completed_jobs had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue completed_jobids has 1 job(s)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue realtimelog_payloads had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | queue messages had 0 job(s) [purged]
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | job 6207395569 has completed (purged errors)
[2024-05-16 23:15:45] 2024-05-17 03:15:25,861 | INFO     | overall cleanup function is called
[2024-05-16 23:15:45] 2024-05-17 03:15:26,894 | INFO     | --- collectZombieJob: --- 10, [37649]
[2024-05-16 23:15:45] 2024-05-17 03:15:26,894 | INFO     | zombie collector waiting for pid 37649
[2024-05-16 23:15:45] 2024-05-17 03:15:26,895 | INFO     | harmless exception when collecting zombies: [Errno 10] No child processes
[2024-05-16 23:15:45] 2024-05-17 03:15:27,900 | INFO     | collected zombie processes
[2024-05-16 23:15:45] 2024-05-17 03:15:27,900 | INFO     | will now attempt to kill all subprocesses of pid=37649
[2024-05-16 23:15:45] 2024-05-17 03:15:28,481 | INFO     | process IDs to be killed: [37649] (in reverse order)
[2024-05-16 23:15:45] 2024-05-17 03:15:28,915 | WARNING  | found no corresponding commands to process id(s)
[2024-05-16 23:15:45] 2024-05-17 03:15:28,916 | INFO     | Do not look for orphan processes in BOINC jobs
[2024-05-16 23:15:45] 2024-05-17 03:15:28,928 | INFO     | did not find any defunct processes belonging to 37649
[2024-05-16 23:15:45] 2024-05-17 03:15:28,941 | INFO     | did not find any defunct processes belonging to 37649
[2024-05-16 23:15:45] 2024-05-17 03:15:28,942 | INFO     | ready for new job
[2024-05-16 23:15:45] 2024-05-17 03:15:28,942 | INFO     | pilot has finished with previous job - re-establishing logging
[2024-05-16 23:15:45] 2024-05-17 03:15:28,944 | INFO     | *************************************
[2024-05-16 23:15:45] 2024-05-17 03:15:28,944 | INFO     | ***  PanDA Pilot version 3.7.5.4  ***
[2024-05-16 23:15:45] 2024-05-17 03:15:28,944 | INFO     | *************************************
[2024-05-16 23:15:45] 2024-05-17 03:15:28,944 | INFO     | 
[2024-05-16 23:15:45] 2024-05-17 03:15:28,949 | INFO     | architecture information:
[2024-05-16 23:15:45] 2024-05-17 03:15:28,949 | INFO     | executing command: cat /etc/os-release
[2024-05-16 23:15:45] 2024-05-17 03:15:29,062 | INFO     | cat /etc/os-release:
[2024-05-16 23:15:45] NAME="CentOS Linux"
[2024-05-16 23:15:45] VERSION="7 (Core)"
[2024-05-16 23:15:45] ID="centos"
[2024-05-16 23:15:45] ID_LIKE="rhel fedora"
[2024-05-16 23:15:45] VERSION_ID="7"
[2024-05-16 23:15:45] PRETTY_NAME="CentOS Linux 7 (Core)"
[2024-05-16 23:15:45] ANSI_COLOR="0;31"
[2024-05-16 23:15:45] CPE_NAME="cpe:/o:centos:centos:7"
[2024-05-16 23:15:45] HOME_URL="https://www.centos.org/"
[2024-05-16 23:15:45] BUG_REPORT_URL="https://bugs.centos.org/"
[2024-05-16 23:15:45] 
[2024-05-16 23:15:45] CENTOS_MANTISBT_PROJECT="CentOS-7"
[2024-05-16 23:15:45] CENTOS_MANTISBT_PROJECT_VERSION="7"
[2024-05-16 23:15:45] REDHAT_SUPPORT_PRODUCT="centos"
[2024-05-16 23:15:45] REDHAT_SUPPORT_PRODUCT_VERSION="7"
[2024-05-16 23:15:45] 
[2024-05-16 23:15:45] 2024-05-17 03:15:29,062 | INFO     | *************************************
[2024-05-16 23:15:45] 2024-05-17 03:15:29,569 | INFO     | executing command: df -mP /tmp/boinchome/slots/0
[2024-05-16 23:15:45] 2024-05-17 03:15:29,726 | INFO     | sufficient remaining disk space (98901688320 B)
[2024-05-16 23:15:45] 2024-05-17 03:15:29,726 | WARNING  | since timefloor is set to 0, pilot was only allowed to run one job
[2024-05-16 23:15:45] 2024-05-17 03:15:29,727 | WARNING  | setting graceful_stop since proceed_with_getjob() returned False (pilot will end)
[2024-05-16 23:15:45] 2024-05-17 03:15:29,727 | WARNING  | data:copytool_out:received graceful stop - abort after this iteration
[2024-05-16 23:15:45] 2024-05-17 03:15:29,727 | WARNING  | aborting monitor loop since graceful_stop has been set (timing out remaining threads)
[2024-05-16 23:15:45] 2024-05-17 03:15:29,727 | INFO     | found 0 job(s) in 20 queues
[2024-05-16 23:15:45] 2024-05-17 03:15:29,727 | WARNING  | pilot monitor received instruction that args.graceful_stop has been set
[2024-05-16 23:15:45] 2024-05-17 03:15:29,727 | WARNING  | will wait for a maximum of 300 s for threads to finish
[2024-05-16 23:15:45] 2024-05-17 03:15:30,377 | INFO     | all data control threads have been joined
[2024-05-16 23:15:45] 2024-05-17 03:15:30,510 | WARNING  | job:job_monitor:received graceful stop - abort after this iteration
[2024-05-16 23:15:45] 2024-05-17 03:15:30,510 | INFO     | aborting loop
[2024-05-16 23:15:45] 2024-05-17 03:15:30,733 | INFO     | [job] retrieve thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:30,777 | WARNING  | job:queue_monitor:received graceful stop - abort after this iteration
[2024-05-16 23:15:45] 2024-05-17 03:15:30,818 | INFO     | all payload control threads have been joined
[2024-05-16 23:15:45] 2024-05-17 03:15:31,125 | INFO     | all job control threads have been joined
[2024-05-16 23:15:45] 2024-05-17 03:15:31,389 | INFO     | [data] control thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:31,516 | INFO     | [job] job monitor thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:31,579 | INFO     | [payload] run_realtimelog thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:31,583 | WARNING  | data:queue_monitoring:received graceful stop - abort after this iteration
[2024-05-16 23:15:45] 2024-05-17 03:15:31,733 | INFO     | [data] copytool_out thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:31,801 | INFO     | [job] queue monitor thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:31,825 | INFO     | [payload] control thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,098 | INFO     | [payload] validate_pre thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,107 | INFO     | [payload] execute_payloads thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,109 | INFO     | [data] copytool_in thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,112 | INFO     | [job] create_data_payload thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,116 | INFO     | [job] validate thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,131 | INFO     | [job] control thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,213 | INFO     | [payload] failed_post thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:32,338 | INFO     | [payload] validate_post thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:35,593 | INFO     | [data] queue_monitor thread has finished
[2024-05-16 23:15:45] 2024-05-17 03:15:37,223 | INFO     | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 139975943079744)>', '<ExcThread(monitor, started 139974997440256)>']
[2024-05-16 23:15:45] 2024-05-17 03:15:37,774 | WARNING  | job_aborted has been set - aborting pilot monitoring
[2024-05-16 23:15:45] 2024-05-17 03:15:37,775 | INFO     | [monitor] control thread has ended
[2024-05-16 23:15:45] 2024-05-17 03:15:42,255 | INFO     | all workflow threads have been joined
[2024-05-16 23:15:45] 2024-05-17 03:15:42,256 | INFO     | end of generic workflow (traces error code: 0)
[2024-05-16 23:15:45] 2024-05-17 03:15:42,259 | INFO     | traces error code: 0
[2024-05-16 23:15:45] 2024-05-17 03:15:42,260 | INFO     | pilot has finished (exit code=0, shell exit code=0)
[2024-05-16 23:15:45] 2024-05-17 03:15:42,887 [wrapper] ==== pilot stdout END ====
[2024-05-16 23:15:45] 2024-05-17 03:15:42,893 [wrapper] ==== wrapper stdout RESUME ====
[2024-05-16 23:15:45] 2024-05-17 03:15:42,900 [wrapper] pilotpid: 2464
[2024-05-16 23:15:45] 2024-05-17 03:15:42,932 [wrapper] Pilot exit status: 0
[2024-05-16 23:15:45] 2024-05-17 03:15:43,077 [wrapper] pandaids: 6207395569
[2024-05-16 23:15:45] 2024-05-17 03:15:43,408 [wrapper] cleanup: SIGTERM to supervisor_pilot 55955 2465
[2024-05-16 23:15:45] 2024-05-17 03:15:43,448 [wrapper] Test setup, not cleaning
[2024-05-16 23:15:45] 2024-05-17 03:15:43,484 [wrapper] ==== wrapper stdout END ====
[2024-05-16 23:15:45] 2024-05-17 03:15:43,570 [wrapper] ==== wrapper stderr END ====
[2024-05-16 23:15:45] 2024-05-17 03:15:43,691 [wrapper] apfmon messages muted
[2024-05-16 23:15:45]  *** Error codes and diagnostics ***
[2024-05-16 23:15:45]     "exeErrorCode": 0,
[2024-05-16 23:15:45]     "exeErrorDiag": "",
[2024-05-16 23:15:45]     "pilotErrorCode": 0,
[2024-05-16 23:15:45]     "pilotErrorDiag": "",
[2024-05-16 23:15:45]  *** Listing of results directory ***
[2024-05-16 23:15:45] total 960864
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas    467845 May 16 10:20 pilot3.tar.gz
[2024-05-16 23:15:45] -rwx------ 1 boincer umatlas     32251 May 16 10:44 runpilot2-wrapper.sh
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas      5633 May 16 10:44 queuedata.json
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas       100 May 16 14:22 wrapper_26015_x86_64-pc-linux-gnu
[2024-05-16 23:15:45] -rwxr-xr-x 1 boincer umatlas      7986 May 16 14:22 run_atlas
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas       105 May 16 14:22 job.xml
[2024-05-16 23:15:45] -rw-r--r-- 2 boincer umatlas 615768110 May 16 14:22 EVNT.38776190._000035.pool.root.1
[2024-05-16 23:15:45] -rw-r--r-- 2 boincer umatlas     17537 May 16 14:22 start_atlas.sh
[2024-05-16 23:15:45] drwxrwx--x 2 boincer umatlas      4096 May 16 14:22 shared
[2024-05-16 23:15:45] -rw-r--r-- 2 boincer umatlas    479999 May 16 14:22 input.tar.gz
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas         0 May 16 14:22 boinc_lockfile
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas      2649 May 16 14:22 pandaJob.out
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas       467 May 16 14:23 setup.sh.local
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas   1018966 May 16 14:24 agis_schedconf.cvmfs.json
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas   1324701 May 16 14:24 cric_ddmendpoints.json
[2024-05-16 23:15:45] drwx------ 4 boincer umatlas      4096 May 16 14:24 pilot3
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas      6389 May 16 23:09 init_data.xml
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas 357130056 May 16 23:12 HITS.38921001._000968.pool.root.1
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas       533 May 16 23:12 boinc_task_state.xml
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas      1095 May 16 23:14 memory_monitor_summary.json
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas         0 May 16 23:14 agis_ddmendpoints.agis.ALL.json
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas    603432 May 16 23:14 log.38921001._000968.job.log.tgz.1
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas        95 May 16 23:14 pilot_heartbeat.json
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas      7773 May 16 23:15 heartbeat.json
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas        29 May 16 23:15 wrapper_checkpoint.txt
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas      8192 May 16 23:15 boinc_mmap_file
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas      4334 May 16 23:15 pilotlog.txt
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas   3141083 May 16 23:15 log.38921001._000968.job.log.1
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas       357 May 16 23:15 output.list
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas       620 May 16 23:15 runtime_log
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas   3768320 May 16 23:15 result.tar.gz
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas     11400 May 16 23:15 runtime_log.err
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas       653 May 16 23:15 3UmKDmDWrR5n9Rq4apoT9bVoABFKDmABFKDmlqFKDmiOKKDmUf2wZm.diag
[2024-05-16 23:15:45] -rw-r--r-- 1 boincer umatlas     21940 May 16 23:15 stderr.txt
[2024-05-16 23:15:45] HITS file was successfully produced:
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas 357130056 May 16 23:12 shared/HITS.pool.root.1
[2024-05-16 23:15:45]  *** Contents of shared directory: ***
[2024-05-16 23:15:45] total 954280
[2024-05-16 23:15:45] -rw-r--r-- 2 boincer umatlas 615768110 May 16 14:22 ATLAS.root_0
[2024-05-16 23:15:45] -rw-r--r-- 2 boincer umatlas     17537 May 16 14:22 start_atlas.sh
[2024-05-16 23:15:45] -rw-r--r-- 2 boincer umatlas    479999 May 16 14:22 input.tar.gz
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas 357130056 May 16 23:12 HITS.pool.root.1
[2024-05-16 23:15:45] -rw------- 1 boincer umatlas   3768320 May 16 23:15 result.tar.gz
23:15:47 (68850): run_atlas exited; CPU time 110894.705614
23:15:47 (68850): called boinc_finish(0)

</stderr_txt>
]]>


©2024 CERN