Name | LPXLDm9UDG6nsSi4ap6QjLDmwznN0nGgGQJmDC4LDmFYnKDmHqDwQo_0 |
Workunit | 225776204 |
Created | 2 Oct 2024, 12:24:51 UTC |
Sent | 2 Oct 2024, 12:28:16 UTC |
Report deadline | 10 Oct 2024, 12:28:16 UTC |
Received | 3 Oct 2024, 19:56:02 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 10852171 |
Run time | 2 hours 47 min 45 sec |
CPU time | 10 hours 17 min 17 sec |
Validate state | Valid |
Credit | 397.12 |
Device peak FLOPS | 8.00 GFLOPS |
Application version | ATLAS Simulation v3.01 (native_mt) x86_64-pc-linux-gnu |
Peak working set size | 2.30 GB |
Peak swap size | 14.27 GB |
Peak disk usage | 995.25 MB |
<core_client_version>7.20.2</core_client_version> <![CDATA[ <stderr_txt> 16:48:36 (16278): wrapper (7.7.26015): starting 16:48:36 (16278): wrapper: running run_atlas (--nthreads 8) [2024-10-03 16:48:36] Arguments: --nthreads 8 [2024-10-03 16:48:36] Threads: 8 [2024-10-03 16:48:36] Checking for CVMFS [2024-10-03 16:48:37] Probing /cvmfs/atlas.cern.ch... OK [2024-10-03 16:48:38] Probing /cvmfs/atlas-condb.cern.ch... OK [2024-10-03 16:48:38] Running cvmfs_config stat atlas.cern.ch [2024-10-03 16:48:39] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE [2024-10-03 16:48:39] 2.11.2.0 20139 54379 73140 137930 0 59 14687094 25165824 4517 130560 0 37322395 99.954 19390970 8295 http://cvmfs-stratum-one.cern.ch:8000/cvmfs/atlas.cern.ch http://192.168.73.11:3128/ 1 [2024-10-03 16:48:39] CVMFS is ok [2024-10-03 16:48:39] Efficiency of ATLAS tasks can be improved by the following measure(s): [2024-10-03 16:48:39] The CVMFS client on this computer should be configured to use Cloudflare's openhtc.io. [2024-10-03 16:48:39] Further information can be found at the LHC@home message board. [2024-10-03 16:48:39] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 [2024-10-03 16:48:39] Checking for apptainer binary... [2024-10-03 16:48:39] Using apptainer found in PATH at /usr/bin/apptainer [2024-10-03 16:48:39] Running /usr/bin/apptainer --version [2024-10-03 16:48:39] apptainer version 1.3.2-1.el7 [2024-10-03 16:48:39] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname [2024-10-03 17:19:01] cg-vwn363.simple-grid.lan [2024-10-03 17:19:01] apptainer works [2024-10-03 17:19:01] Set ATHENA_PROC_NUMBER=8 [2024-10-03 17:19:01] Set ATHENA_CORE_NUMBER=8 [2024-10-03 17:19:02] Starting ATLAS job with PandaID=6354136299 [2024-10-03 17:19:02] Running command: /usr/bin/apptainer exec -B /cvmfs,/var/lib/boinc/slots/4 /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 sh start_atlas.sh [2024-10-03 20:54:11] *** The last 200 lines of the pilot log: *** [2024-10-03 20:54:11] 2024-10-03 18:31:13,231 | INFO | model: linear, x: [1727969380.0, 1727969441.0, 1727969502.0, 1727969563.0, 1727969624.0, 1727969685.0, 1727969746.0, 1727969807.0, 1727969868.0, 1727969929.0, 1727 [2024-10-03 20:54:11] 2024-10-03 18:31:13,232 | INFO | sum of square deviations: 1577789583.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,233 | INFO | sum of deviations: 123658867885.50003 [2024-10-03 20:54:11] 2024-10-03 18:31:13,233 | INFO | mean x: 1727974595.5 [2024-10-03 20:54:11] 2024-10-03 18:31:13,233 | INFO | mean y: 2365425.9360465114 [2024-10-03 20:54:11] 2024-10-03 18:31:13,233 | INFO | -- intersect: -135427215626.40309 [2024-10-03 20:54:11] 2024-10-03 18:31:13,233 | INFO | intersect: -135427215626.40309 [2024-10-03 20:54:11] 2024-10-03 18:31:13,234 | INFO | chi2: 6.802319543696328 [2024-10-03 20:54:11] 2024-10-03 18:31:13,234 | INFO | current chi2=6.802319543696328 (change=10.036595085269278 %) [2024-10-03 20:54:11] 2024-10-03 18:31:13,234 | INFO | right removable region: 171 [2024-10-03 20:54:11] 2024-10-03 18:31:13,234 | INFO | model: linear, x: [1727969685.0, 1727969746.0, 1727969807.0, 1727969868.0, 1727969929.0, 1727969990.0, 1727970051.0, 1727970112.0, 1727970173.0, 1727970234.0, 1727 [2024-10-03 20:54:11] 2024-10-03 18:31:13,234 | INFO | sum of square deviations: 1577789583.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,236 | INFO | sum of deviations: 62939378703.500015 [2024-10-03 20:54:11] 2024-10-03 18:31:13,236 | INFO | mean x: 1727974900.5 [2024-10-03 20:54:11] 2024-10-03 18:31:13,236 | INFO | mean y: 2413786.656976744 [2024-10-03 20:54:11] 2024-10-03 18:31:13,236 | INFO | -- intersect: -68927985947.58455 [2024-10-03 20:54:11] 2024-10-03 18:31:13,236 | INFO | intersect: -68927985947.58455 [2024-10-03 20:54:11] 2024-10-03 18:31:13,237 | INFO | chi2: 3.6385690492500684 [2024-10-03 20:54:11] 2024-10-03 18:31:13,237 | INFO | current chi2=3.6385690492500684 (change=51.87846460532231 %) [2024-10-03 20:54:11] 2024-10-03 18:31:13,237 | INFO | model: linear, x: [1727969990.0, 1727970051.0, 1727970112.0, 1727970173.0, 1727970234.0, 1727970295.0, 1727970356.0, 1727970417.0, 1727970478.0, 1727970539.0, 1727 [2024-10-03 20:54:11] 2024-10-03 18:31:13,237 | INFO | sum of square deviations: 1444149868.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,239 | INFO | sum of deviations: 30416935811.999992 [2024-10-03 20:54:11] 2024-10-03 18:31:13,239 | INFO | mean x: 1727975053.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,239 | INFO | mean y: 2450866.1856287424 [2024-10-03 20:54:11] 2024-10-03 18:31:13,239 | INFO | -- intersect: -36392460379.85292 [2024-10-03 20:54:11] 2024-10-03 18:31:13,239 | INFO | intersect: -36392460379.85292 [2024-10-03 20:54:11] 2024-10-03 18:31:13,240 | INFO | chi2: 2.3140691829458664 [2024-10-03 20:54:11] 2024-10-03 18:31:13,240 | INFO | current chi2=2.3140691829458664 (change=36.40166912806532 %) [2024-10-03 20:54:11] 2024-10-03 18:31:13,240 | INFO | model: linear, x: [1727970295.0, 1727970356.0, 1727970417.0, 1727970478.0, 1727970539.0, 1727970600.0, 1727970661.0, 1727970722.0, 1727970783.0, 1727970844.0, 1727 [2024-10-03 20:54:11] 2024-10-03 18:31:13,240 | INFO | sum of square deviations: 1318277740.5 [2024-10-03 20:54:11] 2024-10-03 18:31:13,241 | INFO | sum of deviations: -1807311019.499998 [2024-10-03 20:54:11] 2024-10-03 18:31:13,241 | INFO | mean x: 1727975205.5 [2024-10-03 20:54:11] 2024-10-03 18:31:13,242 | INFO | mean y: 2489853.6481481483 [2024-10-03 20:54:11] 2024-10-03 18:31:13,242 | INFO | -- intersect: 2371481253.8506794 [2024-10-03 20:54:11] 2024-10-03 18:31:13,242 | INFO | intersect: 2371481253.8506794 [2024-10-03 20:54:11] 2024-10-03 18:31:13,242 | INFO | chi2: 0.9109103422885911 [2024-10-03 20:54:11] 2024-10-03 18:31:13,242 | INFO | current chi2=0.9109103422885911 (change=60.635993556209066 %) [2024-10-03 20:54:11] 2024-10-03 18:31:13,243 | INFO | model: linear, x: [1727970600.0, 1727970661.0, 1727970722.0, 1727970783.0, 1727970844.0, 1727970905.0, 1727970966.0, 1727971027.0, 1727971088.0, 1727971149.0, 1727 [2024-10-03 20:54:11] 2024-10-03 18:31:13,243 | INFO | sum of square deviations: 1199940638.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,244 | INFO | sum of deviations: -8543573685.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,244 | INFO | mean x: 1727975358.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,244 | INFO | mean y: 2498412.7579617836 [2024-10-03 20:54:11] 2024-10-03 18:31:13,244 | INFO | -- intersect: 12305677694.646116 [2024-10-03 20:54:11] 2024-10-03 18:31:13,244 | INFO | intersect: 12305677694.646116 [2024-10-03 20:54:11] 2024-10-03 18:31:13,245 | INFO | chi2: 0.8114981311985365 [2024-10-03 20:54:11] 2024-10-03 18:31:13,245 | INFO | current chi2=0.8114981311985365 (change=10.913501194891385 %) [2024-10-03 20:54:11] 2024-10-03 18:31:13,245 | INFO | left removable region: 40 [2024-10-03 20:54:11] 2024-10-03 18:31:13,245 | INFO | model: linear, x: [1727971820.0, 1727971881.0, 1727971942.0, 1727972003.0, 1727972064.0, 1727972125.0, 1727972186.0, 1727972247.0, 1727972308.0, 1727972369.0, 1727 [2024-10-03 20:54:11] 2024-10-03 18:31:13,245 | INFO | sum of square deviations: 697054930.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,246 | INFO | sum of deviations: 2604206265.9999995 [2024-10-03 20:54:11] 2024-10-03 18:31:13,246 | INFO | mean x: 1727975785.0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,247 | INFO | mean y: 2511492.3740458013 [2024-10-03 20:54:11] 2024-10-03 18:31:13,247 | INFO | -- intersect: -6453228468.884485 [2024-10-03 20:54:11] 2024-10-03 18:31:13,247 | INFO | intersect: -6453228468.884485 [2024-10-03 20:54:11] 2024-10-03 18:31:13,247 | INFO | chi2: 0.0007307409933873609 [2024-10-03 20:54:11] 2024-10-03 18:31:13,247 | INFO | -- intersect: -6453228468.884485 [2024-10-03 20:54:11] 2024-10-03 18:31:13,247 | INFO | current memory leak: 3.74 B/s (using 131 data points, chi2=0.00) [2024-10-03 20:54:11] 2024-10-03 18:31:13,248 | INFO | .............................. [2024-10-03 20:54:11] 2024-10-03 18:31:13,248 | INFO | . Timing measurements: [2024-10-03 20:54:11] 2024-10-03 18:31:13,248 | INFO | . get job = 0 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | . initial setup = 6 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | . payload setup = 128 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | . stage-in = 0 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | . payload execution = 10947 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | . stage-out = 3 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | . log creation = 1 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,249 | INFO | .............................. [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | job summary report [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | -------------------------------------------------- [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | PanDA job id: 6354136299 [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | task id: 41079461 [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | errors: (none) [2024-10-03 20:54:11] 2024-10-03 18:31:13,400 | INFO | status: LOG_TRANSFER = DONE [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | pilot state: finished [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | transexitcode: 0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | exeerrorcode: 0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | exeerrordiag: [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | exitcode: 0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | exitmsg: OK [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | cpuconsumptiontime: 37058 s [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | nevents: 400 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | neventsw: 0 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | pid: 24223 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | pgrp: 24223 [2024-10-03 20:54:11] 2024-10-03 18:31:13,401 | INFO | corecount: 8 [2024-10-03 20:54:11] 2024-10-03 18:31:13,402 | INFO | event service: False [2024-10-03 20:54:11] 2024-10-03 18:31:13,402 | INFO | sizes: {0: 2435262, 1: 2435517, 12: 2435517, 23: 2435545, 34: 2435573, 44: 2435601, 55: 2435757, 66: 2435785, 77: 2435813, 87: 2436103, 11080: 2464862, 11081: 2465 [2024-10-03 20:54:11] 2024-10-03 18:31:13,402 | INFO | -------------------------------------------------- [2024-10-03 20:54:11] 2024-10-03 18:31:13,402 | INFO | [2024-10-03 20:54:11] 2024-10-03 18:31:13,402 | INFO | executing command: ls -lF /var/lib/boinc/slots/4 [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue jobs had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue payloads had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue data_in had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue data_out had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue current_data_in had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue validated_jobs had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,909 | INFO | queue validated_payloads had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue monitored_payloads had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue finished_jobs had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue finished_payloads had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue finished_data_in had 1 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue finished_data_out had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue failed_jobs had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue failed_payloads had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue failed_data_in had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue failed_data_out had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue completed_jobs had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,910 | INFO | queue completed_jobids has 1 job(s) [2024-10-03 20:54:11] 2024-10-03 18:31:13,911 | INFO | queue realtimelog_payloads had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,911 | INFO | queue messages had 0 job(s) [purged] [2024-10-03 20:54:11] 2024-10-03 18:31:13,911 | INFO | job 6354136299 has completed (purged errors) [2024-10-03 20:54:11] 2024-10-03 18:31:13,911 | INFO | overall cleanup function is called [2024-10-03 20:54:11] 2024-10-03 18:31:14,925 | INFO | --- collectZombieJob: --- 10, [24223] [2024-10-03 20:54:11] 2024-10-03 18:31:14,926 | INFO | zombie collector waiting for pid 24223 [2024-10-03 20:54:11] 2024-10-03 18:31:14,926 | INFO | harmless exception when collecting zombies: [Errno 10] No child processes [2024-10-03 20:54:11] 2024-10-03 18:31:14,926 | INFO | collected zombie processes [2024-10-03 20:54:11] 2024-10-03 18:31:14,926 | INFO | will attempt to kill all subprocesses of pid=24223 [2024-10-03 20:54:11] 2024-10-03 18:31:15,564 | INFO | process IDs to be killed: [24223] (in reverse order) [2024-10-03 20:54:11] 2024-10-03 18:31:16,165 | WARNING | found no corresponding commands to process id(s) [2024-10-03 20:54:11] 2024-10-03 18:31:16,165 | INFO | Do not look for orphan processes in BOINC jobs [2024-10-03 20:54:11] 2024-10-03 18:31:16,169 | INFO | did not find any defunct processes belonging to 24223 [2024-10-03 20:54:11] 2024-10-03 18:31:16,172 | INFO | did not find any defunct processes belonging to 24223 [2024-10-03 20:54:11] 2024-10-03 18:31:16,172 | INFO | ready for new job [2024-10-03 20:54:11] 2024-10-03 18:31:16,172 | INFO | pilot has finished with previous job - re-establishing logging [2024-10-03 20:54:11] 2024-10-03 18:31:16,174 | INFO | ************************************* [2024-10-03 20:54:11] 2024-10-03 18:31:16,174 | INFO | *** PanDA Pilot version 3.8.2.8 *** [2024-10-03 20:54:11] 2024-10-03 18:31:16,174 | INFO | ************************************* [2024-10-03 20:54:11] 2024-10-03 18:31:16,174 | INFO | [2024-10-03 20:54:11] 2024-10-03 18:31:16,175 | INFO | pilot is running in a VM [2024-10-03 20:54:11] 2024-10-03 18:31:16,175 | INFO | architecture information: [2024-10-03 20:54:11] 2024-10-03 18:31:16,176 | INFO | executing command: cat /etc/os-release [2024-10-03 20:54:11] 2024-10-03 18:31:16,390 | INFO | cat /etc/os-release: [2024-10-03 20:54:11] NAME="CentOS Linux" [2024-10-03 20:54:11] VERSION="7 (Core)" [2024-10-03 20:54:11] ID="centos" [2024-10-03 20:54:11] ID_LIKE="rhel fedora" [2024-10-03 20:54:11] VERSION_ID="7" [2024-10-03 20:54:11] PRETTY_NAME="CentOS Linux 7 (Core)" [2024-10-03 20:54:11] ANSI_COLOR="0;31" [2024-10-03 20:54:11] CPE_NAME="cpe:/o:centos:centos:7" [2024-10-03 20:54:11] HOME_URL="https://www.centos.org/" [2024-10-03 20:54:11] BUG_REPORT_URL="https://bugs.centos.org/" [2024-10-03 20:54:11] [2024-10-03 20:54:11] CENTOS_MANTISBT_PROJECT="CentOS-7" [2024-10-03 20:54:11] CENTOS_MANTISBT_PROJECT_VERSION="7" [2024-10-03 20:54:11] REDHAT_SUPPORT_PRODUCT="centos" [2024-10-03 20:54:11] REDHAT_SUPPORT_PRODUCT_VERSION="7" [2024-10-03 20:54:11] [2024-10-03 20:54:11] 2024-10-03 18:31:16,390 | INFO | ************************************* [2024-10-03 20:54:11] 2024-10-03 18:31:16,893 | INFO | executing command: df -mP /var/lib/boinc/slots/4 [2024-10-03 20:54:11] 2024-10-03 18:31:17,134 | INFO | sufficient remaining disk space (486872711168 B) [2024-10-03 20:54:11] 2024-10-03 18:31:17,134 | WARNING | since timefloor is set to 0, pilot was only allowed to run one job [2024-10-03 20:54:11] 2024-10-03 18:31:17,134 | WARNING | setting graceful_stop since proceed_with_getjob() returned False (pilot will end) [2024-10-03 20:54:11] 2024-10-03 18:31:17,375 | WARNING | job monitor detected an abort_job request (signal=args.signal) [2024-10-03 20:54:11] 2024-10-03 18:31:17,375 | WARNING | cannot recover job monitoring - aborting pilot [2024-10-03 20:54:11] 2024-10-03 18:31:17,376 | WARNING | job:job_monitor:received graceful stop - abort after this iteration [2024-10-03 20:54:11] 2024-10-03 18:31:17,376 | INFO | will abort loop [2024-10-03 20:54:11] 2024-10-03 18:31:17,452 | WARNING | data:copytool_out:received graceful stop - abort after this iteration [2024-10-03 20:54:11] 2024-10-03 18:31:17,635 | INFO | all data control threads have been joined [2024-10-03 20:54:11] 2024-10-03 18:31:17,691 | INFO | found 0 job(s) in 20 queues [2024-10-03 20:54:11] 2024-10-03 18:31:17,692 | WARNING | pilot monitor received instruction that args.graceful_stop has been set [2024-10-03 20:54:11] 2024-10-03 18:31:17,692 | WARNING | will wait for a maximum of 300 s for threads to finish [2024-10-03 20:54:11] 2024-10-03 18:31:18,085 | INFO | all job control threads have been joined [2024-10-03 20:54:11] 2024-10-03 18:31:18,142 | INFO | [job] retrieve thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,155 | INFO | [data] copytool_in thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,163 | WARNING | data:queue_monitoring:received graceful stop - abort after this iteration [2024-10-03 20:54:11] 2024-10-03 18:31:18,353 | INFO | [payload] validate_post thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,380 | INFO | [job] job monitor thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,415 | WARNING | job:queue_monitor:received graceful stop - abort after this iteration [2024-10-03 20:54:11] 2024-10-03 18:31:18,486 | INFO | [job] validate thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,551 | INFO | all payload control threads have been joined [2024-10-03 20:54:11] 2024-10-03 18:31:18,640 | INFO | [data] control thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,663 | INFO | [payload] execute_payloads thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:18,867 | INFO | [payload] validate_pre thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,091 | INFO | [job] control thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,204 | INFO | [job] create_data_payload thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,422 | INFO | [job] queue monitor thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,459 | INFO | [data] copytool_out thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,470 | INFO | [payload] run_realtimelog thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,557 | INFO | [payload] control thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:19,627 | INFO | [payload] failed_post thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:22,172 | INFO | [data] queue_monitor thread has finished [2024-10-03 20:54:11] 2024-10-03 18:31:23,446 | INFO | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140620919482176)>', '<ExcThread(monitor, started 140620492424960)>'] [2024-10-03 20:54:11] 2024-10-03 18:31:23,727 | WARNING | job_aborted has been set - aborting pilot monitoring [2024-10-03 20:54:11] 2024-10-03 18:31:23,728 | INFO | [monitor] control thread has ended [2024-10-03 20:54:11] 2024-10-03 18:31:28,472 | INFO | all workflow threads have been joined [2024-10-03 20:54:11] 2024-10-03 18:31:28,473 | INFO | end of generic workflow (traces error code: 0) [2024-10-03 20:54:11] 2024-10-03 18:31:28,473 | INFO | traces error code: 0 [2024-10-03 20:54:11] 2024-10-03 18:31:28,473 | INFO | pilot has finished (exit code=0, shell exit code=0) [2024-10-03 20:54:11] 2024-10-03 18:31:28,830 [wrapper] ==== pilot stdout END ==== [2024-10-03 20:54:11] 2024-10-03 18:31:28,925 [wrapper] ==== wrapper stdout RESUME ==== [2024-10-03 20:54:11] 2024-10-03 18:31:28,966 [wrapper] pilotpid: 16384 [2024-10-03 20:54:11] 2024-10-03 18:31:29,038 [wrapper] Pilot exit status: 0 [2024-10-03 20:54:11] 2024-10-03 18:31:29,423 [wrapper] pandaids: 6354136299 [2024-10-03 20:54:11] 2024-10-03 18:31:30,092 [wrapper] cleanup supervisor_pilot 30050 16385 [2024-10-03 20:54:11] 2024-10-03 18:31:30,191 [wrapper] Test setup, not cleaning [2024-10-03 20:54:11] 2024-10-03 18:31:30,306 [wrapper] ==== wrapper stdout END ==== [2024-10-03 20:54:11] 2024-10-03 18:31:30,414 [wrapper] ==== wrapper stderr END ==== [2024-10-03 20:54:11] 2024-10-03 18:31:30,572 [wrapper] apfmon messages muted [2024-10-03 20:54:11] *** Error codes and diagnostics *** [2024-10-03 20:54:11] "exeErrorCode": 0, [2024-10-03 20:54:11] "exeErrorDiag": "", [2024-10-03 20:54:11] "pilotErrorCode": 0, [2024-10-03 20:54:11] "pilotErrorDiag": "", [2024-10-03 20:54:11] *** Listing of results directory *** [2024-10-03 20:54:11] total 583504 [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 484599 Oct 2 14:18 pilot3.tar.gz [2024-10-03 20:54:11] -rwx------. 1 boinc boinc 33814 Oct 2 14:19 runpilot2-wrapper.sh [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 5653 Oct 2 14:22 queuedata.json [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 100 Oct 3 16:48 wrapper_26015_x86_64-pc-linux-gnu [2024-10-03 20:54:11] -rwxr-xr-x. 1 boinc boinc 7986 Oct 3 16:48 run_atlas [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 105 Oct 3 16:48 job.xml [2024-10-03 20:54:11] -rw-r--r--. 2 boinc boinc 441778985 Oct 3 16:48 EVNT.41079459._000008.pool.root.1 [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 6347 Oct 3 16:48 init_data.xml [2024-10-03 20:54:11] drwxrwx--x. 2 boinc boinc 68 Oct 3 16:48 shared [2024-10-03 20:54:11] -rw-r--r--. 2 boinc boinc 498172 Oct 3 16:48 input.tar.gz [2024-10-03 20:54:11] -rw-r--r--. 2 boinc boinc 17537 Oct 3 16:48 start_atlas.sh [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 0 Oct 3 16:48 boinc_lockfile [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 2574 Oct 3 17:19 pandaJob.out [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 424 Oct 3 17:19 setup.sh.local [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 1024549 Oct 3 17:21 agis_schedconf.cvmfs.json [2024-10-03 20:54:11] drwx------. 4 boinc boinc 4096 Oct 3 17:21 pilot3 [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 149329420 Oct 3 20:28 HITS.41079461._000182.pool.root.1 [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 532 Oct 3 20:28 boinc_task_state.xml [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 1017 Oct 3 20:29 memory_monitor_summary.json [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 1615815 Oct 3 20:29 agis_ddmendpoints.agis.ALL.json [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 347095 Oct 3 20:29 log.41079461._000182.job.log.tgz.1 [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 95 Oct 3 20:31 pilot_heartbeat.json [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 7688 Oct 3 20:31 heartbeat.json [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 4460 Oct 3 20:31 pilotlog.txt [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 926044 Oct 3 20:31 log.41079461._000182.job.log.1 [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 357 Oct 3 20:31 output.list [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 620 Oct 3 20:31 runtime_log [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 1290240 Oct 3 20:31 result.tar.gz [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 10999 Oct 3 20:31 runtime_log.err [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 645 Oct 3 20:31 LPXLDm9UDG6nsSi4ap6QjLDmwznN0nGgGQJmDC4LDmFYnKDmHqDwQo.diag [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 8192 Oct 3 20:54 boinc_mmap_file [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 28 Oct 3 20:54 wrapper_checkpoint.txt [2024-10-03 20:54:11] -rw-r--r--. 1 boinc boinc 21587 Oct 3 20:54 stderr.txt [2024-10-03 20:54:11] HITS file was successfully produced: [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 149329420 Oct 3 20:28 shared/HITS.pool.root.1 [2024-10-03 20:54:11] *** Contents of shared directory: *** [2024-10-03 20:54:11] total 579028 [2024-10-03 20:54:11] -rw-r--r--. 2 boinc boinc 441778985 Oct 3 16:48 ATLAS.root_0 [2024-10-03 20:54:11] -rw-r--r--. 2 boinc boinc 498172 Oct 3 16:48 input.tar.gz [2024-10-03 20:54:11] -rw-r--r--. 2 boinc boinc 17537 Oct 3 16:48 start_atlas.sh [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 149329420 Oct 3 20:28 HITS.pool.root.1 [2024-10-03 20:54:11] -rw-------. 1 boinc boinc 1290240 Oct 3 20:31 result.tar.gz 20:54:13 (16278): run_atlas exited; CPU time 37030.276932 20:54:13 (16278): called boinc_finish(0) </stderr_txt> ]]>
©2024 CERN