Message boards : ATLAS application : Problem of the day ATLAS
Message board moderation

To post messages, you must log in.

AuthorMessage
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 801
Credit: 649,406,237
RAC: 241,543
Message 46810 - Posted: 23 May 2022, 16:56:56 UTC
Last modified: 23 May 2022, 16:58:14 UTC

ID: 46810 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2090
Credit: 158,503,681
RAC: 128,226
Message 46813 - Posted: 24 May 2022, 5:48:49 UTC - in response to Message 46810.  

Yes, saw this also, but only in a small number of Atlas-Tasks, also Guru Meditation, last week.
We can only control the Error-Tasks of Atlas or this one with too long runtime and deleting this Tasks.
ID: 46813 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1269
Credit: 8,478,478
RAC: 2,509
Message 46814 - Posted: 24 May 2022, 10:00:25 UTC - in response to Message 46813.  

When you see this happen, you could revive the task:

1. Suspend the task in BOINC with "leave in memory" not selected. The VM will be saved to disk.
2. With Virtual Box Manager:
- delete the saved state
- start the VM and let it run until the first events are processing
- stop the VM with writing the saved state to disk
3. Resume the task in BOINC
ID: 46814 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2090
Credit: 158,503,681
RAC: 128,226
Message 46822 - Posted: 25 May 2022, 17:48:53 UTC - in response to Message 46814.  
Last modified: 25 May 2022, 17:49:59 UTC

2022-05-25 16:02:06 (11660): Guest Log: Running cvmfs_config stat atlas.cern.ch

2022-05-25 16:02:06 (11660): Guest Log: VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE

2022-05-25 16:02:06 (11660): Guest Log: 2.6.3.0 1781 307445734561825742 32288 104734 4 1 1492424 4096000 0 65024 0 0 n/a 0 0 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch http://xx.yyy.zzz.aa:3128 1

2022-05-25 16:02:06 (11660): Guest Log: ATHENA_PROC_NUMBER=12

2022-05-25 16:02:06 (11660): Guest Log: *** Starting ATLAS job. (PandaID=5463929576 taskID=29107814) ***

2022-05-25 16:12:56 (11660): VM is no longer is a running state. It is in 'GuruMeditation'.
2022-05-25 16:12:56 (11660): VM state change detected. (old = 'Running', new = 'GuruMeditation')

2022-05-25 16:12:56 (11660): Powering off VM.
2022-05-25 16:12:56 (11660): Deregistering VM. (boinc_d20a7b32445566aa, slot#5)
2022-05-25 16:13:38 (11660): Removing network bandwidth throttle group from VM.
2022-05-25 16:13:39 (11660): Removing VM from VirtualBox.
2022-05-25 16:14:17 (11660): Virtual machine exited.
16:14:27 (11660): called boinc_finish(0)
ID: 46822 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 801
Credit: 649,406,237
RAC: 241,543
Message 46823 - Posted: 25 May 2022, 20:35:21 UTC

ID: 46823 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2090
Credit: 158,503,681
RAC: 128,226
Message 46917 - Posted: 21 Jun 2022, 22:59:06 UTC

2022-06-21 20:26:53 (52460): Guest Log: 2.6.3.0 1852 307445734561825742 32172 105817 3 1 1492435 4096000 0 65024 0 0 n/a 0 0 http://s1cern-cvmfs.openhtc.io/cvmfs/atlas.cern.ch http://xx.xxx.xxx.xx:3128 1
2022-06-21 20:26:53 (52460): Guest Log: ATHENA_PROC_NUMBER=12
2022-06-21 20:26:55 (52460): Guest Log: *** Starting ATLAS job. (PandaID=5497406599 taskID=29339193) ***
2022-06-21 22:02:59 (52460): Status Report: Elapsed Time: '6000.000000'
2022-06-21 22:02:59 (52460): Status Report: CPU Time: '29828.796875'
2022-06-21 23:43:05 (52460): Status Report: Elapsed Time: '12000.000000'
2022-06-21 23:43:05 (52460): Status Report: CPU Time: '66942.609375'
2022-06-22 00:41:18 (52460): Guest Log: *** Job finished ***

Computer ID 10795955 https://lhcathome.cern.ch/lhcathome/result.php?resultid=358429345
Laufzeit 4 hours 18 min. 52 sek.
CPU Zeit 23 hours 48 min. 43 sek.
Prüfungsstatus Gültig
Punkte 871.48

12 CPU's: 4 hours x 12 = 48 Hours.
CPU Time 23 hours 48 min. 43 sek??
ID: 46917 · Report as offensive     Reply Quote

Message boards : ATLAS application : Problem of the day ATLAS


©2024 CERN