Message boards :
ATLAS application :
several Atlas tasks had to be aborted
Message board moderation
Author | Message |
---|---|
Send message Joined: 2 Sep 04 Posts: 453 Credit: 193,369,412 RAC: 10,065 |
Hi, I found several Atlas-Tasks that were doing nothing usefull and I had to abort them. Examples: https://lhcathome.cern.ch/lhcathome/result.php?resultid=207556366 https://lhcathome.cern.ch/lhcathome/result.php?resultid=207610403 https://lhcathome.cern.ch/lhcathome/result.php?resultid=207570427 https://lhcathome.cern.ch/lhcathome/result.php?resultid=207559971 Supporting BOINC, a great concept ! |
Send message Joined: 2 Sep 04 Posts: 453 Credit: 193,369,412 RAC: 10,065 |
|
Send message Joined: 15 Jun 08 Posts: 2386 Credit: 223,041,230 RAC: 136,850 |
Line 2 points out a possible "hardware error". As ATLAS runs on virtual hardware it may indicate a corrupt vdi file. You may reset the project to get a fresh vdi file. If the error persists you'd have to dig deeper, e.g. for a corrupt VirtualBox installation or a real hardware error. |
Send message Joined: 2 Sep 04 Posts: 453 Credit: 193,369,412 RAC: 10,065 |
|
Send message Joined: 2 May 07 Posts: 2071 Credit: 156,192,791 RAC: 103,819 |
Yeti, you have 32 GByte for this Ryzen with 16 Cores. Do you let 4 or five Atlas running at the same time? Boinc 7.12.1 is now the default. Seam so, that one Atlas is running sometime in a RAM-Desaster and find no end. You have more than 170 Tasks finished successful for the moment. Not easy to see what is going wrong. |
Send message Joined: 2 Sep 04 Posts: 453 Credit: 193,369,412 RAC: 10,065 |
Do you let 4 or five Atlas running at the same time?Nope, only 2x4 or 3x4 Boinc 7.12.1 is now the default.But this doesn't mean that my version is bad. And newer BOINC-Versions have some very bad restrictions (or bugs, don't know) Seam so, that one Atlas is running sometime in a RAM-Desaster and find no end.As these tasks that I have to abort are spread over several of my machines and have started to appear some days ago I don't think it is a problem of my boxes Supporting BOINC, a great concept ! |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
Hi, To the "faulty tasks" hypothesis: The first example failed on 2 other hosts but neither of those 2 shows successful results only failures. The most recent iteration is still in progress -> inconclusive The second example validated on the next iteration -> good task The third example is in second iteration which is in progress -> inconclusive The fourth example validated on the third iteration -> good task 2 good task indicators versus 2 inconclusive indicators. I bet in time the 2 still in progress validate too. If the tasks are indeed faulty then why are they failing only on your hosts? Also a couple of the examples ran for more than 48 hours. Do ATLAS tasks not have a time limit? |
©2024 CERN