Message boards :
ATLAS application :
several Atlas tasks had to be aborted
Message board moderation
| Author | Message |
|---|---|
YetiSend message Joined: 2 Sep 04 Posts: 468 Credit: 215,197,406 RAC: 1,966 |
Hi, I found several Atlas-Tasks that were doing nothing usefull and I had to abort them. Examples: https://lhcathome.cern.ch/lhcathome/result.php?resultid=207556366 https://lhcathome.cern.ch/lhcathome/result.php?resultid=207610403 https://lhcathome.cern.ch/lhcathome/result.php?resultid=207570427 https://lhcathome.cern.ch/lhcathome/result.php?resultid=207559971 Supporting BOINC, a great concept ! |
YetiSend message Joined: 2 Sep 04 Posts: 468 Credit: 215,197,406 RAC: 1,966 |
|
|
Send message Joined: 15 Jun 08 Posts: 2710 Credit: 291,681,477 RAC: 144,140 |
Line 2 points out a possible "hardware error". As ATLAS runs on virtual hardware it may indicate a corrupt vdi file. You may reset the project to get a fresh vdi file. If the error persists you'd have to dig deeper, e.g. for a corrupt VirtualBox installation or a real hardware error. |
YetiSend message Joined: 2 Sep 04 Posts: 468 Credit: 215,197,406 RAC: 1,966 |
|
|
Send message Joined: 2 May 07 Posts: 2278 Credit: 178,775,457 RAC: 2,306 |
Yeti, you have 32 GByte for this Ryzen with 16 Cores. Do you let 4 or five Atlas running at the same time? Boinc 7.12.1 is now the default. Seam so, that one Atlas is running sometime in a RAM-Desaster and find no end. You have more than 170 Tasks finished successful for the moment. Not easy to see what is going wrong. |
YetiSend message Joined: 2 Sep 04 Posts: 468 Credit: 215,197,406 RAC: 1,966 |
Do you let 4 or five Atlas running at the same time?Nope, only 2x4 or 3x4 Boinc 7.12.1 is now the default.But this doesn't mean that my version is bad. And newer BOINC-Versions have some very bad restrictions (or bugs, don't know) Seam so, that one Atlas is running sometime in a RAM-Desaster and find no end.As these tasks that I have to abort are spread over several of my machines and have started to appear some days ago I don't think it is a problem of my boxes Supporting BOINC, a great concept ! |
|
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
Hi, To the "faulty tasks" hypothesis: The first example failed on 2 other hosts but neither of those 2 shows successful results only failures. The most recent iteration is still in progress -> inconclusive The second example validated on the next iteration -> good task The third example is in second iteration which is in progress -> inconclusive The fourth example validated on the third iteration -> good task 2 good task indicators versus 2 inconclusive indicators. I bet in time the 2 still in progress validate too. If the tasks are indeed faulty then why are they failing only on your hosts? Also a couple of the examples ran for more than 48 hours. Do ATLAS tasks not have a time limit? |
©2025 CERN