Message boards :
Theory Application :
Website showing list with "bad" sherpa tasks
Message board moderation
Author | Message |
---|---|
Send message Joined: 18 Dec 15 Posts: 1716 Credit: 106,627,606 RAC: 73,993 |
I remember that a few years ago, here in the forum someone posted an URL to a website showing the names of "bad" (faulty) sherpa tasks (so that one could check after a task download whether that task will sucdeed or probably fail). Unfortunately, I don't remember this URL and I can't find it anywhere. Coulde someone please post it here? |
Send message Joined: 15 Jun 08 Posts: 2443 Credit: 231,372,593 RAC: 121,950 |
few years ago Might not be useful any more since there were major changes recently. Most important: the switch from cvm3 to cvm4 which provides a completely different base environment. According to mcplots the overall failure rate is 1.27 % for revision 2687. |
Send message Joined: 2 May 07 Posts: 2152 Credit: 161,229,638 RAC: 59,060 |
Erich56, remember this page can change the revision. atm rev=2687 http://mcplots2cc7.cern.ch/production.php?view=runs&rev=2687&display=fail |
Send message Joined: 18 Dec 15 Posts: 1716 Credit: 106,627,606 RAC: 73,993 |
on one of my hosts, a Sherpa [Boinc pp winclusive 7000 10 - sherpa 2.2.9 default 6000 223] has been running for more than 2 days. Console f2 says "lean back and enjoy" ... but no events are being shown since then. CPU is active with 99+ % in console f3. Since this task is shown in the list from the link above posting, I guess that I should kill the task, right? |
Send message Joined: 2 May 07 Posts: 2152 Credit: 161,229,638 RAC: 59,060 |
Therefore no task of them was successful, yes. btw no new Theory tasks avalaible. Best time to find the network issues ;-)) |
Send message Joined: 15 Jun 08 Posts: 2443 Credit: 231,372,593 RAC: 121,950 |
Although the task in question indeed might got stuck the "failed" list is not helpful to decide whether any task should be killed or not. That's because 1 important fact has not been mentioned: "Therefore no task of them was successful so far..." Especially when a new mcplots revision starts there are always a couple of runspecs that fail or get lost before they report their first success. To get removed from the "failed" list a runspec needs to report at least 1 successful result. |
©2024 CERN