Message boards :
ATLAS application :
Last days a lot of validate errors or No Hits file produced
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 11 Jul 06 Posts: 6 Credit: 2,915,386 RAC: 1,785 |
With native version also produced many valid but "No HITS result produced" results: https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553020 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552887 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552906 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552965 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553015 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553016 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553017 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552861 As I see: CRITICAL | max running time (10000s) minus grace time (180s) has been exceeded - time to abort pilot |
Send message Joined: 27 Sep 08 Posts: 850 Credit: 692,823,409 RAC: 68,497 |
Around 50% of my ATLAS task are not vaild, this is quite high cf normal range of errors. |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 1,266 |
Around 50% of my ATLAS task are not vaild, this is quite high cf normal range of errors.And the valid ones don't produce a valid HITS-file like e.g. https://lhcathome.cern.ch/lhcathome/result.php?resultid=415615260 |
Send message Joined: 27 Sep 08 Posts: 850 Credit: 692,823,409 RAC: 68,497 |
I didn't check the vaild ones ;), seems like ATLAS is quite broken at the moment in this case. |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 456 |
2024-10-31 12:36:42 (13544): Guest Log: Looking for outputfile HITS.41843707._006391.pool.root.1 2024-10-31 12:36:42 (13544): Guest Log: HITS file was successfully produced For me no problems so far. Boinc 8.0.2 Virtualbox 7.0.14 |
Send message Joined: 3 Nov 12 Posts: 59 Credit: 142,182,531 RAC: 42,802 |
+1 HITS file was successfully produced (native_mt) x86_64-pc-linux-gnu |
Send message Joined: 28 Sep 04 Posts: 732 Credit: 49,365,372 RAC: 17,171 |
If you follow the link to original grafana data from the Atlas jobs graph page you'll find that Boinc_mcore is producing about 5 % successful results. The rest are not valid. |
Send message Joined: 24 May 23 Posts: 43 Credit: 2,624,143 RAC: 8,228 |
With native version also produced many valid but "No HITS result produced" results: Apparently I've been having this same kind of issue since 26/27 Oct, with quite some tasks running on my slow computer (where I also fake more CPUs than the actual ones, so to run more tasks concurrently and avoid dead times in the "starting up" and "finishing" stages). On my fast one, though, very very few. -- Bye |
Send message Joined: 7 Aug 14 Posts: 27 Credit: 10,000,233 RAC: 290 |
Haven't looked at all the logs but of the 20+ native tasks I have checked they all had a HITS file. |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 456 |
Apparently I've been having this same kind of issue since 26/27 Oct, with quite some tasks running on my slow computer (where I also fake more CPUs than the actual ones, so to run more tasks concurrently and avoid dead times in the "starting up" and "finishing" stages). On my fast one, though, very very few. You can change your prefs to run only Atlas. Also you can select only one Atlas-Task with for example 4 CPU's to see if it work. |
Send message Joined: 24 May 23 Posts: 43 Credit: 2,624,143 RAC: 8,228 |
You can change your prefs to run only Atlas. When I checked last time, I had the impression things were already getting much better. I'm running only Atlas on my slow pc (which is not too slow, however), when Atlas work is available. IMHO the better option is to give each Atlas task as much CPU time and as many cores as possible, since this issue seems to be time-related. -- Bye |
Send message Joined: 7 Aug 11 Posts: 104 Credit: 25,221,969 RAC: 17,297 |
I'm up to 670+ invalid tasks with my modest hardware, all failing around the 1 minute mark. Others will have far more with significant amounts of time and power wasted. Could someone perhaps stop loading these miss-configured units until things get fixed? |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 456 |
The best idea, deselect this project, until Cern-IT have an answer. Have stopped work with this Project. No idea, how long this problems will avalaible. |
Send message Joined: 4 Mar 11 Posts: 29 Credit: 3,848,900 RAC: 16 |
No need to disconnect from the project, just simply select "no new tasks", abort any tasks you have. Then sit back and wait until the project staff announce that the problem is solved. |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 1,266 |
The same old story: 2024-11-14 22:40:37 (3440): Guest Log: *** Error codes and diagnostics *** 2024-11-14 22:40:37 (3440): Guest Log: "exeErrorCode": 65, 2024-11-14 22:40:37 (3440): Guest Log: "exeErrorDiag": "Non-zero return code from EVNTtoHITS (8); Logfile error in log.EVNTtoHITS: \"Unable to identify specific exception\"", 2024-11-14 22:40:37 (3440): Guest Log: "pilotErrorCode": 1305, 2024-11-14 22:40:37 (3440): Guest Log: "pilotErrorDiag": "Failed to execute payload:PyJobTransforms.transform.execute 2024-11-14 21:40:01,625 CRITICAL Transform executor raised TransformValidationException: Non-zero return code from EVNTtoHITS (8); Logfile error in log.EVNTtoHITS: \"Unable to identify specific exception\"", https://lhcathome.cern.ch/lhcathome/result.php?resultid=416572140 |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,943,725 RAC: 21,026 |
I am wondering that no one over there by now has noticed that all the tasks which were sent out within the recent past are faulty. How can this be? |
Send message Joined: 27 Jun 06 Posts: 8 Credit: 2,592,725 RAC: 2,536 |
My desktop completed successfully 2 ATLAS WUs today and 1 yesterday. Seems it's back OK now, no? https://lhcathome.cern.ch/lhcathome/results.php?hostid=10859153 |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,943,725 RAC: 21,026 |
...Seems it's back OK now, no?except that according to the server status page there are no tasks available for download |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,943,725 RAC: 21,026 |
I've been trying for a few hours now to get Atlas tasks - without success :-(...Seems it's back OK now, no?except that according to the server status page there are no tasks available for download How did you manage to get tasks ? |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,943,725 RAC: 21,026 |
after some time, one of my hosts did receive a task - and it worked wellI've been trying for a few hours now to get Atlas tasks - without success :-(...Seems it's back OK now, no?except that according to the server status page there are no tasks available for download |
©2024 CERN