Message boards :
ATLAS application :
Last days a lot of validate errors or No Hits file produced
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 11 Jul 06 Posts: 6 Credit: 2,915,386 RAC: 313 |
With native version also produced many valid but "No HITS result produced" results: https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553020 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552887 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552906 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552965 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553015 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553016 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415553017 https://lhcathome.cern.ch/lhcathome/result.php?resultid=415552861 As I see: CRITICAL | max running time (10000s) minus grace time (180s) has been exceeded - time to abort pilot |
Send message Joined: 27 Sep 08 Posts: 852 Credit: 694,261,272 RAC: 112,817 |
Around 50% of my ATLAS task are not vaild, this is quite high cf normal range of errors. |
Send message Joined: 14 Jan 10 Posts: 1427 Credit: 9,510,310 RAC: 2,545 |
Around 50% of my ATLAS task are not vaild, this is quite high cf normal range of errors.And the valid ones don't produce a valid HITS-file like e.g. https://lhcathome.cern.ch/lhcathome/result.php?resultid=415615260 |
Send message Joined: 27 Sep 08 Posts: 852 Credit: 694,261,272 RAC: 112,817 |
I didn't check the vaild ones ;), seems like ATLAS is quite broken at the moment in this case. |
Send message Joined: 2 May 07 Posts: 2245 Credit: 174,006,243 RAC: 8,727 |
2024-10-31 12:36:42 (13544): Guest Log: Looking for outputfile HITS.41843707._006391.pool.root.1 2024-10-31 12:36:42 (13544): Guest Log: HITS file was successfully produced For me no problems so far. Boinc 8.0.2 Virtualbox 7.0.14 |
Send message Joined: 3 Nov 12 Posts: 59 Credit: 142,485,943 RAC: 30,775 |
+1 HITS file was successfully produced (native_mt) x86_64-pc-linux-gnu |
Send message Joined: 28 Sep 04 Posts: 735 Credit: 49,844,204 RAC: 35,579 |
If you follow the link to original grafana data from the Atlas jobs graph page you'll find that Boinc_mcore is producing about 5 % successful results. The rest are not valid. |
Send message Joined: 24 May 23 Posts: 46 Credit: 2,632,622 RAC: 2,016 |
With native version also produced many valid but "No HITS result produced" results: Apparently I've been having this same kind of issue since 26/27 Oct, with quite some tasks running on my slow computer (where I also fake more CPUs than the actual ones, so to run more tasks concurrently and avoid dead times in the "starting up" and "finishing" stages). On my fast one, though, very very few. -- Bye |
Send message Joined: 7 Aug 14 Posts: 27 Credit: 10,000,233 RAC: 51 |
Haven't looked at all the logs but of the 20+ native tasks I have checked they all had a HITS file. |
Send message Joined: 2 May 07 Posts: 2245 Credit: 174,006,243 RAC: 8,727 |
Apparently I've been having this same kind of issue since 26/27 Oct, with quite some tasks running on my slow computer (where I also fake more CPUs than the actual ones, so to run more tasks concurrently and avoid dead times in the "starting up" and "finishing" stages). On my fast one, though, very very few. You can change your prefs to run only Atlas. Also you can select only one Atlas-Task with for example 4 CPU's to see if it work. |
Send message Joined: 24 May 23 Posts: 46 Credit: 2,632,622 RAC: 2,016 |
You can change your prefs to run only Atlas. When I checked last time, I had the impression things were already getting much better. I'm running only Atlas on my slow pc (which is not too slow, however), when Atlas work is available. IMHO the better option is to give each Atlas task as much CPU time and as many cores as possible, since this issue seems to be time-related. -- Bye |
Send message Joined: 7 Aug 11 Posts: 105 Credit: 25,519,766 RAC: 24,199 |
I'm up to 670+ invalid tasks with my modest hardware, all failing around the 1 minute mark. Others will have far more with significant amounts of time and power wasted. Could someone perhaps stop loading these miss-configured units until things get fixed? |
Send message Joined: 2 May 07 Posts: 2245 Credit: 174,006,243 RAC: 8,727 |
The best idea, deselect this project, until Cern-IT have an answer. Have stopped work with this Project. No idea, how long this problems will avalaible. |
Send message Joined: 4 Mar 11 Posts: 29 Credit: 3,848,900 RAC: 3 |
No need to disconnect from the project, just simply select "no new tasks", abort any tasks you have. Then sit back and wait until the project staff announce that the problem is solved. |
Send message Joined: 14 Jan 10 Posts: 1427 Credit: 9,510,310 RAC: 2,545 |
The same old story: 2024-11-14 22:40:37 (3440): Guest Log: *** Error codes and diagnostics *** 2024-11-14 22:40:37 (3440): Guest Log: "exeErrorCode": 65, 2024-11-14 22:40:37 (3440): Guest Log: "exeErrorDiag": "Non-zero return code from EVNTtoHITS (8); Logfile error in log.EVNTtoHITS: \"Unable to identify specific exception\"", 2024-11-14 22:40:37 (3440): Guest Log: "pilotErrorCode": 1305, 2024-11-14 22:40:37 (3440): Guest Log: "pilotErrorDiag": "Failed to execute payload:PyJobTransforms.transform.execute 2024-11-14 21:40:01,625 CRITICAL Transform executor raised TransformValidationException: Non-zero return code from EVNTtoHITS (8); Logfile error in log.EVNTtoHITS: \"Unable to identify specific exception\"", https://lhcathome.cern.ch/lhcathome/result.php?resultid=416572140 |
Send message Joined: 18 Dec 15 Posts: 1827 Credit: 119,561,229 RAC: 44,109 |
I am wondering that no one over there by now has noticed that all the tasks which were sent out within the recent past are faulty. How can this be? |
Send message Joined: 27 Jun 06 Posts: 8 Credit: 2,592,725 RAC: 444 |
My desktop completed successfully 2 ATLAS WUs today and 1 yesterday. Seems it's back OK now, no? https://lhcathome.cern.ch/lhcathome/results.php?hostid=10859153 |
Send message Joined: 18 Dec 15 Posts: 1827 Credit: 119,561,229 RAC: 44,109 |
...Seems it's back OK now, no?except that according to the server status page there are no tasks available for download |
Send message Joined: 18 Dec 15 Posts: 1827 Credit: 119,561,229 RAC: 44,109 |
I've been trying for a few hours now to get Atlas tasks - without success :-(...Seems it's back OK now, no?except that according to the server status page there are no tasks available for download How did you manage to get tasks ? |
Send message Joined: 18 Dec 15 Posts: 1827 Credit: 119,561,229 RAC: 44,109 |
after some time, one of my hosts did receive a task - and it worked wellI've been trying for a few hours now to get Atlas tasks - without success :-(...Seems it's back OK now, no?except that according to the server status page there are no tasks available for download |
©2025 CERN