Message boards : ATLAS application : 20 minutes remaining to 2 days
Message board moderation

To post messages, you must log in.

AuthorMessage
Keter

Send message
Joined: 5 May 22
Posts: 1
Credit: 136,414
RAC: 1,653
Message 46773 - Posted: 14 May 2022, 16:48:10 UTC
Last modified: 14 May 2022, 16:50:40 UTC

Curious as to why the 6 CPU application task says 20 minutes were remaining but the last few from 99.75% have taken days to accumulate? Is this particularly normal with LHC Atlas tasks?
I've paused other tasks to possibly allow for extra resources to be given to the LHC application with no observable affect.
ID: 46773 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 728
Credit: 493,891,267
RAC: 309,312
Message 46774 - Posted: 14 May 2022, 17:27:34 UTC - in response to Message 46773.  

I think if you have don't done many tasks then there is error in how the remaining time calculated, it should get better over time.
ID: 46774 · Report as offensive     Reply Quote
7v7.pl

Send message
Joined: 11 May 22
Posts: 3
Credit: 235,455
RAC: 3,873
Message 46785 - Posted: 18 May 2022, 6:46:50 UTC - in response to Message 46773.  

I have the same problem 5 CPUs task done in 100%, working time so far 3 days 4:57:58 and still counting althout ending time shows 00:00:00 all other tasks are stopped ...
ID: 46785 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 2028
Credit: 148,620,117
RAC: 115,781
Message 46787 - Posted: 18 May 2022, 6:55:55 UTC - in response to Message 46785.  

You may make your computers visible for other volunteers first:
https://lhcathome.cern.ch/lhcathome/prefs.php?subset=project
ID: 46787 · Report as offensive     Reply Quote
7v7.pl

Send message
Joined: 11 May 22
Posts: 3
Credit: 235,455
RAC: 3,873
Message 46788 - Posted: 18 May 2022, 7:13:09 UTC - in response to Message 46787.  

done
ID: 46788 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 2028
Credit: 148,620,117
RAC: 115,781
Message 46791 - Posted: 18 May 2022, 10:39:00 UTC - in response to Message 46785.  

I have the same problem 5 CPUs task done in 100%, working time so far 3 days 4:57:58 and still counting althout ending time shows 00:00:00 all other tasks are stopped ...

This is most likely caused by too many suspend/resume cycles.
See the logs from ATLAS/CMS.


You got a mix of different tasks.

Theory: works fine
see:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=354205816
https://lhcathome.cern.ch/lhcathome/result.php?resultid=354202802



CMS: works fine
https://lhcathome.cern.ch/lhcathome/result.php?resultid=354214457
except that CMS (and ATLAS) don't like to be suspended too often or for longer.
This may become a problem in the future.
2022-05-12 16:57:06 (5864): VM state change detected. (old = 'running', new = 'paused')
2022-05-12 17:10:39 (5864): VM state change detected. (old = 'paused', new = 'running')
2022-05-12 17:12:52 (5864): VM state change detected. (old = 'running', new = 'paused')
2022-05-12 17:19:05 (5864): VM state change detected. (old = 'paused', new = 'running')
.
.
.




ATLAS
WUs like this are caused by a CERN configuration error:
https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=188417240
Just ignore them.




These are fine as they returned a HITS file:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=354294703
https://lhcathome.cern.ch/lhcathome/result.php?resultid=354295020
https://lhcathome.cern.ch/lhcathome/result.php?resultid=354211799
2022-05-12 16:40:00 (11588): Guest Log: HITS file was successfully produced




The last one suffered from suspend/resume described above but finally succeeded.
Nonetheless it was close before a crash.
2022-05-11 23:03:46 (9856): Error in stop VM for VM: -108
ID: 46791 · Report as offensive     Reply Quote
7v7.pl

Send message
Joined: 11 May 22
Posts: 3
Credit: 235,455
RAC: 3,873
Message 46792 - Posted: 18 May 2022, 18:54:04 UTC - in response to Message 46791.  

Thank you for deep explanations.
ID: 46792 · Report as offensive     Reply Quote

Message boards : ATLAS application : 20 minutes remaining to 2 days


©2022 CERN