Message boards : Theory Application : NO_SUB_TASKS for Theory
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
bronco

Send message
Joined: 13 Apr 18
Posts: 443
Credit: 8,438,885
RAC: 0
Message 36930 - Posted: 1 Oct 2018, 3:37:55 UTC - in response to Message 36929.  

The one I received at 30 Sep 2018, 21:40:57 UTC got a job and it's running fine now. Maybe there is a pattern but I don't see it. It seems like it comes and goes randomly, unpredictably.
ID: 36930 · Report as offensive     Reply Quote
bronco

Send message
Joined: 13 Apr 18
Posts: 443
Credit: 8,438,885
RAC: 0
Message 36961 - Posted: 6 Oct 2018, 0:45:12 UTC

Uh-oh. Condor's gone rogue too? Has anybody seen Laurence?
ID: 36961 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1687
Credit: 103,106,422
RAC: 127,158
Message 37952 - Posted: 8 Feb 2019, 17:18:05 UTC

after this did not occur for long time, I had such a case today:
207 (0x000000CF) EXIT_NO_SUB_TASKS

https://lhcathome.cern.ch/lhcathome/result.php?resultid=215619010
ID: 37952 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 675
Credit: 43,541,645
RAC: 15,578
Message 37953 - Posted: 8 Feb 2019, 19:05:09 UTC - in response to Message 37952.  

The server status page shows 0 tasks in ready to send queue, so at least the problem shouldn't expand itself. I have three Theory tasks currently running and they don't look very promising to succeed but let's see what happens.
ID: 37953 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1687
Credit: 103,106,422
RAC: 127,158
Message 39946 - Posted: 17 Sep 2019, 10:57:40 UTC

I've had several EXIT_NO_SUB_TASKS failures after 22 minutes from start, within the past 2 hours.
ID: 39946 · Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer

Send message
Joined: 20 Jun 14
Posts: 373
Credit: 238,712
RAC: 0
Message 39947 - Posted: 17 Sep 2019, 11:17:09 UTC - in response to Message 39946.  

The MCPlots server ran of of disk space. It has been fixed. Jobs should be flowing again soon.
ID: 39947 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1687
Credit: 103,106,422
RAC: 127,158
Message 39948 - Posted: 17 Sep 2019, 11:59:57 UTC - in response to Message 39947.  

Thanks, Laurence, for the quick information :-)
ID: 39948 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1687
Credit: 103,106,422
RAC: 127,158
Message 39951 - Posted: 17 Sep 2019, 14:39:39 UTC
Last modified: 17 Sep 2019, 14:45:20 UTC

even if there were jobs available now - there are no tasks ready for download so far :-(
ID: 39951 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1117
Credit: 49,725,007
RAC: 13,926
Message 39955 - Posted: 17 Sep 2019, 18:23:17 UTC

207 (0x000000CF) EXIT_NO_SUB_TASKS

I started the day with 20 of these on 4 different hosts.

With these bizarre stderr's

https://lhcathome.cern.ch/lhcathome/result.php?resultid=245437184
ID: 39955 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 804
Credit: 650,180,708
RAC: 246,438
Message 39956 - Posted: 17 Sep 2019, 18:27:37 UTC - in response to Message 39955.  
Last modified: 17 Sep 2019, 18:27:55 UTC

I think ring 2 stack in use is from trying to start too many VM's at once.
ID: 39956 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1117
Credit: 49,725,007
RAC: 13,926
Message 39957 - Posted: 17 Sep 2019, 20:10:27 UTC - in response to Message 39956.  

I think ring 2 stack in use is from trying to start too many VM's at once.


No that isn't what happened with mine Toby.

They are running the same as they always do but scroll down that stderr until you see what I mean.

(it has to be that dizzy redhat6 server again)
ID: 39957 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 804
Credit: 650,180,708
RAC: 246,438
Message 39959 - Posted: 18 Sep 2019, 6:14:29 UTC - in response to Message 39957.  

maybe the ring 2 part is due to the vm not powering off when requested, but you already had no jobs so it wasn't import based on the 1st error.
ID: 39959 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Theory Application : NO_SUB_TASKS for Theory


©2024 CERN