Message boards :
CMS Application :
EXIT_NO_SUB_TASKS
Message board moderation
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 16 · Next
Author | Message |
---|---|
Send message Joined: 15 Jun 08 Posts: 2411 Credit: 226,247,371 RAC: 130,486 |
Don't know if this helps to identify the problem. ATM I don't have any CMS tasks but this morning I had some still running from last night. I noticed that those tasks got fresh jobs after they finished the one before. At the same time all fresh tasks failed with EXIT_NO_SUB_TASKS. Might be that the Condor server doesn't accept new clients. |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,128 RAC: 397 |
|
Send message Joined: 3 May 20 Posts: 10 Credit: 600,070 RAC: 0 |
I just checked my work results, and it seems that I have 70 CMS faulty tasks that all errored out after ~1000 sec. So can I assume this is connected to the said condor problem? If so, why weren't jobs cancelled then? Still, happy to see that the issue has been resolved meanwhile and CMS WU are crunching fine again. |
Send message Joined: 15 Jun 08 Posts: 2411 Credit: 226,247,371 RAC: 130,486 |
So can I assume this is connected to the said condor problem? Most likely yes, but since your computers are hidden the logfiles can't be checked for other reasons. If so, why weren't jobs cancelled then? See here: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5090&postid=44381 |
Send message Joined: 3 May 20 Posts: 10 Credit: 600,070 RAC: 0 |
Thanks for the pointer! Didn't realize mine were hidden. I usually never do this. Now they should be visible. I'll keep running CMS tasks in the meantime :) |
Send message Joined: 18 Nov 17 Posts: 120 Credit: 52,002,925 RAC: 25,902 |
EXIT_NO_SUB_TASKS again |
Send message Joined: 16 Aug 05 Posts: 5 Credit: 2,795,425 RAC: 0 |
Same here. Question: can I just put the tasks in my queue on hold, and wait for better times? |
Send message Joined: 15 Jun 08 Posts: 2411 Credit: 226,247,371 RAC: 130,486 |
Yes, but you may be aware that if just 1 task from a project is set on hold your BOINC client will not download any new task from that project. |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,128 RAC: 397 |
|
Send message Joined: 27 Sep 08 Posts: 807 Credit: 652,142,562 RAC: 290,359 |
Looks like this time again. Is the effort to link the back end to BOINC job generation so much effort cf manually filling the queue and the server load when tasks run out? |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,128 RAC: 397 |
|
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,128 RAC: 397 |
|
Send message Joined: 18 Nov 17 Posts: 120 Credit: 52,002,925 RAC: 25,902 |
Looks like EXIT_NO_SUB_TASKS again. |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,686,320 RAC: 121,744 |
Looks like EXIT_NO_SUB_TASKS again.yes, and the download of new tasks has stopped automatically, which makes sense. So let's hope that CMS will soon be running again |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,128 RAC: 397 |
|
Send message Joined: 24 Oct 04 Posts: 1127 Credit: 49,749,586 RAC: 10,234 |
(I must not have hit the right "post" button earlier...) Not quite the same as the Kentucky Derby (glad I didn't make a bet) I have just been running the CMS here one at a time lately and so far all have been Valid. But I did have back to back time wasters over at -dev ( same version as far as I know) Back to running one again so I think it was just the usual slow internet speed and mine tried to start all 3 CMS tasks I have running at the same time. ( I hope you get that F1 GP in HDTV Ivan) |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,686,320 RAC: 121,744 |
now though all downloaded tasks error out after a few minutes with: 2021-05-02 20:17:03 (1516): VM Completion Message: Could not connect to Condor server on port 9618 . for example, see: https://lhcathome.cern.ch/lhcathome/result.php?resultid=315934632 |
Send message Joined: 24 Oct 04 Posts: 1127 Credit: 49,749,586 RAC: 10,234 |
Looks like the condor is taking sunday off again. I have one here that has been running 20% so far (3hrs 42mins) and this host doesn't have enough ram to try a new one and the ones I have this CMS version are -dev hosts so I would have to mess it all up by d/ling this version over here for hours but when one of the 2 CMS I have running is finished I will give this one a try again ( I have one of these on this host running from both here and at -dev at the same time) But both of them started 25 mins apart on this one and are actually running. Maybe Ivan will get this back to normal on monday. http://localhost:50606/logs/running.log |
Send message Joined: 24 Oct 04 Posts: 1127 Credit: 49,749,586 RAC: 10,234 |
Well I finally finished the 2 I had running .....one from here and one from over at -dev and started 2 more of the same 2 hours ago and they are running as they should so maybe if there was a problem it was fixed. (it is monday here for 3 minutes so that means it is the start of a new day in Geneva and London) |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,128 RAC: 397 |
( I hope you get that F1 GP in HDTV Ivan) I'm not that much of a fan, besides my broadband is lousy... Tuned my TV to Radio 5 Sports Extra for the sound, and my laptop to bbc.co.uk/sports for the comments and leaderboard. As an ex-racer myself, I do wish that the motorcycling GPs were available free-to-air here, I haven't seen one in 5 or 6 years (last time I visited my brother in Oz). |
©2024 CERN