Message boards :
Theory Application :
[ERROR] No jobs were available to run.
Message board moderation
Author | Message |
---|---|
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
I see we have hundreds of these again today with Theory and CMS tasks I have over 20 so far and I checked another members who runs lots of these too and see the same thing so I have to go suspend all of mine so they don't all end up doing this. |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
You may as well suspend your LHCb tasks too because they're not gonna get any sub-tasks either unless they hooked up with Condor before this latest problem started. |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
Yeah I would have done that but the only LHCb's I have running are the beta version and they are still working. (version 1.07) |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
I, too, had a few Theory tasks without jobs last night. Now it seems to work again. With LHCb no problem so far. And what concerns CMS: yesterday, this sub-project was removed from the list in the Project Status Page. So, obviously, it's dead for the time being (which is too bad). |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
last night, same thing: there were a few Theory tasks not getting jobs. Why is it that always during the night hours, we temporarily run out of jobs? |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
And again.........8 in a row so far |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
I had a few ones again yesterday afternoon and today afternoon. |
Send message Joined: 2 May 07 Posts: 2117 Credit: 159,924,350 RAC: 80,112 |
Since 11.00 UTC no new tasks. |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
No new tasks again and the ones we do have are once again......[ERROR] Condor exited after 4751s without running a job. They had been running good for almost a week but tend to do this on a saturday. So far I have 20 in a row doing this and another 84 to run. |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
https://lhcathome.cern.ch/lhcathome/results.php?hostid=9930008&offset=0&show_names=0&state=6&appid=13 Again today we have thousands of these. |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
https://lhcathome.cern.ch/lhcathome/results.php?hostid=9930008&offset=0&show_names=0&state=6&appid=13Again, I really can't understand why no-one at LHC takes care of these recurring problems :-( |
Send message Joined: 13 Jul 05 Posts: 167 Credit: 14,945,019 RAC: 255 |
Isn't selecting only failed tasks cheating a bit? https://lhcathome.cern.ch/lhcathome/results.php?hostid=9930008&offset=0&show_names=0&state=4&appid=13 shows that there are some valid subtasks out there.https://lhcathome.cern.ch/lhcathome/results.php?hostid=9930008&offset=0&show_names=0&state=6&appid=13 Again today we have thousands of these. Again, I really can't understand why no-one at LHC takes care of these recurring problems :-(OK: so what's your suggestion for when a project has only a small amount of work available? Should it just give up on BOINC and run the work privately? I suppose they could drastically reduce the number of pilots so that each is more likely to actually get a sub-task - but there'd still be whining here about the lack of WUs instead. Edit: although telling us what's going on wouldn't do any harm! |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
Edit: although telling us what's going on wouldn't do any harm!yes, at least this could be done and would be nice for us crunchers. |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
Henry Nebrensky nothing you said makes any sense and it doesn't even belong here.. |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
this morning: same thing: all tasks without jobs :-(https://lhcathome.cern.ch/lhcathome/results.php?hostid=9930008&offset=0&show_names=0&state=6&appid=13Again, I really can't understand why no-one at LHC takes care of these recurring problems :-( And it is rather annoying by now. Why do the LHC@Home people not solve this never-ending problem? |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
Yes same thing here Erich And as always I checked other members stats who do lots of these and they had the same problem so that tells us it is at the Cern server end again. But with a couple tries I reloaded all my hosts again for a another run. But I think once in a while all we need to do is mention it here and they usually take care of the problem. |
Send message Joined: 18 Dec 15 Posts: 1691 Credit: 104,595,111 RAC: 114,771 |
Yes same thing here Erich ...so far they havn't though. From what I could see this morning: tons of failed tasks due to lack of jobs. These tasks run for about 20 minutes, then they fail - what a waste :-( Something is running very wrong over there, and obvioulsly they don't have the experts to get that fixed. |
Send message Joined: 20 Jun 14 Posts: 378 Credit: 238,712 RAC: 0 |
The number of queued jobs has been increased. This should resolve any issues with no sub tasks. |
Send message Joined: 24 Oct 04 Posts: 1129 Credit: 49,762,040 RAC: 5,592 |
Thanks Laurence So far today 23 Valids and many more running with no problems. ( Erich I am sending you a pm about this ) |
Send message Joined: 2 May 07 Posts: 2117 Credit: 159,924,350 RAC: 80,112 |
Something is running very wrong over there, and obvioulsly they don't have the experts to get that fixed. Erich, please more respect for the Cern-IT and project-Teams! |
©2024 CERN