Message boards :
Number crunching :
Server not reporting "No Work from project"
Message board moderation
Author | Message |
---|---|
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
Main page says no work. Scheduler contacts give the following: 10/18/2007 7:17:44 PM|lhcathome|Sending scheduler request: To fetch work 10/18/2007 7:17:44 PM|lhcathome|Requesting 2526 seconds of new work 10/18/2007 7:17:49 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/18/2007 7:17:49 PM|lhcathome|Deferring communication for 30 min 18 sec 10/18/2007 7:17:49 PM|lhcathome|Reason: requested by project 10/18/2007 7:48:09 PM|lhcathome|Sending scheduler request: To fetch work 10/18/2007 7:48:09 PM|lhcathome|Requesting 2550 seconds of new work 10/18/2007 7:48:14 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/18/2007 7:48:14 PM|lhcathome|Deferring communication for 30 min 18 sec 10/18/2007 7:48:14 PM|lhcathome|Reason: requested by project |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
OK... It's getting a bit strange... You'll notice that within the same session there are two deferrals with the no work coming second. The next scheduler connect only has the one entry "requested by project" and goes back to the 30 minute interval instead of the longer backoff... 10/18/2007 9:49:50 PM|lhcathome|Sending scheduler request: To fetch work 10/18/2007 9:49:50 PM|lhcathome|Requesting 2960 seconds of new work 10/18/2007 9:49:55 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/18/2007 9:49:55 PM|lhcathome|Deferring communication for 30 min 18 sec 10/18/2007 9:49:55 PM|lhcathome|Reason: requested by project 10/18/2007 9:49:55 PM|lhcathome|Deferring communication for 41 min 9 sec 10/18/2007 9:49:55 PM|lhcathome|Reason: no work from project 10/18/2007 10:31:06 PM|lhcathome|Sending scheduler request: To fetch work 10/18/2007 10:31:06 PM|lhcathome|Requesting 3001 seconds of new work 10/18/2007 10:31:11 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/18/2007 10:31:11 PM|lhcathome|Deferring communication for 30 min 18 sec 10/18/2007 10:31:11 PM|lhcathome|Reason: requested by project |
Send message Joined: 14 Jul 05 Posts: 275 Credit: 49,291 RAC: 0 |
The two deferral messages are a client bug. Upgrade to a newer version. |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
The two deferral messages are a client bug. Upgrade to a newer version. Hmmm... This only started as of today... Why would a client bug manifest itself after several months of operation? Do you have a Trac bug number and a way I can look at the bug log? I used to be able to browse the list without logging in, but I can't seem to find how I did that... |
Send message Joined: 14 Jul 05 Posts: 275 Credit: 49,291 RAC: 0 |
Hmmm... This only started as of today... Why would a client bug manifest itself after several months of operation? Do you have a Trac bug number and a way I can look at the bug log? I don't think it was reported on Trac. Devs probably found it themselves without any user reporting it. Anyway, the latest version's logs look like this: 19/10/2007 00:45:07|boincsimap|Sending scheduler request: To fetch work. Requesting 5556 seconds of work, reporting 0 completed tasks 19/10/2007 00:45:22|boincsimap|Scheduler request succeeded: got 1 new tasks 19/10/2007 00:45:32|boincsimap|Sending scheduler request: To fetch work. Requesting 1151 seconds of work, reporting 0 completed tasks 19/10/2007 00:45:42|boincsimap|Scheduler request succeeded: got 1 new tasksYep, doesn't show deferral at all. Not a bug, it was removed on purpose. I already gave up on trying to get the change reverted. Complain yourself. I used to be able to browse the list without logging in, but I can't seem to find how I did that... And even though I have permission to do *whatever* on the wiki (I helped migrate the old documentation into the wiki, so I even have delete permissions) and *whatever* with the bug tickets, I suddenly lost even read permission for the source code and changesets. So I think somebody recently broke Trac permissions. Wouldn't surprise me if tickets are now blocked for everybody, or if only spammers can access the wiki :P |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
Hmmm... This only started as of today... Why would a client bug manifest itself after several months of operation? Do you have a Trac bug number and a way I can look at the bug log? Well, I decided to restart BOINC and see what happens. Next connect in 4 minutes. I'll edit this with the results and further comment... Well, same thing... This is 5.8.16, so it's not super-old... I wasn't fond of what I was hearing about 5.10.x versions, so I hung back with 5.8.16... This is a minor nuisance...and I'd prefer to be able to see the deferral, so I'll just hang out for a while and see what happens... |
Send message Joined: 18 Sep 04 Posts: 143 Credit: 27,645 RAC: 0 |
I suddenly lost even read permission for the source code and changesets. So I think somebody recently broke Trac permissions. Nah, it's super secret code that they are adding. Only visible on a need-to-know-basis. :-D Jord BOINC FAQ Service |
Send message Joined: 1 Mar 07 Posts: 47 Credit: 32,356 RAC: 0 |
I used to be able to browse the list without logging in, but I can't seem to find how I did that... Matt Lebofsky, SETI@home server admin, also does work on the BOINC servers. In his recent SAH Tech News he stated that he accidentally broke Trac a few days ago.
See http://setiathome.berkeley.edu/tech_news.php#134 and http://setiathome.berkeley.edu/forum_thread.php?id=42913 for more. Keith. |
Send message Joined: 14 Jul 05 Posts: 275 Credit: 49,291 RAC: 0 |
Matt Lebofsky, SETI@home server admin, also does work on the BOINC servers. I saw that, and it was a few days ago. But the source permissions got broken just last night. |
Send message Joined: 24 Jul 05 Posts: 56 Credit: 5,602,899 RAC: 8 |
One of my PC's, the VISTA box using 5.10.20, is getting this message stream when left unattended and looking for work: 10/27/2007 1:05:23 PM|lhcathome|Message from server: Not sending work - last request too recent: 65 sec 10/27/2007 1:05:23 PM|lhcathome|Deferring communication for 1 min 0 sec 10/27/2007 1:05:23 PM|lhcathome|Reason: no work from project 10/27/2007 1:06:24 PM|lhcathome|Sending scheduler request: To fetch work 10/27/2007 1:06:24 PM|lhcathome|Requesting 1199 seconds of new work 10/27/2007 1:06:29 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/27/2007 1:06:29 PM|lhcathome|Message from server: Not sending work - last request too recent: 65 sec 10/27/2007 1:06:29 PM|lhcathome|Deferring communication for 1 min 44 sec 10/27/2007 1:06:29 PM|lhcathome|Reason: no work from project 10/27/2007 1:08:14 PM|lhcathome|Sending scheduler request: To fetch work 10/27/2007 1:08:14 PM|lhcathome|Requesting 1199 seconds of new work 10/27/2007 1:08:19 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/27/2007 1:08:19 PM|lhcathome|Message from server: Not sending work - last request too recent: 111 sec 10/27/2007 1:08:19 PM|lhcathome|Deferring communication for 1 min 0 sec 10/27/2007 1:08:19 PM|lhcathome|Reason: no work from project 10/27/2007 1:09:24 PM|lhcathome|Sending scheduler request: To fetch work 10/27/2007 1:09:24 PM|lhcathome|Requesting 1199 seconds of new work 10/27/2007 1:09:29 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/27/2007 1:09:29 PM|lhcathome|Message from server: Not sending work - last request too recent: 71 sec 10/27/2007 1:09:29 PM|lhcathome|Deferring communication for 5 min 16 sec 10/27/2007 1:09:29 PM|lhcathome|Reason: no work from project 10/27/2007 1:14:50 PM|lhcathome|Sending scheduler request: To fetch work 10/27/2007 1:14:50 PM|lhcathome|Requesting 1200 seconds of new work 10/27/2007 1:14:55 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/27/2007 1:14:55 PM|lhcathome|Message from server: Not sending work - last request too recent: 325 sec 10/27/2007 1:14:55 PM|lhcathome|Deferring communication for 19 min 10 sec 10/27/2007 1:14:55 PM|lhcathome|Reason: no work from project I have detatched and rebooted with the same result, "no work from project" because of too recent a request. I am currently focusing on POEM and LHC on my home PC's and this is the only one doing this. the other 3 PC's are XP(2) and 2k(1). |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
Your issue is one where the server *IS* reporting no work, so it might be beneficial to start a new post instead of using this thread... FYI, the current minimum time between scheduler contacts is a little over 30 minutes, so you need to simply set that box to no new tasks for 31 minutes and then try again... Brian One of my PC's, the VISTA box using 5.10.20, is getting this message stream when left unattended and looking for work: |
Send message Joined: 29 Dec 06 Posts: 100 Credit: 184,937 RAC: 0 |
Great advice from Brian Silvers! It would be nice if BOINC manager would always set the next try for 30 minutes, 18 seconds... but that's a subject for another thread. Thanks again, Brian. I'm using the "no new tasks" technique and setting my wrist watch for 31 minutes... so simple, yet so smart. _______ "Three quarks for Muster Mark!" . . . . . . . - James Joyce, Finnegans Wake . . . . |
Send message Joined: 29 Dec 06 Posts: 100 Credit: 184,937 RAC: 0 |
...and yes, this issue is technically off topic (I just realized.) If anyone has other questions about the "Server IS reporting No Work from Project saying that my last request was too recent" issue, start a new thread. Cheers! _______ "Three quarks for Muster Mark!" . . . . . . . - James Joyce, Finnegans Wake . . . . |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
It would be nice if BOINC manager would always set the next try for 30 minutes, 18 seconds... but that's a subject for another thread. It depends on various conditions (that I don't know/understand) as to what delay time is set for the next comm attempt. IMO, with the current work quota set so low, I don't see the need for the backoff anyway... I would think that the current 30 minute policy probably increased the load on the scheduler. Edit: OK, I figure perhaps the 30-minute backoff was to try to get reporting done before forcing the 24-hour backoff. Dunno what to say. I'll check to see what happens with the 4 units I pick up over the next 2 hours, but I think the 30-minute rule doesn't get the final 2 results reported until tomorrow unless I hit the update button myself... Brian |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
Yep, that's exactly what happened. I had one short running result, so it got reported on the contact that happened 30 minutes after the download of the first two results. The 2nd result finished at about 40 minutes after it was first downloaded, so the 2nd result went up in the report just a few minutes ago. However, here's what the scheduler did: 10/27/2007 8:41:44 PM|lhcathome|Sending scheduler request: To fetch work 10/27/2007 8:41:44 PM|lhcathome|Requesting 256151 seconds of new work, and reporting 1 completed tasks 10/27/2007 8:41:54 PM|lhcathome|Scheduler RPC succeeded [server version 505] 10/27/2007 8:41:54 PM|lhcathome|Message from server: No work sent 10/27/2007 8:41:54 PM|lhcathome|Message from server: (reached daily quota of 4 results) 10/27/2007 8:41:54 PM|lhcathome|Deferring communication for 22 hr 27 min 48 sec 10/27/2007 8:41:54 PM|lhcathome|Reason: requested by project So, to report the remaining 2 results, I'll have to manually push the update button or wait until the next contact some 22 hours away... Brian |
Send message Joined: 24 Jul 05 Posts: 56 Credit: 5,602,899 RAC: 8 |
OK, so I misread the POST header. But still the back off times were not for the reason of reaching the daily quota, I had see that before. But the systems were unattended and I was not controling the times of the WAIT between communication events, it was all on the software and the server. If I go back far enough in the logs I will find an item similar to the one shown above. When I looked at the web site after the last message I saw that there was work, about 90,000 units at that time and rising as I refreshed. |
Send message Joined: 3 Jan 07 Posts: 124 Credit: 7,065 RAC: 0 |
I understood your issue. I was merely pointing out that this subject is not reflective of your actual issue. Anyway, to further explain what's going on in your case, the project has implemented a rule that says that no computer can contact the server more than once in any 30 minute and 18 second interval. If BOINC decides it is going to connect again in 1 minute, then I believe that your 30 minutes and 18 seconds are reset. In other words, if you were 20 minutes into the 30 minute wait and your system contacted the scheduler, you would then be told to wait 30 more minutes, not 10 more. This behavior is different from the 24-hour quota timer, which appears to keep track of specifically when the results were sent and adjusts accordingly. I don't know if there is anything that can be done about BOINC deciding to automatically connect again in less than the set limit by LHC. It is a different issue than what I was originally reporting, that's why I'm suggesting that you create a new thread to discuss the issue. Brian
|
©2024 CERN