Message boards :
News :
CMS@Home accidentally shut down -- Please set No New Tasks
Message board moderation
Author | Message |
---|---|
Send message Joined: 29 Aug 05 Posts: 1060 Credit: 7,737,455 RAC: 1,317 |
We need to upgrade the CMS@Home WMAgent before Thursday, so I tried to set the workflows to drain down. Unfortunately, I misunderstood the batch states and killed off most of them instead. :-(. There's one still left with about 200 jobs, so that won't last long. Please set your CMS projects to No New Tasks to avoid getting lots of computation errors. I'll let you know when the upgrade is done and jobs are flowing again. |
Send message Joined: 15 Jun 08 Posts: 2534 Credit: 253,851,031 RAC: 37,958 |
Ah, this explains why most of my running CMS tasks crashed within a few minutes short ago. Thought it was caused by the nasty storm. Thanks for the post. |
Send message Joined: 29 Aug 05 Posts: 1060 Credit: 7,737,455 RAC: 1,317 |
|
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,589,655 RAC: 2,832 |
We need to upgrade the CMS@Home WMAgent before Thursday, so I tried to set the workflows to drain down. Unfortunately, I misunderstood the batch states and killed off most of them instead. :-(. There's one still left with about 200 jobs, so that won't last long. I'm not sure I understand this. If you're shutting the server down, surely my client just won't be able to get any tasks anyway? Why do we have to tell it not to fetch? Shouldn't I just get "no work available" back from the server, or perhaps no response? |
Send message Joined: 14 Jan 10 Posts: 1418 Credit: 9,464,929 RAC: 2,680 |
I'm not sure I understand this. If you're shutting the server down, surely my client just won't be able to get any tasks anyway? Why do we have to tell it not to fetch? Shouldn't I just get "no work available" back from the server, or perhaps no response?Your client requests a task from the BOINC server. That task creates a CMS-VM on your machine and that VM requests CMS-jobs from another server and that's out of jobs or unreachable. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,589,655 RAC: 2,832 |
Your client requests a task from the BOINC server. That task creates a CMS-VM on your machine and that VM requests CMS-jobs from another server and that's out of jobs or unreachable. Ah that makes sense. Although couldn't the Boinc server have been told not to hand out any CMS? |
Send message Joined: 15 Jun 08 Posts: 2534 Credit: 253,851,031 RAC: 37,958 |
Although couldn't the Boinc server have been told not to hand out any CMS? It sometimes happens that the BOINC server doesn't immediately stop sending task envelopes, especially when the subtask shortage is not very long. In this case Ivan sends out an NNT info. This ensures a faster CMS restart when fresh work is available. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,589,655 RAC: 2,832 |
Although couldn't the Boinc server have been told not to hand out any CMS? Ok, although chances are most people won't see that message for a day or so. I usually don't even notice them at all. |
Send message Joined: 21 Mar 11 Posts: 1 Credit: 45,962 RAC: 0 |
My LHC@home has stopped working since the server failed. So it doesn't make a difference. :-/ |
Send message Joined: 29 Aug 05 Posts: 1060 Credit: 7,737,455 RAC: 1,317 |
Although couldn't the Boinc server have been told not to hand out any CMS? Yeah, sure, but I have to do my best. The problem is that when no jobs are available, then the BOINC task will return an error. Your BOINC manager will flag that as an error, and reduce the quota of tasks you can request per day, until this gets down to unity. The quota will only increase when you run a successful task. So if you don't set NNT then your machine gets reduced to one task request per day, until new jobs are available, allowing a task to complete, thus increasing the per-day quota. So, it takes several days to ramp up to full production again if you allow your machine to deplete its quota. |
Send message Joined: 29 Aug 05 Posts: 1060 Credit: 7,737,455 RAC: 1,317 |
|
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
So, it takes several days to ramp up to full production again if you allow your machine to deplete its quota. Thanks. It makes sense now. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,589,655 RAC: 2,832 |
Although couldn't the Boinc server have been told not to hand out any CMS? Ok, thanks for explaining in detail. I take it some people run only CMS on their clients? I have Androids that only do Sixtrack (that's all that gets given to them, I assume Androids can't do the others), but my main desktop does any LHC task, and rarely gets CMS, presumably since Theory and Atlas have a lot more tasks available. My smaller three desktops produce a lot of errors when running virtual machine LHC (they're only 8GB RAM with old processors), so I don't use LHC on those. You could email everyone if there's something urgent - we all have the option to tick (or not) the thing that says it's ok for admins to email us. Then we'd see it quicker. |
Send message Joined: 16 Jan 20 Posts: 3 Credit: 7,502 RAC: 0 |
"My smaller three desktops produce a lot of errors when running virtual machine LHC (they're only 8GB RAM with old processors)" 4gb ram on 9 year old cpu still working fine for me with LHC. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,589,655 RAC: 2,832 |
"My smaller three desktops produce a lot of errors when running virtual machine LHC (they're only 8GB RAM with old processors)" 4gb ram on 9 year old cpu still working fine for me with LHC. Which subprojects does it run? I didn't bother selecting, I let them do anything. I got computation errors on about 75% of them, so thought it a waste of time. And what CPUs do you have? One of mine for example is a Q8400 with 8GB DDR2. And what OS? Mine all run Windows 10. |
Send message Joined: 15 Jun 08 Posts: 2534 Credit: 253,851,031 RAC: 37,958 |
You could email everyone if there's something urgent - we all have the option to tick (or not) the thing that says it's ok for admins to email us. Then we'd see it quicker. Are you aware that you can subscribe to MB threads or get them via RSS feed? The way this works can be configured on the MB pages and on this page: https://lhcathome.cern.ch/lhcathome/edit_forum_preferences_form.php It's far more than simply allow an admin to send you a mail. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,589,655 RAC: 2,832 |
You could email everyone if there's something urgent - we all have the option to tick (or not) the thing that says it's ok for admins to email us. Then we'd see it quicker. Sorry, I have no idea what the point of RSS is and have never felt the need to use it. Everything I use emails me when something needs my attention (eg when you replied in this conversation). I really don't want yet another source to check and another program to run. In my life I have four inputs - phonecalls, texts, emails, and physical post (mail). I don't want five. |
©2024 CERN