Message boards :
News :
Many queued tasks - server status page erratic
Message board moderation
Author | Message |
---|---|
Send message Joined: 15 Jul 05 Posts: 242 Credit: 5,800,306 RAC: 0 |
Due to the very high number of queued Sixtrack tasks, we have enabled 4 load-balanced scheduler/feeder servers to handle the demand. (Our bottleneck is the database, but several schedulers can cache more tasks to be dispatched.) Our server status page does not currently show in real time the daemon status on remote servers. Hence the server status page may indicate a varying number of processes, depending on which web server is active. Please also be patient if you are not getting tasks for your preferred application quickly enough. After a few retries, there will be some tasks. Thanks for your understanding and happy crunching! ---the team |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,817,517 RAC: 132,770 |
Hi Nils, there are Atlas Dowonload Error now mostly three at a time for one PC. Is there a possibility to find a way to reduce this? https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=122732033 |
Send message Joined: 15 Jul 05 Posts: 242 Credit: 5,800,306 RAC: 0 |
Not sure what causes this. Our 3 file servers are fine and not with much load. It could be a local ISP issue too. |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,471,672 RAC: 131,976 |
I doubt that it's an ISP issue. At least in the past those ATLAS download errors were typically caused by faulty WU batches. |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,817,517 RAC: 132,770 |
Not sure what causes this. Our 3 file servers are fine and not with much load. It could be a local ISP issue too. After 5:30 UTC today, no more download Error for Atlas so long. |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
Nils Høimyr wrote on Aug. 21: ...Nils, when will be back to "normal" ? |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,817,517 RAC: 132,770 |
Erich, be patient and wait after this 3.0 Mio. Tasks are in the past. You can see this on the server status-page. |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
Erich, be patient and wait after this 3.0 Mio. Tasks are in the past.well, we are below 3 Mio tasks now, and the situation has not changed :-( It is rather laborius to have to manually retry for 20 or 30 minutes until finally new tasks are being downloaded. I am questioning the rationale behind loading over 3 Mio Sixtrack tasks on the server, if - in turn - this blocks all other sub-projects. |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,817,517 RAC: 132,770 |
We have to wait if ALL tasks (3 Mio.) are finished and the system went to normally. So be patient.... Scheduler boincai12 feeder boincai12 Download server lhcathome-upload Upload server lhcathome-upload This four Server are new to do so many work. |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
We have to wait if ALL tasks (3 Mio.) are finishedyour are not kidding, are you? |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
I just attached a machine, and can't get native ATLAS or Theory. EDIT: But I switched that machine to SixTrack too, and it immediately picked up work. So the moral is simple. If they want SixTrack, do SixTrack. But another machine is getting a steady supply of SixTrack. So once you are in, you are in. |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
I am not surprised that it's NOT a problem to download Sixtrack. But the problem exists with all other sub-projects. On one of my machines, this morning I first tried about 20 minutes, without success, and then almost 30 minutes, again no success. Just now, I tried some 10 minutes, nothing. Who has the time to sit there for hours? A big part of these almost three million Sixtrack tasks should be transferred away from the download server and interimely parked somewhere else. What sense does it make that non of the other subtasks now cannot be downloaded for quite some time ahead? |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,817,517 RAC: 132,770 |
We have to wait if ALL tasks (3 Mio.) are finishedyour are not kidding, are you? Erich, you have more than 100k Cobblestones per day! |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
Erich, you have more than 100k Cobblestones per day!no, about half is the correct figure. But what has this to do with this nonsense of swamping the server with 3 Mio Sixtrack tasks, so that, as a consequence, all other subtasks get stuck ? Can anyone explain that to me? |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,817,517 RAC: 132,770 |
Erich, this stats is your in Boinc-stats for today from LHCatHome: Cobblestones 143,734 Duration per day 51,976 Let us crunch and do the work they give us. Cern-IT know how to handle so many tasks. I have a lot of respect for their work. |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
Cern-IT know how to handle so many tasks.in this case: definitely not, sorry to say that :-( |
Send message Joined: 24 Oct 04 Posts: 1127 Credit: 49,751,535 RAC: 8,764 |
8/24/2019 2:36:39 PM | LHC@home | Not requesting tasks: too many uploads in progress First time I have ever seen that and I can't send or receive in the last 6 hours |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
8/24/2019 2:36:39 PM | LHC@home | Not requesting tasks: too many uploads in progress If it is the first time that you have seen that, then it is probably the first time in the history of the world. I am going to give it a break. |
Send message Joined: 24 Oct 04 Posts: 1127 Credit: 49,751,535 RAC: 8,764 |
Thanks for that important reply Jim Btw I finally got all of mine to return to Cern and got all the new ones to d/l |
Send message Joined: 18 Dec 15 Posts: 1688 Credit: 103,877,813 RAC: 121,583 |
I am not surprised that it's NOT a problem to download Sixtrack.for how long will this bad situation still last? I have been trying to download ATLAS tasks for more than 1 hour now, sitting here and pushing the "update" button every 5 seconds. This is more than annoying by now :-( |
©2024 CERN