Message boards :
Number crunching :
not sending out SixTrack
Message board moderation
Author | Message |
---|---|
Send message Joined: 29 Sep 04 Posts: 42 Credit: 11,505,632 RAC: 0 |
As off now there are 336,450 WUs of SixTrack supposed to be ready to be sent out, but my machines don't get any of them. Addendum: Now I am flooded with WUs :) |
Send message Joined: 24 Oct 04 Posts: 1156 Credit: 52,775,698 RAC: 62,554 |
That good old server is doing that with Theory tasks too. The Event Log says 1/21/2017 2:59:42 PM | LHC@home | No tasks are available for Theory Simulation Same with the Atlas tasks. |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
Also no CMS or LHCb. We are out of luck. (But at least I am getting ATLAS from ATLAS@home). |
Send message Joined: 29 Aug 05 Posts: 1048 Credit: 7,498,441 RAC: 7,594 |
Also no CMS or LHCb. We are out of luck. I was wondering about that, given the drop off in the running jobs graph for CMS. Laurence's little cluster hasn't changed its number of jobs, but it's not running BOINC. Both my machines (running -dev as well as production) have tasks, but the differences in timing between tasks being reported and being sent suggests a problem somewhere. I'll go take a look at the Condor server; having tasks doesn't necessarily mean being served jobs, although a lack of tasks usually results in a task timeout after ten minutes. Later: I have jobs for most of my 3x2-core -dev tasks, one has an idle slot. One of my two machines running production has jobs for its two tasks, the other (which is the machine running -dev) has none: [eesridr@pion:~] > condor_status -pool vccondor01.cern.ch|grep 9-1054 slot1@9-1054-23839 LINUX X86_64 Claimed Busy 1.130 3000 0+00:19:05 slot2@9-1054-23839 LINUX X86_64 Claimed Busy 1.090 3000 0+00:19:06 slot1@9-1054-27215 LINUX X86_64 Unclaimed Idle 0.000 3000 0+00:12:31 slot2@9-1054-27215 LINUX X86_64 Claimed Busy 0.770 3000 0+00:12:52 slot1@9-1054-32714 LINUX X86_64 Claimed Busy 1.120 3000 0+00:08:17 slot2@9-1054-32714 LINUX X86_64 Claimed Busy 1.080 3000 0+00:48:48 [eesridr@pion:~] > condor_status -pool vccondor01.cern.ch|grep '14095-' 14095-10412491-270 LINUX X86_64 Claimed Busy 1.890 3000 0+01:05:00 14095-10412491-320 LINUX X86_64 Claimed Busy 2.060 3000 0+01:21:21 I logged onto my two machines. The one running both projects just got a new -dev task, so -dev is serving tasks, but production is telling it there are no jobs available, so the problem seems to be just with production. The machine running production tasks got them 7 or 8 hours ago, so the problem has arisen since then. |
Send message Joined: 14 Jan 10 Posts: 1374 Credit: 9,156,225 RAC: 4,980 |
The title of this thread is: "not sending out SixTrack", but since Sixtrack has 351689 unsent tasks (still increasing) and is sending tasks now, it looks like the project with massive Sixtrack tasks in queue, has troubles with serving tasks to the others sub-projects. |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
I just started getting LHCb again, but still no Theory or CMS. (I don't have SixTrack selected, so don't know). |
Send message Joined: 29 Aug 05 Posts: 1048 Credit: 7,498,441 RAC: 7,594 |
I just started getting LHCb again, but still no Theory or CMS. (I don't have SixTrack selected, so don't know). Quota back-off for assumed computer errors is probably going to stop people getting tasks for a while. I was in touch with the CERN crew and I guess they made some adjustments -- both my production machines are now running two tasks. The CMS job graphs aren't showing great recovery yet, but there is a hint. |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
I am getting both CMS and Theory now, which is nice, since the ATLAS feeder is down. But that is what redundancy is for. |
Send message Joined: 14 Feb 14 Posts: 5 Credit: 17,818,305 RAC: 0 |
I'm getting SixTrack tasks now. Server status says 214K unsent, so it looks like it's resolved and the queue is decreasing.. |
Send message Joined: 19 Feb 08 Posts: 708 Credit: 4,336,250 RAC: 0 |
I am getting mostly LHCb tasks on my Windows 10 PC and SixTrack tasks on my main Linux box, all accompanied by SETI@home and SETI Beta GPU tasks. I have a nVidia GTX 1050 on the Windows PC and a GTX 750 on the Linux box. Tullio |
Send message Joined: 12 Feb 14 Posts: 72 Credit: 4,639,155 RAC: 0 |
There are two ATLAS feeders. The one for the ATLAS@home project attached to at http://atlasathome.cern.ch/ is up according to http://atlasathome.cern.ch/server_status.php, while the feeder for the ATLAS@home project attached to at https://lhcathome.cern.ch/ATLAS/ is down according to https://lhcathome.cern.ch/ATLAS/server_status.php. |
Send message Joined: 29 Sep 04 Posts: 42 Credit: 11,505,632 RAC: 0 |
From time to time the server is very closefisted. More than half a million of sixtrack WUs, but not giving them away. |
Send message Joined: 17 Feb 07 Posts: 86 Credit: 968,855 RAC: 0 |
There are many many tasks available at the moment, but I do not get any. Why? I have not installed the virtual box but I guess there are many sixtracks WU's available as the server page shows and no VB needed. Or is that wrong thinking of me? Greetings from, TJ |
Send message Joined: 19 Feb 08 Posts: 708 Credit: 4,336,250 RAC: 0 |
I am getting many of them on my 32-Bit Linux laptop. Tullio |
Send message Joined: 17 Feb 07 Posts: 86 Credit: 968,855 RAC: 0 |
I am getting many of them on my 32-Bit Linux laptop. That sounds good. So there must be something wrong on my system. But what? Greetings from, TJ |
Send message Joined: 17 Feb 07 Posts: 86 Credit: 968,855 RAC: 0 |
this are the messages: 3/14/2017 5:56:42 PM | LHC@home 1.0 | Requesting new tasks for CPU 3/14/2017 5:56:43 PM | LHC@home 1.0 | Scheduler request completed: got 0 new tasks 3/14/2017 5:56:44 PM | LHC@home 1.0 | No tasks sent 3/14/2017 5:56:54 PM | LHC@home 1.0 | Sending scheduler request: To fetch work. 3/14/2017 5:56:54 PM | LHC@home 1.0 | Requesting new tasks for CPU 3/14/2017 5:56:55 PM | LHC@home 1.0 | Scheduler request completed: got 0 new tasks 3/14/2017 5:56:55 PM | LHC@home 1.0 | No tasks sent 3/14/2017 6:00:44 PM | LHC@home 1.0 | Resetting project 3/14/2017 6:00:46 PM | LHC@home 1.0 | Master file download succeeded 3/14/2017 6:00:51 PM | LHC@home 1.0 | Sending scheduler request: To fetch work. 3/14/2017 6:00:51 PM | LHC@home 1.0 | Requesting new tasks for CPU 3/14/2017 6:00:52 PM | LHC@home 1.0 | Scheduler request completed: got 0 new tasks 3/14/2017 6:00:52 PM | LHC@home 1.0 | No tasks sent and goes so on and on... Greetings from, TJ |
Send message Joined: 24 Apr 06 Posts: 1 Credit: 529,771 RAC: 0 |
I'm getting jobs but they are completing in under 10 seconds. |
Send message Joined: 4 Mar 17 Posts: 22 Credit: 9,852,654 RAC: 6,810 |
T.J. do you use the old project Url? The new: LHC@home https://lhcathome.cern.ch/lhcathome/ |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 198,635,553 RAC: 77,327 |
|
Send message Joined: 17 Feb 07 Posts: 86 Credit: 968,855 RAC: 0 |
Yesterday I saw a lot of tasks to crunch but I did not get any. Today there are even more tasks. But I do not get any so I removed the project and add it again. Then I got only one task that ran fast. And now again no new ones... Somewhere something is wrong I think. Thanks. Edit: when I manually request new tasks I get 8. Greetings from, TJ |
©2024 CERN