Message boards : Number crunching : No Tasks ???
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Profile Robert Pick

Send message
Joined: 1 Dec 05
Posts: 62
Credit: 11,441,610
RAC: 0
Message 24210 - Posted: 9 Jul 2012, 22:29:12 UTC

I haven't posted since LHC@home started up again. I've been merrily crunching along since 2005! Within the last year I upgraded my old machine to an i7 950. For the last month or so every thing was going smoooooth. This morning I had 18 WU to crunch. As one would finish another would down load, just like clockwork! AND THEN--- no tasks sent---no tasks sent---no tasks sent. That has happened before during all the upgrades being made at your end but never lasted all day long. I also know that an i7 950 is capable of sse2,sse3, sse4,8,13,32,95/ just kidding but I never get any, how come. Hope all these things get worked out. Pick
ID: 24210 · Report as offensive     Reply Quote
Profile Robert Pick

Send message
Joined: 1 Dec 05
Posts: 62
Credit: 11,441,610
RAC: 0
Message 24213 - Posted: 10 Jul 2012, 1:10:24 UTC

Well, while puttsing about I saw my system start to download. As fast as they did they were done. Then the download errors started. I clicked on the transfer button and a half dozen were hung up in there. One click and they downloaded and just as fast they were gone. 30 or so in about 15 sec. .25% download errors. What's up? Pick

ID: 24213 · Report as offensive     Reply Quote
Teilchen

Send message
Joined: 8 Jun 07
Posts: 13
Credit: 250,850
RAC: 0
Message 24216 - Posted: 10 Jul 2012, 7:07:49 UTC

I gst no tasks since yesterday evening.
And 3 tasks hang in download than they become download error.
Greetings.
ID: 24216 · Report as offensive     Reply Quote
Nuadormrac

Send message
Joined: 26 Sep 05
Posts: 85
Credit: 421,130
RAC: 0
Message 24218 - Posted: 10 Jul 2012, 11:47:32 UTC

After seeming to resolve itself last weekend, this problem started up again yeserday, and has been unable to get work since through today. This isn't just 7.0.28, as beings that I've got a new laptop scheduled to arrive in 2 days (only ever built desktops, not laptops) I haven't bothered to update software on a system that's getting replaced/tossed out. This is occuring on 6.12.15 on Windows 7 Pro x64 also....

14,000 + available in queue, no work sent. There's also not much in the queue besides as I've been cleaning the queue out, to wind stuff down. They say tomarrow, but Fedex hasn't shown it yet, so I'm planning on 1 or possibly 2 days left to crunch (I'll check the trackinng latter), but it doesn't take that many hours to crunch these, so....
ID: 24218 · Report as offensive     Reply Quote
Neal Watkins
Avatar

Send message
Joined: 14 Nov 05
Posts: 32
Credit: 715,125
RAC: 0
Message 24219 - Posted: 10 Jul 2012, 13:19:32 UTC

After getting a few WUs, I am now back to zero WUs but there
are ~14k WUs available to send. Whatever it was - it's back...
ID: 24219 · Report as offensive     Reply Quote
Profile Kathryn Tombaugh Weber

Send message
Joined: 12 Sep 11
Posts: 38
Credit: 218,154
RAC: 0
Message 24271 - Posted: 11 Jul 2012, 18:17:11 UTC

I think I am back to normal now. Happily crunching 4 tasks at the moment. It's just been a heck of a week for everybody!

I never had any doubt that the problems would get resolved, and I don't worry about loss of credit because it's all for science anyway. And such great science it is. I'm with this project all the way.
ID: 24271 · Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 6 Jul 06
Posts: 108
Credit: 663,175
RAC: 0
Message 24327 - Posted: 13 Jul 2012, 13:03:24 UTC
Last modified: 13 Jul 2012, 13:04:30 UTC

Strange that after stopping LHC for a week or two due to the wrong applications being downloaded and Linux being so slow compared to Windows, that when I allow tasks I am unable to get any at all.

Server says over 8,000 WUs ready to send.

Keeps saying "Got 0 new tasks", "No tasks sent".

Tried a reset and a detach and reattach but still the same, Linux and Windows.

Oh well I did finally get to the 50,000 milestone the other week, guess I wont be getting to the 100,000 milestone any time soon, never mind I gave it a go.

Conan
ID: 24327 · Report as offensive     Reply Quote
Tex1954

Send message
Joined: 24 Apr 11
Posts: 37
Credit: 1,295,012
RAC: 0
Message 24328 - Posted: 13 Jul 2012, 14:15:17 UTC - in response to Message 24327.  
Last modified: 13 Jul 2012, 14:15:46 UTC

Yup, me too. What was working no longer works... sigh... not even the i7-950 box gets any more tasks...

Have to smile about that! It's a sure sign folks are tweaking the system.

LOL!

:)
ID: 24328 · Report as offensive     Reply Quote
Tex1954

Send message
Joined: 24 Apr 11
Posts: 37
Credit: 1,295,012
RAC: 0
Message 24332 - Posted: 13 Jul 2012, 17:38:30 UTC - in response to Message 24328.  

Woopsy!

Just got a load of (pni) work on the compaq...

Running now!

:)
ID: 24332 · Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 6 Jul 06
Posts: 108
Credit: 663,175
RAC: 0
Message 24346 - Posted: 14 Jul 2012, 0:38:53 UTC

Allowed work fetch again and this time work downloaded.
PNI work units on 3 machines and SSE2 on the other. A number of work units downloaded on the Windows machines but only 1 each for the Linux machines.
I am interested to see how they run as the Linux WUs before were way behind the 8 ball.

Conan
ID: 24346 · Report as offensive     Reply Quote
Tex1954

Send message
Joined: 24 Apr 11
Posts: 37
Credit: 1,295,012
RAC: 0
Message 24349 - Posted: 14 Jul 2012, 0:51:23 UTC - in response to Message 24332.  

Just got another load on the i7-950 system... so looks like things moving along well.

This load was SSE3 and PNI both...

On the Compaq box (running Linux at the moment), the tasks take 4 hours instead of the usual 7 or so... so much improved.

I LIKE what they are doing! The tweaks are improving things all around!

Hip Hip Hurray for smart developers! I bet this project get tuned perfectly very soon!

:)
ID: 24349 · Report as offensive     Reply Quote
Profile Igor Zacharov
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 16 May 11
Posts: 79
Credit: 111,419
RAC: 0
Message 24350 - Posted: 14 Jul 2012, 0:53:23 UTC - in response to Message 24346.  

interesting.

There should be no difference between linux and windows for the #tasks downloaded. Of course, there is an influence of the past results. So, if the linux boxes were much slower, their average turnaround parameter is higher and they are considered "less reliable" by the system and therefore get less work.

This should correct itself over time, as new results come in.


Allowed work fetch again and this time work downloaded.
PNI work units on 3 machines and SSE2 on the other. A number of work units downloaded on the Windows machines but only 1 each for the Linux machines.
I am interested to see how they run as the Linux WUs before were way behind the 8 ball.

Conan

skype id: igor-zacharov
ID: 24350 · Report as offensive     Reply Quote
Tex1954

Send message
Joined: 24 Apr 11
Posts: 37
Credit: 1,295,012
RAC: 0
Message 24353 - Posted: 14 Jul 2012, 1:57:51 UTC
Last modified: 14 Jul 2012, 1:58:43 UTC

Okay, have the new Linux installed on the 800D box with the Sabertooth Mobo... had to update the Realtek driver to get the LAN to work right and now it's humming fine.

It has also downloaded only PNI tasks and they are running fine.

I like running linux since many projects run significantly faster using Linux. For instance, Correlizer@home tasks take 11:35 under windows on this box, but the same tasks take only 9:54 under Linux. Same for many others..

Thanks for all your hard work! Looks like all my systems talking and working well now!

And the SSE3/PNI tasks run almost twice as fast... 7+ hrs old tasks vs. 4hrs SSE3/PNI on the 1055T box (3.25GHz) for instance.. WOOHOO!

:)
ID: 24353 · Report as offensive     Reply Quote
Filipe

Send message
Joined: 9 Aug 05
Posts: 36
Credit: 7,698,293
RAC: 0
Message 24429 - Posted: 24 Jul 2012, 20:53:15 UTC
Last modified: 24 Jul 2012, 20:53:31 UTC

We've got a lot of work to do! Nice to see 50000 tasks in progress and an adicional 20000 to send.

Congratulations.
ID: 24429 · Report as offensive     Reply Quote
Profile ritterm
Avatar

Send message
Joined: 30 May 08
Posts: 93
Credit: 5,160,246
RAC: 0
Message 24430 - Posted: 25 Jul 2012, 1:31:40 UTC - in response to Message 24429.  

We've got a lot of work to do! Nice to see 50000 tasks in progress and an adicional 20000 to send.

Yes, indeed... Great to see the steady work. Keep it coming! :-)

ID: 24430 · Report as offensive     Reply Quote
[AF>FAH-Addict.net]toTOW

Send message
Joined: 9 Oct 10
Posts: 77
Credit: 3,671,357
RAC: 0
Message 24432 - Posted: 25 Jul 2012, 9:11:24 UTC

It didn't last long ... they have been all distributed.
ID: 24432 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 857
Credit: 1,619,050
RAC: 0
Message 24439 - Posted: 25 Jul 2012, 17:50:16 UTC

Well I am doing my best,but we have some limitations on the
user side at CERN. Also we have to introduce some new elements
(physics) in he next few days, generate an sse4 executable, fix
the MAC executable etc etc.

However, since "you" are delivering the equivalent CPU capacity
of the entire CERN computing centre, but really "only" 50%, since
we run every task twice, I shall make a further effort.
As suggested here on the Message Boards I am submitting work in batches.
I shall try increasing the number of batches.

Incidentally, Igor has just reported some interesting numbers which I
hope he will pass on. Basically, if I remember well, we are rejecting results from
something like 50 hosts out of 25,000 (he will correct), which means I am rather
confident about numeric portability. I am also planning to send a test to these only 50
volunteers. In fact, Igor had the idea of including a test of a few turns in each
Work Unit. If this worked I could then run each case only once, with
perhaps a few random duplicates, in order to double the capacity! :-)
Well it is perhaps a dream. This would be easier than programming GPUs
as SixTrack is Fortran.

However, the main thing at the moment is to document, maintain the quality
of service to both users and volunteers, and get on with the studies of the
various LHC upgrade options. Eric.

ID: 24439 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 857
Credit: 1,619,050
RAC: 0
Message 24440 - Posted: 25 Jul 2012, 17:50:22 UTC

Well I am doing my best,but we have some limitations on the
user side at CERN. Also we have to introduce some new elements
(physics) in he next few days, generate an sse4 executable, fix
the MAC executable etc etc.

However, since "you" are delivering the equivalent CPU capacity
of the entire CERN computing centre, but really "only" 50%, since
we run every task twice, I shall make a further effort.
As suggested here on the Message Boards I am submitting work in batches.
I shall try increasing the number of batches.

Incidentally, Igor has just reported some interesting numbers which I
hope he will pass on. Basically, if I remember well, we are rejecting results from
something like 50 hosts out of 25,000 (he will correct), which means I am rather
confident about numeric portability. I am also planning to send a test to these only 50
volunteers. In fact, Igor had the idea of including a test of a few turns in each
Work Unit. If this worked I could then run each case only once, with
perhaps a few random duplicates, in order to double the capacity! :-)
Well it is perhaps a dream. This would be easier than programming GPUs
as SixTrack is Fortran.

However, the main thing at the moment is to document, maintain the quality
of service to both users and volunteers, and get on with the studies of the
various LHC upgrade options. Eric.

ID: 24440 · Report as offensive     Reply Quote
Christoph

Send message
Joined: 25 Aug 05
Posts: 69
Credit: 306,627
RAC: 0
Message 24442 - Posted: 25 Jul 2012, 18:03:59 UTC - in response to Message 24439.  

Great news Eric, Thank You!

Now imagine, if you send an EMail to all users (or all inactive last 30 days) and only half of the currently inactive are coming back.......

I guess you should increase the available discspace before that message to not run out of it again ;-)

AND you get it to produce a GPU app. (Dreaming on? Or not?)
Christoph
ID: 24442 · Report as offensive     Reply Quote
Profile ritterm
Avatar

Send message
Joined: 30 May 08
Posts: 93
Credit: 5,160,246
RAC: 0
Message 24443 - Posted: 25 Jul 2012, 18:20:20 UTC - in response to Message 24440.  

Well I am doing my best...I shall make a further effort...I shall try increasing the number of batches.

Thanks for all the hard work, Eric. I'm just glad to see the significant increase in work recently and happy to crunch away whenever the tasks are available. :-)

Cheers,

MarkR

ID: 24443 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : No Tasks ???


©2024 CERN