Message boards :
Number crunching :
Host messing up tons of results
Message board moderation
Author | Message |
---|---|
Send message Joined: 17 Jul 05 Posts: 102 Credit: 542,016 RAC: 0 |
10137504 currently has 13152 inconclusive results and 16 valid ones. |
Send message Joined: 27 Sep 08 Posts: 847 Credit: 691,620,446 RAC: 112,812 |
That computer was the same before, I believe someone was going to talk to the owner. |
Send message Joined: 9 Oct 10 Posts: 77 Credit: 3,671,357 RAC: 0 |
I have two WUs in validation inconclusive because of this host too ... Someone should tell him about these errors :( |
Send message Joined: 10 Sep 08 Posts: 6 Credit: 6,350,253 RAC: 0 |
Now 21303 inconclusive - ridiculous! |
Send message Joined: 29 Nov 13 Posts: 59 Credit: 4,012,100 RAC: 0 |
Yea! Weird thing is he has 0 errored atm! Team AnandTech - WCG, Uni@H, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H. Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit 2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64 |
Send message Joined: 12 Jul 11 Posts: 857 Credit: 1,619,050 RAC: 0 |
I'll look at this tomorrow. My IT support has some new scripts (I hope) for monitoring errors. In any case I shall turn off this host if that is the solution. Eric. |
Send message Joined: 12 Jul 11 Posts: 857 Credit: 1,619,050 RAC: 0 |
......and I have just realised that since we are "all" waiting for a 3rd run to get validation.......these additonal runs seem to be scheduled at end of the one million job queue!!!!! This could explain a lot, especially why the return of results seems slow. Eric. |
Send message Joined: 29 Nov 13 Posts: 59 Credit: 4,012,100 RAC: 0 |
I've just realised I've got 6 WUs held up by this host! Looking at his times it's now obvious to me they will be errored results. I've just looked through my pending results & found a user's PC with every single of its 478 tasks errored! This 1 http://lhcathomeclassic.cern.ch/sixtrack/results.php?hostid=9973913 1 of rhurlin's PCs. And all but maybe 1 of this ones! http://lhcathomeclassic.cern.ch/sixtrack/results.php?hostid=10200322 (Kevin Arth) And all of this ones! http://lhcathomeclassic.cern.ch/sixtrack/show_host_detail.php?hostid=10313550 ([AF>Libristes>Gentoo]JujuBickoille) Team AnandTech - WCG, Uni@H, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H. Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit 2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64 |
Send message Joined: 12 Jul 11 Posts: 857 Credit: 1,619,050 RAC: 0 |
Your results will not be invalidated, if correct........the other ones will be. Eric. |
Send message Joined: 29 Nov 13 Posts: 59 Credit: 4,012,100 RAC: 0 |
Roger that, just some point latter on I assume? Was mainly just pointing out some other dodgy hosts. Team AnandTech - WCG, Uni@H, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H. Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit 2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64 |
Send message Joined: 27 Sep 08 Posts: 847 Credit: 691,620,446 RAC: 112,812 |
You can fix that Eric if you change the settings that we talked about in the other thread, accelerate re-tries I think it was called? |
Send message Joined: 27 Oct 07 Posts: 186 Credit: 3,297,640 RAC: 0 |
You can fix that Eric if you change the settings that we talked about in the other thread, accelerate re-tries I think it was called? And as I said in that other thread - message 26567 - I don't think that accelerating retries would help bring additional tasks forward from the end of the queue (that's where they go), but it would help to make sure they're dealt with quickly and effectively when we do reach them. At least the B1 injection run seems to have a low average runtime, so the queue is shrinking rapidly. |
Send message Joined: 29 Sep 04 Posts: 281 Credit: 11,866,264 RAC: 0 |
I'm starting to see a few ..job_corr_bb..... _2 and _3 resends now so maybe we've got to the tail, of that batch anyway. I would hope that the resends come from the tail of each study rather than going all the way to the end of the whole queue. (These are from ordinary errors, not inconclusives from that rogue host, but might be a sign that we will soon be working through those.) |
Send message Joined: 29 Nov 13 Posts: 59 Credit: 4,012,100 RAC: 0 |
That host now has 'Validation inconclusive (27935)'! You'd think he/she would notice the huge dearth in points by now! I just sent him a PM, see if he replies..... Team AnandTech - WCG, Uni@H, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H. Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit 2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64 |
Send message Joined: 27 Sep 08 Posts: 847 Credit: 691,620,446 RAC: 112,812 |
Sorry I thought it put them at the front of queue |
Send message Joined: 29 Sep 04 Posts: 281 Credit: 11,866,264 RAC: 0 |
One would have hoped that any wus requiring to be resent would go to the front of the queue but my earliest invalids have been waiting since 2nd July and have still not been resent to a third contributor. Current batch of longer-running tasks (although good to see) might slow up the resends even more. I had a 15 hour job overnight. |
Send message Joined: 1 Dec 12 Posts: 11 Credit: 5,844,526 RAC: 0 |
I can confirm: I have 55 tasks waiting as "validation inconclusive", most of them if not all from host 10137504. The first of them has been waiting since 2nd of July too. |
Send message Joined: 1 Dec 12 Posts: 11 Credit: 5,844,526 RAC: 0 |
I can confirm: I have 55 tasks waiting as "validation inconclusive", most of them if not all from host 10137504. The first of them has been waiting since 2nd of July too. |
Send message Joined: 29 Nov 13 Posts: 59 Credit: 4,012,100 RAC: 0 |
Hi Eric, that host 10137504 (not the user) is becoming a menace, can't you cut it off? Having asked that, it seems to only have 3 WUs in progress now........ Team AnandTech - WCG, Uni@H, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H. Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit 2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64 |
Send message Joined: 12 Jul 11 Posts: 857 Credit: 1,619,050 RAC: 0 |
I have e-mailed and we shall see. Situation now very complicated due to "new" multiple "ERR_RESULT_DOWLOAD" errors. I am watching over weekend and we shall see. Eric. |
©2024 CERN