Message boards : Number crunching : Well That Was Strange
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1114
Credit: 49,502,974
RAC: 4,007
Message 27021 - Posted: 5 Dec 2014, 5:44:33 UTC
Last modified: 5 Dec 2014, 6:17:57 UTC

http://lhcathomeclassic.cern.ch/sixtrack/results.php?hostid=10337530&offset=0&show_names=0&state=5&appid=

Just happened to be sitting here watching these tasks complete and get sent in each one run about 1.5 hours and they end up being *Completed, can't validate*

I have 4 more running and I wonder if this will continue.

This was after just having 29 complete and validated.

edit: the next one ended up *Completed, validation inconclusive*

....and another one of those......guess I will abort the last 2 to save the time for something else.

http://lhcathomeclassic.cern.ch/sixtrack/results.php?hostid=10337530
Volunteer Mad Scientist For Life
ID: 27021 · Report as offensive     Reply Quote
Stick

Send message
Joined: 21 Aug 07
Posts: 46
Credit: 1,503,630
RAC: 0
Message 27022 - Posted: 5 Dec 2014, 15:14:26 UTC - in response to Message 27021.  

ID: 27022 · Report as offensive     Reply Quote
Stick

Send message
Joined: 21 Aug 07
Posts: 46
Credit: 1,503,630
RAC: 0
Message 27023 - Posted: 5 Dec 2014, 21:03:39 UTC - in response to Message 27022.  
Last modified: 5 Dec 2014, 21:04:14 UTC

I have a few more of these now and I noticed that all of them are WU's beginning with w-binl_1_job.B1injection_binl_ and they all failed with the following message: Too many results (may be nondeterministic).
ID: 27023 · Report as offensive     Reply Quote
Uffe F

Send message
Joined: 9 Jan 08
Posts: 66
Credit: 727,923
RAC: 0
Message 27024 - Posted: 5 Dec 2014, 22:36:58 UTC - in response to Message 27023.  

Had 41 errors with the same here.
ID: 27024 · Report as offensive     Reply Quote
Profile Viking69
Avatar

Send message
Joined: 24 Jul 05
Posts: 56
Credit: 5,602,722
RAC: 4
Message 27025 - Posted: 6 Dec 2014, 19:47:19 UTC

I saw a few of these too. I'm not getting any work now. They must have stopped distribution, but they have not posted to the home page since OCT 31st.

Let's crunch for our future.
ID: 27025 · Report as offensive     Reply Quote
m

Send message
Joined: 6 Sep 08
Posts: 116
Credit: 10,927,002
RAC: 2,464
Message 27026 - Posted: 8 Dec 2014, 1:20:03 UTC
Last modified: 8 Dec 2014, 1:39:02 UTC

The failures seem to have the quorum set lower than the initial replication. In some cases tasks returned after the quorum had been met (but are still within the deadline) fail "...can't validate". Similar to what is described here. Some of mine failed like this but in other cases all the results failed. BOINC should try to validate running tasks completed within deadline, even if the quorum has been met - delaying deletion of the canonical result - but this doesn't always seem to work when set up this way.

John.
ID: 27026 · Report as offensive     Reply Quote
Profile Robert Pick

Send message
Joined: 1 Dec 05
Posts: 62
Credit: 11,398,274
RAC: 261
Message 27030 - Posted: 9 Dec 2014, 21:39:41 UTC

We are getting the Can't validate s--t again!!!!!!!
Pick
ID: 27030 · Report as offensive     Reply Quote
Uffe F

Send message
Joined: 9 Jan 08
Posts: 66
Credit: 727,923
RAC: 0
Message 27032 - Posted: 10 Dec 2014, 16:00:09 UTC - in response to Message 27030.  

Same here. Got some new that can't validate.
ID: 27032 · Report as offensive     Reply Quote
Stick

Send message
Joined: 21 Aug 07
Posts: 46
Credit: 1,503,630
RAC: 0
Message 27033 - Posted: 10 Dec 2014, 21:50:47 UTC - in response to Message 27032.  

I've got 3 more now, too.
ID: 27033 · Report as offensive     Reply Quote
Profile Robert Pick

Send message
Joined: 1 Dec 05
Posts: 62
Credit: 11,398,274
RAC: 261
Message 27034 - Posted: 11 Dec 2014, 0:04:07 UTC

Well let's see! You keep sending (w-binl_1_job.B1injection) WU's, I keep running them and sending them back. Then you reject them because TOOOOOOO many crunchers get the right answer!!! It's a very funny game we're playing. There must be a better of doing this, don't ya think. Pick
ID: 27034 · Report as offensive     Reply Quote
USTL-FIL (Lille Fr)

Send message
Joined: 11 Dec 09
Posts: 27
Credit: 236,744,737
RAC: 70
Message 27036 - Posted: 11 Dec 2014, 14:47:36 UTC

Hi!
I have 1500 'can't validate task'....

État: Tous (3894) · En cours (121) · Validation pending (220) · Validation inconclusive (127) · Valide (1958) · Invalide (1464) · Erreur (4)

What can we do???
ID: 27036 · Report as offensive     Reply Quote
Uffe F

Send message
Joined: 9 Jan 08
Posts: 66
Credit: 727,923
RAC: 0
Message 27037 - Posted: 11 Dec 2014, 17:20:10 UTC - in response to Message 27036.  

They just keep coming and wasting power. I think i will put on hold for a little time (until it's fixed), because its a lot of wasted resources. Then I will happely come back and chrunch.
ID: 27037 · Report as offensive     Reply Quote
Stick

Send message
Joined: 21 Aug 07
Posts: 46
Credit: 1,503,630
RAC: 0
Message 27038 - Posted: 11 Dec 2014, 19:19:31 UTC - in response to Message 27037.  

And, compared to when it was first reported, it seems like the problem is getting worse (i.e. it seems that problem WU's are now a higher percentage of all new WU's being issued).

What is very frustrating to me is the lack of any acknowledgement by the project that this issue exists.
ID: 27038 · Report as offensive     Reply Quote
[AF>FAH-Addict.net]toTOW

Send message
Joined: 9 Oct 10
Posts: 77
Credit: 3,671,357
RAC: 0
Message 27039 - Posted: 11 Dec 2014, 22:36:28 UTC
Last modified: 11 Dec 2014, 22:38:02 UTC

All the WUs I got today (18) ended in Invalid status ...

When the last one ended, I was monitoring the machine and I noticed a lot of disk accesses when it was finilizing the works after reaching 100% and before sending the results.
ID: 27039 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 857
Credit: 1,619,050
RAC: 0
Message 27041 - Posted: 20 Dec 2014, 10:50:45 UTC

I am looking at these; strange indeed.
I suspect a validation problem with the w-
when there are a smaller number of particle pairs.....
Eric.

ID: 27041 · Report as offensive     Reply Quote

Message boards : Number crunching : Well That Was Strange


©2024 CERN