Message boards : News : No RESULTS accepted from Linux Kernel 4.8.*
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31563 - Posted: 23 Jul 2017, 18:05:22 UTC - in response to Message 31561.  

I have banned this host with reluctance.

I had a look at the validator log,
1980483 valid results and 159343 invalids,
very roughly 8% invalid.

I made a list of hosts with invalids.

The hostid 9841071 is 28th on the list.
This host has 279 valids and 674 invalids.

I can't just ban them all!

Here are sorted counts, first 20 of hosts with invalids,
count hostid
33722 10388131
19209 10487841
18677 10486162
15130 10485156
11890 10452223
8421 10480022
5223 10484659
4548 10484606
4224 10486251
4200 10483458
2631 10481907
2284 10484752
2137 10487436
2019 10454365
1852 10484503
1789 10485912
1721 10484663
1615 10485911
1607 10455705
1550 10487212

The 91st host on the list has 10 invalids BUT 110 valids!

Of the total of 787 hosts with invalids 75 are already banned.

It is vital that I prioritise finding and solving the reason
for the Invalids and just not making ad hoc bans!

I am trying now in desperation to delete all existing results from
from banned hosts or from 2015 and earlier...
I SEEM to have deleted too much though...
Ah well we shall see. Easy to resubmit.
The project status looks really bad, and I await the Project Management
Status. We shall see.
Eric.





OK, maybe my list is incomplete. This host is already banned though. Eric.

This one is not (yet) banned, but seems not trustful: Host 9841071

The mentioned host is still getting tasks or getting tasks again and only producing inconclusive's, invalids and errors.

State: All (2145) · In progress (0) · Validation pending (518) · Validation inconclusive (1039) · Valid (1) · Invalid (285) · Error (302)

ID: 31563 · Report as offensive     Reply Quote
Profile Olivier Fehr
Avatar

Send message
Joined: 1 Jun 17
Posts: 9
Credit: 964,242
RAC: 0
Message 31564 - Posted: 23 Jul 2017, 18:22:45 UTC - in response to Message 31563.  

I have had a couple of tasks 'aborted by project' today also on Windows and Ubuntu hosts. Both worked fine (I thought) before. I also have two Fedora (one 25 and one 26 servers) that I am struggling to make VBox work on, Tasks on CentOS 7.3 on the other hand are running just fine, Windows and Ubuntu Boxes were those with tasks cancelled.Did anything change lately or is it me?
ID: 31564 · Report as offensive     Reply Quote
Profile Robert Pick

Send message
Joined: 1 Dec 05
Posts: 61
Credit: 7,313,757
RAC: 6,635
Message 31565 - Posted: 23 Jul 2017, 18:25:27 UTC
Last modified: 23 Jul 2017, 18:27:29 UTC

This may not belong here but I don't see anywhere else to put it. I just had all of my work units aborted from all four of my machines. Well over 100 WU's! What gives?
ID: 31565 · Report as offensive     Reply Quote
Profile White Mountain Wes
Avatar

Send message
Joined: 1 Jan 09
Posts: 32
Credit: 895,135
RAC: 508
Message 31567 - Posted: 23 Jul 2017, 18:33:11 UTC - in response to Message 31565.  

This may not belong here but I don't see anywhere else to put it. I just had all of my work units aborted from all four of my machines. Well over 100 WU's! What gives?

Me too.
ID: 31567 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 621
Credit: 4,693,772
RAC: 1,888
Message 31570 - Posted: 23 Jul 2017, 19:01:10 UTC - in response to Message 31567.  
Last modified: 23 Jul 2017, 19:02:02 UTC

No idea if it's connected, but I'm seeing strange behaviour in one of my monitors. Since about 1645 UTC there has been a noticeable up-tick in test4theory jobs (945->1431), lhcbpilot (345->511) and CMS (654->791). Maybe someone with a large compute farm has joined the project?
ID: 31570 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 750
Credit: 6,028,092
RAC: 700
Message 31571 - Posted: 23 Jul 2017, 19:05:35 UTC - in response to Message 31563.  

ivan wrote:

I SEEM to have deleted too much though...
Ah well we shall see. Easy to resubmit.
The project status looks really bad, and I await the Project Management
Status. We shall see.
Eric.

Is this the reason several users have seen task aborts by the project? Also for VM-subprojects.
ID: 31571 · Report as offensive     Reply Quote
Profile MAGIC Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 852
Credit: 37,599,441
RAC: 25,196
Message 31580 - Posted: 23 Jul 2017, 20:44:33 UTC

Another of the Cern servers has been down for 4 hours.

I got on the message board there for a couple minutes but now it still will not send/receive
Volunteer Mad Scientist For Life
ID: 31580 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31581 - Posted: 23 Jul 2017, 21:11:35 UTC - in response to Message 31571.  

Well I am still checking, I don't see why active tasks should be deleted.
They are still in the database. There may be another problem.
Difficult to be sure as both the Project Status and Project
Managment pages are being updated vey very slowly.
More will be clear tomorrow morning and I shall post more news
soonest. Eric.
ID: 31581 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 562
Credit: 349,699,765
RAC: 363,987
Message 31586 - Posted: 23 Jul 2017, 22:11:50 UTC - in response to Message 31571.  

My super old WU's from 2013 are gone now so you did what you wanted to in addition
ID: 31586 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 374
Credit: 20,108,005
RAC: 20,541
Message 31588 - Posted: 23 Jul 2017, 22:33:59 UTC

My "Validation inconclusive" Work units don't have the third task anymore which used to show "Unsent". Here's an example https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=72699356
ID: 31588 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31595 - Posted: 24 Jul 2017, 12:14:04 UTC - in response to Message 31588.  

Interesting. I shall issue a full report shortly.
I believe this WU (and others) had passed its
"sell by" date. "Deadline passed or something.
Very useful feedback. Eric.

My "Validation inconclusive" Work units don't have the third task anymore which used to show "Unsent". Here's an example https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=72699356

ID: 31595 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31596 - Posted: 24 Jul 2017, 12:15:44 UTC - in response to Message 31586.  

Well that's good. Nobody else was going to do anything about them! :-)
Pity about side effects, but full report to follow. Eric.

My super old WU's from 2013 are gone now so you did what you wanted to in addition

ID: 31596 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31597 - Posted: 24 Jul 2017, 12:18:25 UTC - in response to Message 31567.  

Sorry about that they will back under another WU. I am looking into all that.
I currently think they were report_deadline exceeded.
Full report to follow. Eric.

This may not belong here but I don't see anywhere else to put it. I just had all of my work units aborted from all four of my machines. Well over 100 WU's! What gives?

Me too.

ID: 31597 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31598 - Posted: 24 Jul 2017, 12:21:20 UTC - in response to Message 31571.  

I am afraid so. I should have limited actions to SixTrack only with an
AND something or other. Current thoughts are that as a side effect someone?
Assimilator?? deleted tasks which had passed their deadline.
Full report to follow. Eric.

ivan wrote:

I SEEM to have deleted too much though...
Ah well we shall see. Easy to resubmit.
The project status looks really bad, and I await the Project Management
Status. We shall see.
Eric.

Is this the reason several users have seen task aborts by the project? Also for VM-subprojects.

ID: 31598 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31599 - Posted: 24 Jul 2017, 12:23:20 UTC - in response to Message 31564.  

I can't help with VMBox. I am trying to find out why Tasks were aborted.
Full report to follow. I did NOT delete them but it was surely a side effect of
deleting rubbish going back to 2103.

I have had a couple of tasks 'aborted by project' today also on Windows and Ubuntu hosts. Both worked fine (I thought) before. I also have two Fedora (one 25 and one 26 servers) that I am struggling to make VBox work on, Tasks on CentOS 7.3 on the other hand are running just fine, Windows and Ubuntu Boxes were those with tasks cancelled.Did anything change lately or is it me?

ID: 31599 · Report as offensive     Reply Quote
Profile Olivier Fehr
Avatar

Send message
Joined: 1 Jun 17
Posts: 9
Credit: 964,242
RAC: 0
Message 31605 - Posted: 24 Jul 2017, 19:31:42 UTC - in response to Message 31599.  

Don't worry about Vbox - I think I have that figured out now. I was just wondering yesterday if the aborted tasks were due to my configuration changes or something else, but that question has been answered now. Thanks for your efforts.
ID: 31605 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 851
Credit: 1,616,240
RAC: 66
Message 31614 - Posted: 25 Jul 2017, 4:05:07 UTC - in response to Message 31605.  

Thanks for your understanding. I goofed. it was MY fault. I aborted
the Tasks not so much due to MySQL, but my lack of understanding
of the Server states.

Don't worry about Vbox - I think I have that figured out now. I was just wondering yesterday if the aborted tasks were due to my configuration changes or something else, but that question has been answered now. Thanks for your efforts.

ID: 31614 · Report as offensive     Reply Quote
Dirk Broer

Send message
Joined: 20 Sep 05
Posts: 29
Credit: 870,485
RAC: 117
Message 31886 - Posted: 7 Aug 2017, 22:34:53 UTC

No RESULTS accepted from Linux Kernel 4.8.*

No RESULTS accepted from Linux Kernel 4.10.* either.....
ID: 31886 · Report as offensive     Reply Quote
Dirk Broer

Send message
Joined: 20 Sep 05
Posts: 29
Credit: 870,485
RAC: 117
Message 32238 - Posted: 5 Sep 2017, 0:09:55 UTC - in response to Message 31886.  

No RESULTS accepted from Linux Kernel 4.8.*

No RESULTS accepted from Linux Kernel 4.10.* either.....


Ah! You will accept results from Kernel 4.10.*!
It is just that mere updating the kernel from 4.8.* to 4.10.* is not enough,
you have to detach and re-attach again.
ID: 32238 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3

Message boards : News : No RESULTS accepted from Linux Kernel 4.8.*


©2019 CERN