Message boards : Theory Application : Error 206 EXIT_INIT_FAILURE
Message board moderation

To post messages, you must log in.

AuthorMessage
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 222,957,255
RAC: 136,927
Message 30263 - Posted: 10 May 2017, 6:03:16 UTC

During the last days my hosts had repeatedly problems getting jobs from inside Theory VMs.
After the project's rollback to version 262.70 I reset my local client.

After the reset CMS and LHCb run as expected.
ATLAS had a download problem for a few minutes this morning on one of my hosts that was resolved automatically.


The problems regarding Theory persist.
WUs start and pass the HTCondor ping but get no jobs and therefore end up in a "206 (0x000000CE) EXIT_INIT_FAILURE":
https://lhcathome.cern.ch/lhcathome/result.php?resultid=138397873
https://lhcathome.cern.ch/lhcathome/result.php?resultid=138397801
https://lhcathome.cern.ch/lhcathome/result.php?resultid=138397560

Be so kind as to check whether there is a problem with the VM or with the server setup.
ID: 30263 · Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer

Send message
Joined: 20 Jun 14
Posts: 372
Credit: 238,712
RAC: 0
Message 30269 - Posted: 10 May 2017, 8:35:05 UTC - in response to Message 30263.  

Thanks for the report. The job queue was not being filled fast enough due to a blockage. This has been cleared and jobs should be flowing again.
ID: 30269 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 222,957,255
RAC: 136,927
Message 30271 - Posted: 10 May 2017, 9:56:44 UTC - in response to Message 30269.  

Thank you, but unfortunately there is no change from my perspective.
Theory WU failed:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=138415490

The following ATLAS WU started as expected.
ID: 30271 · Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer

Send message
Joined: 20 Jun 14
Posts: 372
Credit: 238,712
RAC: 0
Message 30274 - Posted: 10 May 2017, 15:15:20 UTC - in response to Message 30271.  

There was another issue that caused the blockage in the first place.
ID: 30274 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 798
Credit: 644,773,333
RAC: 231,805
Message 30293 - Posted: 11 May 2017, 17:28:41 UTC

I think its come back?
ID: 30293 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 222,957,255
RAC: 136,927
Message 30295 - Posted: 11 May 2017, 17:40:49 UTC
Last modified: 11 May 2017, 17:50:57 UTC

The issues regarding Theory's task distribution seem to be not yet solved.

One of my WUs had an unusual short runtime and the following WUs did not get a task.
As a result there were again EXIT_INIT_FAILUREs.

short WU:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=138911237

following WUs:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=139075211
https://lhcathome.cern.ch/lhcathome/result.php?resultid=139094803

I'll uncheck Theory until the errors are sorted out.

<edit>Got an LHCb now that started without problems.</edit>
ID: 30295 · Report as offensive     Reply Quote
Rasputin42

Send message
Joined: 26 Dec 09
Posts: 10
Credit: 1,192,862
RAC: 0
Message 30296 - Posted: 11 May 2017, 17:47:52 UTC - in response to Message 30295.  

+1
ID: 30296 · Report as offensive     Reply Quote
Profile Laurence
Project administrator
Project developer

Send message
Joined: 20 Jun 14
Posts: 372
Credit: 238,712
RAC: 0
Message 30299 - Posted: 12 May 2017, 7:15:58 UTC - in response to Message 30296.  

Unblocking.
ID: 30299 · Report as offensive     Reply Quote

Message boards : Theory Application : Error 206 EXIT_INIT_FAILURE


©2024 CERN