Message boards : Number crunching : VM Job Unmanageable?
Message board moderation

To post messages, you must log in.

AuthorMessage
mlcasmey

Send message
Joined: 30 Mar 17
Posts: 8
Credit: 974,885
RAC: 0
Message 30166 - Posted: 2 May 2017, 21:59:33 UTC

I keep getting a "Postponed: VM job unmanageable, restart later" message on the Theory Simulation files one of my machines is running. I don't want to change my preferences to remove this file type just because one machine is having problems with this if possible. Any ideas?

Mike.
ID: 30166 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 625
Credit: 392,375,326
RAC: 163,351
Message 30171 - Posted: 3 May 2017, 6:14:23 UTC

When I keep the CPU load at 50% on a machine it seems less frequent.
ID: 30171 · Report as offensive     Reply Quote
mlcasmey

Send message
Joined: 30 Mar 17
Posts: 8
Credit: 974,885
RAC: 0
Message 30186 - Posted: 4 May 2017, 1:01:19 UTC - in response to Message 30171.  

Thanks. I'll try lowering my CPU to 50% in my preferences

Mike.

Seems like VM causes all kinds of problems. Any reason why LHC chooses to make files that require it?
ID: 30186 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 625
Credit: 392,375,326
RAC: 163,351
Message 30188 - Posted: 4 May 2017, 5:55:52 UTC

The scientific platform in CERN is Linux, they didn't want drain resources from the science team to write applications for all platforms. With the VM they can just use the ones they have.
ID: 30188 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 750
Credit: 5,688,626
RAC: 0
Message 30196 - Posted: 4 May 2017, 9:12:50 UTC - in response to Message 30188.  

Yes. There is overhead, but overall it's been judged that this outweighs the simplification of the development and maintenance of the software.
ID: 30196 · Report as offensive     Reply Quote
Lluis

Send message
Joined: 17 Sep 04
Posts: 2
Credit: 562,103
RAC: 0
Message 30211 - Posted: 4 May 2017, 16:18:30 UTC

I just updated my BOINC manager to 7.6.33 and VM to 5.0.18 (automatic installation) and I keep getting a "VM Hypervisor failed to enter an online state in a timely fashion" in all jobs with VM, even though I've changed my preferences (a little bit randomly!).
ID: 30211 · Report as offensive     Reply Quote
Lluis

Send message
Joined: 17 Sep 04
Posts: 2
Credit: 562,103
RAC: 0
Message 30212 - Posted: 4 May 2017, 16:19:26 UTC
Last modified: 4 May 2017, 16:21:14 UTC

Any help welcome !!!
ID: 30212 · Report as offensive     Reply Quote
keputnam

Send message
Joined: 27 Sep 04
Posts: 83
Credit: 2,506,880
RAC: 1,205
Message 30213 - Posted: 4 May 2017, 16:22:39 UTC - in response to Message 30166.  

If all else fails,you could use a different "location" for the one trouble machine,that excludes that type of work in preferences
ID: 30213 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 1096
Credit: 37,087,481
RAC: 17,992
Message 30215 - Posted: 4 May 2017, 16:56:38 UTC - in response to Message 30211.  
Last modified: 4 May 2017, 16:59:31 UTC

VM to 5.0.18 (automatic installation).


Take a look at virtualbox.org. Actuell is 5.1.22.
5.0.18 is too old. Don't forget extension packages!.
ID: 30215 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 750
Credit: 5,688,626
RAC: 0
Message 30220 - Posted: 5 May 2017, 0:05:18 UTC - in response to Message 30213.  
Last modified: 5 May 2017, 0:06:13 UTC

If all else fails,you could use a different "location" for the one trouble machine,that excludes that type of work in preferences

Yes, don't forget you have four different locales you can set your machines to, according to which projects you want them to run: home, work, school, and default. That's almost enough for anyone... :-)
ID: 30220 · Report as offensive     Reply Quote
mlcasmey

Send message
Joined: 30 Mar 17
Posts: 8
Credit: 974,885
RAC: 0
Message 30221 - Posted: 5 May 2017, 1:20:51 UTC

Might sound like a dumb question, but once I have the latest version of VM loaded do I need to go into the program and set up a virtual machine and give it memory.....

Mike.
ID: 30221 · Report as offensive     Reply Quote
mlcasmey

Send message
Joined: 30 Mar 17
Posts: 8
Credit: 974,885
RAC: 0
Message 30222 - Posted: 5 May 2017, 1:24:07 UTC - in response to Message 30221.  

All I have when I open up the software is a whole list of error messages saying Boink..... inaccessable with the following:

Runtime error opening 'C:\ProgramData\BOINC\slots\2\boinc_b57df568436cbc9e\boinc_b57df568436cbc9e.vbox' for reading: -103(Path not found.).
F:\tinderbox\win-5.1\src\VBox\Main\src-server\MachineImpl.cpp[745] (long __cdecl Machine::i_registeredInit(void)).
Result Code:
E_FAIL (0x80004005)
Component:
MachineWrap
Interface:
IMachine {b2547866-a0a1-4391-8b86-6952d82efaa0}
ID: 30222 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 750
Credit: 5,688,626
RAC: 0
Message 30238 - Posted: 5 May 2017, 20:39:19 UTC - in response to Message 30221.  

Might sound like a dumb question, but once I have the latest version of VM loaded do I need to go into the program and set up a virtual machine and give it memory.....

Mike.

No, the VM image is downloaded from the project server.
ID: 30238 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 750
Credit: 5,688,626
RAC: 0
Message 30240 - Posted: 5 May 2017, 20:42:13 UTC - in response to Message 30222.  

All I have when I open up the software is a whole list of error messages saying Boink..... inaccessable with the following:

Runtime error opening 'C:\ProgramData\BOINC\slots\2\boinc_b57df568436cbc9e\boinc_b57df568436cbc9e.vbox' for reading: -103(Path not found.).
F:\tinderbox\win-5.1\src\VBox\Main\src-server\MachineImpl.cpp[745] (long __cdecl Machine::i_registeredInit(void)).
Result Code:
E_FAIL (0x80004005)
Component:
MachineWrap
Interface:
IMachine {b2547866-a0a1-4391-8b86-6952d82efaa0}

It sounds like your BOINC has got confused, and is looking for an old task that has been deleted somehow. At this point it's probably best to do a project reset.
ID: 30240 · Report as offensive     Reply Quote
Terrible T

Send message
Joined: 1 Nov 05
Posts: 8
Credit: 596,413
RAC: 0
Message 30445 - Posted: 21 May 2017, 11:27:18 UTC - in response to Message 30240.  

Had the same issue, seeing the 'F:\tinderbox\.......' made me kind of paranoia, as I don't have an F drive. (And I don't WannaCry..)
Did project reset, remove/ install VM, no louck.

Finally found a mention somewhere that following file might be missing/corrupt:
C:\Users\USERNAME\.VirtualBox\VirtualBox.xml

File VirtualBox.xml was empty, replaced it with the VirtualBox.xml-prev, and
was back up and running.
Looks to me as a scientist put the wrong VM data file onto the project.
ID: 30445 · Report as offensive     Reply Quote

Message boards : Number crunching : VM Job Unmanageable?


©2021 CERN