Message boards : ATLAS application : ATLAS vbox v2.01
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
JZD

Send message
Joined: 31 Dec 11
Posts: 1
Credit: 6,421,700
RAC: 3,869
Message 46951 - Posted: 28 Jun 2022, 6:51:59 UTC

Hi, the first task went well. Other tasks refuse to continue - Postponed:VM environment needs to be clean up.
https://lhcathome.cern.ch/lhcathome/result.php?resultid=358665589 OK.
https://lhcathome.cern.ch/lhcathome/result.php?resultid=358671171 Error. FATAL: Could not read from the boot medium! System halted. Canceled via gui.
ID: 46951 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,193,023
RAC: 103,440
Message 46952 - Posted: 28 Jun 2022, 6:59:31 UTC - in response to Message 46950.  

Guess you highlighted the wrong line.
The real issue is this:
Host system reported disk full.

Host ?
ID: 46952 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,193,023
RAC: 103,440
Message 46953 - Posted: 28 Jun 2022, 7:02:59 UTC - in response to Message 46951.  

Hi, the first task went well. Other tasks refuse to continue - Postponed:VM environment needs to be clean up.
https://lhcathome.cern.ch/lhcathome/result.php?resultid=358665589 OK.
https://lhcathome.cern.ch/lhcathome/result.php?resultid=358671171 Error. FATAL: Could not read from the boot medium! System halted. Canceled via gui.

Saw the same Yesterday. Stopped Atlas for me Win11pro.
ID: 46953 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 223,043,819
RAC: 136,841
Message 46954 - Posted: 28 Jun 2022, 7:17:06 UTC - in response to Message 46951.  

Looks like your vdi file is attached to a VM in VirtualBox media manager and that VM doesn't exist any more.
You may
- ensure no ATLAS task is running
- stop BOINC
- open the VirtualBox media manager and remove the disk entry "ATLAS_vbox_2.01_image.vdi" (but keep the image file!)
- restart BOINC
- resume ATLAS tasks


You may periodically check the "load average" on the host using "top".
What are the maximum values when ATLAS has set up all threads?
ID: 46954 · Report as offensive     Reply Quote
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 46957 - Posted: 28 Jun 2022, 9:39:24 UTC

It seems this version is causing many failures, so I have reverted back to v2.00 while we debug the problems.
ID: 46957 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1686
Credit: 100,486,314
RAC: 104,413
Message 46958 - Posted: 28 Jun 2022, 10:21:56 UTC - in response to Message 46957.  

It seems this version is causing many failures...
hm, strange. Here it worked well on all of my machines (Windows).
ID: 46958 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1268
Credit: 8,433,416
RAC: 3,056
Message 46959 - Posted: 28 Jun 2022, 10:28:11 UTC - in response to Message 46949.  

This can be on your side as well as on the Cloudflare/CERN side but since many other computers are running fine it's more likely the issue is on your side.

If you run a (Linux) CVMFS client inside your LAN you may manually run "cvmfs_config probe" from that client.
I get all six Probings... OK
ID: 46959 · Report as offensive     Reply Quote
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 46960 - Posted: 28 Jun 2022, 11:24:35 UTC - in response to Message 46958.  

It seems this version is causing many failures...
hm, strange. Here it worked well on all of my machines (Windows).


Yes, it seems to work on some machines and not others. It all looked ok when we tested it on the dev project too.

But many hosts fail immediately with an error like this:
2022-06-28 10:54:08 (153184): Error in start VM for VM: -1073741819
Command:
VBoxManage -q startvm "boinc_bb7752aa79a5ff69" --type headless
Output:
Waiting for VM "boinc_bb7752aa79a5ff69" to power on...
VBoxManage.exe: error: The virtual machine 'boinc_bb7752aa79a5ff69' has terminated unexpectedly during startup with exit code -1073741819 (0xc0000005)
VBoxManage.exe: error: Details: code E_FAIL (0x80004005), component MachineWrap, interface IMachine

2022-06-28 10:54:08 (153184): VM failed to start.
2022-06-28 10:54:08 (153184): Could not start 
2022-06-28 10:54:08 (153184): ERROR: VM failed to start

And because the tasks fail straight away these hosts rapidly fail many tasks in succession so we were getting around 98% failure rate for the new version.
ID: 46960 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 223,043,819
RAC: 136,841
Message 46961 - Posted: 28 Jun 2022, 12:27:57 UTC - in response to Message 46960.  

A list of measures that may help to solve "E_FAIL (0x80004005)" on Windows:
https://www.technewstoday.com/result-code-e_fail-0x80004005/
ID: 46961 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,193,023
RAC: 103,440
Message 46962 - Posted: 28 Jun 2022, 13:24:58 UTC - in response to Message 46957.  

It seems this version is causing many failures, so I have reverted back to v2.00 while we debug the problems.

+1
ID: 46962 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,193,023
RAC: 103,440
Message 46963 - Posted: 28 Jun 2022, 14:43:09 UTC - in response to Message 46962.  

Thank you David and your Team,
first two Atlas Vers.2.0 are running in Win11pro!
ID: 46963 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1686
Credit: 100,486,314
RAC: 104,413
Message 46983 - Posted: 7 Jul 2022, 12:39:47 UTC - in response to Message 46957.  

David wrote:
It seems this version is causing many failures, so I have reverted back to v2.00 while we debug the problems.
David - any idea when the problems will be straigthened out and the new version will again be in place?
ID: 46983 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,193,023
RAC: 103,440
Message 46984 - Posted: 7 Jul 2022, 13:13:24 UTC - in response to Message 46983.  

We need more Investigation-Time.
We have atm Atlas-Version 2.00.
ID: 46984 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : ATLAS application : ATLAS vbox v2.01


©2024 CERN