Message boards :
ATLAS application :
"No starage device attached ..."
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
There are allways 32 Tasks (6 active and 26 waiting). Have no idea why over the day one or two tasks show this phenomenon. |
Send message Joined: 14 Jan 10 Posts: 1273 Credit: 8,480,147 RAC: 2,155 |
I startetd 5 ATLAS-tasks almost at the same time and got one with your phenonemon. CVMFS checking, but no response. I revived the task https://lhcathome.cern.ch/lhcathome/result.php?resultid=366977052 by using the method mentioned in the link from my previous post. |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
This is a good way to correct it, Crystal. But, for hundreds of Tasks per day for those Threadripper. No, thank you. Have also no Squid, because of this phenonemon. |
Send message Joined: 18 Dec 15 Posts: 1687 Credit: 103,041,651 RAC: 126,644 |
last night, I had another case: ERROR [COM]: aRC=VBOX_E_OBJECT_NOT_FOUND (0x80bb0001) aIID={85632c68-b5bb-4316-a900-5eb28d3413df} aComponent={SessionMachine} aText={No storage device attached to device slot 0 on port 2 of controller 'Hard Disk Controller'}, preserve=false aResultDetail=0 the CPU was running only for 1 min 37 secs, and unfortunately I found out only this morning after a total task runtime of almost 10 hours :-( https://lhcathome.cern.ch/lhcathome/result.php?resultid=367034222 Again, this happened on the host where a defective SSD was replaced about 1 month ago. This type of failure happens only on this host, not on any other one - but only once in while. Hence, I am suspecting more and more that there is some problem with the new SSD. |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
Had also a task with 10 hour runtime, last night. In -dev, there is a test, maybe this will fix it. Computer ID 10797673 Laufzeit 10 Stunden 8 min. 47 sek. CPU Zeit 14 sek. |
Send message Joined: 18 Dec 15 Posts: 1687 Credit: 103,041,651 RAC: 126,644 |
last night, I had another case:same problem a few minutes ago: https://lhcathome.cern.ch/lhcathome/result.php?resultid=367052136 the task directly before the above cited one finished okay. |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
We have to wait and hope for the new version for Windows. |
Send message Joined: 18 Dec 15 Posts: 1687 Credit: 103,041,651 RAC: 126,644 |
this morning, I detected the same problem with a task on another host: ERROR [COM]: aRC=VBOX_E_OBJECT_NOT_FOUND (0x80bb0001) aIID={85632c68-b5bb-4316-a900-5eb28d3413df} aComponent={SessionMachine} aText={No storage device attached to device slot 0 on port 2 of controller 'Hard Disk Controller'}, preserve=false aResultDetail=0 https://lhcathome.cern.ch/lhcathome/result.php?resultid=367058721 So, I guess I can revise my assumption that this problem has to do with a replaced SSD on the other host. No idea though what the problem is caused by :-( |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
No Erich, this is the CVMFS-Conflict and we are waiting for a solution on production. One Threadripper crashed yesterday evening with pci-Bus Error (Lenovo). Since 16 days problems with the new Windows 22H2, so you are not the only one with problems. |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
Laufzeit 7 Stunden 25 min. 47 sek. CPU Zeit 31 sek. 14:13:31.742187 ERROR [COM]: aRC=E_FAIL (0x80004005) aIID={85632c68-b5bb-4316-a900-5eb28d3413df} aComponent={SessionMachine} aText={This machine does not have any snapshots}, preserve=false aResultDetail=0 14:13:32.241380 Saving settings file "C:\ProgramData\BOINC\slots\3\boinc_b95149f276154f28\boinc_b95149f276154f28.vbox" with version "1.16-windows" 14:13:32.855578 ERROR [COM]: aRC=VBOX_E_OBJECT_NOT_FOUND (0x80bb0001) aIID={d0a0163f-e254-4e5b-a1f2-011cf991c38d} aComponent={VirtualBoxWrap} aText={Could not find a registered machine named 'boinc_f42e141ac575b363'}, preserve=false aResultDetail=0 14:13:33.286633 Saving settings file "C:\Users\mae_a\.VirtualBox\VirtualBox.xml" with version "1.12-windows" |
Send message Joined: 18 Dec 15 Posts: 1687 Credit: 103,041,651 RAC: 126,644 |
this morning, I discovered the same problem VBoxManage.exe: error: Could not find a registered machine named 'boinc_f63b71f1735a8cad' on a different host than the one where the problem occurred many times in the recent past: https://lhcathome.cern.ch/lhcathome/result.php?resultid=368247704 this is really annoying since the task does not stop, but keeps running ... running ... running. More than 13 hours in this case, with CPU usage about 2 minutes :-( |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
This is the CVMFS-Conflict and we are waiting for a solution on production. When Atlas-Task in Windows starts and CVMFS is not ready, you can reproduce it. (Disconnecting LAN-Cable). |
Send message Joined: 2 May 07 Posts: 2090 Credit: 158,817,878 RAC: 127,005 |
Saw this morning after Squid 5.5 change one more of this Tasks. Computer ID 10795955 Laufzeit 1 Stunden 8 min. 31 sek. CPU Zeit 7 sek. Have Squid disconnected. |
©2024 CERN