Message boards :
ATLAS application :
Repeated computation errors
Message board moderation
Author | Message |
---|---|
Send message Joined: 27 Sep 04 Posts: 95 Credit: 6,035,159 RAC: 6,968 ![]() ![]() ![]() |
Got a new computer recently, and for a few days, things were great Then I started getting Computation Errors on every job I run Last time this happened the consensus was that I had lousy internet (failure to connect to the LHC servers) I have since upgraded my connection from 25MB to 150MB (I can post results from several speed test sites if it will help) Running BOINC 7.16.11 VBox 6.1.26 (+ Extension Pack) Any assistance gratefully accepted |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 ![]() ![]() |
It looks like you are using drive E: for BOINC, at least the data directory. I have not tried that, but I expect that BOINC and especially VirtualBox are best located on the OS drive. |
Send message Joined: 27 Sep 04 Posts: 95 Credit: 6,035,159 RAC: 6,968 ![]() ![]() ![]() |
I have used E: as my BOINC Data Drive for over 8 years Started when my system HD was a little undersized No problems at all BOINC and VBOX executables and VBOX VMs are all on C: |
Send message Joined: 2 May 07 Posts: 1729 Credit: 130,941,182 RAC: 283,299 ![]() ![]() ![]() |
Folder for Boinc-Data on a other drive is no problem. For me, Program folder and Data folder are on the same drive. Saw this: 00:00:01.037605 ExtPack: Created cloud provider 'OCI' (hrc=ERROR_SUCCESS) Have no idea. Network is normaly no problem. |
![]() Send message Joined: 15 Jun 08 Posts: 2150 Credit: 175,937,568 RAC: 110,627 ![]() ![]() ![]() |
There may be a couple of issues. 1. driver version mismatch Your logfiles mention this a couple of times: 00:00:00.793696 Support driver version mismatch: DriverVersion=0x290001 ClientVersion=0x300000 rc=VERR_VERSION_MISMATCH It usually points out that VirtualBox is not correctly installed. You may have used version A and upgraded to version B but it looks like there are files/keys from version A left on your computer. The extension pack version has also to be in sync with the main version. You may completely remove VirtualBox and clean all related keys. Then do a fresh install and reboot. 2. Pointers to Oracle Cloud (OCI) BOINC (more precise: vboxwrapper) runs the VMs locally. It does not configure any OCI related options. Nonetheless, your logfiles do show OCI related entries (although incomplete!): 00:00:00.726827 server 1: 68.105.28.11 00:00:00.726828 server 2: 68.105.29.11 00:00:00.726828 server 3: 68.105.28.12 00:00:00.726829 no domain set 00:00:00.726829 no search string entries 00:00:00.793696 Support driver version mismatch: DriverVersion=0x290001 ClientVersion=0x300000 rc=VERR_VERSION_MISMATCH 00:00:00.851187 VD: VDInit finished with VINF_SUCCESS 00:00:00.866853 OCI: Local config file 'C:\Users\xxx\.VirtualBox\oci_config' does not exist 00:00:00.867039 OCI: Original config file 'C:\Users\xxx\.oci\config' does not exist 00:00:00.867040 OCI: Reading profiles finished with status ERROR_SUCCESS 00:00:00.867057 ExtPack: Created cloud provider 'OCI' (hrc=ERROR_SUCCESS) Did you fiddle around with any OCI settings? If you need it you should exactly know what to do, otherwise leave this options untouched. See: https://www.virtualbox.org/manual/ch01.html#cloud-integration 3. VM tweaking Some lines from your log: 2021-12-05 15:55:12 (17108): Setting Memory Size for VM. (8096MB) 2021-12-05 15:55:13 (17108): Setting CPU Count for VM. (5) This VM was configured to use 5 cores. The standard RAM setting would then be 7500 MB. There's no need to configure more than that. In the past ATLAS VMs running in a 1-core setup (3900 MB) sometimes suffered from not enough RAM while the EVNT file was extracted. This has never be seen with VMs using 3 or more cores. 4. Using drive "E:" As long as the disk (whatever technology) as a whole is fast enough to process all I/O requests without timeouts it doesn't matter where the data is written to. IIRC in the past there were other BOINC projects expecting everything to be on just 1 filesystem but this is not an issue here. |
Send message Joined: 2 May 07 Posts: 1729 Credit: 130,941,182 RAC: 283,299 ![]() ![]() ![]() |
IIRC in the past there were other BOINC projects expecting everything to be on just 1 filesystem but this is not an issue here. Do you have tested it out for Win 7,8,8.1,10 and 11? |
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 ![]() ![]() |
The drive may not matter, but if you don't install/uninstall correctly, it could go to the wrong drive. It is simpler on a single drive. |
![]() Send message Joined: 15 Jun 08 Posts: 2150 Credit: 175,937,568 RAC: 110,627 ![]() ![]() ![]() |
Found an older post (3 years ago) where I mentioned primegrid being the project that did not work when the slots folder was on a different filesystem: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4855&postid=37068 As far as I remember this was a hardwired primegrid issue and explained in their forum. LHC@home works fine for years with slots being mounted on a tmpfs filesystem. I didn't test Windows but I know that other volunteers run it with slots being on a ramdisk - also for years. From the OS perspective tmpfs/ramdisk are separate filesystems. Hence, for LHC@home it's not a basic issue. Just ensure there's enough physical RAM available. Jim1348 wrote: It is simpler on a single drive. +1 The golden rule: KISS (Keep It Simple and Stupid) |
Send message Joined: 18 Dec 15 Posts: 1562 Credit: 58,216,399 RAC: 50,997 ![]() ![]() ![]() |
Found an older post (3 years ago) where I mentioned primegrid being the project that did not work when the slots folder was on a different filesystem:interesting to read. I remember when years ago, on one of my systems I installed a RAMdisk and put the slots folder there, it did not work. Windows, not Linux. |
Send message Joined: 2 May 07 Posts: 1729 Credit: 130,941,182 RAC: 283,299 ![]() ![]() ![]() |
+1 |
![]() Send message Joined: 15 Jun 08 Posts: 2150 Credit: 175,937,568 RAC: 110,627 ![]() ![]() ![]() |
... years ago, on one of my systems I installed a RAMdisk and put the slots folder there, it did not work. Windows, not Linux. You may try this tool: https://sourceforge.net/projects/imdisk-toolkit/ it supports: - dynamic ramdisk size - use of a folder (e.g. somewhere\slots) instead of a drive letter as mountpoint - automatic syncing (backup/restore) during shutdown/reboot |
Send message Joined: 27 Sep 04 Posts: 95 Credit: 6,035,159 RAC: 6,968 ![]() ![]() ![]() |
Thanks for the responses No, I have not done any tweaking of VBox I have uninstalled VBox, used ADAware uninstall cleanup, booted reinstalled VBox and the extension pack and rebooted again Now waiting for the job queue to clear out to where the Scheduler requests another ATLAS job |
Send message Joined: 27 Sep 04 Posts: 95 Credit: 6,035,159 RAC: 6,968 ![]() ![]() ![]() |
Thanks, guys The uninstall/re-install seems to have cured the problem I'm about 10 minutes from completing a WU Thinking back, I did upgrade VBox to 6.1.30 Apparently it didn't clean up the previously installed version very well |
©2023 CERN