1) Questions and Answers : Getting started : Computation Error, Mac (Message 51568)
Posted 18 Feb 2025 by rob
Post:
With your computers hidden it is difficult for people to try and help you.
2) Message boards : Number crunching : what does "timed out" status mean? (Message 51491)
Posted 2 Feb 2025 by rob
Post:
Deadline is, for most projects, the time the task should be completed, returned and reported to the project. Some projects do give a bit of lee-way, but not all.

[edit to add]
Both your recent tasks that failed due to exceeding the deadline were sent out to others within a few minutes of the deadline, one has already been completed, returned and validated.
3) Message boards : Number crunching : what does "timed out" status mean? (Message 51487)
Posted 1 Feb 2025 by rob
Post:
Exactly as you surmise - your version of that task is dead, and so you might as well abort it.
This message appears when the task has been on your computer beyond its deadline. One way to reduce the possibility of this in the future is to have a small cache, but even this is not guaranteed to work if your PC is slow or only runs BOINC for very few hours a day.
4) Message boards : ATLAS application : All tasks failing (Message 51131)
Posted 24 Nov 2024 by rob
Post:
I had four "good" tasks on the 22nd Nov, since then all (16) have failed with "validate error" as the headline. Lots of strange messages:
2024-11-24 13:37:11 (7136): Guest Log: *** Starting ATLAS job. (PandaID=6416690328 taskID=42161013) ***
2024-11-24 13:39:31 (7136): Guest Log: *** Job finished ***
2024-11-24 13:39:31 (7136): Guest Log: *** The last 20 lines of the pilot log: ***
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:22,732 | INFO | generated guid for lfn=HITS.42161013._131760.pool.root.1: 45DB498D-73E2-4806-8741-CB186C50CDEB
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:22,732 | WARNING | aborting payload error diagnosis since an error has already been set: [127, 1187]
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:23,775 | INFO | [payload] execute_payloads thread has finished
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:24,235 | INFO | only monitor.control thread still running - safe to abort: ['<_MainThread(MainThread, started 140077397043008)>', '<ExcThread(monitor, started 140077103560448)>']
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:24,490 | WARNING | job_aborted has been set - aborting pilot monitoring
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:24,490 | INFO | [monitor] control thread has ended
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,240 | INFO | all workflow threads have been joined
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,240 | INFO | end of generic workflow (traces error code: 0)
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,241 | INFO | traces error code: 0
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,241 | INFO | pilot has finished (exit code=0, shell exit code=0)
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,299 [wrapper] ==== pilot stdout END ====
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,303 [wrapper] ==== wrapper stdout RESUME ====
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,306 [wrapper] pilotpid: 5928
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,309 [wrapper] Pilot exit status: 0
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,417 [wrapper] pandaids: 6416690328
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,442 [wrapper] cleanup supervisor_pilot 5934 5929
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,445 [wrapper] Test setup, not cleaning
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,450 [wrapper] apfmon messages muted
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,453 [wrapper] ==== wrapper stdout END ====
2024-11-24 13:39:31 (7136): Guest Log: 2024-11-24 13:39:29,456 [wrapper] ==== wrapper stderr END ====


Then after a few more lines I get:


2024-11-24 13:39:31 (7136): Guest Log: -rw-r--r--. 1 atlas atlas 10776 Nov 24 13:39 runtime_log.err
2024-11-24 13:39:31 (7136): Guest Log: -rw-------. 1 atlas atlas 636 Nov 24 13:39 4ILLDma7QZ6nsSi4ap6QjLDmwznN0nGgGQJmq4hLDmSMhKDm50VHnm.diag
2024-11-24 13:39:31 (7136): Guest Log: Looking for outputfile HITS.42161013._131760.pool.root.1
2024-11-24 13:39:31 (7136): Guest Log: No HITS file was produced
2024-11-24 13:39:31 (7136): Guest Log: Successfully finished the ATLAS job!
2024-11-24 13:39:31 (7136): Guest Log: Copying the results back to the shared directory!
2024-11-24 13:39:31 (7136): Guest Log: *** Contents of shared directory: ***
2024-11-24 13:39:32 (7136): Guest Log: total 269908
2024-11-24 13:39:32 (7136): Guest Log: -rwxrwxrwx. 1 root root 275766805 Nov 24 13:36 ATLAS.root_0
2024-11-24 13:39:32 (7136): Guest Log: -rwxrwxrwx. 1 root root 9433 Nov 24 13:36 init_data.xml
2024-11-24 13:39:32 (7136): Guest Log: -rwxrwxrwx. 1 root root 499895 Nov 24 12:42 input.tar.gz
2024-11-24 13:39:32 (7136): Guest Log: -rwxrwxrwx. 1 root root 81920 Nov 24 2024 result.tar.gz
2024-11-24 13:39:32 (7136): Guest Log: -rwxrwxrwx. 1 root root 17569 Nov 24 12:42 start_atlas.sh
2024-11-24 13:39:32 (7136): Guest Log: *** Success! Shutting down the machine. ***
2024-11-24 13:39:32 (7136): VM Completion File Detected.
2024-11-24 13:39:32 (7136): Powering off VM.
2024-11-24 13:39:32 (7136): Successfully stopped VM.


and the VM stops in an orderly manner.

(Meanwhile CMS tasks are happily running on the same computer)
5) Message boards : ATLAS application : Last days a lot of validate errors or No Hits file produced (Message 51042)
Posted 10 Nov 2024 by rob
Post:
No need to disconnect from the project, just simply select "no new tasks", abort any tasks you have. Then sit back and wait until the project staff announce that the problem is solved.
6) Questions and Answers : Windows : I would really like to have some help here this is frusterating adn depressing and is driving me to depression (Message 50745)
Posted 8 Oct 2024 by rob
Post:
We don't have access to your computer names which makes it difficult to work out which of your computers you are talking about.
That said if you follow this link https://lhcathome.cern.ch/lhcathome/hosts_user.php you will wee your list of computers and be able to work out which is which. After selecting the computer you want to look at you will see there's a few filters, select the "error" one and you will only see tasks that ended with an error, then its simply a case of selecting a task in the list (I'd suggest that you are looking for the most recent one), scroll and near the top of the information is the error number you are looking for, scrolling down the page is all the detail of the error (mostly groups of repeating lines which may mean something to others)
7) Questions and Answers : Windows : I would really like to have some help here this is frusterating adn depressing and is driving me to depression (Message 50741)
Posted 7 Oct 2024 by rob
Post:
Make sure you don't have the Windows own "version" of virtualisation installed. Crudely there are two "types" of virtualisation, and they don't always work well together.
It's a bit of a pain to make sure that the Windows one (Hyper-V) is not installed - it sometimes gets installed without you doing anything :-( The best thing I can suggest if for you to do a web search for "removing Hyper-V from Windows x" and follow the instructions step by step (it may take several re-boots).
8) Questions and Answers : Windows : you need Virtualbox installed and virtualbox extension pack or your tasks will be "error" (Message 50452)
Posted 25 Jun 2024 by rob
Post:
I've sat around watching the debate about the need or otherwise for the extension pack under windows.

The short answer is yes or no, depending on you wanting to spend time watching tasks run or not. Simply put, if you want to keep a close eye on tasks as they run then you do need the extension pack, but if you don't want to watch tasks running then you don't need the extension pack.
Evidence?
These few lines from a recent successful task (https://lhcathome.cern.ch/lhcathome/result.php?resultid=411995370)

2024-06-25 08:06:37 (36756): Enabling remote desktop for VM.
2024-06-25 08:06:38 (36756): Required extension pack not installed, remote desktop not enabled.


The next couple of lines show that communication to the shared disk space is OK:
2024-06-25 08:06:38 (36756): Enabling shared directory for VM.
2024-06-25 08:06:38 (36756): Starting VM using VBoxManage interface. (boinc_01210e331ce40d44, slot#0)
2024-06-25 08:06:44 (36756): Successfully started VM. (PID = '21560')
2024-06-25 08:06:44 (36756): Reporting VM Process ID to BOINC.
2024-06-25 08:06:44 (36756): Guest Log: BIOS: VirtualBox 7.0.14
9) Message boards : ATLAS application : Download failures (Message 50258)
Posted 27 May 2024 by rob
Post:
Two today:
BipKDmO9fV5n9Rq4apOajLDm4fhM0noT9bVo2ijZDmF6FKDmBB9tyn received at 10:31
https://lhcathome.cern.ch/lhcathome/result.php?resultid=411404266

zaBMDmcviV5n9Rq4apOajLDm4fhM0noT9bVo2ijZDmABGKDm8fILCn received at 14:51
https://lhcathome.cern.ch/lhcathome/result.php?resultid=411405886

Both are sitting in the queue ready to sart
10) Message boards : ATLAS application : Download failures (Message 50239)
Posted 24 May 2024 by rob
Post:
No problems downloading here, but then I'm only doing one or two downloads at a time
11) Questions and Answers : Windows : LHC not downloading work on new Windows 11 PC (Message 49929)
Posted 10 Apr 2024 by rob
Post:
Quite simple, NOT WSL, but the VM that is bundled with BOINC when downloaded from the BOINC site.
12) Questions and Answers : Windows : LHC not downloading work on new Windows 11 PC (Message 49924)
Posted 10 Apr 2024 by rob
Post:
The AMD Ryzen 9 7900X has virtualization support, but this is turned OFF by default, and report "virtualization not supported". Check in the BIOS that virtualization is enabled, then turn it ON. After rebooting that error message should disappear, however other messages may then appear, which may take a fair bit of work to clear them. Most common is trying to run the "BOINC supplied" form of virtual machine on a computer that has (or has had) the MS version of Linux installed, the two just aren't compatible with each other despite having very similar names....
13) Message boards : CMS Application : Does anyone else never restart your machine running CMS? (Message 49793)
Posted 20 Mar 2024 by rob
Post:
I had the same problem. There is a work-around that appears to work:
Suspend the project before shutting down BOINC, then I can safely turn the computer off. This forces Virtual Machine to first stop processing the task then saves the Virtual Machine so it can restart at a later date or time.

(And I agree with you about wanting native tasks rather than VM -based tasks - life would be a little easier).
14) Message boards : Theory Application : New native version v300.08 (Message 49419)
Posted 6 Feb 2024 by rob
Post:
There is something strange going on with the initial estimated run time vs. the actual run time of some, if not all of these tasks:
On my PC:
Initial estimated run time = 61.5 minutes
During the first 60 minutes of running the elapsed time increments at about 1second per second of clock time, and continues at this rate - only a couple of seconds out after an hour.
However the remaining time only drops by about 15 seconds to 60.25 minutes.
At 61.5 elapsed minutes the remaining time jumps to 9 days 23 hours and 46 minutes.

This has the effect that my computer downloads a number of tasks that are, initially predicted to be finished by the deadline, but at the "1 hour" adjustment the majority will not even be started never mid finished by the deadline (10 days). This is potentially a rather unproductive waste of bandwidth, not to mention frustration for me. It will be interesting to see what the actual run time of these tasks is.
15) Message boards : ATLAS application : No Tasks from LHC@home (Message 49376)
Posted 3 Feb 2024 by rob
Post:
Well......
WSL uses "type 1" virtualization, but that is totally incompatible with the "type 2" virtualization used by the Oracle system deployed by LHC. As far as I can gather it is not possible to have both "type 1" and "type 2" virtualizations running on the same PC. It may be possible to do some form of dual boot allowing you to boot into either a "type 1" environment or a "type 2" environment, but I don't think either configuration would allow the other to run at the same time.
16) Message boards : ATLAS application : No Tasks from LHC@home (Message 49354)
Posted 2 Feb 2024 by rob
Post:
You have the same set of issues I had. Something (Microsoft??) decided to sneak in an update which launched Hyper-V and Hyper-V prevents Oracle VirtualBox from running. The not so simple solution is to unpick Hyper-V from your system (you need admin privileges), it can take a few re-boots to get rid of the last vestiges - search for "removing Hyper-V from a computer), allow an hour or more.
17) Message boards : ATLAS application : No Tasks from LHC@home (Message 49338)
Posted 1 Feb 2024 by rob
Post:
"Just turn OFF"....
40 minute later, several re-boots and I'm seeing the first LHC ATLAS simulations downloading, along with the expected(?) ATLAS v-box image (all 1.7GB of it).
OK, sit back and wait for the next fun instalment, but for now I think I've earned a drink.
18) Message boards : ATLAS application : No Tasks from LHC@home (Message 49336)
Posted 1 Feb 2024 by rob
Post:
So, if I turn OFF hyper-v, and make sure VirtualBox is on I should be able to get the VB running, and thus get some LHC tasks (if any are available)?
19) Message boards : ATLAS application : No Tasks from LHC@home (Message 49333)
Posted 1 Feb 2024 by rob
Post:
VM was working, but this appears to be a "windows" issue as until a couple of weeks ago. Now trying to get a "clean" copy of VM to see that has been corrupted as reinstalling BOINC didn't give me the expected messages about installing VM version, despite downloading the BOINC + VM installer that says it includes VB 7.0.6......
20) Message boards : ATLAS application : No Tasks from LHC@home (Message 49331)
Posted 1 Feb 2024 by rob
Post:
Another try - force start of VM, first few lines of the log:
01/02/2024 14:17:35 | | Starting BOINC client version 7.24.1 for windows_x86_64
01/02/2024 14:17:35 | | log flags: file_xfer, sched_ops, task, http_xfer_debug, sched_op_debug
01/02/2024 14:17:35 | | Libraries: libcurl/8.2.1-DEV Schannel zlib/1.2.13
01/02/2024 14:17:35 | | Data directory: C:\ProgramData\BOINC
01/02/2024 14:17:35 | | Running under account rob
01/02/2024 14:17:35 | | CUDA: NVIDIA GPU 0: NVIDIA GeForce GTX 1070 Ti (driver version 511.65, CUDA version 11.6, compute capability 6.1, 8192MB, 8192MB available, 8186 GFLOPS peak)
01/02/2024 14:17:35 | | CUDA: NVIDIA GPU 1: NVIDIA GeForce GTX 1070 Ti (driver version 511.65, CUDA version 11.6, compute capability 6.1, 8192MB, 8192MB available, 8186 GFLOPS peak)
01/02/2024 14:17:35 | | OpenCL: NVIDIA GPU 0: NVIDIA GeForce GTX 1070 Ti (driver version 511.65, device version OpenCL 3.0 CUDA, 8192MB, 8192MB available, 8186 GFLOPS peak)
01/02/2024 14:17:35 | | OpenCL: NVIDIA GPU 1: NVIDIA GeForce GTX 1070 Ti (driver version 511.65, device version OpenCL 3.0 CUDA, 8192MB, 8192MB available, 8186 GFLOPS peak)
01/02/2024 14:17:35 | | Windows processor group 0: 16 processors
01/02/2024 14:17:35 | | Host name: gaw-win-64
01/02/2024 14:17:35 | | Processor: 16 AuthenticAMD AMD Ryzen 7 3700X 8-Core Processor [Family 23 Model 113 Stepping 0]
01/02/2024 14:17:35 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 htt pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 sse4a osvw wdt topx page1gb rdtscp fsgsbase bmi1 smep bmi2
01/02/2024 14:17:35 | | OS: Microsoft Windows 10: Professional x64 Edition, (10.00.19044.00)
01/02/2024 14:17:35 | | Memory: 31.93 GB physical, 55.82 GB virtual
01/02/2024 14:17:35 | | Disk: 930.76 GB total, 605.05 GB free
01/02/2024 14:17:35 | | Local time is UTC +0 hours
01/02/2024 14:17:35 | | No WSL found.
01/02/2024 14:17:35 | | VirtualBox version: 7.0.6
01/02/2024 14:17:35 | climateprediction.net | Found app_config.xml


Then the LHC "start":
01/02/2024 14:17:35 | LHC@home | [sched_op] Starting scheduler request
01/02/2024 14:17:35 | LHC@home | Sending scheduler request: To fetch work.
01/02/2024 14:17:35 | LHC@home | Requesting new tasks for CPU
01/02/2024 14:17:35 | LHC@home | [sched_op] CPU work request: 176256.00 seconds; 0.00 devices
01/02/2024 14:17:35 | LHC@home | [sched_op] NVIDIA GPU work request: 0.00 seconds; 0.00 devices
01/02/2024 14:17:36 | | [http_xfer] [ID#0] HTTP: wrote 330 bytes
01/02/2024 14:17:36 | | [http_xfer] [ID#1] HTTP: wrote 3115 bytes
01/02/2024 14:17:36 | LHC@home | Scheduler request completed: got 0 new tasks
01/02/2024 14:17:36 | LHC@home | [sched_op] Server version 721
01/02/2024 14:17:36 | LHC@home | No tasks sent
01/02/2024 14:17:36 | LHC@home | Tasks for AMD/ATI GPU are available, but your preferences are set to not accept them
01/02/2024 14:17:36 | LHC@home | Project requested delay of 6 seconds
01/02/2024 14:17:36 | LHC@home | [sched_op] Deferring communication for 00:00:06
01/02/2024 14:17:36 | LHC@home | [sched_op] Reason: requested by project
01/02/2024 14:17:37 | | [http_xfer] [ID#0] HTTP: wrote 2942 bytes
01/02/2024 14:17:38 | | [http_xfer] [ID#0] HTTP: wrote 344 bytes
01/02/2024 14:17:39 | | [http_xfer] [ID#0] HTTP: wrote 3895 bytes
01/02/2024 14:17:40 | | [http_xfer] [ID#0] HTTP: wrote 2893 bytes
01/02/2024 14:17:42 | | [http_xfer] [ID#0] HTTP: wrote 3947 bytes
01/02/2024 14:17:43 | | [http_xfer] [ID#0] HTTP: wrote 1300 bytes

No tasks delivered.
Let's try reinstalling BOINC & VB (again....)


Next 20


©2025 CERN