1) Message boards : Cafe LHC : Good Morning December 25th 2024 (Message 51326)
Posted 28 days ago by CloverField
Post:
Merry Christmas!
2) Message boards : News : Seasons greetings (Message 51312)
Posted 20 Dec 2024 by CloverField
Post:
Merry Christmas/ Happy holidays to you as well!
3) Message boards : ATLAS application : Last days a lot of validate errors or No Hits file produced (Message 51272)
Posted 10 Dec 2024 by CloverField
Post:
... I'm running virtual box 7.1.4 ...

This version runs fine.

The reason why it sometimes fails is a race condition when you had no ATLAS tasks and then start at least 2 of them concurrently.
If you are lucky the timings do not cause the race condition and everything works fine.
Otherwise the media registry gets corrupted and stays corrupted until you manually clean it.

Vboxwrapper 26208 includes a patch that avoids the race condition.
CMS/Theory use a beta version that already includes that patch:
https://github.com/BOINC/boinc/pull/5571

Thanks for the explanation. I'll lock atlas at one task until it gets the patch.
4) Message boards : ATLAS application : Last days a lot of validate errors or No Hits file produced (Message 51270)
Posted 10 Dec 2024 by CloverField
Post:
Has anyone else been able to run more than one atlas task at a time? The instant I do I end up with the yellow hard drive triangle in virtual box and all my other atlas tasks fail with computation errors until I clean it up.
on several of my hosts I run more than one Atlas task at a time. So no idea what exactly might be the problem on your host, but it looks like some misconfiguration of the VirtualBox. Which version are you running? Maybe an update to a newer one might help.


So I went and updated virtual box a couple of weeks ago to see if it would fix the issue. I'm running virtual box 7.1.4. I'm just baffled because it would run multiple tasks happily until about a month ago.
5) Message boards : ATLAS application : Last days a lot of validate errors or No Hits file produced (Message 51266)
Posted 10 Dec 2024 by CloverField
Post:
Has anyone else been able to run more than one atlas task at a time? The instant I do I end up with the yellow hard drive triangle in virtual box and all my other atlas tasks fail with computation errors until I clean it up.
6) Message boards : Number crunching : New virtualbox wrapper (Message 51144)
Posted 26 Nov 2024 by CloverField
Post:
Oh nice hopefully this fixes the yellow hard drives that we have to clear out.
7) Message boards : Number crunching : VirtualBox {/\}_ 7.1.4 _{/\} (released October 15 2024) (Message 51064)
Posted 15 Nov 2024 by CloverField
Post:
So I just installed this but it looks like I lost my show virtual box console button for my tasks. Did this happen to either of you?
8) Message boards : Theory Application : High disk reads (Message 50930)
Posted 26 Oct 2024 by CloverField
Post:
How much more ram do you think they need? I'm running 1024 mb as suggested in the thread.
Do we think maybe 1280 mb might be enough?

With the amount of RAM you have, I would suggest 1536MB
On one the the two hosts which are currently running Herwig7, I set the RAM to 1024MB. Console_3 shows about 48MB swapping.
On the other hosts on which 13 Herwigs are currently running, I set the RAM to 1536MB. Within the first 4-5 days' runtime, there was no swapping. I now checked again and see values between 4 and 18MB. All tasks are still in the integration phase. I am courious how swapping will be thereafter. So for future Herwigs, I might end up with even a higher RAM than 1536MB (with 128GB I've plenty of RAM available, anyway).

Welp 2048 it is then.
9) Message boards : Theory Application : High disk reads (Message 50924)
Posted 26 Oct 2024 by CloverField
Post:
How much more ram do you think they need? I'm running 1024 mb as suggested in the thread.
Do we think maybe 1280 mb might be enough?
10) Message boards : News : Downtime 1 October (Message 50673)
Posted 1 Oct 2024 by CloverField
Post:
Should we stop crunching until the dbs come back up?
11) Message boards : ATLAS application : 6,09 GByte Downloadfile (Message 50636)
Posted 26 Sep 2024 by CloverField
Post:
They have all been failing for me. All of them are returning EXIT_DISK_LIMIT_EXCEEDED.
12) Message boards : ATLAS application : Download failures (Message 50274)
Posted 28 May 2024 by CloverField
Post:
You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?

ATLAS EVNT files are very large and Squid does not cache them.
The decision which file needs to be downloaded is made by the BOINC client based on the WU data it gets from the server.

Hence, I don't think your Squid is the problem.
Nonetheless, feel free to purge it's cache - it will rebuild automatically.

A project reset looks more promising (no 100% guarantee).
Finish all tasks in your work buffer before.


Yeah the squid purge didn't work I'll set no new tasks and go for the project reset.

And no dice for me I'll just run theory until all my atlas tasks expire
13) Message boards : ATLAS application : Download failures (Message 50262)
Posted 27 May 2024 by CloverField
Post:
You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?

ATLAS EVNT files are very large and Squid does not cache them.
The decision which file needs to be downloaded is made by the BOINC client based on the WU data it gets from the server.

Hence, I don't think your Squid is the problem.
Nonetheless, feel free to purge it's cache - it will rebuild automatically.

A project reset looks more promising (no 100% guarantee).
Finish all tasks in your work buffer before.


Yeah the squid purge didn't work I'll set no new tasks and go for the project reset.
14) Message boards : ATLAS application : Download failures (Message 50259)
Posted 27 May 2024 by CloverField
Post:
You think I should just reset lhc at home all of mine are still failing. I do have a squid cache, so maybe that's the source of my issues?
15) Message boards : ATLAS application : Download failures (Message 50256)
Posted 27 May 2024 by CloverField
Post:
Has anyone actually had a successful task download for atlas since this issue started? Ive been running theory since it started but I try and give atlas a go once a day and all of my downloads continue to fail.
16) Message boards : ATLAS application : 2000 Events Threadripper 3995WX (Message 49619)
Posted 23 Feb 2024 by CloverField
Post:
Any reason why the tasks suddenly jumped up to 6 hours? They used be like 40 min to 2 hours in the past?
17) Message boards : News : Seasons greetings (Message 49065)
Posted 23 Dec 2023 by CloverField
Post:
Merry Christmas/ happy holidays everyone!
18) Message boards : Theory Application : Stuck WU: Waiting for the delivery of SIGUSR1 (Message 48118)
Posted 19 May 2023 by CloverField
Post:
Got about 4 of these last night.
19) Message boards : Theory Application : Stuck WU: Waiting for the delivery of SIGUSR1 (Message 47787)
Posted 25 Feb 2023 by CloverField
Post:
These continue to happen I get about ~5-10 a week. Is there anyway we could get some retry logic in the start up like at Altas and cms have so I don't have to make check for stuck tasks part of my morning routine?
20) Message boards : Theory Application : Stuck WU: Waiting for the delivery of SIGUSR1 (Message 47759)
Posted 8 Feb 2023 by CloverField
Post:
2023-02-07 15:05:56 (46556): Adding network bandwidth throttle group to VM. (Defaulting to 1024GB)
.
.
.
2023-02-07 15:06:02 (46556): Setting network throttle for VM. (5120KB)

It looks like you tweak your network bandwidth settings for the VMs (or BOINC as a whole).
This makes no sense since it applies only to outgoing traffic (from the VM perspective), but it may affect the connection timing.

You may leave those settings unlimited or at default values.


Ill try and change those but I ended up putting them on because I would get latency spikes in my network when tasks would upload.


Next 20


©2025 CERN