Message boards : Theory Application : Feedback on the Theory docker app
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 82
Credit: 2,606,406
RAC: 10,152
Message 53252 - Posted: 24 Mar 2026, 5:49:43 UTC

Unexpectedly got a resend on a rig that wasn't properly set up as far as I know. It errored out after starting. Is there anything I could do to successfully run any future task I might grab?
ID: 53252 · Report as offensive     Reply Quote
AndreyOR

Send message
Joined: 8 Dec 19
Posts: 44
Credit: 8,846,145
RAC: 4,516
Message 53253 - Posted: 24 Mar 2026, 7:37:48 UTC - in response to Message 53250.  

In reply to Mr P Hucker's message of 23 Mar 2026:
In reply to rob's message of 23 Mar 2026:
There may be an issue when you request work from all the sub-projects. I had the same problem until I turned them all off except Theory. Whereupon I got a task that said it would run in a matter of an hour or so, but in reality is showing 100% completion (BOINC progress), is actually at about integrate 340 of 760 and just under two days, so the back of my envelope suggests another 2 days at least to completion.

This is the last work I'll be having from LHC until such time as they can clearly demonstrate that the initial estimates are about right.....

(Windows 10, vbox64:
https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10787143

I'd try that, but Theory is empty.

I'd seen it recommended that "Run native if available? (Not recommended for Windows)" option in preferences be deselected when using Windows. I have it this way and haven't had problems getting tasks. Try it out, when tasks show up again. It seems to take at least a couple of days for a new batch to show up once the old one is drained.
ID: 53253 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53259 - Posted: 24 Mar 2026, 11:37:27 UTC - in response to Message 53253.  

In reply to AndreyOR's message of 24 Mar 2026:
I'd seen it recommended that "Run native if available? (Not recommended for Windows)" option in preferences be deselected when using Windows. I have it this way and haven't had problems getting tasks. Try it out, when tasks show up again. It seems to take at least a couple of days for a new batch to show up once the old one is drained.
I'll try that, but I thought it had to be selected to tell the server to give me Buddha instead of VB?
ID: 53259 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53262 - Posted: 24 Mar 2026, 14:01:49 UTC

SOLVED!!!

Boinc, since it's written by the most incompetant programmer known to mankind, had set this flag when I upgraded to the latest "including docker" Boinc:

<dont_use_docker>1</dont_use_docker>

This is in cc_config.xml and needs changed to 0.
ID: 53262 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53264 - Posted: 24 Mar 2026, 14:17:00 UTC

Whoops! Can anybody interpret this?


<core_client_version>8.2.8</core_client_version>
<![CDATA[
<message>
Incorrect function.
(0x1) - exit code 1 (0x1)</message>
<stderr_txt>
docker_wrapper 18 starting
docker_wrapper config:
workdir: /boinc_slot_dir
use GPU: no
create args: --cap-add=SYS_ADMIN --device /dev/fuse
verbose: 1
Using WSL distro boinc-buda-runner
Using podman
running docker command: ps --all --filter "name=^boinc__lhcathome.cern.ch_lhcathome__theory_2922-4787634-715_1$" --format "{{.Names}}|{{.Status}}"
program: podman
command output:
EOM
creating container boinc__lhcathome.cern.ch_lhcathome__theory_2922-4787634-715_1
running docker command: images
program: podman
command output:
REPOSITORY TAG IMAGE ID CREATED SIZE
EOM
building image
running docker command: build "." -t boinc__lhcathome.cern.ch_lhcathome__theory_2922-4787634-715 -f Dockerfile
program: podman
read_from_pipe() error: timeout
build_image() failed: -182

</stderr_txt>
]]>
ID: 53264 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53265 - Posted: 24 Mar 2026, 14:43:55 UTC
Last modified: 24 Mar 2026, 14:51:18 UTC

Trying to paste some instructions but keep getting forum database errors!

Can't create post. Failed to add post to DB.

Fine..... adopt, adapt, and improve, motto of the knights of the round table. Let's try an image.
ID: 53265 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53266 - Posted: 24 Mar 2026, 14:51:27 UTC
Last modified: 24 Mar 2026, 14:51:38 UTC

ID: 53266 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 82
Credit: 2,606,406
RAC: 10,152
Message 53267 - Posted: 24 Mar 2026, 16:25:10 UTC

I managed to somehow get LHC to run vbox on an old laptop. Seems like Theory Simulation v301.00 (vbox64_theory) tasks run fine while Theory Simulation v302.10 (docker) ones crash after about 30 minutes of doing nothing (insignificant CPU load). Is there a way to only get vbox64 tasks? I'm not sure I can get Docker to run properly. Been struggling with the help of Gemini today, but I might just remain with ATLAS if all else fails.
ID: 53267 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53268 - Posted: 24 Mar 2026, 16:29:04 UTC - in response to Message 53267.  
Last modified: 24 Mar 2026, 16:29:55 UTC

In reply to Garrulus glandarius's message of 24 Mar 2026:
I managed to somehow get LHC to run vbox on an old laptop. Seems like Theory Simulation v301.00 (vbox64_theory) tasks run fine while Theory Simulation v302.10 (docker) ones crash after about 30 minutes of doing nothing (insignificant CPU load). Is there a way to only get vbox64 tasks? I'm not sure I can get Docker to run properly. Been struggling with the help of Gemini today, but I might just remain with ATLAS if all else fails.

Reverse how I fixed mine. In cc_config.xml, put <dont_use_docker>1</dont_use_docker>

As for getting it to work, how fast is your internet? Mine is 34Mbps, and it looks like it got impatient the first time. If you let it fail the tasks, it will eventually download enough stuff.
ID: 53268 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 82
Credit: 2,606,406
RAC: 10,152
Message 53269 - Posted: 24 Mar 2026, 16:32:32 UTC - in response to Message 53268.  
Last modified: 24 Mar 2026, 16:48:11 UTC

In reply to Mr P Hucker's message of 24 Mar 2026:
In reply to Garrulus glandarius's message of 24 Mar 2026:
I managed to somehow get LHC to run vbox on an old laptop. Seems like Theory Simulation v301.00 (vbox64_theory) tasks run fine while Theory Simulation v302.10 (docker) ones crash after about 30 minutes of doing nothing (insignificant CPU load). Is there a way to only get vbox64 tasks? I'm not sure I can get Docker to run properly. Been struggling with the help of Gemini today, but I might just remain with ATLAS if all else fails.

Reverse how I fixed mine. In cc_config.xml, put <dont_use_docker>1</dont_use_docker>

As for getting it to work, how fast is your internet? Mine is 34Mbps, and it looks like it got impatient the first time. If you let it fail the tasks, it will eventually download enough stuff.


I did the config trick recommended by Gemini, but then I kept getting red warnings that docker is not allowed and nothing downloaded. My net is slower but nowhere near as slow as some people around here (BOINC in general) complain.

Edit: weird, seems to have worked the second time around.

Re-edit: welp, it just downloaded another docker one... Running 30 minutes for no reason is pretty expensive time-wise, especially it if keeps getting many docker tasks.

Re-re-edit: Ok, this is now total black magic. It seems that the docker task is running correctly now. I can acees the running log file and it's advancing with the events.
ID: 53269 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53271 - Posted: 24 Mar 2026, 16:51:46 UTC - in response to Message 53269.  
Last modified: 24 Mar 2026, 16:55:42 UTC

Have you downloaded the latest docker auto-setup? This sets up everything correctly itself.

https://github.com/BOINC/boinc-buda-runner-wsl-installer/releases?hl=en-GB-u-fw-mon-ms-uksystem-mu-celsius

It could be yours was being impatient like mine, there is a lot of data to download. If your internet is as slow as my 34Mbit, that may have been the problem initially.

Also make sure all these are ticked (in "turn windows features on or off"):
Hyper-V
Virtual Machine Platform
Windows Hypervisor platform
Windows Subsystem for Linux
ID: 53271 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 946
Credit: 783,874,658
RAC: 168,848
Message 53274 - Posted: 24 Mar 2026, 17:31:05 UTC - in response to Message 53249.  

If you remove VirtualBox then you can use buda to do the docker/podman task for Theory. I set this up on a spare windows computer to test but my main windows comptuer I kept on VirtualBox for now.
ID: 53274 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 82
Credit: 2,606,406
RAC: 10,152
Message 53275 - Posted: 24 Mar 2026, 17:42:39 UTC
Last modified: 24 Mar 2026, 18:05:39 UTC

Oops, looks like I forgot to mention I was talking about a Linux system, sorry guys. Since then I had 2 vbox tasks fail and the ones that are running seem to have restarted from scratch at a much slower pace... I have no idea what's going on. Guess I'll just let them run and see what works (if something finally works, lol)

Edit: I'm also very prone to being impatient so I aborted everything, restarted the rig and got a fresh batch of 8 docker tasks. The first 4 seem to be running fine now, one is relatively fast, 2 average and one very slow. All 4 CPUs got maxed out once the events started rolling so I'm trying to be hopeful. Not going to touch anything anymore.
ID: 53275 · Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 468
Credit: 15,268,630
RAC: 10,010
Message 53276 - Posted: 24 Mar 2026, 17:46:58 UTC - in response to Message 53274.  
Last modified: 24 Mar 2026, 17:47:15 UTC

In reply to Toby Broom's message of 24 Mar 2026:
If you remove VirtualBox then you can use buda to do the docker/podman task for Theory. I set this up on a spare windows computer to test but my main windows comptuer I kept on VirtualBox for now.
I have both installed. I think LHC is refusing to give me ATLAS/CMS because they would run slower, but it allows vbox from other projects (if there are any!)
ID: 53276 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 82
Credit: 2,606,406
RAC: 10,152
Message 53277 - Posted: 24 Mar 2026, 19:16:37 UTC

It ran fine, until the end, when it crashed and has no stderr...
ID: 53277 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 946
Credit: 783,874,658
RAC: 168,848
Message 53280 - Posted: 25 Mar 2026, 7:59:11 UTC - in response to Message 53277.  

did you set up cmvfs and move podman to use cgroup's?
ID: 53280 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 82
Credit: 2,606,406
RAC: 10,152
Message 53281 - Posted: 25 Mar 2026, 11:12:24 UTC - in response to Message 53280.  

In reply to Toby Broom's message of 25 Mar 2026:
did you set up cmvfs and move podman to use cgroup's?

I think so. Gemini gave me some commands for setting up cmvfs and some other settings. After each crash/error, I gave the error message to the AI and it came up with apparently good solutions. Got everything working until the last step apparently (with docker). Somehow a couple of vbox tasks did finish correctly earlier, but I have no idea how to replicate those settings. I wanted to somehow filter out getting docker tasks, but it doesn't work.
ID: 53281 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 946
Credit: 783,874,658
RAC: 168,848
Message 53290 - Posted: 25 Mar 2026, 21:39:03 UTC - in response to Message 53281.  
Last modified: 26 Mar 2026, 8:05:03 UTC

The one you aborted seemed to be working:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=434108926

the only thing in the logs was

Also add "CVMFS_USE_CDN=yes" to "/etc/cvmfs/default.local".

You might want to setup proxy on to reduce the load on the CERN servers:

https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5473
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5474


Your VBox ones vaildated so its seem like all is good.

Does cc_config.xml, put <dont_use_docker>1</dont_use_docker> not work?
ID: 53290 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2753
Credit: 303,792,433
RAC: 108,655
Message 53294 - Posted: 26 Mar 2026, 7:11:28 UTC - in response to Message 53290.  

In reply to Toby Broom's message of 25 Mar 2026:
... then the above comment about CDN would be not needed ...

This is wrong!

"CVMFS_USE_CDN=yes" must always be set!
Exceptions are possible for clients running inside CERN's own network or inside networks registered as CERN tier-n partners.
Most (nearly all) clients here do NOT meet that partner relationship.


In addition a local proxy (Squid) is recommended (better: it is a MUST!) for all LANs running more than 5 clients concurrently.
Typical examples:
- a multicore computer running more than 5 vbox VMs concurrently
- a multicore computer running more than 5 docker containers concurrently using CVMFS inside each container
- a mix of the previous 2 exceeding 5 concurrently running CVMFS clients
- multiple computers inside the same LAN running more than 5 VMs/containers (better: CVMFS clients) in total


What must NEVER happen:
A setup without a local proxy together with "CVMFS_USE_CDN=no".
This will cause alarms in CERN's monitoring systems and may even cause offending computers to be blocked.
ID: 53294 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 946
Credit: 783,874,658
RAC: 168,848
Message 53296 - Posted: 26 Mar 2026, 8:04:53 UTC - in response to Message 53294.  

Thanks for correction, sorry.
ID: 53296 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Theory Application : Feedback on the Theory docker app


©2026 CERN