Message boards : Theory Application : Theory in containers
Message board moderation
| Author | Message |
|---|---|
LaurenceSend message Joined: 20 Jun 14 Posts: 420 Credit: 240,048 RAC: 118 |
A new version of the Theory app which runs in containers in now available as a beta. In order to run this you will need BOINC client v8.2 or newer. Podman should also be available on your system. The documentation for this can be found on the BOINC wiki. |
|
Send message Joined: 30 Sep 21 Posts: 2 Credit: 9,032,345 RAC: 23,533 |
Hi, I have received a bunch of these workunits and they all seem to fail within seconds. Normally i get native ATLAS workunits and that seems to work fine; eg CVMFS and podman seem to be working as intended. Is there a setting i have to change on my end to not have these workunits fail? The gist of the errorlogs: time="2026-01-28T09:59:58+01:00" level=warning msg="The cgroupv2 manager is set to systemd but there is no systemd user session available" time="2026-01-28T09:59:58+01:00" level=warning msg="For using systemd, you may need to login using an user session" time="2026-01-28T09:59:58+01:00" level=warning msg="Alternatively, you can enable lingering with: `loginctl enable-linger 126` (possibly as root)" time="2026-01-28T09:59:58+01:00" level=warning msg="Falling back to --cgroup-manager=cgroupfs" time="2026-01-28T09:59:58+01:00" level=warning msg="The cgroupv2 manager is set to systemd but there is no systemd user session available" time="2026-01-28T09:59:58+01:00" level=warning msg="For using systemd, you may need to login using an user session" time="2026-01-28T09:59:58+01:00" level=warning msg="Alternatively, you can enable lingering with: `loginctl enable-linger 126` (possibly as root)" time="2026-01-28T09:59:58+01:00" level=warning msg="Falling back to --cgroup-manager=cgroupfs" For now i have disabled getting tasks to not flood you with broken workunits. Kind regards |
|
Send message Joined: 4 Mar 17 Posts: 37 Credit: 12,435,212 RAC: 6,778 |
On my device(Arch linux with docker) do they run longer but they make the finish steps and boinc count them as computation failure ********* Total number of errors, excluding junctions = 0 ************* ********* Total number of errors, including junctions = 0 ************* ********* Total number of warnings = 0 ************* ********* Fraction of events that fail fragmentation cuts = 0.00000 ********* Generator run finished successfully INFO: rivet analysis finished: numEvents=100000 crossSection=23.5899 --- the last line of the log data: REF_ALEPH_2004_I636645_d91-x01-y01.dat -> /scratch/dat/ALEPH_2004_I636645-ee-189/zhad-C-aleph1-d91-x01-y01/ALEPH_2004_I636645.dat https://lhcathome.cern.ch/lhcathome/result.php?resultid=432008470 https://lhcathome.cern.ch/lhcathome/result.php?resultid=432008470 |
LaurenceSend message Joined: 20 Jun 14 Posts: 420 Credit: 240,048 RAC: 118 |
For Linux, you might have to enable linger for the boinc user. sudo usermod --add-subuids 100000-165535 --add-subgids 100000-165535 boinc sudo loginctl enable-linger boinc cat /var/lib/boinc/.config/containers/containers.conf [engine] cgroup_manager = "cgroupfs" |
|
Send message Joined: 14 Jan 10 Posts: 1491 Credit: 9,985,849 RAC: 991 |
In reply to Schizm's message of 28 Jan 2026: Hi,I'm a layman on Linux and containers, but I think you have to set in containers.conf "cgroup-manager=cgroupfs" |
LaurenceSend message Joined: 20 Jun 14 Posts: 420 Credit: 240,048 RAC: 118 |
I can't see any output from the job. This is strange since even if the job failed, we should still see some output from the client or docker wrapper. |
|
Send message Joined: 4 Mar 17 Posts: 37 Credit: 12,435,212 RAC: 6,778 |
Here are some logs of this task https://lhcathome.cern.ch/lhcathome/result.php?resultid=431994641 https://pastebin.com/jGVBsSBT stderr.txt https://pastebin.com/m7YVeS1G runRivet.log Once the current tasks have finished i will try to switch to podman if that works better than docker. |
|
Send message Joined: 14 Jan 10 Posts: 1491 Credit: 9,985,849 RAC: 991 |
@Laurence: Why are you running every 10 seconds these 2 commands: ps --all -f and stats --no-stream --format "{{.CPUPerc}} {{.MemUsage}}" Do you consider to reduce this? |
|
Send message Joined: 4 Mar 17 Posts: 37 Credit: 12,435,212 RAC: 6,778 |
In reply to Toggleton's message of 28 Jan 2026: Once the current tasks have finished i will try to switch to podman if that works better than docker Not sure why docker did not work but Podman with linger enabled works fine with multiple tasks finished successfully. |
LaurenceSend message Joined: 20 Jun 14 Posts: 420 Credit: 240,048 RAC: 118 |
In reply to Crystal Pellet's message of 28 Jan 2026: @Laurence: Why are you running every 10 seconds these 2 commands: Not sure, this in in the upstream code. Feel free to post the question in the issue tracker. |
LaurenceSend message Joined: 20 Jun 14 Posts: 420 Credit: 240,048 RAC: 118 |
In reply to Toggleton's message of 28 Jan 2026: In reply to Toggleton's message of 28 Jan 2026: Great! |
|
Send message Joined: 14 Jan 10 Posts: 1491 Credit: 9,985,849 RAC: 991 |
In reply to Laurence's message of 28 Jan 2026: In reply to Crystal Pellet's message of 28 Jan 2026:Thanks Laurence. It suppose to be a check every 10 seconds, whether the job has exited. |
|
Send message Joined: 7 May 08 Posts: 266 Credit: 2,118,600 RAC: 2,026 |
Up to now, no problems with my Win11 64bit |
LaurenceSend message Joined: 20 Jun 14 Posts: 420 Credit: 240,048 RAC: 118 |
In reply to Crystal Pellet's message of 28 Jan 2026: In reply to Laurence's message of 28 Jan 2026: I think I can turn the logging verbosity down to remove this from the logs. For now it is good to see more details just in case we have any issues. |
|
Send message Joined: 30 Sep 21 Posts: 2 Credit: 9,032,345 RAC: 23,533 |
Enabled linger locally, this did not change anything. Subuids and subgids were set for both boinc group and local user. Using cgroupfs instead of systemd only seems to hide docker (and podman) from the boinc client (for all projects); as far as i know cgroupv2 was needed for rootless containers. When systemd is used LHC detects podman and world community grid detects docker. Boinc client used on this machine: 8.2.8 podman version 3.4.4 Docker version 28.2.2, build 28.2.2-0ubuntu1~22.04.1 Running docker's hello-world results in this: Hello from Docker!
This message shows that your installation appears to be working correctly.
To generate this message, Docker took the following steps:
1. The Docker client contacted the Docker daemon.
2. The Docker daemon pulled the "hello-world" image from the Docker Hub.
(amd64)
3. The Docker daemon created a new container from that image which runs the
executable that produces the output you are currently reading.
4. The Docker daemon streamed that output to the Docker client, which sent it
to your terminal.How the running processes for podman and docker look: schizm@Enceladus:~$ ps -ef|grep podman boinc 1337 1243 0 13:15 ? 00:00:00 /usr/bin/podman boinc 2510 1 0 13:15 ? 00:00:00 /usr/bin/slirp4netns --disable-host-loopback --mtu=65520 --enable-sandbox --enable-seccomp -c -e 3 -r 4 --netns-type=path /tmp/podman-run-126/netns/cni-c2ec05af-66c0-e5de-7d6e-27e58d346990 tap0 schizm 2888 2796 0 13:15 ? 00:00:00 /usr/bin/podman schizm 143078 3676 0 16:29 pts/0 00:00:00 grep --color=auto podman schizm@Enceladus:~$ ps -ef|grep docker root 1631 1 0 13:15 ? 00:00:01 /usr/bin/dockerd -H fd:// --containerd=/run/containerd/containerd.sock schizm 143090 3676 0 16:30 pts/0 00:00:00 grep --color=auto docker Another note to add: There was an issue with virtualbox and podman running simultaneously, could something similar be the case here? Can not test my latest changes for theory since i'm getting atlas tasks again instead, but i might have to run the docker daemon under another uid as a fix. Will try to update here after i get new theory tasks. |
|
Send message Joined: 27 Sep 08 Posts: 898 Credit: 770,607,830 RAC: 175,526 |
I didn't have issues to run poadman and virtualbox at the same time before. just got a few sucessfuls running both. You don't need both docker and podman, I chose podman |
|
Send message Joined: 7 May 08 Posts: 266 Credit: 2,118,600 RAC: 2,026 |
In reply to [VENETO] boboviz's message of 28 Jan 2026: Up to now, no problems with my Win11 64bit Mmm, some errors, like 432010091 <message> |
©2026 CERN