Message boards :
ATLAS application :
atlas error
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 3 Nov 12 Posts: 68 Credit: 150,309,918 RAC: 119,683 ![]() ![]() ![]() |
I got same invalid results for a bunch of my tasks too. (Example) These are coming more and more now. Dozens of them. It^s incredible ... |
Send message Joined: 3 Nov 12 Posts: 68 Credit: 150,309,918 RAC: 119,683 ![]() ![]() ![]() |
If you use the suggested CVMFS configuration supplementary repositories are mounted automatically when an app requires them. looks like this is true for automounter "autofs" but it's not for automounter "systemd". switched to systemd since autofs is no longer available as a binary on manjaro linux. any hints for configuring systemd? |
Send message Joined: 14 Sep 08 Posts: 52 Credit: 66,850,956 RAC: 23,733 ![]() ![]() ![]() |
I just realized I wasted thousands of tasks and 1.5TB of the project bandwidth in past 20 hours... Oops and very sorry for that and I have paused all work fetch now. My setup was a bit weird, carried over from Arch when I ran the cvmfs container with configs copied over from an Ubuntu VM after installing the deb package there. Guess I can install the proper official packages now that I've switched back to Ubuntu. Hopefully that would make sure I always have the recommended configuration from now on. Is this the latest recommended configuration? (Edit: Guess yes. I have seen the task successfully find the nightlies repo afterwards.) Related note: I wonder if it's possible to have the task fail, instead of showing validation error after uploading the result for basic setup issues, like the missing repo error here? That way, BOINC client would automatically back off, instead of keeping fetching and uploading invalid results. I have monitoring for failed jobs on client side too. However, a successfully uploaded result marked as invalid requires me to check the website periodically. I noticed this today only from my bandwidth monitoring... |
Send message Joined: 18 Dec 15 Posts: 1843 Credit: 126,561,259 RAC: 128,095 ![]() ![]() ![]() |
since yesterday, I have had a few tasks on different PCs which successfully produced a HITS file and were uploaded okay, but ended up with "confirmation error" ("Bestätigungsfehler") - example see here: https://lhcathome.cern.ch/lhcathome/result.php?resultid=411294133 any idea what's behind this ? |
Send message Joined: 3 Nov 12 Posts: 68 Credit: 150,309,918 RAC: 119,683 ![]() ![]() ![]() |
since yesterday, I have had a few tasks on different PCs which successfully produced a HITS file and were uploaded okay, but ended up with "confirmation error" ("Bestätigungsfehler") - example see here: Same here "Atlas native" https://lhcathome.cern.ch/lhcathome/result.php?resultid=411222337 No errors but invalid :-( |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
Seeing the same, hoping Cern-IT give us the creditpoints, because the Atlas-Tasks are confirmed and validated. This are days of running time for me. |
Send message Joined: 18 Dec 15 Posts: 1843 Credit: 126,561,259 RAC: 128,095 ![]() ![]() ![]() |
last night, I had at least two tasks which failed after about 5 minutes: https://lhcathome.cern.ch/lhcathome/result.php?resultid=411314062 Edit: I now have found several other such tasks on various other computers within my network. |
![]() Send message Joined: 15 Jul 05 Posts: 250 Credit: 5,974,599 RAC: 0 ![]() ![]() |
Sorry for these validation errors. We had a problem with the ATLAS validator yesterday on our new server. We have now set the relevant results to re-validate in the database. Hopefully you should get credit soon, unless something else prevents it. |
Send message Joined: 18 Dec 15 Posts: 1843 Credit: 126,561,259 RAC: 128,095 ![]() ![]() ![]() |
Sorry for these validation errors. We had a problem with the ATLAS validator yesterday on our new server. We have now set the relevant results to re-validate in the database. Hopefully you should get credit soon, unless something else prevents it.I just noticed that the status of the tasks in question has changed from "invalid" to "unknown" - and still no credits ... why "unknown" ? |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
Ok, Nils, the Atlas-Tasks are away from Boinc for me. They are now shown as running. But the Timestamp is from yesterday. https://lhcathome.cern.ch/lhcathome/results.php?userid=75468 Edit: now download failed. |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
Now, waiting for Confirmation, finished successful. |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
Creditpoints completed, Thank you. |
Send message Joined: 3 Nov 12 Posts: 68 Credit: 150,309,918 RAC: 119,683 ![]() ![]() ![]() |
Got two newer WU's with invalid result. They looks o.k. for me. So what's wrong? https://lhcathome.cern.ch/lhcathome/result.php?resultid=411314039 https://lhcathome.cern.ch/lhcathome/result.php?resultid=411258905 |
![]() Send message Joined: 15 Jun 08 Posts: 2606 Credit: 262,432,412 RAC: 136,862 ![]() ![]() |
Looks like something got messed on the new server. This result has been received before it has been sent: https://lhcathome.cern.ch/lhcathome/result.php?resultid=411215779 Sent 23 May 2024, 9:09:38 UTC Received 22 May 2024, 7:13:56 UTC |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
Got two newer WU's with invalid result. Now 1.950 instead of 400 Events. |
Send message Joined: 27 Apr 24 Posts: 13 Credit: 1,065,939 RAC: 1,012 ![]() ![]() |
Since I updated to Ubuntu 24.04 LTS, my apptainer for Atlas isn't working. I have purged, and reinstalled my apptainer application, to no effect. I have no idea what is wrong, or how to fix it. [2024-05-25 09:25:59] Using apptainer image /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 [2024-05-25 09:25:59] Checking for apptainer binary... [2024-05-25 09:25:59] Using apptainer found in PATH at /usr/bin/apptainer [2024-05-25 09:25:59] Running /usr/bin/apptainer --version [2024-05-25 09:25:59] apptainer version 1.3.1 [2024-05-25 09:25:59] Checking apptainer works with /usr/bin/apptainer exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/containers/fs/singularity/x86_64-centos7 hostname [2024-05-25 09:25:59] apptainer isnt working: [91mERROR : Could not write info to setgroups: Permission denied [2024-05-25 09:25:59] [0m[91mERROR : Error while waiting event for user namespace mappings: no event received [2024-05-25 09:25:59] [0m 09:36:00 (11112): run_atlas exited; CPU time 0.127016 09:36:00 (11112): app exit status: 0x1 09:36:00 (11112): called boinc_finish(195) |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
[2024-05-25 09:25:59] apptainer isnt working: [91mERROR : Could not write info to setgroups: Permission denied You can search in folder number crunching for this setting. |
Send message Joined: 27 Apr 24 Posts: 13 Credit: 1,065,939 RAC: 1,012 ![]() ![]() |
You can search in folder number crunching for this setting. I don't understand what your answer means. I can see Permission denied, but I don't know how or where to fix this. At the moment, I am limited to only using Virtualbox for Atlas or Theory work units. |
Send message Joined: 2 May 07 Posts: 2260 Credit: 175,581,097 RAC: 11,545 ![]() ![]() ![]() |
|
Send message Joined: 14 Sep 08 Posts: 52 Credit: 66,850,956 RAC: 23,733 ![]() ![]() ![]() |
Well you didn't search yourself. :-) @M0CZY's message above is the only one on this forum that showed the exact error messages. I ran into the same issue after upgrading to Ubuntu 24.04. Example failed task: https://lhcathome.cern.ch/lhcathome/result.php?resultid=411375194 The fix is to follow the workaround in this report. Execute the two commands as root. echo "kernel.apparmor_restrict_unprivileged_userns = 0" >/etc/sysctl.d/99-userns.conf sysctl --system Then the failed apptainer command in the error log should now print the host name when executed with normal user privilege. Note that this effectively reverts the tightened user namespace setting in Ubuntu 24.04. For anyone who knows apparmor configs better (not me), there will likely be a more restricted approach to only give apptainer the permission. |
©2025 CERN