Message boards :
ATLAS application :
ATLAS native v2.91
Message board moderation
Author | Message |
---|---|
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
ATLAS native 2.91 was just released, which contains the improvements in v2.90, but with the problem with read-only tmp dirs fixed. Please let us know if you see any problems! |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,503,014 RAC: 131,868 |
First tasks are running with a local apptainer (from the Linux vendor). So far there are no unexpected issues. |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,503,014 RAC: 131,868 |
Since this ATLAS version is out there's a slow but steady increase of the average load values. The more cores are given to ATLAS the steeper the load increases. It looks like the cleanup doesn't work at the end of a task and some processes keep running. Needs investigation. The bad thing is that this will sooner or later lead to a crash. After a reboot fresh ATLAS tasks create the following subfolders in the global tmp folders in /tmp hsperfdata_<boinc_username> in /var/tmp .cricinfo_<boinc_userid> |
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
Can you see what processes are left running once a task finishes? Unfortunately there are still some parts of ATLAS tasks hard-coded to use /var/tmp, we are working on fixing this. But I think /tmp/hsperfdata is something related to Java(?) and not coming from ATLAS tasks. |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,503,014 RAC: 131,868 |
Just started an ATLAS task on a BOINC test instance. My proxy log shows this, requested by that ATLAS task: [06/Sep/2022:12:02:08 +0200] "CONNECT atlas-cric.cern.ch:443 HTTP/1.0" 200 1797516 "-" "-" TCP_TUNNEL:HIER_DIRECT But I think /tmp/hsperfdata is something related to Java(?) and not coming from ATLAS tasks. Right, it's from Java, but created during the start of ATLAS. Will let the task finish to see what remains. |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,818,488 RAC: 127,549 |
Have one test in CentOS9-VM:https://lhcathome.cern.ch/lhcathome/results.php?hostid=10813499 and seeing NO entry in \tmp or \var\tmp. |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,503,014 RAC: 131,868 |
Just started an ATLAS task on a BOINC test instance. The test task finished without leaving anything weird. Let's see what happens over night when the machines are back on full load. |
Send message Joined: 2 Sep 04 Posts: 453 Credit: 193,569,815 RAC: 9,173 |
Since today around 12:00 UTC I see a lot (not all) of Atlas-Native-Tasks fail after 600 seconds runtime. You can check here: https://lhcathome.cern.ch/lhcathome/results.php?userid=555&offset=0&show_names=0&state=6&appid= Supporting BOINC, a great concept ! |
©2024 CERN