Message boards :
ATLAS application :
All tasks failing
Message board moderation
Author | Message |
---|---|
Send message Joined: 4 Sep 22 Posts: 92 Credit: 16,008,656 RAC: 5,452 |
Since about 23:30 25 Sept, I have had only one successful task run to completion. All the others have been failing with this in the stderr_txt: 2024-09-25 20:01:36 (15434): Command: VBoxManage -q storageattach "boinc_674f437b0a9c5e28" --storagectl "Hard Disk Controller" --port 0 --device 0 --type hdd --mtype multiattach --medium "/var/lib/boinc/projects/lhcathome.cern.ch_lhcathome/ATLAS_vbox_3.01_image.vdi" Exit Code: -2135228409 Output: VBoxManage: error: Cannot attach medium '/var/lib/boinc/projects/lhcathome.cern.ch_lhcathome/ATLAS_vbox_3.01_image.vdi': the media type 'MultiAttach' can only be attached to machines that were created with VirtualBox 4.0 or later VBoxManage: error: Details: code VBOX_E_INVALID_OBJECT_STATE (0x80bb0007), component SessionMachine, interface IMachine, callee nsISupports VBoxManage: error: Context: "AttachDevice(Bstr(pszCtl).raw(), port, device, DeviceType_HardDisk, pMedium2Mount)" at line 785 of file VBoxManageStorageController.cpp 2024-09-25 20:01:36 (15434): Command: VBoxManage -q closemedium "/var/lib/boinc/projects/lhcathome.cern.ch_lhcathome/ATLAS_vbox_3.01_image.vdi" Exit Code: 0 Output: |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 573 |
Did you have a look with VirtualBox Manager - Tools - Media, whether you maybe have child media with exclamation marks. |
Send message Joined: 4 Sep 22 Posts: 92 Credit: 16,008,656 RAC: 5,452 |
Did you have a look with VirtualBox Manager - Tools - Media, whether you maybe have child media with exclamation marks. None |
Send message Joined: 4 Mar 17 Posts: 25 Credit: 10,262,043 RAC: 574 |
Looking at your tasks all fail with Exit status 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED and have 12GB Peak disk usage. So it is not your fault, you just have gotten a lot of the 6.09GB task files that fail for everyone. https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6214 Not sure if that 6GB tasks are still sent. Have not gotten a big one the last hours. |
Send message Joined: 4 Sep 22 Posts: 92 Credit: 16,008,656 RAC: 5,452 |
Not sure if that 6GB tasks are still sent. Have not gotten a big one the last hours. That's probably because the queue is empty right now. |
Send message Joined: 18 Dec 15 Posts: 1823 Credit: 119,020,715 RAC: 16,704 |
what's happening with ATLAS ? tons of tasks since last night erroring out after a few minutes, see: https://lhcathome.cern.ch/lhcathome/result.php?resultid=416867873 BTW: the download files per task are 1,53GB large (!!!) P.S.: I just looked up the tasks list of a few other volunteers - same problem there. So at least there's nothing wrong with my hosts. |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 573 |
From the Guest Log: "pilotErrorDiag": "Failed to execute payload:/bin/bash: Sim_tf.py: command not found\n No idea whether someone at ATLAS can fix that. |
Send message Joined: 7 Aug 14 Posts: 27 Credit: 10,000,233 RAC: 131 |
From the Guest Log: "pilotErrorDiag": "Failed to execute payload:/bin/bash: Sim_tf.py: command not found\n Same error as https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6248 Who knows, might even be the same work sent out again ! |
Send message Joined: 24 May 23 Posts: 43 Credit: 2,624,143 RAC: 3,726 |
Same error as https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6248 Much disappointing. And a bit surprising, to me, since we're speaking of the CERN. However I don't know how much useful can actually be the work processed on our PCs, and so how much effort it deserves beside CERN "regular" work. It's a matter of fact that, among the few distributed computing projects still alive (a small fraction of those running not so many years ago), this one looks more prone to periodical issues (in my limited experience, at least), which I find... unexpected. -- Bye, Lem |
Send message Joined: 17 Sep 04 Posts: 105 Credit: 32,824,862 RAC: 40 |
[quote]Same error as [url]https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6248 I certainly get the feeling that this project is not a particularly high priority. Regards, Bob P. |
Send message Joined: 18 Dec 15 Posts: 1823 Credit: 119,020,715 RAC: 16,704 |
I certainly get the feeling that this project is not a particularly high priority.that's exactly what I think, unfortunately :-( |
Send message Joined: 7 Aug 11 Posts: 105 Credit: 25,221,969 RAC: 7,833 |
Come on folks, really?? cvmfs_config probe returns normally so it's not my local machine or network, the data simply isn't there. [2024-11-23 19:24:26] "exeErrorCode": 0, [2024-11-23 19:24:26] "exeErrorDiag": "", [2024-11-23 19:24:26] "pilotErrorCode": 1305, [2024-11-23 19:24:26] "pilotErrorDiag": "Failed to execute payload:/bin/bash: Sim_tf.py: command not found\n", So what's new up there since the last time the Altas project had viable work? New work experience kids? New hardware that hasn't been configured? Someone got switched to decaf as a prank? |
Send message Joined: 4 Mar 11 Posts: 29 Credit: 3,848,900 RAC: 7 |
I had four "good" tasks on the 22nd Nov, since then all (16) have failed with "validate error" as the headline. Lots of strange messages: 2024-11-24 13:37:11 (7136): Guest Log: *** Starting ATLAS job. (PandaID=6416690328 taskID=42161013) *** Then after a few more lines I get:
and the VM stops in an orderly manner. (Meanwhile CMS tasks are happily running on the same computer) |
Send message Joined: 18 Dec 15 Posts: 1823 Credit: 119,020,715 RAC: 16,704 |
(Meanwhile CMS tasks are happily running on the same computer)CMS tasks are running, but for sure not "happily". They do NOT download jobs right at the beginning, that's why there is no CPU usage and the task finishes after about half an hour. You even get a few credit points, but the result of the task is of NO VALUE to the science. Normally, there is a mechanism which stops the distribution of tasks as soon as there are no jobs available. However, now this does not seem to work, and already for 5 days many volunteers download and process these CMS tasks - for nothing, unfortunately. And, even worse, obviously no one at the receiving point of these useless tasks has noticed this so far and stopped this nonsense :-( |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 206 |
Atlas-Tasks with Creditpoints, but no running Job inside of an Intel-Board. https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10593998 |
Send message Joined: 7 Aug 11 Posts: 105 Credit: 25,221,969 RAC: 7,833 |
Have we kicked the Trisolarians out of the system yet? Or is it the work experience kid still configuring things? |
Send message Joined: 23 Dec 19 Posts: 18 Credit: 43,744,045 RAC: 11,832 |
All Atlas and CMS jobs are still failing. Atlas error log 2024-11-28 16:37:20 (938304): Guest Log: *** Error codes and diagnostics *** 2024-11-28 16:37:20 (938304): Guest Log: "exeErrorCode": 0, 2024-11-28 16:37:20 (938304): Guest Log: "exeErrorDiag": "", 2024-11-28 16:37:20 (938304): Guest Log: "pilotErrorCode": 1305, 2024-11-28 16:37:20 (938304): Guest Log: "pilotErrorDiag": "Failed to execute payload:/bin/bash: Sim_tf.py: command not found\n", And on CMS, as others have informed, VM starts but nothing gets executed. This is frustrating. |
Send message Joined: 28 Dec 08 Posts: 341 Credit: 4,865,275 RAC: 58 |
CMS task with just 2 seconds: https://lhcathome.cern.ch/lhcathome/result.php?resultid=417360799 |
©2025 CERN