Message boards : Theory Application : 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED - how come?
Message board moderation
Previous · 1 . . . 3 · 4 · 5 · 6
| Author | Message |
|---|---|
|
Send message Joined: 19 Feb 08 Posts: 708 Credit: 4,336,250 RAC: 0 |
They are spoiling my Science United statistics. Tullio |
|
Send message Joined: 14 Jan 10 Posts: 1530 Credit: 10,031,459 RAC: 1,323 |
Three tasks in this workunit: https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=130540238 1st EXIT_DISK_LIMIT_EXCEEDED https://lhcathome.cern.ch/lhcathome/result.php?resultid=259086785 2nd aborted by client https://lhcathome.cern.ch/lhcathome/result.php?resultid=259092479 3rd Valid cause I increased the rsc_disk_bound from 2000000000 to 8000000000 in my client_state.xml Could an admin increase that setting for rsc_disk_bound server-side? It will increase the number of successes and avoid wasting CPU-cycles. |
LaurenceSend message Joined: 20 Jun 14 Posts: 431 Credit: 253,474 RAC: 206 |
Three tasks in this workunit: https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=130540238 Done |
|
Send message Joined: 14 Jan 10 Posts: 1530 Credit: 10,031,459 RAC: 1,323 |
Thanks, Laurence.Could an admin increase that setting for rsc_disk_bound server-side? It will increase the number of successes and avoid wasting CPU-cycles.Done I just got a generator madgraph5amc again and after 20 minutes the image.vdi is already 3,056,599,040 bytes. |
|
Send message Joined: 18 Dec 15 Posts: 1958 Credit: 158,891,804 RAC: 47,490 |
|
|
Send message Joined: 14 Jan 10 Posts: 1530 Credit: 10,031,459 RAC: 1,323 |
the next one:This time the forced stop was useful to avoid extreme disk usage caused by a faulty task. The resend had the same problem (also over 17GB) and so will do the 3rd (last) try. https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=130930529 Edit: The 3rd try failed for another reason: lock by another process. |
|
Send message Joined: 18 Dec 15 Posts: 1958 Credit: 158,891,804 RAC: 47,490 |
(also over 17GB)which is nice for hosts with a SSD :-( |
|
Send message Joined: 18 Dec 15 Posts: 1958 Credit: 158,891,804 RAC: 47,490 |
Now I had another one, which failed after 5 hrs 30 min: https://lhcathome.cern.ch/lhcathome/result.php?resultid=268226030 and I thought that this kind of problem was solved |
|
Send message Joined: 14 Jan 10 Posts: 1530 Credit: 10,031,459 RAC: 1,323 |
Now I had another one, which failed after 5 hrs 30 min:pp jets 7000 170,-,2960 - sherpa 1.4.1 default: is one of the jobs I don't even start. |
|
Send message Joined: 18 Dec 15 Posts: 1958 Credit: 158,891,804 RAC: 47,490 |
pp jets 7000 170,-,2960 - sherpa 1.4.1 default[/i]: is one of the jobs I don't even start.hm, obviously I had overlooked it in your list, or it got started at a time I was not there, ... |
|
Send message Joined: 14 Jan 10 Posts: 1530 Credit: 10,031,459 RAC: 1,323 |
This workunit containing the job "boinc pp z1j 8000 - - sherpa 2.2.8 default 100000 500" starves because of EXIT_DISK_LIMIT_EXCEEDED https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=238981772 The 20 hours running docker task with peak disk usage of 7.48 GB contains: got abort request from client running docker command: kill boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1 program: podman command output: boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1 EOM . . . stderr end running docker command: container rm boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1 program: podman command output: boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1 EOM running docker command: image rm boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500 program: podman command output: Untagged: localhost/boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500:latest Deleted: 79376ef46d917bc296637af0b05b32bfd9343f28f7fbcb0b1de6c1c506d72d39 Deleted: 9259743e983aaef18ae52b23e457320fa4e849e4e352edb3ad1bd4eece38cec6 Deleted: a42a951158504bf9a4debe713e6aa7365d4651bd8f02aa676adef32a66e324da Deleted: e06ed2ad55322792d5d90223aced1f2c12f101443f88bc2e586a122413c7ebe0 Deleted: c4f9331961caded74fc715fe1b0e5a576df596340e8a2d50385e0fbdc1cd9ea6 I aborted mine. |
©2026 CERN