Message boards : Theory Application : 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED - how come?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6

AuthorMessage
tullio

Send message
Joined: 19 Feb 08
Posts: 708
Credit: 4,336,250
RAC: 0
Message 41240 - Posted: 10 Jan 2020, 18:25:29 UTC

They are spoiling my Science United statistics.
Tullio
ID: 41240 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1530
Credit: 10,031,459
RAC: 1,323
Message 41243 - Posted: 13 Jan 2020, 8:05:16 UTC

Three tasks in this workunit: https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=130540238
1st EXIT_DISK_LIMIT_EXCEEDED https://lhcathome.cern.ch/lhcathome/result.php?resultid=259086785
2nd aborted by client https://lhcathome.cern.ch/lhcathome/result.php?resultid=259092479
3rd Valid cause I increased the rsc_disk_bound from 2000000000 to 8000000000 in my client_state.xml

Could an admin increase that setting for rsc_disk_bound server-side? It will increase the number of successes and avoid wasting CPU-cycles.
ID: 41243 · Report as offensive     Reply Quote
ProfileLaurence
Project administrator
Project developer

Send message
Joined: 20 Jun 14
Posts: 431
Credit: 253,474
RAC: 206
Message 41248 - Posted: 14 Jan 2020, 9:15:24 UTC - in response to Message 41243.  

Three tasks in this workunit: https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=130540238
1st EXIT_DISK_LIMIT_EXCEEDED https://lhcathome.cern.ch/lhcathome/result.php?resultid=259086785
2nd aborted by client https://lhcathome.cern.ch/lhcathome/result.php?resultid=259092479
3rd Valid cause I increased the rsc_disk_bound from 2000000000 to 8000000000 in my client_state.xml

Could an admin increase that setting for rsc_disk_bound server-side? It will increase the number of successes and avoid wasting CPU-cycles.

Done
ID: 41248 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1530
Credit: 10,031,459
RAC: 1,323
Message 41252 - Posted: 14 Jan 2020, 12:36:47 UTC - in response to Message 41248.  
Last modified: 14 Jan 2020, 13:04:31 UTC

Could an admin increase that setting for rsc_disk_bound server-side? It will increase the number of successes and avoid wasting CPU-cycles.
Done
Thanks, Laurence.

I just got a generator madgraph5amc again and after 20 minutes the image.vdi is already 3,056,599,040 bytes.
ID: 41252 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1959
Credit: 158,896,775
RAC: 47,344
Message 41324 - Posted: 22 Jan 2020, 6:26:11 UTC - in response to Message 41252.  

ID: 41324 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1530
Credit: 10,031,459
RAC: 1,323
Message 41325 - Posted: 22 Jan 2020, 7:27:28 UTC - in response to Message 41324.  
Last modified: 22 Jan 2020, 11:04:21 UTC

the next one:

https://lhcathome.cern.ch/lhcathome/result.php?resultid=259680436
This time the forced stop was useful to avoid extreme disk usage caused by a faulty task.
The resend had the same problem (also over 17GB) and so will do the 3rd (last) try.
https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=130930529

Edit: The 3rd try failed for another reason: lock by another process.
ID: 41325 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1959
Credit: 158,896,775
RAC: 47,344
Message 41326 - Posted: 22 Jan 2020, 10:03:19 UTC - in response to Message 41325.  

(also over 17GB)
which is nice for hosts with a SSD :-(
ID: 41326 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1959
Credit: 158,896,775
RAC: 47,344
Message 41938 - Posted: 18 Mar 2020, 5:55:44 UTC

Now I had another one, which failed after 5 hrs 30 min:

https://lhcathome.cern.ch/lhcathome/result.php?resultid=268226030

and I thought that this kind of problem was solved
ID: 41938 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1530
Credit: 10,031,459
RAC: 1,323
Message 41943 - Posted: 18 Mar 2020, 8:32:19 UTC - in response to Message 41938.  

Now I had another one, which failed after 5 hrs 30 min:

https://lhcathome.cern.ch/lhcathome/result.php?resultid=268226030
pp jets 7000 170,-,2960 - sherpa 1.4.1 default: is one of the jobs I don't even start.
ID: 41943 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1959
Credit: 158,896,775
RAC: 47,344
Message 41948 - Posted: 18 Mar 2020, 13:41:59 UTC - in response to Message 41943.  

pp jets 7000 170,-,2960 - sherpa 1.4.1 default[/i]: is one of the jobs I don't even start.
hm, obviously I had overlooked it in your list, or it got started at a time I was not there, ...
ID: 41948 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1530
Credit: 10,031,459
RAC: 1,323
Message 52996 - Posted: 8 Feb 2026, 9:02:35 UTC - in response to Message 41948.  

This workunit containing the job "boinc pp z1j 8000 - - sherpa 2.2.8 default 100000 500" starves because of EXIT_DISK_LIMIT_EXCEEDED
https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=238981772
The 20 hours running docker task with peak disk usage of 7.48 GB contains:

got abort request from client
running docker command: kill boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1
program: podman
command output:
boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1
EOM
.
.
.
stderr end
running docker command: container rm boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1
program: podman
command output:
boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500_1
EOM
running docker command: image rm boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500
program: podman
command output:
Untagged: localhost/boinc__lhcathome.cern.ch_lhcathome__theory_2922-4899195-500:latest
Deleted: 79376ef46d917bc296637af0b05b32bfd9343f28f7fbcb0b1de6c1c506d72d39
Deleted: 9259743e983aaef18ae52b23e457320fa4e849e4e352edb3ad1bd4eece38cec6
Deleted: a42a951158504bf9a4debe713e6aa7365d4651bd8f02aa676adef32a66e324da
Deleted: e06ed2ad55322792d5d90223aced1f2c12f101443f88bc2e586a122413c7ebe0
Deleted: c4f9331961caded74fc715fe1b0e5a576df596340e8a2d50385e0fbdc1cd9ea6


I aborted mine.
ID: 52996 · Report as offensive     Reply Quote
Previous · 1 . . . 3 · 4 · 5 · 6

Message boards : Theory Application : 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED - how come?


©2026 CERN