Message boards : ATLAS application : 6,09 GByte Downloadfile
Message board moderation

To post messages, you must log in.

AuthorMessage
maeax

Send message
Joined: 2 May 07
Posts: 2262
Credit: 175,581,097
RAC: 652
Message 50633 - Posted: 25 Sep 2024, 18:51:36 UTC

Every Atlas-Task in Win11pro have 6.09 GByte file!
ID: 50633 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 759
Credit: 53,345,394
RAC: 40,796
Message 50635 - Posted: 25 Sep 2024, 21:24:30 UTC

I got a few of those as well. None of them running yet.
Most of the tasks are between 400 - 500 MB though.
ID: 50635 · Report as offensive     Reply Quote
CloverField

Send message
Joined: 17 Oct 06
Posts: 91
Credit: 58,284,561
RAC: 8,054
Message 50636 - Posted: 26 Sep 2024, 0:49:28 UTC

They have all been failing for me. All of them are returning EXIT_DISK_LIMIT_EXCEEDED.
ID: 50636 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 759
Credit: 53,345,394
RAC: 40,796
Message 50638 - Posted: 26 Sep 2024, 5:53:28 UTC - in response to Message 50636.  

They have all been failing for me. All of them are returning EXIT_DISK_LIMIT_EXCEEDED.

Failing here also with same error.
ID: 50638 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2262
Credit: 175,581,097
RAC: 652
Message 50639 - Posted: 26 Sep 2024, 7:11:08 UTC

100 Tasks with this Error's overnight.
Every download 6 GByte => 600 GByte for nothing, OMG.
ID: 50639 · Report as offensive     Reply Quote
Lem Novantotto

Send message
Joined: 24 May 23
Posts: 48
Credit: 4,120,767
RAC: 833
Message 50640 - Posted: 26 Sep 2024, 7:28:39 UTC - in response to Message 50638.  

They have all been failing for me. All of them are returning EXIT_DISK_LIMIT_EXCEEDED.

Failing here also with same error.

Linux native, idem.

Bye.
ID: 50640 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1444
Credit: 9,704,984
RAC: 918
Message 50642 - Posted: 26 Sep 2024, 11:52:01 UTC - in response to Message 50640.  

All those monster tasks will fail by the start, cause the files needed are exceeding the 9.3GB allowed to be in one slot directory.
Fortunately they do not notice this disk exceeding at the end of the task.
I will give 4 tasks (two by two) a try with 10 threads after one 'normal' task now running has finished and allowing those monsters more diskspace.
ID: 50642 · Report as offensive     Reply Quote
Boone

Send message
Joined: 22 Sep 08
Posts: 2
Credit: 10,142,525
RAC: 3,271
Message 50643 - Posted: 26 Sep 2024, 12:12:00 UTC - in response to Message 50642.  

Same here since 12 hours. I run Atlas native with linux mint.
The big ones crashing with this error message:
LHC@home 26.09.2024 13:43:44 Aborting task U85LDmlK0D6nsSi4ap6QjLDmwznN0nGgGQJmDC4LDmbblKDme3lgMm_0: exceeded disk limit: 12452.66MB > 9536.74MB

There are some normal with about 500MB an then these mega-Workunits:

~$ sudo du /var/lib/boinc-client/slots/0 -hs
455M /var/lib/boinc-client/slots/0

:~$ sudo du /var/lib/boinc-client/slots/2 -hs
6,1G /var/lib/boinc-client/slots/2

Only the big ones crashing, I understand i can't do anything about it, right?

I will pause Atlas on my client until this is solved.
ID: 50643 · Report as offensive     Reply Quote
Profile Yeti
Volunteer moderator
Avatar

Send message
Joined: 2 Sep 04
Posts: 455
Credit: 209,773,883
RAC: 14,582
Message 50644 - Posted: 26 Sep 2024, 12:26:41 UTC

These 6 GB-WUs have crashed 4 Hosts of mine ==> ATLAS is paused on all machines until this is solved


Supporting BOINC, a great concept !
ID: 50644 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1444
Credit: 9,704,984
RAC: 918
Message 50645 - Posted: 26 Sep 2024, 14:34:11 UTC - in response to Message 50642.  

All those monster tasks will fail by the start, cause the files needed are exceeding the 9.3GB allowed to be in one slot directory.
Fortunately they do not notice this disk exceeding at the end of the task.
I will give 4 tasks (two by two) a try with 10 threads after one 'normal' task now running has finished and allowing those monsters more diskspace.
Just for testing, I started 2 of these tasks with the 6.1GB root-file.
After the input files are copied to the slot-directory and the VM is created, the slot contains 13.490.000.000 bytes and increasing, where max 10.000.000.000 is allowed.
The tasks have the same number of events to process: 400 and processing now after I made an adjustment to BOINC.
Those tasks should not be sent to volunteers.
ID: 50645 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2262
Credit: 175,581,097
RAC: 652
Message 50646 - Posted: 26 Sep 2024, 14:42:46 UTC - in response to Message 50645.  

Those tasks should not be sent to volunteers.

We hoping Cern-IT have someone to stop this 6 GByte Atlas-Tasks.
We need an answer from there, to do the work without this Tasks!
ID: 50646 · Report as offensive     Reply Quote
kotenok2000
Avatar

Send message
Joined: 21 Feb 11
Posts: 82
Credit: 577,297
RAC: 18
Message 50647 - Posted: 26 Sep 2024, 14:51:31 UTC

What adjustment?
ID: 50647 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1444
Credit: 9,704,984
RAC: 918
Message 50648 - Posted: 26 Sep 2024, 15:52:52 UTC - in response to Message 50647.  

What adjustment?
rsc_disk_bound in client_state.xml for every (new) workunit.

First two tasks returned successfully:

https://lhcathome.cern.ch/lhcathome/result.php?resultid=414531235
https://lhcathome.cern.ch/lhcathome/result.php?resultid=414531094

Look for the size of ATLAS.root_0 in the results.
Next two are running.
ID: 50648 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2262
Credit: 175,581,097
RAC: 652
Message 50649 - Posted: 26 Sep 2024, 16:31:43 UTC - in response to Message 50648.  

Crystal,
it's ok to find this <rsc_disk_bound>xxxx</rsc_disk_bound>,
but, this is no solution, but a problem for all other user.
ID: 50649 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1444
Credit: 9,704,984
RAC: 918
Message 50650 - Posted: 26 Sep 2024, 17:30:23 UTC - in response to Message 50649.  

Crystal,
it's ok to find this <rsc_disk_bound>xxxx</rsc_disk_bound>,
but, this is no solution, but a problem for all other user.
As I wrote: "Just for testing"
ID: 50650 · Report as offensive     Reply Quote

Message boards : ATLAS application : 6,09 GByte Downloadfile


©2025 CERN