Message boards : CMS Application : No resends on failures
Message board moderation

To post messages, you must log in.

AuthorMessage
AnandBhat

Send message
Joined: 14 Feb 22
Posts: 3
Credit: 873,502
RAC: 471
Message 49846 - Posted: 26 Mar 2024, 1:45:30 UTC

A few of my CMS tasks failed due to my host running out of memory (I've limited the number of concurrent tasks after noticing this). However, I see such failed workunits have not been issued to anyone else. Is there a reason why the project doesn't resend these?

E.g., https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=220992952
max # of error/total/success tasks	  1, 1, 1
errors                                    Too many total results
ID: 49846 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2418
Credit: 226,702,425
RAC: 130,770
Message 49847 - Posted: 26 Mar 2024, 7:03:57 UTC - in response to Message 49846.  

Is there a reason why the project doesn't resend these?

Yes.
At the project server the resend quota is intentionally set to 1 for vbox apps.
This is due to the fact that BOINC tasks are only "envelopes" launching a VM.
The scientific jobs are then done inside the VM.
ID: 49847 · Report as offensive     Reply Quote
AnandBhat

Send message
Joined: 14 Feb 22
Posts: 3
Credit: 873,502
RAC: 471
Message 49848 - Posted: 26 Mar 2024, 8:28:05 UTC - in response to Message 49847.  

Got it, thanks!
ID: 49848 · Report as offensive     Reply Quote

Message boards : CMS Application : No resends on failures


©2024 CERN