Message boards : LHCb Application : 4 x 'Error while computing' after almost 1 hour CPU time.
Message board moderation

To post messages, you must log in.

AuthorMessage
Simplex0

Send message
Joined: 26 Aug 05
Posts: 68
Credit: 545,660
RAC: 0
Message 35532 - Posted: 16 Jun 2018, 6:31:48 UTC
Last modified: 16 Jun 2018, 6:36:16 UTC

I had 4 x LHCb shuting down and reporting 'Error while computing' after almost 1 hour CPU time.
Here is a link to one Stderr output

https://lhcathome.cern.ch/lhcathome/result.php?resultid=198762484

I have aborted all LHCb work units for now on this computer but have 1 wu running on an other computer that has been running for 13 hours and reached 74%

Wonder why this 'Error while computing' occurred, I have run other work units under LHCb before that worked just fine.
ID: 35532 · Report as offensive     Reply Quote
Simplex0

Send message
Joined: 26 Aug 05
Posts: 68
Credit: 545,660
RAC: 0
Message 35536 - Posted: 16 Jun 2018, 12:01:04 UTC - in response to Message 35532.  
Last modified: 16 Jun 2018, 12:15:13 UTC

https://lhcathome.cern.ch/lhcathome/result.php?resultid=198762484

The when I check the work unit 97239767 it says "errors Too many total results"
Does this mean that this wu failed on too many computers?
ID: 35536 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 932
Credit: 6,284,444
RAC: 719
Message 35539 - Posted: 16 Jun 2018, 16:51:21 UTC - in response to Message 35536.  

Does this mean that this wu failed on too many computers?

No.
The BOINC task is sent only once, cause the real LHCb-job is done inside the VM.
So no BOINC resend task needed.
You can see the max of total/error/success tasks is 1.
ID: 35539 · Report as offensive     Reply Quote

Message boards : LHCb Application : 4 x 'Error while computing' after almost 1 hour CPU time.


©2020 CERN