Message boards : Number crunching : Wrong exit code?
Message board moderation

To post messages, you must log in.

AuthorMessage
Phil
Avatar

Send message
Joined: 26 Jul 05
Posts: 63
Credit: 4,083,755
RAC: 0
Message 29983 - Posted: 20 Apr 2017, 17:48:01 UTC

If a user runs a sub-project job that runs the VM correctly, but CONDOR fails to find and run a single job, that is recorded by BOINC as an error.
This is wrong as far as the user and BOINC is concerned - the user has run the VM correctly and it was not his fault that no work for CERN was completed.
If these jobs were recorded as a success rather than an error they would show on the statistics pages as the true current status of the sub-project and BOINC users could see which sub-projects to support.
ID: 29983 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1006
Credit: 6,272,128
RAC: 397
Message 29988 - Posted: 20 Apr 2017, 20:50:55 UTC - in response to Message 29983.  

If a user runs a sub-project job that runs the VM correctly, but CONDOR fails to find and run a single job, that is recorded by BOINC as an error.
This is wrong as far as the user and BOINC is concerned - the user has run the VM correctly and it was not his fault that no work for CERN was completed.
If these jobs were recorded as a success rather than an error they would show on the statistics pages as the true current status of the sub-project and BOINC users could see which sub-projects to support.

I'm trying to get this resolved -- I like it as little as you. Some concrete suggestions on how to deal with it would be appreciated, especially from people familiar with the client-BOINC interface. Ideally I'd like to see a backoff of some sort (and hopefully not an "Error while computing" code). Any ideas?
ID: 29988 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1280
Credit: 8,489,126
RAC: 1,872
Message 29991 - Posted: 21 Apr 2017, 11:12:09 UTC - in response to Message 29988.  

I'm trying to get this resolved -- I like it as little as you. Some concrete suggestions on how to deal with it would be appreciated, especially from people familiar with the client-BOINC interface. Ideally I'd like to see a backoff of some sort (and hopefully not an "Error while computing" code). Any ideas?

It can be even worse and I've seen that in the past sometimes with my tasks and tasks of other users, that even when several jobs were already successfully done by the VM, the task ended into an error, just because there were no jobs available anymore or there could not made a connection at that moment. That was a long sentence. Now a shorter one:

The solution is quite simple.

In those cases just write the shutdown file into the shared folder with a valid exit code and BOINC will handle the task successfully.
ID: 29991 · Report as offensive     Reply Quote

Message boards : Number crunching : Wrong exit code?


©2024 CERN