Message boards :
ATLAS application :
Pilot has decided to kill looping job from restored VM
Message board moderation
Author | Message |
---|---|
Send message Joined: 14 Jan 10 Posts: 1168 Credit: 7,217,203 RAC: 2,072 ![]() ![]() ![]() |
I suspended an ATLAS-VM overnight and restored it this morning. It ended very quickly, BOINC validated OK, but there was no valid ATLAS HITS-file uploaded. https://lhcathome.cern.ch/lhcathome/result.php?resultid=158750724 It seems that the server connected to the VM decided to kill the VM-job: Pilot has decided to kill looping job 3637921913 at 2017-10-07T07:23:41+-100 Why?? |
Send message Joined: 18 Dec 15 Posts: 1571 Credit: 66,592,227 RAC: 165,859 ![]() ![]() ![]() |
Pilot has decided to kill looping job 3637921913 at 2017-10-07T07:23:41+-100[/i] the interesting part of this is that the notice talks about a "looping job". So, the question may be: is a suspended task considered to run in a loop and therefore terminaded after some time? |
©2023 CERN