1) Message boards : LHCb Application : LHCb/other tasks failing after putting computer into hibernation state? (Message 36450)
Posted 15 Aug 2018 by ChristianVirtual
Post:

You just got the typical error for the VB tasks that just happen once in a while (depending on how many you are running)

Guest Log: [ERROR] Condor exited after 111913s without running a job.

The main thing is you can check the VB Manager and make sure they are saved and suspended and then switch to other tasks (same if you have to reboot for any reason)

Thanks for the quick response; any suggestion on the sequence of steps ?
1) first suspend the VM, then suspend the WU in BOINC Manager
2) first suspend the WU, then suspend the VM
3) doesn’t matter, just suspend both WU and respective VM in short timeframe
TIA
2) Message boards : LHCb Application : LHCb/other tasks failing after putting computer into hibernation state? (Message 36437)
Posted 15 Aug 2018 by ChristianVirtual
Post:
Sorry for resurection; but I had the same issue

https://lhcathome.cern.ch/lhcathome/result.php?resultid=204084499

I paused a number of WU to finish some other tasks from different project and when returned the WU failed dumping work already done.
Is there a "correct" way to hold VM-based WUs ? If not then I can next time direct dump the WU if there is a change in processing sequence required on my client.
3) Message boards : CMS Application : CMS computation error (Message 36362)
Posted 9 Aug 2018 by ChristianVirtual
Post:
I also got CMS assigned, but inside the log it says no work available ...

https://lhcathome.cern.ch/lhcathome/result.php?resultid=204082395

Seems waste of overhead is I get a WU, crunch 1000 seconds to figure I actually have no work ... confusing ...



©2024 CERN