Message boards :
Number crunching :
power glitch causes signal 11 error?
Message board moderation
Author | Message |
---|---|
Send message Joined: 10 Aug 07 Posts: 54 Credit: 813,704 RAC: 116 |
Greetings. I had a power outage and I *thought* my UPS would handle it. But several LHC and a WCG task errored out with signal 11 - segmentation fault. (after the restart) I had done a suspend-all and shutdown before the battery reached 50%. Is this because the glitch a bad checkpoint file??? PCs on a different UPS (Both linux and windows vista) had no effect. I realize the the WUs will get recycled to the next user - but I wonder if there is anything else I can do? other stuff filesystems ext2 /var/lib and swap mounted on a sdd / and /home on a HDD Thanks!! Jay -- (edit) Perhaps I should have posted this to the BOINC forum. |
Send message Joined: 21 Jun 10 Posts: 40 Credit: 10,608,629 RAC: 10,627 |
Yep, that's a known problem with Linux. I've had my Ubuntu machines get that error when there was an internet connection problem. Windows machines would continue to work just fine. The BOINC people know about it and the WCG team knows about it. AFAIK, we are still waiting for a fix. |
Send message Joined: 12 Jul 11 Posts: 857 Credit: 1,619,050 RAC: 0 |
Thanks for the feedback jay; LHC (SixTrack) should restart OK! However I am wondering a bit about Checkpoint/Restart for other reasons..... I'll keep you posted. Nothing you can do. Eric. |
©2024 CERN