Message boards :
Number crunching :
Condor Problems
Message board moderation
Author | Message |
---|---|
Send message Joined: 30 May 08 Posts: 93 Credit: 5,160,246 RAC: 0 |
Sorry if starting a new thread is redundant, but I thought perhaps this discussion should be in a more general location... Seeing a lot of these errors right now (for me, mostly Theory but also LHCb): 2016-12-28 09:09:18 (19627): VM Completion Message: Could not connect to Condor server on port 9618 A recent example is Task 110836548. |
Send message Joined: 24 Oct 04 Posts: 1169 Credit: 54,226,370 RAC: 56,762 |
Yes it is doing the same thing over at vLHC (copy of my post there seconds ago) OK what is going on here right now?? https://lhcathome.cern.ch/vLHCathome/result.php?resultid=7030843 Just got 2 of those in a row and they act like they are connecting fast and then stop all of a sudden because the Condor is flying around dropping bread on things again. Maybe I will try one at the LHC and see if it is doing this too. Edit: no luck and is doing the same thing at LHC https://lhcathome.cern.ch/lhcathome/result.php?resultid=110832690 Volunteer Mad Scientist For Life |
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
Blocked by external firewall. Hopefully fixed but will not know until the configuration has been refreshed (up to 4 hours). Thanks for the alert |
Send message Joined: 30 May 08 Posts: 93 Credit: 5,160,246 RAC: 0 |
Hopefully fixed but will not know until the configuration has been refreshed (up to 4 hours)... Thanks, Laurence. I have a Theory and an LHCb task that are 11 and 15 hours into their work, respectively, and am wondering if I should suspend them for now. Is it possible that work will be lost if those tasks try to get another job or finish up while connectivity issues continue? |
Send message Joined: 24 Oct 04 Posts: 1169 Credit: 54,226,370 RAC: 56,762 |
The ones running should be ok......the worst that usually happens is you have to wait to send them in but that should not happen either. I have lots of them running and will leave them running and will check to see if this Condor problem is fixed (makes no sense that it just starts happening when things are running good lately) I see it is also happening over at vLHC-dev |
Send message Joined: 30 May 08 Posts: 93 Credit: 5,160,246 RAC: 0 |
The ones running should be ok... Okay, great. I'll let them go, then. |
Send message Joined: 1 Sep 04 Posts: 140 Credit: 2,579 RAC: 0 |
Well done Laurence! Condor seems happy again. Happy holidays to all! Ben |
Send message Joined: 24 Oct 04 Posts: 1169 Credit: 54,226,370 RAC: 56,762 |
The ones running should be ok... I found out for sure and have sent in a few finished tasks here and vLHC Just started a new task and we are back to normal again so time to start up a batch of new tasks here again. Volunteer Mad Scientist For Life |
Send message Joined: 30 May 08 Posts: 93 Credit: 5,160,246 RAC: 0 |
MAGIC Quantum Mechanic wrote: I found out for sure and have sent in a few finished tasks here and vLHC Ye, indeed! Ben Segal wrote: Well done Laurence! Condor seems happy again. +1 and thanks for the quick response. |
©2024 CERN