Message boards : CMS Application : CMS Tasks Failing
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 19 · 20 · 21 · 22

AuthorMessage
Erich56

Send message
Joined: 18 Dec 15
Posts: 1811
Credit: 118,317,329
RAC: 27,294
Message 44365 - Posted: 22 Feb 2021, 9:25:27 UTC

I noticed only now that since last night, all CMS tasks fail after a few minutes with

"-152 (0xFFFFFF68) ERR_NETOPEN"
2021-02-22 08:45:28 (5768): Guest Log: [ERROR] Could not connect to Condor server on port 9618
2021-02-22 08:45:28 (5768): Guest Log: [INFO] Shutting Down.

What's the problem ?
ID: 44365 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2530
Credit: 253,722,201
RAC: 51,175
Message 44366 - Posted: 22 Feb 2021, 9:36:01 UTC - in response to Message 30508.  
Last modified: 22 Feb 2021, 9:39:42 UTC

A Condor service does not respond.
It's already under investigation.

ATM all tasks will fail until this is fixed.

<edit>
Sorry replied to an old post (but the same topic).
The correct link would be this:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4159&postid=44365
</edit>
ID: 44366 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1060
Credit: 7,737,452
RAC: 2,207
Message 44367 - Posted: 22 Feb 2021, 10:24:09 UTC - in response to Message 44366.  

We're trying to get the service restarted. Unfortunately, Laurence is on holiday and I don't have access to the Condor server.
ID: 44367 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1060
Credit: 7,737,452
RAC: 2,207
Message 44368 - Posted: 22 Feb 2021, 12:31:54 UTC - in response to Message 44367.  

OK, we're running again. Apparently a networking problem internal to CMS's VM pool.
ID: 44368 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1811
Credit: 118,317,329
RAC: 27,294
Message 44778 - Posted: 21 Apr 2021, 14:32:48 UTC - in response to Message 44368.  

for the past hours, CMS tasks are failing, and no new ones are available (which is obviously the automatic stop).
ID: 44778 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 846
Credit: 691,137,107
RAC: 110,278
Message 44779 - Posted: 21 Apr 2021, 18:27:00 UTC - in response to Message 44778.  

Same here
ID: 44779 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1060
Credit: 7,737,452
RAC: 2,207
Message 44788 - Posted: 22 Apr 2021, 14:15:25 UTC

We're up again after a WMAgent upgrade,
ID: 44788 · Report as offensive     Reply Quote
Previous · 1 . . . 19 · 20 · 21 · 22

Message boards : CMS Application : CMS Tasks Failing


©2024 CERN