Message boards : CMS Application : EXIT_NO_SUB_TASKS
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 15 · Next

AuthorMessage
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43574 - Posted: 6 Nov 2020, 13:39:19 UTC - in response to Message 43571.  

Not sure why. We have jobs available, but there's a dip in the graphs, Possibly a network glitch? Things seem normal again now.
ID: 43574 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43736 - Posted: 29 Nov 2020, 12:26:27 UTC

I'm not sure what's going on at the moment. I have a new batch of jobs in the "assigned" state, while the last batch is in "running" but only about 200 jobs left. We should be starting to run jobs from the new batch, but that's not happening yet.
Meanwhile, I may not keep my eyes closely on the ball. I'm in a certain amount of pain, with a two-inch wound on my right arm, closed with ten stitches, where a surgeon removed a half-inch diameter skin cancer last week. Apologies if I'm slow to respond in the next few days...
ID: 43736 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1516
Credit: 46,824,301
RAC: 61,300
Message 43737 - Posted: 29 Nov 2020, 13:36:48 UTC - in response to Message 43736.  

Ivan, I wish you a quick recovery! All will be good!
ID: 43737 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43738 - Posted: 29 Nov 2020, 14:19:28 UTC - in response to Message 43737.  

Ivan, I wish you a quick recovery! All will be good!

Thanks, Erich, I really hope so.
ID: 43738 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43739 - Posted: 29 Nov 2020, 14:21:05 UTC

Meanwhile, jobs still aren't starting...
ID: 43739 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 2029
Credit: 149,001,314
RAC: 121,178
Message 43740 - Posted: 29 Nov 2020, 15:01:37 UTC - in response to Message 43736.  

All best wishes for you, Ivan.
ID: 43740 · Report as offensive     Reply Quote
NOGOOD

Send message
Joined: 18 Nov 17
Posts: 118
Credit: 41,026,294
RAC: 7,776
Message 43741 - Posted: 29 Nov 2020, 15:06:57 UTC

All best wishes, Ivan!
ID: 43741 · Report as offensive     Reply Quote
Profile Ben Segal
Volunteer moderator
Project administrator

Send message
Joined: 1 Sep 04
Posts: 131
Credit: 2,579
RAC: 0
Message 43742 - Posted: 29 Nov 2020, 16:03:55 UTC - in response to Message 43736.  

Recover fast Ivan. Take care!
ID: 43742 · Report as offensive     Reply Quote
m

Send message
Joined: 6 Sep 08
Posts: 112
Credit: 8,666,023
RAC: 820
Message 43750 - Posted: 29 Nov 2020, 22:31:37 UTC
Last modified: 29 Nov 2020, 22:32:22 UTC

Very best wishes for complete and speedy recovery, Ivan. Meanwhile - take things easy for a bit.
jp
ID: 43750 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43762 - Posted: 1 Dec 2020, 16:48:27 UTC

It turns out that the problems at the weekend were due to certificates expiring. ...again...
A planned database intervention yesterday apparently compounded the problem. We're up again now.
ID: 43762 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43845 - Posted: 11 Dec 2020, 9:12:28 UTC

We want to make another update to the WMAgent codes on Monday, so I'll be running down the queues over the weekend. Prepare for a short outage of jobs starting some time Sunday night (if I get my timings right...).
ID: 43845 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 2029
Credit: 149,001,314
RAC: 121,178
Message 43872 - Posted: 12 Dec 2020, 8:41:24 UTC - in response to Message 43845.  

Subtask queue is already empty.
Switched mine to ATLAS.
ID: 43872 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43876 - Posted: 12 Dec 2020, 14:51:47 UTC - in response to Message 43872.  

Subtask queue is already empty.
Switched mine to ATLAS.

Yes, there's an Oracle database problem. CERN is working on it.
ID: 43876 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43883 - Posted: 12 Dec 2020, 20:31:55 UTC - in response to Message 43876.  
Last modified: 12 Dec 2020, 20:56:15 UTC

Subtask queue is already empty.
Switched mine to ATLAS.

Yes, there's an Oracle database problem. CERN is working on it.

It looks like jobs have just started flowing again. Digits cruciate...

[Update] It's not entirely clear to me if this was an Oracle or a WMAgent problem. Alan has restarted all the WMAgent components and, so far so good... [/Update]
ID: 43883 · Report as offensive     Reply Quote
Greger

Send message
Joined: 9 Jan 15
Posts: 151
Credit: 431,596,822
RAC: 0
Message 43884 - Posted: 12 Dec 2020, 22:14:00 UTC

This update to the WMAgent codes changes would it change to deal with the ipv6 setup issue that we got?
ID: 43884 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 883
Credit: 5,852,906
RAC: 65
Message 43886 - Posted: 13 Dec 2020, 13:06:41 UTC - in response to Message 43884.  

This update to the WMAgent codes changes would it change to deal with the ipv6 setup issue that we got?

No, I don't think so. The IPv6 problems seem to be at the Volunteers' machines, not at CERN.
ID: 43886 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1516
Credit: 46,824,301
RAC: 61,300
Message 43893 - Posted: 14 Dec 2020, 10:00:22 UTC - in response to Message 43845.  

We want to make another update to the WMAgent codes on Monday, ...
a question just out of curiosity: why do they fiddle around with this WMAgent that frequently?
ID: 43893 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 1590
Credit: 67,745,306
RAC: 238,257
Message 43894 - Posted: 14 Dec 2020, 10:21:05 UTC - in response to Message 43893.  

We want to make another update to the WMAgent codes on Monday, ...
a question just out of curiosity: why do they fiddle around with this WMAgent that frequently?

Why do we get so many updates from the OS? Just for fun?
ID: 43894 · Report as offensive     Reply Quote
tullio

Send message
Joined: 19 Feb 08
Posts: 704
Credit: 4,274,121
RAC: 657
Message 43898 - Posted: 14 Dec 2020, 11:43:34 UTC

I am getting almost daily updates on my SuSE Linux Tumbleweed, a Virtual Machine on a Windows 10 host. Its kernel today is 5.9.12 and they hint to a 5.10 kernel before Christmas. At every kernel update I have to reboot it.
Tullio
ID: 43898 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1516
Credit: 46,824,301
RAC: 61,300
Message 43907 - Posted: 14 Dec 2020, 19:06:04 UTC - in response to Message 43845.  

We want to make another update to the WMAgent codes on Monday
Ivan, is there a problem with the update? There are still no tasks available.
ID: 43907 · Report as offensive     Reply Quote
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 15 · Next

Message boards : CMS Application : EXIT_NO_SUB_TASKS


©2022 CERN