Message boards : CMS Application : Upcoming WMAgent/CouchDB update - jobs will drain Sunday night
Message board moderation

To post messages, you must log in.

AuthorMessage
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 881
Credit: 5,852,630
RAC: 124
Message 46903 - Posted: 17 Jun 2022, 12:58:45 UTC

CMS wants to upgrade WMAgent to bring in a new version of CouchDB. I've just submitted a new workflow which should start draining around midnight Sunday (European). Please be ready to set NoNewTasks late Sunday to avoid any problems. Hopefully we'll be up again by Monday night.
ID: 46903 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 881
Credit: 5,852,630
RAC: 124
Message 46904 - Posted: 17 Jun 2022, 15:25:54 UTC - in response to Message 46903.  

CERN IT also wants to recreate the VM that we run the agent on, to move it to a new Hypervisor. So the downtime may be a bit longer than I anticipated, but hopefully just a few hours.
ID: 46904 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 2012
Credit: 147,536,639
RAC: 113,955
Message 46914 - Posted: 18 Jun 2022, 20:07:48 UTC - in response to Message 46904.  

CERN IT also wants to recreate the VM that we run the agent on...

If the WMAgent server gets a new name and/or a new IP the network tests called by bootstrap-cms need to be revised.
When the old name can't be contacted any more all CMS tasks will fail.
ID: 46914 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 881
Credit: 5,852,630
RAC: 124
Message 46915 - Posted: 20 Jun 2022, 9:40:13 UTC - in response to Message 46914.  

CERN IT also wants to recreate the VM that we run the agent on...

If the WMAgent server gets a new name and/or a new IP the network tests called by bootstrap-cms need to be revised.
When the old name can't be contacted any more all CMS tasks will fail.

Noted. I've no idea if that will happen, but thanks for the heads-up.
ID: 46915 · Report as offensive     Reply Quote
[VENETO] boboviz
Avatar

Send message
Joined: 7 May 08
Posts: 134
Credit: 1,393,604
RAC: 625
Message 46916 - Posted: 20 Jun 2022, 20:02:14 UTC - in response to Message 46904.  

CERN IT also wants to recreate the VM that we run the agent on, to move it to a new Hypervisor.


I don't understand. Are they abandon VirtualBox on boinc for a new hypervisor?
ID: 46916 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 1581
Credit: 64,833,034
RAC: 227,079
Message 46969 - Posted: 2 Jul 2022, 15:37:11 UTC

Master.log shows many of this lines:
07/02/22 17:31:16 (pid:16879) CONFIGURATION PROBLEM: Failed to insert ClassAd attribute STARTD_PARTITIONABLE_SLOT_ATTRS = MemoryUsage,ProportionalSetSizeKb. The most common reason for this is that you forgot to quote a string value in the list of attributes being added to the MASTER ad.
07/02/22 17:31:16 (pid:16879) CONFIGURATION PROBLEM: Failed to insert ClassAd attribute GLIDEIN_Resource_Slots = Iotokens,80,,type=main. The most common reason for this is that you forgot to quote a string value in the list of attributes being added to the MASTER ad.
ID: 46969 · Report as offensive     Reply Quote
Profile MAGIC Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1009
Credit: 47,459,287
RAC: 2,185
Message 46970 - Posted: 2 Jul 2022, 21:22:09 UTC - in response to Message 46969.  

That is always on my master log page every time and I look at all of them but they still run Valid .
ID: 46970 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 881
Credit: 5,852,630
RAC: 124
Message 46975 - Posted: 4 Jul 2022, 22:56:50 UTC - in response to Message 46916.  

CERN IT also wants to recreate the VM that we run the agent on, to move it to a new Hypervisor.


I don't understand. Are they abandon VirtualBox on boinc for a new hypervisor?

No, not the hypervisor, the VM image file (.vdi).
ID: 46975 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 881
Credit: 5,852,630
RAC: 124
Message 46976 - Posted: 4 Jul 2022, 22:58:46 UTC - in response to Message 46970.  

That is always on my master log page every time and I look at all of them but they still run Valid .

Yes, it is not a problem apart from filling up the log file. One day i might convince someone to fix it...
ID: 46976 · Report as offensive     Reply Quote

Message boards : CMS Application : Upcoming WMAgent/CouchDB update - jobs will drain Sunday night


©2022 CERN