Message boards :
CMS Application :
since about 2 hours: all tasks failing after few minutes (SOLVED)
Message board moderation
Author | Message |
---|---|
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,946,009 RAC: 20,590 |
On all my hosts, CMS tasks are failing after about 2-3 minutes - see here: https://lhcathome.cern.ch/lhcathome/result.php?resultid=407746801 exerpt from stderr: 2024-03-12 09:22:16 (6252): Guest Log: Ncat: Could not resolve hostname "vccs.cern.ch": Name or service not known. QUITTING. 2024-03-12 09:22:16 (6252): Guest Log: [ERROR] Could not connect to vccs.cern.ch on port 443 So one would see a network .problem. However, a ping to "vccs.cern.ch" works well. Atlas and Theory are being processed without any problem. Any idea what's going on? |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,946,009 RAC: 20,590 |
I now detected the same problem at other volunteers' hosts. So obviously the problem is not a local one, but rather at CERN :-( |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 1,266 |
After the failing connection: Guest Log: NCAT DEBUG: Using system default trusted CA certificates and those in /usr/share/ncat/ca-bundle.crt. Guest Log: NCAT DEBUG: Unable to load trusted CA certificates from /usr/share/ncat/ca-bundle.crt: error:02001002:system library:fopen:No such file or directory |
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
The server is being updated. Should be back soon. |
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
It is back. |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 1,266 |
Now the problem is not getting the X509 credentials from LHC@home and vLHC@home-dev |
Send message Joined: 29 Aug 05 Posts: 1061 Credit: 7,737,455 RAC: 298 |
|
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
Yes, the idtoken is in place. The server logs suggest the service is working. |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 456 |
Laurence, every 30 sec. searching for a x509 credential from LHCatHome or LHCatHome-dev. Win11pro - Workstation https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10664116 2024-03-12 13:17:25 (2940): Guest Log: [INFO] Requesting an X509 credential from LHC@home 2024-03-12 13:17:26 (2940): Guest Log: [INFO] Requesting an X509 credential from vLHC@home-dev 2024-03-12 13:17:57 (2940): Guest Log: [DEBUG] % Total % Received % Xferd Average Speed Time Time Time Current 2024-03-12 13:17:57 (2940): Guest Log: Dload Upload Total Spent Left Speed 2024-03-12 13:17:57 (2940): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 2024-03-12 13:17:57 (2940): Guest Log: 100 4924 0 4924 0 0 13462 0 --:--:-- --:--:-- --:--:-- 13490 2024-03-12 13:17:57 (2940): Guest Log: [DEBUG] 2024-03-12 13:17:57 (2940): Guest Log: ERROR: Couldn't read proxy from: /tmp/x509up_u0 2024-03-12 13:17:57 (2940): Guest Log: globus_credential: Error reading proxy credential 2024-03-12 13:17:57 (2940): Guest Log: globus_credential: Error reading proxy credential: Couldn't read PEM from bio 2024-03-12 13:17:57 (2940): Guest Log: OpenSSL Error: pem_lib.c:707: in library: PEM routines, function PEM_read_bio: no start line 2024-03-12 13:17:57 (2940): Guest Log: Use -debug for further information. 2024-03-12 13:17:57 (2940): Guest Log: [ERROR] Could not get an x509 credential 2024-03-12 13:17:57 (2940): Guest Log: [ERROR] The x509 proxy creation failed. |
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
From the server logs, it looks like it started working for you since 13:11:06 +0100 |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 456 |
This is from Ivan in an other message: In this case, the two meanings of the word "proxy" are quite different. The proxy credential is an authorisation to connect to the service; the (squid) proxy server is a caching server that saves requested files so that they don't need to be transported again if re-requested. |
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
It looks like there is an issue with the proxy generated. I will put the old server back until we can find the cause of the issue. |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,946,009 RAC: 20,590 |
It looks like there is an issue with the proxy generated. I will put the old server back until we can find the cause of the issue.Laurence, tasks still failing: https://lhcathome.cern.ch/lhcathome/result.php?resultid=407757073 |
Send message Joined: 2 May 07 Posts: 2244 Credit: 173,902,375 RAC: 456 |
Erich56, a Server is not back in a few minutes. |
Send message Joined: 20 Jun 14 Posts: 380 Credit: 238,712 RAC: 0 |
It should be back now. |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,946,009 RAC: 20,590 |
It should be back now.but still not working: "Could not get an x509 credential": 2024-03-12 19:58:11 (3552): Guest Log: [INFO] Reading volunteer information 2024-03-12 19:58:15 (3552): Guest Log: [INFO] Requesting an X509 credential from LHC@home 2024-03-12 19:58:16 (3552): Guest Log: [INFO] Requesting an idtoken from LHC@home 2024-03-12 19:58:17 (3552): Guest Log: [INFO] Requesting an idtoken from vLHC@home-dev 2024-03-12 19:58:47 (3552): Guest Log: [INFO] Requesting an idtoken from LHC@home 2024-03-12 19:58:48 (3552): Guest Log: [INFO] Requesting an idtoken from vLHC@home-dev 2024-03-12 19:59:18 (3552): Guest Log: [INFO] Requesting an idtoken from LHC@home 2024-03-12 19:59:19 (3552): Guest Log: [INFO] Requesting an idtoken from vLHC@home-dev 2024-03-12 19:59:49 (3552): Guest Log: [INFO] Requesting an idtoken from LHC@home 2024-03-12 19:59:50 (3552): Guest Log: [INFO] Requesting an idtoken from vLHC@home-dev 2024-03-12 20:00:20 (3552): Guest Log: [INFO] Requesting an idtoken from LHC@home 2024-03-12 20:00:21 (3552): Guest Log: [INFO] Requesting an idtoken from vLHC@home-dev 2024-03-12 20:00:53 (3552): Guest Log: [INFO] Requesting an idtoken from LHC@home 2024-03-12 20:00:55 (3552): Guest Log: [INFO] Requesting an idtoken from vLHC@home-dev 2024-03-12 20:01:30 (3552): Guest Log: [DEBUG] % Total % Received % Xferd Average Speed Time Time Time Current 2024-03-12 20:01:30 (3552): Guest Log: Dload Upload Total Spent Left Speed 2024-03-12 20:01:30 (3552): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 2024-03-12 20:01:30 (3552): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0 2024-03-12 20:01:30 (3552): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0 2024-03-12 20:01:30 (3552): Guest Log: 100 196 100 196 0 0 92 0 0:00:02 0:00:02 --:--:-- 92 2024-03-12 20:01:30 (3552): Guest Log: [DEBUG] % Total % Received % Xferd Average Speed Time Time Time Current 2024-03-12 20:01:30 (3552): Guest Log: Dload Upload Total Spent Left Speed 2024-03-12 20:01:30 (3552): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 2024-03-12 20:01:30 (3552): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0 2024-03-12 20:01:30 (3552): Guest Log: 0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0 2024-03-12 20:01:30 (3552): Guest Log: 100 196 100 196 0 0 92 0 0:00:02 0:00:02 --:--:-- 92 2024-03-12 20:01:30 (3552): Guest Log: [ERROR] Could not get an x509 credential |
Send message Joined: 14 Jan 10 Posts: 1422 Credit: 9,484,585 RAC: 1,266 |
This morning CMS is running OK for me. |
Send message Joined: 18 Dec 15 Posts: 1821 Credit: 118,946,009 RAC: 20,590 |
This morning CMS is running OK for me.I re-started CMS about 1 hour ago, it's working fine now :-) |
©2024 CERN