Message boards : Number crunching : Server problem?
Message board moderation

To post messages, you must log in.

AuthorMessage
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 780
Credit: 59,374,388
RAC: 46,656
Message 52234 - Posted: 17 Sep 2025, 18:02:59 UTC
Last modified: 17 Sep 2025, 18:11:10 UTC

My win11 host has started to see server problems but my win10 host is still connecting OK and getting new work. I did a project reset on problem host but the problem persists. Here is what I see on the message log:
86	LHC@home	17-09-2025 20:54	[sched_op] Starting scheduler request	
87	LHC@home	17-09-2025 20:54	Sending scheduler request: To fetch work.	
88	LHC@home	17-09-2025 20:54	Requesting new tasks for CPU and NVIDIA GPU	
89	LHC@home	17-09-2025 20:54	[sched_op] CPU work request: 95040.00 seconds; 0.00 devices	
90	LHC@home	17-09-2025 20:54	[sched_op] NVIDIA GPU work request: 1001.00 seconds; 0.00 devices	
91	LHC@home	17-09-2025 20:54	[sched_op] AMD/ATI GPU work request: 0.00 seconds; 0.00 devices	
92	LHC@home	17-09-2025 20:54	Scheduler request completed: got 0 new tasks	
93	LHC@home	17-09-2025 20:54	Server can't open log file (../log_boinc01/scheduler.log)	
94	LHC@home	17-09-2025 20:54	[sched_op] Deferring communication for 00:03:34	
95	LHC@home	17-09-2025 20:54	[sched_op] Reason: project is down	

This host has BOINC 8.0.2. So the problem seems to be Server can't open log file (../log_boinc01/scheduler.log) but how to resolve this situation?
[edit] The same message appeared also before project reset.
[edit2] The situation started when I aborted a Theory task that had gone past the deadline and server had marked it: Timed out - no response. Here is the host: https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10858236 I am not sure if the project reset went through as the host shows still that it has all the same tasks as before.
[edit3] I will try to detach and reattach the host .
ID: 52234 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 780
Credit: 59,374,388
RAC: 46,656
Message 52235 - Posted: 17 Sep 2025, 18:17:09 UTC

Removing LHC@home and adding it again didn't help. Same error appears: Server can't open log file (../log_boinc01/scheduler.log)
ID: 52235 · Report as offensive     Reply Quote
Ryan Munro

Send message
Joined: 17 Aug 17
Posts: 124
Credit: 10,716,130
RAC: 12,038
Message 52236 - Posted: 17 Sep 2025, 19:11:13 UTC - in response to Message 52235.  

Getting the same error here as well
ID: 52236 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1233
Credit: 79,150,235
RAC: 161,321
Message 52237 - Posted: 17 Sep 2025, 19:23:57 UTC

9/17/2025 12:18:06 PM | LHC@home | Server can't open log file (../log_boinc01/scheduler.log)

Same here
ID: 52237 · Report as offensive     Reply Quote
paul

Send message
Joined: 6 Feb 23
Posts: 7
Credit: 148,211
RAC: 714
Message 52238 - Posted: 17 Sep 2025, 19:34:20 UTC - in response to Message 52235.  

This is server side, not client side, issue is with LHC server.
Paul.
ID: 52238 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 780
Credit: 59,374,388
RAC: 46,656
Message 52239 - Posted: 17 Sep 2025, 19:42:33 UTC
Last modified: 17 Sep 2025, 19:43:15 UTC

Oddly the win10 host (BOINC 7.16.5) is still communicating normally with server and getting new Atlas work. The problem host didn't get even a host id from server after adding LHC as a new project. Boinc Manager shows host id as 0.
ID: 52239 · Report as offensive     Reply Quote
CloverField

Send message
Joined: 17 Oct 06
Posts: 94
Credit: 61,148,572
RAC: 24,736
Message 52240 - Posted: 18 Sep 2025, 0:14:50 UTC
Last modified: 18 Sep 2025, 0:58:28 UTC

I was able to fix this by going out to my boinc data folder and just making a file called scheduler.log
Edit: Nevermind I just realized that error message was coming from the server and I just got lucky when I tried that .
ID: 52240 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1233
Credit: 79,150,235
RAC: 161,321
Message 52241 - Posted: 18 Sep 2025, 3:04:08 UTC

I can't speak for everyone but the reason I mentioned it was so that people on the server side would see us saying this and then they would do their part to take care of that.
If we don't do that it doesn't get fixed and next thing it is a weekend and nobody looks at anything until the next monday at best.
ID: 52241 · Report as offensive     Reply Quote
Dark Angel
Avatar

Send message
Joined: 7 Aug 11
Posts: 118
Credit: 28,531,452
RAC: 38,636
Message 52242 - Posted: 18 Sep 2025, 3:55:10 UTC

Thu 18 Sep 2025 13:53:37 | LHC@home | update requested by user
Thu 18 Sep 2025 13:53:41 | LHC@home | Sending scheduler request: Requested by user.
Thu 18 Sep 2025 13:53:41 | LHC@home | Requesting new tasks for CPU
Thu 18 Sep 2025 13:53:42 | LHC@home | Scheduler request completed: got 0 new tasks
Thu 18 Sep 2025 13:53:42 | LHC@home | Server can't open log file (../log_boinc01/scheduler.log)

Still happening here
ID: 52242 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2678
Credit: 286,460,648
RAC: 126,094
Message 52243 - Posted: 18 Sep 2025, 5:52:52 UTC

This is a server side issue on boinc01.cern.ch.
So far boinc02.cern.ch is not affected.

It can't be solved on the client side without tweaking the own DNS since load balancing between both is done via DNS, hence by random.
It's not worth to do so.

Admins are informed.
ID: 52243 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 780
Credit: 59,374,388
RAC: 46,656
Message 52244 - Posted: 18 Sep 2025, 7:34:39 UTC
Last modified: 18 Sep 2025, 7:35:34 UTC

For me the problem was solved about an hour and a half ago. Crunching is going on and cache is full :-)
The nice people at Cern got it going, I didn't do anything. Thank you!
ID: 52244 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 46
Credit: 712,633
RAC: 19,353
Message 52423 - Posted: 1 Oct 2025, 15:08:50 UTC

Any info on why all servers besides scheduler, upload and download are offline?
ID: 52423 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1904
Credit: 143,955,761
RAC: 87,834
Message 52424 - Posted: 1 Oct 2025, 16:02:08 UTC - in response to Message 52423.  
Last modified: 1 Oct 2025, 16:03:28 UTC

Any info on why all servers besides scheduler, upload and download are offline?
Here CMS tasks were uploaded, but could not get reported. When pushing the Update button on the left hand side of the BOINC manager, in the status column it says "communication deferred - and the time counts downwards from 1 hour).
The BOINC event log says "Project server temporarily down for maintenance" and "project requested delay of 3600 seconds".

Also, the number of unsent CMS tasks on the server status page is shrinking.
ID: 52424 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 46
Credit: 712,633
RAC: 19,353
Message 52426 - Posted: 1 Oct 2025, 17:20:16 UTC - in response to Message 52424.  

All systems seem to be working now, validation queue is gone.
ID: 52426 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1904
Credit: 143,955,761
RAC: 87,834
Message 52427 - Posted: 1 Oct 2025, 18:21:00 UTC - in response to Message 52426.  

All systems seem to be working now, validation queue is gone.
here, too :-)
ID: 52427 · Report as offensive     Reply Quote

Message boards : Number crunching : Server problem?


©2025 CERN