Message boards : Number crunching : Server problem?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 790
Credit: 61,351,248
RAC: 52,268
Message 52660 - Posted: 14 Nov 2025, 10:03:10 UTC - in response to Message 52659.  

Now I see on one host that it is getting only 1 Theory task at a time. And there is only 1 task waiting for crunching. So number of tasks in progress is max_concurrent (from app_config) + 1.
Do you have set the number 1 in the prefs for Max # CPUs?

No, I have both hosts set to 4 max CPUs.

My assessment of the situation turns out to be wrong. The win10 host (Boinc 7.16.5) is still doing what I said above but the win11 host (Boinc 8.0.2) didn't get new Theory tasks and was getting free CPU cores. So I enabled the CMS work for that host and I just got 8 of those.
ID: 52660 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 888
Credit: 757,929,193
RAC: 348,245
Message 52666 - Posted: 16 Nov 2025, 17:10:59 UTC
Last modified: 16 Nov 2025, 17:11:30 UTC

There does seem to be something different with the number of WUs.

I have unlimited set for both Max # CPU & Max # jobs, with run all applications.

My higher end computers, have about 40-50 WUs and are requesting work but are not given more so they are not fully utilised.

The cap for CMS seems to be 8 and for 10 Theory, not sure for ATLAS as I don't have so many of these at the moment. so it seems like unlimited is no longer unlimited.

I don't max_concurrent set, just that Theory and ATLAS should only use 1 core per job/task.
ID: 52666 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1921
Credit: 148,162,664
RAC: 132,304
Message 52667 - Posted: 16 Nov 2025, 17:34:28 UTC

I am making the same experience: for one of my PCs (16 cores) I have set the max. number of Theory tasks to "unlimitied", but I get only 10. How come?
ID: 52667 · Report as offensive     Reply Quote
Garrulus glandarius

Send message
Joined: 5 Apr 25
Posts: 56
Credit: 1,098,480
RAC: 4,070
Message 52668 - Posted: 16 Nov 2025, 17:52:27 UTC - in response to Message 52667.  

I am making the same experience: for one of my PCs (16 cores) I have set the max. number of Theory tasks to "unlimitied", but I get only 10. How come?


Might indeed be a hard cap on the server side. I noticed one such cap at TN-Grid where each host can get at most 6 tasks/thread (not core), regardless of any other settings.
ID: 52668 · Report as offensive     Reply Quote
pututu

Send message
Joined: 13 May 17
Posts: 1
Credit: 11,082,442
RAC: 266,227
Message 52669 - Posted: 16 Nov 2025, 18:18:27 UTC - in response to Message 52666.  
Last modified: 16 Nov 2025, 18:37:45 UTC

I'm also seeing a cap of 8 CMS tasks per PC irrespective of the cpu core count or the max work cache set. The only way to feed your PC with 100% utilization is to run multiple boinc clients if you have high core count setup with sufficient RAM and wanting to run CMS tasks only.

WIth multiple boinc clients, on linux machines I've not seeing this VBoxManage.exe error about registering/attaching the CMS_2025_04_08_prod.vdi virtual hard disk (maybe the first task but ok subsequently) but not on Windows machine.

I prefer to run CMS as the run times don't fluctuate as much as Theory tasks with CMS task seems to have a cap on the run time of 64,800 seconds. I had a few Theory tasks that run for days.
ID: 52669 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 790
Credit: 61,351,248
RAC: 52,268
Message 52670 - Posted: 16 Nov 2025, 19:01:27 UTC

I'm seeing the same caps for Theory and CMS here as well. But Atlas seems different. Last new tasks from Atlas were on 14th of November and I received 37 for a 8/16 core CPU and 22 for 16/32 core CPU. The limit on server side used to be 16 for both of those CPUs. There wasn't enough tasks available to actually see what is the limit now but definitely the limits have changed for Atlas too.
ID: 52670 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2703
Credit: 290,790,338
RAC: 141,316
Message 52671 - Posted: 16 Nov 2025, 19:44:08 UTC - in response to Message 52669.  

WIth multiple boinc clients, on linux machines I've not seeing this VBoxManage.exe error about registering/attaching the CMS_2025_04_08_prod.vdi virtual hard disk (maybe the first task but ok subsequently) but not on Windows machine.

This was not related to the number of BOINC instances running on the same host.
Instead, it was related to a possible race condition related to vboxwrapper.

Vboxwrapper 26210 used here for ATLAS/CMS/Theory mitigates/recovers from those errors on Apple/Linux/Windows.
ID: 52671 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 888
Credit: 757,929,193
RAC: 348,245
Message 52673 - Posted: 18 Nov 2025, 8:13:49 UTC

ATLAS got a bit more work so I have an update:

Name                           Value
----                           -----
Theory                         19
CMS                            6
ATLAS                          0

Theory                         7
CMS                            3
ATLAS                          1

Theory                         10
CMS                            8
ATLAS                          40

Theory                         16
CMS                            8
ATLAS                          9

Theory                         10
CMS                            8
ATLAS                          2

Theory                         10
CMS                            8
ATLAS                          40


Seems like maybe ATLAS cap is, 40.
ID: 52673 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 790
Credit: 61,351,248
RAC: 52,268
Message 52674 - Posted: 18 Nov 2025, 8:38:29 UTC - in response to Message 52673.  

To my liking 40 is too high especially with these 1000 event tasks.
ID: 52674 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 888
Credit: 757,929,193
RAC: 348,245
Message 52677 - Posted: 18 Nov 2025, 17:06:51 UTC - in response to Message 52674.  
Last modified: 19 Nov 2025, 20:18:42 UTC

In theory, BOINC, will send out tasked based upon preferences, e.g. if if you store 0.1 day of work and the tasks take 10 days, then you would not be storing many so a cap of 40 would not be reached easily.

e.g.

LHC@home	11/19/2025 9:17:41 PM	Not requesting tasks: don't need (CPU: job cache full; Intel GPU: no applications)
ID: 52677 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 888
Credit: 757,929,193
RAC: 348,245
Message 52685 - Posted: 21 Nov 2025, 21:55:05 UTC

Things seem to have gone back to normal.

For Therory, 4/6 have more than 10, the ones with less than 10 are probally as Harri and I were discussing, they are limited by there compute speed and small cache size.

For CMS, 1/6 has more than 8, so could be.

For ATLAS, none have more than 40 so this could still be the cap.
ID: 52685 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Server problem?


©2025 CERN