Message boards : CMS Application : CMS@Home -- jobs update
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
[VENETO] boboviz
Avatar

Send message
Joined: 7 May 08
Posts: 233
Credit: 1,575,053
RAC: 0
Message 51360 - Posted: 7 Jan 2025, 11:05:47 UTC - in response to Message 51357.  

I have some running at -dev but that site needs to be poked with a stick since it is just blank pages for everything so we can't see what is going on.


Still waiting for lhc-dev website....
ID: 51360 · Report as offensive     Reply Quote
mmonnin

Send message
Joined: 22 Mar 17
Posts: 66
Credit: 25,047,948
RAC: 42,717
Message 51361 - Posted: 7 Jan 2025, 12:25:13 UTC
Last modified: 7 Jan 2025, 12:26:13 UTC

All the tasks I've gotten on Linux PCs have been single threaded even with it set to be 4 threads in LHC preferences. The lone task I've completed in Win10 was mt. Both say Setting CPU Count for VM. (4)

Dev site seems to be missing but services must be up as I can download a task.
ID: 51361 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2607
Credit: 262,620,721
RAC: 139,578
Message 51362 - Posted: 7 Jan 2025, 15:10:03 UTC - in response to Message 51361.  

All the tasks I've gotten on Linux PCs have been single threaded ...

This can't be verified since your computers are hidden.
And, no, please don't post the logs.

CERN Grafana shows that since yesterday all CMS jobs were 4-core jobs.
ID: 51362 · Report as offensive     Reply Quote
mmonnin

Send message
Joined: 22 Mar 17
Posts: 66
Credit: 25,047,948
RAC: 42,717
Message 51363 - Posted: 7 Jan 2025, 16:42:03 UTC

https://lhcathome.cern.ch/lhcathome/result.php?resultid=418694926

Run time 46 min 22 sec
CPU time 33 min 47 sec

Ya don't want logs? Posting task links is not useful for history as task data is purged. I've often come to BOINC forums and been unable to verify if I am having the same issue as described as only a link was posted. It also doesn't help when searching either. For this PC, I was able to fix the vbox issue on this PC yesterday by searching LHC forums with the error. This was only possible because someone posted their log.
ID: 51363 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2607
Credit: 262,620,721
RAC: 139,578
Message 51364 - Posted: 7 Jan 2025, 17:32:38 UTC - in response to Message 51363.  

Your VMs are configured to use 4 cores.

Check the consoles of a VM that is active for at least 30-35 mins (setup phase must be finished).
console 3: should show 4 instances of "cmsExternalGene" each using close to 100% CPU
console 4: should show "WMAgent Run the job ..." as last line
console 5: should show a couple of lines like "INFO:root:PSS: ... PCPU: 15.0; PMEM 4.4"

If the VM finishes without all of that the job did not fully start.
The low CPU time compared to the walltime suggests the same.
This happens sometimes if IP connections get lost.
Check if your local firewall blocks any connections initiated by the VMs.

Also check the VM internal logs.
Stderr.txt reports the port where they can be found, like:
Detected: Web Application Enabled (http://localhost:42069)
Type this in a browser on the host running the VM:
http://localhost:42069/logs
ID: 51364 · Report as offensive     Reply Quote
Pascal

Send message
Joined: 13 May 20
Posts: 38
Credit: 2,045,533
RAC: 2,462
Message 51365 - Posted: 7 Jan 2025, 18:54:42 UTC

bonjour,
j'ai reçu 112 taches cms avec un temps d'execution de 17h50 avec 4 threads.
est ce normal?
hello,
I received 112 cms tasks with a time of 17h50 with 4 threads.
Is this normal?
ID: 51365 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 859
Credit: 703,877,653
RAC: 164,824
Message 51366 - Posted: 7 Jan 2025, 19:48:06 UTC - in response to Message 51365.  

Not really but it happens when the estimated compute performance of your computer is high vs the estimated compute for the WU.

On your side you can run the benchmarks in BOINC, for the estimated compute this is set by the project, you can overide it by editing the WU, for me this is a bit too much effort but others have done it.
ID: 51366 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2607
Credit: 262,620,721
RAC: 139,578
Message 51367 - Posted: 7 Jan 2025, 20:25:50 UTC

CMS tasks running scientific jobs usually finish between 12-18 h.
The BOINC benchmarks are not relevant since each VM runs internal benchmarks that are not reported back to BOINC.
Based on those "glidein" tries to calculate if there's enough time left to request another job.

Each pause/resume initiated by BOINC disturbs this calculation and may result in loosing the last scientific job (18 h are a hard limit).
Since this is also not reported back to BOINC it is recommended to avoid pause/resume cycles.

The short runtimes during the last few weeks were tasks without any scientific job.
Just empty envelope VMs doing a basic setup and then shut down after a timeout.

To be sure a scientific job is running, check the consoles and logs as described here:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6265&postid=51364
ID: 51367 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1440
Credit: 9,663,827
RAC: 1,380
Message 51382 - Posted: 10 Jan 2025, 18:50:32 UTC

I noticed for the first time, that 1 CMS-job is now processing 120,000 events in stead of the 100,000 in previous batches.
Is this the new default number of events?
ID: 51382 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1072
Credit: 8,434,377
RAC: 6,861
Message 51392 - Posted: 13 Jan 2025, 14:25:28 UTC - in response to Message 51382.  

I noticed for the first time, that 1 CMS-job is now processing 120,000 events in stead of the 100,000 in previous batches.
Is this the new default number of events?

Yes, I changed that a while back to adjust job timing.
ID: 51392 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 748
Credit: 52,076,306
RAC: 33,442
Message 51417 - Posted: 16 Jan 2025, 22:10:32 UTC

The project is now delivering a new version of CMS application (70.60).
ID: 51417 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2607
Credit: 262,620,721
RAC: 139,578
Message 51418 - Posted: 16 Jan 2025, 22:17:37 UTC - in response to Message 51417.  

Same as before, except that this app_version uses vboxwrapper 26208 like Theory published earlier this afternoon.
ID: 51418 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : CMS Application : CMS@Home -- jobs update


©2025 CERN