Message boards : Theory Application : New native version v300.08
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Saturn911

Send message
Joined: 3 Nov 12
Posts: 36
Credit: 118,033,392
RAC: 128,505
Message 49145 - Posted: 6 Jan 2024, 7:47:07 UTC - in response to Message 49144.  

03:30:54 CET +01:00 2024-01-06: cranky-0.1.4: [INFO] Can't find '/etc/cvmfs/domain.d/cern.ch.local'.
03:30:54 CET +01:00 2024-01-06: cranky-0.1.4: [INFO] Can't find '/etc/cvmfs/config.d/cvmfs-config.cern.ch.local'.
What is the reason for this and how to fill it with info?

Tasks are running with status zero!

https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5594&postid=48539
ID: 49145 · Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jun 18
Posts: 126
Credit: 53,906,141
RAC: 46,247
Message 49146 - Posted: 6 Jan 2024, 17:19:20 UTC

Will I still be able to run native ATLAS with this new 300.08 Theory configuration?
ID: 49146 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2418
Credit: 226,735,116
RAC: 130,288
Message 49147 - Posted: 6 Jan 2024, 18:24:21 UTC - in response to Message 49146.  

Yes, if ATLAS worked before.
Theory does not affect ATLAS.

But you should revise your CVMFS setup following the advice here:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6075&postid=49145
ID: 49147 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2108
Credit: 159,824,478
RAC: 104,482
Message 49149 - Posted: 7 Jan 2024, 5:51:32 UTC - in response to Message 49146.  

Will I still be able to run native ATLAS with this new 300.08 Theory configuration?

When you have a account in -dev, you can test it there.
ID: 49149 · Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 12 Jul 11
Posts: 95
Credit: 1,129,876
RAC: 0
Message 49151 - Posted: 7 Jan 2024, 21:46:23 UTC

I have 2 native theory tasks that have been running for days (2 and 4) and I realize boinc says 100% is achieved and they are still running (using CPU)









Is this possible ? it is still running and useful (and not stalled or dead) ?
ID: 49151 · Report as offensive     Reply Quote
Toggleton

Send message
Joined: 4 Mar 17
Posts: 20
Credit: 8,243,431
RAC: 12,799
Message 49152 - Posted: 7 Jan 2024, 22:43:26 UTC - in response to Message 49151.  
Last modified: 7 Jan 2024, 22:51:01 UTC

I have such long running tasks right now too. one Sherpa (52hours so far)like you mcplots runspec: boinc pp winclusive 7000 10 - sherpa 2.2.5 default 100000 126 https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=218564767
and one with 30hours so far mcplots runspec: boinc ppbar mb-inelastic 900 - - pythia8 8.306 dire-default 100000 136
https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=218596130

Both long running tasks have nearly the same Name as yours, so guess that are just long running experiments.

But i did have before theory tasks that have run for days and been successful. The only unusual thing with that 2 tasks is that they don't write how many events are done to /var/lib/boinc/slots/*/cernvm/shared/runRivet.log like all the other tasks have so far.

Runtime of recent Theory tasks in hours: average, min, max 3.09 (0.01 - 238.65)Theory Tasks can run very long.
ID: 49152 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2108
Credit: 159,824,478
RAC: 104,482
Message 49154 - Posted: 8 Jan 2024, 11:40:51 UTC

11:27:47 CET +01:00 2024-01-08: cranky-0.1.4: [INFO] Using /cvmfs/cernvm-prod.cern.ch/cvm4
mkdir: das Verzeichnis „/sys/fs/cgroup/unified“ kann nicht angelegt werden: Das Dateisystem ist nur lesbar
Have changed properties for /sys/fs/cgroup from read to read and/or write.
Get the same message as before in -native, but the task finished with zero.
using no script so far (CentOS9). https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10806714
ID: 49154 · Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 12 Jul 11
Posts: 95
Credit: 1,129,876
RAC: 0
Message 49161 - Posted: 9 Jan 2024, 23:20:49 UTC
Last modified: 9 Jan 2024, 23:21:04 UTC

Well it's not very nice, it turns out I realized today boinc had "stopped" without further notice yesterday at noon in the VM (I couldn't figure out why) and after I restarted boinc the 2 tasks have "reset", one is not even started and the 2nd has very little computing time now :(

ID: 49161 · Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 12 Jul 11
Posts: 95
Credit: 1,129,876
RAC: 0
Message 49171 - Posted: 12 Jan 2024, 11:52:24 UTC

I still have them running, always stating 100% is done !


and restarting forever, now it says only 1 day of calculation, so I guess each time I need to restart the VM they restart from 0 and again they don't end


I decided to abort them, enough CPU cycles waste !
ID: 49171 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2418
Credit: 226,735,116
RAC: 130,288
Message 49172 - Posted: 12 Jan 2024, 12:24:44 UTC - in response to Message 49171.  

I guess each time I need to restart the VM they restart from 0

Yes, that's what they do.
But it is well known that Theory tasks start from scratch when you restart the computer (or the VM like in this case).

So, why don't you let the tasks finish before you restart the VM?
Or, why do you shut the VM down instead of just suspend/resume it?

Also well known:
Theory tasks run between a few minutes (min) and 10 days (max).
This depends on the task's input data.
Locate "runRivet.log" below the worker slot to check how many events a task is configured to process and how many are already done.
Together with the time already used you can estimate the remaining runtime.
BOINC is not aware of those numbers, hence presents fake estimates based on averages.
This fact has also been discussed many times in this forum.
ID: 49172 · Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 12 Jul 11
Posts: 95
Credit: 1,129,876
RAC: 0
Message 49175 - Posted: 12 Jan 2024, 21:53:47 UTC - in response to Message 49172.  

Thanks for the answer, I never stop this VM except I was trying to run some yoyo on it and it would kill boinc due to memory saturation (OOM killer) so it cost me various boinc restart during the past before I understood the issue and could limit to 1 concurrent yoyo and now it seems OK.

Too late since I already cancelled the 2 tasks, but I'll know this for next time, I had never experienced such long runners, but it had been a long I hadn't worked again with LHC tasks.
ID: 49175 · Report as offensive     Reply Quote
Drago75

Send message
Joined: 22 Jan 21
Posts: 5
Credit: 257,390
RAC: 265
Message 49197 - Posted: 16 Jan 2024, 12:40:15 UTC

setting up the native version of Theory I got as far as installing cvmfs but when I get to the command: "cvmfs_config setup" UBUNTU tells me that I need root privileges for that. Can anybody help me with that?
ID: 49197 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2418
Credit: 226,735,116
RAC: 130,288
Message 49198 - Posted: 16 Jan 2024, 13:07:20 UTC - in response to Message 49197.  

If you run a command on Linux that requires root privileges prefix it with "sudo " and enter root's password when asked.
Hence, here run "sudo cvmfs_config setup".

Be aware that cvmfs_config allows some subcommands to be run as normal user and some subcommands require root privileges.
ID: 49198 · Report as offensive     Reply Quote
Drago75

Send message
Joined: 22 Jan 21
Posts: 5
Credit: 257,390
RAC: 265
Message 49199 - Posted: 16 Jan 2024, 14:34:53 UTC - in response to Message 49198.  

Tried that. It did nothing, just opened a new prompt line. "cvmfs_config probe" afterwards did the same.
ID: 49199 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2418
Credit: 226,735,116
RAC: 130,288
Message 49200 - Posted: 16 Jan 2024, 15:02:09 UTC - in response to Message 49199.  

It did nothing, ...

It may have done the setup without printing a comment.


... "cvmfs_config probe" afterwards did the same.

Your CVMFS configuration may be incomplete.
If you have trouble with it follow the HowTo here and post questions here.

Leave this thread for comments/questions related to Theory native v300.08.
ID: 49200 · Report as offensive     Reply Quote
Ryan Munro

Send message
Joined: 17 Aug 17
Posts: 77
Credit: 6,062,678
RAC: 18,640
Message 49255 - Posted: 24 Jan 2024, 15:21:38 UTC

I get the error

"Found Sudo-Version 1.9.9.
This sudo version is lower than 1.9.10.
It does not support regular expressions.
Hence, sudoers will not be modified.
Error running /tmp/prepare_theory_native_environment"

Bit of A Linux n00b, does this mean I will need to wait for software updates before I can run Theory? (Running Linux Mint)
I did have theory running briefly, but had to reinstalled and now its failing every time.
ID: 49255 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2108
Credit: 159,824,478
RAC: 104,482
Message 49403 - Posted: 6 Feb 2024, 6:57:30 UTC
Last modified: 6 Feb 2024, 7:23:27 UTC

Have one Task in -native with no ending time of running. Starting more than three times from the beginning:
ppbar mb-inelastic 1800 - - pythia8 8.303 dire-default 0 2 0 0 2
https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=219290397
-------- PYTHIA Event Listing (hard process) -----------------------------------------------------------------------------------

no id name status mothers daughters colours p_x p_y p_z e m
0 90 (system) -11 0 0 0 0 0 0 0.000 0.000 0.000 1800.000 1800.000
1 2212 (p+) -12 0 0 3 0 0 0 0.000 0.000 900.000 900.000 0.938
2 -2212 (pbar-) -12 0 0 4 0 0 0 0.000 0.000 -900.000 900.000 0.938
3 21 (g) -21 1 0 5 6 101 102 0.000 0.000 2.291 2.291 0.000
4 21 (g) -21 2 0 5 6 102 103 0.000 0.000 -10.836 10.836 0.000
5 21 g 23 3 4 0 0 101 104 -1.281 -2.649 1.025 3.116 0.000
6 21 g 23 3 4 0 0 104 103 1.281 2.649 -9.570 10.012 0.000
Charge sum: 0.000 Momentum sum: 0.000 0.000 -8.545 13.127 9.966

-------- End PYTHIA Event Listing -----------------------------------------------------------------------------------------------
Rivet.AnalysisHandler: INFO Only using nominal weight. Variation weights will be
ignored.
0 events processed
ID: 49403 · Report as offensive     Reply Quote
rob

Send message
Joined: 4 Mar 11
Posts: 22
Credit: 3,593,654
RAC: 631
Message 49419 - Posted: 6 Feb 2024, 17:11:03 UTC

There is something strange going on with the initial estimated run time vs. the actual run time of some, if not all of these tasks:
On my PC:
Initial estimated run time = 61.5 minutes
During the first 60 minutes of running the elapsed time increments at about 1second per second of clock time, and continues at this rate - only a couple of seconds out after an hour.
However the remaining time only drops by about 15 seconds to 60.25 minutes.
At 61.5 elapsed minutes the remaining time jumps to 9 days 23 hours and 46 minutes.

This has the effect that my computer downloads a number of tasks that are, initially predicted to be finished by the deadline, but at the "1 hour" adjustment the majority will not even be started never mid finished by the deadline (10 days). This is potentially a rather unproductive waste of bandwidth, not to mention frustration for me. It will be interesting to see what the actual run time of these tasks is.
ID: 49419 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2108
Credit: 159,824,478
RAC: 104,482
Message 49420 - Posted: 6 Feb 2024, 17:18:16 UTC - in response to Message 49419.  

Not all Theory finishing in a short time of a few hours.
When you get a Sherpa.... We seeing some ending after 10 days in a crash.
So, we have to control the duration for us.
What you can do, is using an app_config.xml to control the number of input Theory tasks.
mcplots show an info how many Tasks have done.
ID: 49420 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2418
Credit: 226,735,116
RAC: 130,288
Message 49431 - Posted: 6 Feb 2024, 19:14:04 UTC - in response to Message 49419.  

BOINC's runtime estimation (like the credit calculation) is not really good when real runtimes are highly variable.

In case of Theory tasks runtime can be between few seconds and up to 10 days.
ATM it looks like runtimes of many tasks are much longer than usual (a couple of days) while some weeks ago many were much shorter.
BOINC usually needs a couple of days, sometimes even weeks, to catch up and adjust the average.

The only thing that helps is to slightly modify BOINC's work buffer size.
Although there's a myth claiming it every now and then for years app_config.xml does not support a parameter that limits the number of tasks a project server sends.
ID: 49431 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Theory Application : New native version v300.08


©2024 CERN