41) Message boards : Theory Application : Theory Task doing nothing (Message 42615)
Posted 25 May 2020 by CloverField
Post:
There must be something wrong with your Computer:
You have a sixtrack with x86(32-bit) and this was not finished:
https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=139948215
1. Check your OS
2. let only sixtrack running (prefs).

Edit: Sorry, there is a x86 Version running in sixtrack:
Microsoft Windows (98 or later) running on an Intel x86-compatible CPU


1. Running 64 bit windows.
2. I will switch to six track only here in a moment.

I don't think that task is blocking network connections off the top of my head sixtrack doesn't talk to the internet.
I was also able to find the task running away happily.

42) Message boards : Theory Application : Theory Task doing nothing (Message 42611)
Posted 24 May 2020 by CloverField
Post:
Work up to 4 more doing that this morning along with some atlas tasks doing nothing.
Are there network problems at CERN?
43) Message boards : Theory Application : Theory Task doing nothing (Message 42535)
Posted 18 May 2020 by CloverField
Post:
I now have two more in my currently running tasks doing the exact same thing.
44) Message boards : Theory Application : Theory Task doing nothing (Message 42531)
Posted 17 May 2020 by CloverField
Post:
Ive gotten about four theory tasks today that seem to be nothing showing the vm console reveals this.





Top shows that nothing is running.

45) Message boards : Number crunching : How does task switching actually work? (Message 42530)
Posted 17 May 2020 by CloverField
Post:
So it kind of worked. It still will start new atlas tasks as they come in over already running theory tasks.
However when it is done with the atlas tasks that it has downloaded it will let theory tasks run until the first one is complete and then it usually fetches new work.
Should I reduce the 360000 to something like 130000 or is this something I'm just going to have to live with.
46) Message boards : Number crunching : How does task switching actually work? (Message 42520)
Posted 16 May 2020 by CloverField
Post:
Heres an example. It was running 3 atlas tasks and 8 theory tasks for a bit. The atlas tasked finished and it started working on the rest of the theory task I had downloaded.
The scheduler got some work and downloaded some more atlas tasks, and instead of letting the theory tasks finish it paused them and started right up on the atlas tasks it just downloaded.
47) Message boards : Number crunching : How does task switching actually work? (Message 42516)
Posted 16 May 2020 by CloverField
Post:
Sorry I explained that poorly say I send a work request and I get ten atlas tasks, ten theory tasks, and ten cms tasks.
It will start running the atlas tasks for a little bit, get about half way in then switch to theory run those for a little bit, then switch to cms and so on.
Im wondering more why it doesnt run tasks to completion and instead jumps all over the place between the tasks I have downloaded.
48) Message boards : Number crunching : How does task switching actually work? (Message 42508)
Posted 16 May 2020 by CloverField
Post:
I am currently only the LHC@home project through bionic and I have selected to receive jobs from all applications.
However I noticed that it seems to basically get stuck running certain projects some times. Like it will get a batch of atlas tasks and then only run ATLAS for the rest of the day, Other times it will jump all over the place. What actually controls what order the tasks run in?
Is this something I can fix buy adjusting my switch between tasks setting?
49) Message boards : Number crunching : Setting up a local squid cache for a home cluster - old comments (Message 42500)
Posted 15 May 2020 by CloverField
Post:
The raw data can be found in squid's access.log but it requires a couple of scripts and filters to get values like TCP_MISS__UPLOAD.
I run the refined(!) data through a customized awstats.
This results in pages like this (not that much hits of course ;-)):
http://wlcg-squid-monitor.cern.ch/awstats/bin/awstats.pl?month=05&year=2020&output=main&config=atlasfrontier.cern.ch&framename=index


Looks like I have a fun weekend project then.
Thanks for the info!
50) Message boards : Number crunching : Setting up a local squid cache for a home cluster - old comments (Message 42495)
Posted 15 May 2020 by CloverField
Post:
Well this work surprisingly well, set the cache up at 2 am and its already cached 1.3 Gigs of data.
@computezrmle would you mind telling me how you got this output from squid.

Downloads served by the proxy
TCP_MEM_HIT 1,017,914 requests 1.92 GB
TCP_HIT 1,516 requests 4.68 GB
TCP_REFRESH_UNMODIFIED 8,505 requests 63.26 MB

Downloads requested from lhc@home
TCP_MISS 1,363 requests 362.30 MB
TCP_REFRESH_MODIFIED 3,031 requests 11.54 MB

Result uploads to lhc@home
TCP_MISS__UPLOAD 2,037 requests 28.97 GB


Id like to see how much data Ive saved by having the cache running.
51) Message boards : CMS Application : CMS Tasks Failing (Message 42492)
Posted 15 May 2020 by CloverField
Post:
... and all CMS tasks from today finished with an error.

Example:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=273333559



[url]https://lhcathome.cern.ch/lhcathome/result.php?resultid=273134385
[/url]
Everything was working fine for me until about 4 am then all the CMS tasks started failing.

This one seems to have failed because there were no jobs available.
https://lhcathome.cern.ch/lhcathome/result.php?resultid=273225488
52) Message boards : ATLAS application : How to only get Atlas 8 core tasks. (Message 42488)
Posted 14 May 2020 by CloverField
Post:
Cool thanks for the info.
53) Message boards : Theory Application : Limiting Theory to 28 tasks at a time (Message 42481)
Posted 14 May 2020 by CloverField
Post:
So theres no way to only have theory restricted to 28 tasks, and everything else can use 100?
If I just want theory limited to 28 tasks, I have to restrict my preferences to only run theory tasks?
54) Message boards : Theory Application : Limiting Theory to 28 tasks at a time (Message 42479)
Posted 14 May 2020 by CloverField
Post:
Recently theory has been overloading my computer and causing blu screens. I was able to solve this by telling boinc to use 87.5% of my cpus.
However every other project runs fine with 100% cpu usage so I dont want to limit them when Im running them. I saw a post about app configs to limit the maximum number of concurrent tasks. Ive made one that I think is correct and just wanted to verify.

<app_config>
 <app>
  <name>Theory</name>
  <max_concurrent>28</max_concurrent>
 </app>
</app_config>


This should be all I need correct?
55) Message boards : ATLAS application : How to only get Atlas 8 core tasks. (Message 42467)
Posted 14 May 2020 by CloverField
Post:
So I've been running alot of atlas tasks lately and I was wondering if there is a way to only get certain core count tasks.
Last night I got slammed with 16 4 core tasks and ran out of Ram. I took a look at the max cores option in preferences but that seems to be max number and below so if set it to 8 I could still get other lower core count tasks.
Is there any way to do this, or do I just need to keep an eye on my box for out of memory exceptions.
56) Message boards : Theory Application : Extreme Overload caused by a Theory Task (Message 42404)
Posted 11 May 2020 by CloverField
Post:
Could everything running concurrently be the reason that my computer crashes when its running 32 theory tasks at once?
57) Message boards : Theory Application : Tasks run 4 days and finish with error (Message 42373)
Posted 4 May 2020 by CloverField
Post:
Do we think this https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5413
will fix our issues?
58) Message boards : Theory Application : Tasks run 4 days and finish with error (Message 42323)
Posted 28 Apr 2020 by CloverField
Post:
Yes, the VirtualBox Interface is usually the one that don't shutdown. I then kill it manually from Windows Task Manager. So far I haven't lost any task because of that.


As long as no virtual box instances are running. You can kill this whenever you want.
59) Message boards : Theory Application : Tasks run 4 days and finish with error (Message 42205)
Posted 16 Apr 2020 by CloverField
Post:
I won't pretend I don't see problems, but the failures are definitely a minority of my Sherpas.]


Yeah I think I worded that poorly. Probably have a bit of false positive bias because the only tasks that I see fail are sherpas because they get stuck.

I do like the idea a pushing all the sherpas off into there own little sub project.
60) Message boards : Theory Application : Tasks run 4 days and finish with error (Message 42199)
Posted 16 Apr 2020 by CloverField
Post:
You can only check/uncheck Theory Simulation as a whole.
If it is checked you get jobs from the currently active mcplots list:
http://mcplots-dev.cern.ch/production.php?view=control

The recent revision 2378 lists 70957 job definitions with only 3% being sherpas.
Longrunners can't be avoided since each job definition usually creates less than 30 attempts and runtimes can't be estimated before a job starts.

The reason why it appears that there are much more sherpas than other jobs is caused by the fact that other jobs often have very short runtimes, hence the hosts run many of them before they get a longrunner.


Ive only been getting theory tasks recently.
Would this mean I would be doing nothing till other types of tasks come back into the queue?


Previous 20 · Next 20


©2024 CERN