1) Message boards : ATLAS application : 16, 24 or 32 core ATLAS going to become available? (Message 33179)
Posted 12 days ago by Toby Broom
Post:
Where is the 8core limit?

Maybe it was just pragmatic as most people don't have more than 8 cores?
2) Message boards : LHCb Application : Relative Efficiency of LHCb Tasks (Message 33164)
Posted 13 days ago by Toby Broom
Post:
The shutdown after 10min is by design, the VM will try to get work from the CERN servers, if it is not able to get work after 10min it quits.

This way you only waste 10min. Of course if you chose to run only LHCb then you will not be doing any processing on your computer
3) Message boards : Sixtrack Application : Run Length of wTestNew_hl13B1 units (Message 33109)
Posted 21 days ago by Toby Broom
Post:
I think they have the Virtual Memory size set wrong at ca 4GB, when they only use 3-400MB like normal. Boinc used the VM size for sechduling.
4) Message boards : Number crunching : Some WU's tagged computation error but logs looks good (Message 33063)
Posted 29 days ago by Toby Broom
Post:
I would say since condor exited with N/A not 0 then this is considdered and error.
5) Message boards : Number crunching : -152 (0xFFFFFF68) ERR_NETOPEN and 206 (0x000000CE) EXIT_INIT_FAILURE (Message 33014)
Posted 7 Nov 2017 by Toby Broom
Post:
It has there attention :)
6) Message boards : Number crunching : -152 (0xFFFFFF68) ERR_NETOPEN and 206 (0x000000CE) EXIT_INIT_FAILURE (Message 33010)
Posted 7 Nov 2017 by Toby Broom
Post:
I asked the CERN team last night, they are looking at but nothing to report so far.

i noticed some stats from a while ago where I was seeing about the same level of 206 errors in January so for it's crept up a few % from then, athough I could have dropped alot in between and I didn't notice as we have all the SixTrack issues.
7) Message boards : Number crunching : -152 (0xFFFFFF68) ERR_NETOPEN and 206 (0x000000CE) EXIT_INIT_FAILURE (Message 33008)
Posted 6 Nov 2017 by Toby Broom
Post:
I don't think Altas uses condor for work submission?

I've seen 206 for a long time, sparodically, this is a CERN issue when there is no work on there side, Boinc will create WU even if there is nothing for them to do in the WU. 207 looks the same, although I would say it ran something then couldn't get more work.

I don't know how the condor queues are filled and how to see the state of this other than the page https://lhcathomedev.cern.ch/lhcathome-dev/cms_job.php

for me I see ca 20% failure on CMS and 18% on Theory, I don't know what the project and we find to be an acceptable error rate?

Also for the 206/207 it wastes 10min at a time so it's not a huge waste of computre resources, just makes it difficult to see a real error if there is one.
8) Message boards : Sixtrack Application : Bad SixTrack workunits? (Message 32835)
Posted 15 Oct 2017 by Toby Broom
Post:
I saw the same, I sent some logs to Eric & James
9) Message boards : Sixtrack Application : SIXTRACKTEST (Message 32800)
Posted 11 Oct 2017 by Toby Broom
Post:
I got some today.

My Linux computer is taking its time though 3hrs on one task which is unusal for Sixtrack.
10) Message boards : CMS Application : CMS Tasks Failing (Message 32737)
Posted 9 Oct 2017 by Toby Broom
Post:
Thanks Laurence, got to 12min so should be good.

I took the oppertunity to upgrade VBox so not bad :)
11) Message boards : CMS Application : CMS Tasks Failing (Message 32734)
Posted 9 Oct 2017 by Toby Broom
Post:
looks the all the projects fell over not just CMS
12) Message boards : ATLAS application : 'drive limit' error (Message 32731)
Posted 9 Oct 2017 by Toby Broom
Post:
Nothing simple that you can do the project team set rac disk bound. You can edit the task to increase if you feel adventurous just multiply the number by 10 after closing Boinc and restart
13) Message boards : ATLAS application : 'drive limit' error (Message 32723)
Posted 9 Oct 2017 by Toby Broom
Post:
Could you un-hide your computers? Then we can check the WUs for error.
14) Message boards : ATLAS application : No more than 3 single core ATLAS will run. (Message 32710)
Posted 9 Oct 2017 by Toby Broom
Post:
David talked about some optimisations, it needs lot of ram on startup then less for the actual tasks so they could put a big swap file in. However I agree that 4GB is low.

When I ran 1core ones I could run 24 on a machine with 128GB with 5GB/WU. I'm not sure what you see for the BOINC caclulated ram you can see this from the task properties WorkingSetSize, this is not what you set in the app_config, so if it's say 9.8GB then boinc will not run more tasks as it thinks it's run out of memory, you can tweek this by useing the # of cores setting on the web. 3 cores = 5.18GB. If you set the web setting to 1core then it should bring down what BOINC thinks is the ram usage.
15) Message boards : ATLAS application : No more than 3 single core ATLAS will run. (Message 32697)
Posted 8 Oct 2017 by Toby Broom
Post:
If you set atlas server side to 24 then it will always get 24, the boinc work cache is ignored.

I'm running more than 3 on my computers.

I can't run a mixture of ATLAS and the other project, otherwise the cap of 24 WU kicks in.

By default I think ATLAS reports to BOINC that it uses 5.5GB so it could be that BOINC thinks it can't run more WU's. so BOINC thinks it's using 66GB
16) Message boards : Theory Application : maximium tasks? (Message 32633)
Posted 4 Oct 2017 by Toby Broom
Post:
I can confim the limit is now lifted, my 40 thread machine has 40 in progress.
17) Message boards : Theory Application : maximium tasks? (Message 32621)
Posted 4 Oct 2017 by Toby Broom
Post:
I have 0.5 and 0.01, so I should get 1 for each core.

I have 20 cores and 40 threads.
18) Message boards : Theory Application : maximium tasks? (Message 32617)
Posted 3 Oct 2017 by Toby Broom
Post:
Is the maximium number of theory tasks 10?

Can I increase the limit on my many core machines?
19) Message boards : Sixtrack Application : More Work (Message 32584)
Posted 2 Oct 2017 by Toby Broom
Post:
I got some work yesterday so they are being made avalible
20) Message boards : Number crunching : Priorities? (Message 32481)
Posted 21 Sep 2017 by Toby Broom
Post:
Its a BOINC problem as this manages the launching vor VirualBox, LHC only provides the image for VBox to run.

You can feedback on the main BOINC website

I limit the number of task to less than the max number to have a responsive machine.


Next 20


©2017 CERN