41) Message boards : ATLAS application : How is Work-Distribution calculated ? (Message 43679)
Posted 22 Nov 2020 by Profile Yeti
Post:
So you could experiment with number of CPUs in your preferences. This affects the amount of memory Boinc thinks each task is using (actual memory used can be set with app_config).
Nope, I can't play with the number of CPUs. If I raise this figure the Working-Set-Size of each Workunit will raise up to 10.200 MB. With 5-CPUs the Working-Set-Size is 7.500 MB.

The memory-setting in app_config is only responsable for the memory-setting of the Virtual-Machine.

The BOINC-Client reserves the memory that is set by Working-Set-Size, even if the Virtual-Machine needs less memory
42) Message boards : ATLAS application : How is Work-Distribution calculated ? (Message 43669)
Posted 22 Nov 2020 by Profile Yeti
Post:
Hi,

I'm a little bit irritated about Work-Distribution of Atlas-Work.

All my clients get 10 WUs. Regardless how powerfull or slow the individual workstation is.

So, my slowest PCs has enough work for up to 2 days.

My fastest PC has work for max 6 hours.

This are my LHC-Specific-preferences:



As long as boxes have 10 WUs local, the server tells "No Atlas work available". If they have less workunits, then we get exact the difference to 10.

What to do to get more work on my Power-Machines ?
43) Message boards : ATLAS application : Confused (Message 43665)
Posted 21 Nov 2020 by Profile Yeti
Post:
Just saw this thread.

The scheduler was designed in a time, where only Single-Core-WUs exist and with this it works very fine.

The scheduler has really problems to balance with Multi-Core-WUs; if you like to run these, it may be neccessary to help the scheduler. At LHC you have the possibility to tell "Give me only 1 Workunit". This is setup here: https://lhcathome.cern.ch/lhcathome/prefs.php?subset=project

Choose 'Max # jobs' and set it to one or two or whatever you would like
44) Message boards : ATLAS application : Faulty Box or Faulty WUs ? (Message 43647)
Posted 18 Nov 2020 by Profile Yeti
Post:
Okay, finally it was only a small reason.

In BOINC Proxy-Settings, I had only entered the "short" form of the machine running my Squid. SInce I changed this to the full qualified name all seems to be fine now.

So, the finally answer to my question is: Faulty Box ;-)
45) Message boards : ATLAS application : Faulty Box or Faulty WUs ? (Message 43635)
Posted 17 Nov 2020 by Profile Yeti
Post:
The link is not allowed for other users but I guess it's this computer:
https://lhcathome.cern.ch/lhcathome/results.php?hostid=10383816
You are right, that is the correct machine

Looks like the faulty WUs produced errors on all computers they have been sent to.
The interesting fact is that it's just 1 of your's that picked up all faulty tasks.
That is what made me thinking something could be wrong with this single box.

What about the tasks that are currently shown "in progress"?
Did you set them on hold or are they running fine?
I have set them on hold, because I couldn't babysit the box.

Now, after a restart, I have started one WU again to see what happens. And I have switched off the proxy-setting via Squid to see, if this has something todo with the problem
46) Message boards : ATLAS application : Faulty Box or Faulty WUs ? (Message 43633)
Posted 17 Nov 2020 by Profile Yeti
Post:
Hi together,

have come back to Atlas, all my Clients are back to Atlas, but one of them produces a lot of "Validate error".

If I track the Result, it looks as if Input-File from Atlas is empty ? !

Can you follow this link: https://lhcathome.cern.ch/lhcathome/results.php?userid=555&offset=0&show_names=0&state=5&appid=

Here is the Pilot-Log:

020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:16:59,709 | WARNING | queue_monitoring | pilot.util.common | should_abort | data:queue_monitoring:received graceful stop - abort after this iteration
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:02,711 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | will not set job_aborted yet
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:02,711 | DEBUG | queue_monitoring | pilot.control.data | queue_monitoring | [data] queue_monitor thread has finished
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:45,609 | WARNING | job_monitor | pilot.control.job | check_job_monitor_waiting_time | no jobs in monitored_payloads queue (waited for 61 s)
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:45,609 | DEBUG | job_monitor | pilot.util.processes | threads_aborted | aborting since the last relevant thread is about to finish
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:45,609 | DEBUG | job_monitor | pilot.control.job | job_monitor | will proceed to set job_aborted
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:45,609 | DEBUG | job_monitor | pilot.control.job | job_monitor | [job] job monitor thread has finished
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:46,458 | INFO | MainThread | pilot.workflow.generic | run | end of generic workflow (traces error code: 0)
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:46,458 | INFO | MainThread | root | wrap_up | traces error code: 0
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:46,459 | INFO | MainThread | root | wrap_up | pilot has finished
2020-11-17 12:17:47 (6324): Guest Log: 2020-11-17 11:17:46,495 [wrapper] ==== pilot stdout END ====
47) Message boards : Number crunching : VirtualBox needed? (Message 40859)
Posted 8 Dec 2019 by Profile Yeti
Post:
Ah, sorry, I'm doing this with VMWare ESX
48) Message boards : Number crunching : VirtualBox needed? (Message 40856)
Posted 8 Dec 2019 by Profile Yeti
Post:
AFAIK nested 64bit vbox is only possible on AMD CPUs.

Nope, I'm doing this on my XEONs
49) Message boards : ATLAS application : New console monitoring (Message 40363)
Posted 6 Nov 2019 by Profile Yeti
Post:
Crystal,

is it okay if I use your hardcopy for the checklist ?

Yeti

Since last night ATLAS (vbox) uses v2.2.0 of ATLAS Event Monitoring.
Example:
50) Message boards : ATLAS application : New console monitoring (Message 40362)
Posted 6 Nov 2019 by Profile Yeti
Post:
computezrmle, I like your new monitoring, this is much much better than what we had in the past !

Good work, thank you very much

I should remember to update the checklist
51) Message boards : Number crunching : Checklist Version 3 for Atlas@Home (and other VM-based Projects) on your PC (Message 40295)
Posted 27 Oct 2019 by Profile Yeti
Post:
From your hardcopys I would say you only need to enter your bios and switch VT-X on
52) Message boards : ATLAS application : ATLAS vbox version 2.00 (Message 40231)
Posted 21 Oct 2019 by Profile Yeti
Post:
Please upgrade to 6.0.x. (with ExtPack)

Maybe I missed it but I never saw a word from Projektteam that VirtualBox Version 6.x is okay to use with Atlas. And so long I will stay with 5.x

David,

which version of VirtualBox do you want us to use for the V2 ?

Latest 6.x or 5.x ?????
53) Message boards : ATLAS application : ATLAS vbox version 2.00 (Message 40199)
Posted 18 Oct 2019 by Profile Yeti
Post:
I don't think the project team let us know, it's normally one of us that tries it.

HM, not really a good idea. In the past, the wrapper of Atlas had some particularities so that it was not a good idea to switch to a new Major-Release from VirtualBox without the okay from the projectteam. Sometimes they had to make special preparations for the wrapper ...
54) Message boards : ATLAS application : ATLAS vbox version 2.00 (Message 40194)
Posted 18 Oct 2019 by Profile Yeti
Post:
Please upgrade to 6.0.x. (with ExtPack)

Maybe I missed it but I never saw a word from Projektteam that VirtualBox Version 6.x is okay to use with Atlas. And so long I will stay with 5.x
55) Message boards : Number crunching : Checklist Version 3 for Atlas@Home (and other VM-based Projects) on your PC (Message 40156)
Posted 15 Oct 2019 by Profile Yeti
Post:
For some reason, Atlas is using 10200 MB per instance of Atlas, despite an app_config setting it at 6600 MB for a 4 core job. Any ideas why this might be happening?


The needed memory is calculated by the LHC@Home-Server that doesn't know your app_config settings. It uses the Web-Preferences, so you should change your Web-Preferences regarding Max # jobs and Max # CPUs

As far as I know there is a bug in the server-software so these parameters are crossed interpreted, so you will have to play a little bit around
56) Message boards : Number crunching : LHC on android (Message 40097)
Posted 9 Oct 2019 by Profile Yeti
Post:
The arm/android executable is aarch64, your host is armv7, which is 32bit arm, thus you will not get sent anything.

James, thank you, i wasn't aware that this is only a 32-Bit System
57) Message boards : Number crunching : LHC on android (Message 40087)
Posted 7 Oct 2019 by Profile Yeti
Post:
Okay, I have one Android-6-Handy that asks for work since weeks, but never got any task: https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10603561

The Flag allowing Test-Applications is set, also sixtracktest is checked, nothing has done the trick so far.

Does anyone have any idea what to try ?
58) Message boards : ATLAS application : Task will not end despite being 100% (Message 38873)
Posted 16 May 2019 by Profile Yeti
Post:
I'm having massive issues trying to get an ATLAS task to complete without giving a COMPUTATION ERROR.
The tasks are all 6 CPU tasks (I'd previously tried 8, but they never finish or error out)
The 6CPU tasks typically run for almost 3 days on my machine.

The computer is dual E5-2683 v3 giving 28 actual cores and RAM is 128GB.
Windows 10 Pro, Hyper-V is disabled.
BOINC Manager 7.12.2 (x64), Virtual Box 6.0.4 r128413 with extension pack

It crunches though Theory Simulation (6 CPU) tasks with no issue.

Please if someone could help me out, I really do want to contribute.

Thank you,
Ewin

I invite you to take a walk through my checklist
59) Message boards : News : Database problems (Message 38855)
Posted 15 May 2019 by Profile Yeti
Post:
Pentathlon hasn't startet yet...but whatever...

Nope, it HAS already started. With telling the participiants the name of the Sprint-Project, all try to get WUs immediatly ...
I want my Atlas tasks back, but i can not get any :-(
It is possible, I got some
60) Message boards : News : Database problems (Message 38850)
Posted 15 May 2019 by Profile Yeti
Post:
Could it be that LHC was victim of a hacker attack?

Nope, it is not a hacker attack, it is Pentathlon-Time and LHC is Project for Sprint


Previous 20 · Next 20


©2024 CERN