Message boards : ATLAS application : BOINC downloads only 2 ATLAS tasks
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35035 - Posted: 18 Apr 2018, 16:25:44 UTC

With my 12 core processor (6+6HT), 32GB RAM, I'd like to process at least three 2-core ATLAS tasks conurrently. Some time ago, it was no problem at all to download a sufficient number of tasks.
Now, however, BOINC downloads only 2 ATLAS tasks, regardless whether in the web settings I put 3, 4, 5, ... or even an unlimited number of tasks. It says "no tasks available" (although the Project Status page permanently shows a few thousand unsent tasks).
Which means that I can run only 2 ATLAS tasks concurrently.

Any idea what's going wrong?
ID: 35035 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 798
Credit: 642,831,911
RAC: 283,213
Message 35036 - Posted: 18 Apr 2018, 17:23:55 UTC - in response to Message 35035.  

I can get 3, with no limit setting.
ID: 35036 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35037 - Posted: 18 Apr 2018, 17:27:43 UTC - in response to Message 35035.  

I am only getting 2 at a time also, regardless of the setting. Maybe it is the priority they are putting on Sixtrack?
I would do more ATLAS if they will send it.
ID: 35037 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35038 - Posted: 18 Apr 2018, 18:30:49 UTC

So maybe there is some policy behind it; if so, it would be nice if they told us - just so that we know.
ID: 35038 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 674
Credit: 43,012,577
RAC: 17,256
Message 35039 - Posted: 18 Apr 2018, 19:44:15 UTC

I think that this setting is not working the way it should. Currently I'm running on one host with 3 cores and this gets 3 tasks and an other host with 4 cores and this gets 4 tasks. They are both set up for "No Limit". A while back I was running third host with 1 core and it got just 1 task at a time. All hosts had 0.4+0.2 days cache.
ID: 35039 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35040 - Posted: 18 Apr 2018, 21:57:38 UTC - in response to Message 35039.  

I think that this setting is not working the way it should. Currently I'm running on one host with 3 cores and this gets 3 tasks and an other host with 4 cores and this gets 4 tasks. They are both set up for "No Limit". A while back I was running third host with 1 core and it got just 1 task at a time. All hosts had 0.4+0.2 days cache.

That could be it. I am set for 2 cores, and am getting 2 tasks. It looks like they got them a bit mixed up.
ID: 35040 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35045 - Posted: 19 Apr 2018, 10:05:09 UTC - in response to Message 35040.  

I think that this setting is not working the way it should.

That could be it.

yes, I agree, it seems like. So I am wondering when this will be repaired.
ID: 35045 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35070 - Posted: 22 Apr 2018, 8:02:13 UTC - in response to Message 35040.  

I think that this setting is not working the way it should. Currently I'm running on one host with 3 cores and this gets 3 tasks and an other host with 4 cores and this gets 4 tasks. They are both set up for "No Limit". A while back I was running third host with 1 core and it got just 1 task

That could be it. I am set for 2 cores, and am getting 2 tasks. It looks like they got them a bit mixed up.
I guess we now can confirm that there is this mix-up in the settings.
Because when I now changed the setting of number of cores from 2 to 3, I could download 3 ATLAS tasks.

So let's hope that someone back at CERN will get this error straightened out.
ID: 35070 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 798
Credit: 642,831,911
RAC: 283,213
Message 35073 - Posted: 22 Apr 2018, 10:21:24 UTC - in response to Message 35070.  

I asked them to take a look.
ID: 35073 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35074 - Posted: 22 Apr 2018, 15:26:53 UTC - in response to Message 35073.  

I asked them to take a look.

Thanks. This project actually has remarkably good support as compared to some that I could name.
ID: 35074 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35121 - Posted: 30 Apr 2018, 15:53:10 UTC

There is a way to deal with this. Just set "Max # CPUs" to 8, so that it will download 8 work units.

Then, use an app_config.xml to set the number of cores that you actually want to run. I usually run on 2 cores, for maximum efficiency.
(This includes a max_concurrent, which is optional.)

<app_config>

<app>
  <name>ATLAS</name>
  <max_concurrent>4</max_concurrent>
</app>

<app_version>
  <app_name>ATLAS</app_name>
  <plan_class>native_mt</plan_class>
  <avg_ncpus>2</avg_ncpus>
</app_version>

</app_config>


I hope they are working on a more permanent fix, with an increased number of work units.
ID: 35121 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35124 - Posted: 30 Apr 2018, 18:51:21 UTC - in response to Message 35121.  

There is a way to deal with this. Just set "Max # CPUs" to 8, so that it will download 8 work units.
Jim, that's an interesting approach. And it really works?
Why do I ask? When doing this, each downloaded task in the BOINC Manager is shown as "Ready to start (8 CPUs)" - and these 8 CPU-tasks work as 2-core tasks (as set in the app_config)?
ID: 35124 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2375
Credit: 221,697,371
RAC: 142,937
Message 35125 - Posted: 30 Apr 2018, 19:31:52 UTC - in response to Message 35124.  

There is a way to deal with this. Just set "Max # CPUs" to 8, so that it will download 8 work units.
Jim, that's an interesting approach. And it really works?
Why do I ask? When doing this, each downloaded task in the BOINC Manager is shown as "Ready to start (8 CPUs)" - and these 8 CPU-tasks work as 2-core tasks (as set in the app_config)?

Are you aware that Jim1348 is running ATLAS (native) on his linux boxes?
On a windows box you'll have to set "<plan_class>vbox64_mt_mcore_atlas</plan_class>" instead.
You may check if this is set correctly.

In addition you may add the line:
<cmdline>--nthreads 2 --memory_size_mb xxxx</cmdline>

with
xxxx=4400 (according to the project's RAM formula)
xxxx=5000 (safe setting if you have enough resources)
ID: 35125 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35126 - Posted: 30 Apr 2018, 20:19:59 UTC - in response to Message 35125.  

Are you aware that Jim1348 is running ATLAS (native) on his linux boxes?

Thanks for the clarification. I tend to forget about Windows on LHC, though I use it on other projects.

Erich, yes it works, at least in my setup.

Good luck.
ID: 35126 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35199 - Posted: 9 May 2018, 6:03:12 UTC - in response to Message 35121.  

Jim wrote:
There is a way to deal with this. Just set "Max # CPUs" to 8, so that it will download 8 work units.

Then, use an app_config.xml to set the number of cores that you actually want to run. I usually run on 2 cores, for maximum efficiency.
(This includes a max_concurrent, which is optional.)

<app_config>

<app>
  <name>ATLAS</name>
  <max_concurrent>4</max_concurrent>
</app>

<app_version>
  <app_name>ATLAS</app_name>
  <plan_class>native_mt</plan_class>
  <avg_ncpus>2</avg_ncpus>
</app_version>

</app_config>


I hope they are working on a more permanent fix, with an increased number of work units.
Jim, your procedure for downloading a higher number aof ATLAS tasks works fine also here.
However, I can crunch only up to three 1-core ATLAS tasks concurrently, regardless of what my settings in the app_config are. If I set it to "4" (as in your example above), still only 3 tasks are being processed.
With 32GB RAM (and 6+6HT CPU cores), processing of 4 1-core ATLAS tasks concurrently should not be a problem, should it?
BTW: on my system, experience shows that highest efficiency is with 1-core tasks (I had tried it also with 2-core and 3-core).
ID: 35199 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35208 - Posted: 9 May 2018, 14:19:44 UTC - in response to Message 35199.  
Last modified: 9 May 2018, 14:32:45 UTC

However, I can crunch only up to three 1-core ATLAS tasks concurrently, regardless of what my settings in the app_config are. If I set it to "4" (as in your example above), still only 3 tasks are being processed.
With 32GB RAM (and 6+6HT CPU cores), processing of 4 1-core ATLAS tasks concurrently should not be a problem, should it?
BTW: on my system, experience shows that highest efficiency is with 1-core tasks (I had tried it also with 2-core and 3-core).

Erich,
I usually run 2-core tasks (three at a time). But yesterday I tried the 1-core tasks, and was able to crunch six at a time (I did not try seven). So it should be possible.
I used the same app_config that I posted above, except with <avg_ncpus>1</avg_ncpus>

By the way, I can now download only seven ATLAS tasks at a time rather than eight. As I recall, the first time I tried I got eight, but the first one quickly errored out, and it has been only seven since.
There is something wrong with the LHC server settings anyway, so who knows how it will work in any given case.

But if you want to try something else, it has been suggested to me by another very knowledgeable user (who I think likes to remain anonymous) that this is a better one, and I will try it later today:

1-core-setup

<app_config>
  <app>
    <name>ATLAS</name>
    <max_concurrent>2</max_concurrent> # replace "2" with your favorite number
    <report_results_immediately/>
  </app>

  <app_version>
    <app_name>ATLAS</app_name>
    <plan_class>native_mt</plan_class>
    <avg_ncpus>1.0</avg_ncpus>
    <cmdline>--nthreads 1 --memory_size_mb 3600</cmdline> # memory_size_mb can be lower than with vbox
  </app_version>

  <project_max_concurrent>2</project_max_concurrent> # replace "2" with your favorite number
  <report_results_immediately/>
</app_config>

2-core-setup

<app_config>
  <app>
    <name>ATLAS</name>
    <max_concurrent>2</max_concurrent> # replace "2" with your favorite number
    <report_results_immediately/>
  </app>

  <app_version>
    <app_name>ATLAS</app_name>
    <plan_class>native_mt</plan_class>
    <avg_ncpus>2.0</avg_ncpus>
    <cmdline>--nthreads 2 --memory_size_mb 3600</cmdline> # memory_size_mb can be lower than with vbox
  </app_version>

  <project_max_concurrent>2</project_max_concurrent> # replace "2" with your favorite number
  <report_results_immediately/>
</app_config>


Have fun.

PS - My LHC settings are now:
Max # jobs No limit
Max # CPUs No limit
for whatever difference that makes.
ID: 35208 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2375
Credit: 221,697,371
RAC: 142,937
Message 35209 - Posted: 9 May 2018, 14:55:05 UTC

What I really miss is a comment from the experts at CERN.
Is there anybody who takes a look at the server setup?
ID: 35209 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1681
Credit: 99,384,100
RAC: 110,641
Message 35210 - Posted: 9 May 2018, 14:56:55 UTC - in response to Message 35208.  

But if you want to try something else...
thanks, Jim, for the suggestion. What I am wondering, though, is the low amount of memory: 3600MB (for both 1-core and 2-core).
What I have observed for long time is that - at least here - console 3 shows memory usage of up to 5900MB (so, to be on the save side, my memory setting in the app_config is 7000MB).
ID: 35210 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2375
Credit: 221,697,371
RAC: 142,937
Message 35211 - Posted: 9 May 2018, 15:04:38 UTC - in response to Message 35210.  

But if you want to try something else...
thanks, Jim, for the suggestion. What I am wondering, though, is the low amount of memory: 3600MB (for both 1-core and 2-core).
What I have observed for long time is that - at least here - console 3 shows memory usage of up to 5900MB (so, to be on the save side, my memory setting in the app_config is 7000MB).

That's exactly the pitfall.
Jim runs ATLAS (native) on linux.
Erich runs ATLAS (vbox...) on windows.
The latter usually needs more RAM as the VM has to set up an internal CVMFS cache.
ID: 35211 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 35212 - Posted: 9 May 2018, 15:28:43 UTC - in response to Message 35211.  

Jim runs ATLAS (native) on linux.
Erich runs ATLAS (vbox...) on windows.
The latter usually needs more RAM as the VM has to set up an internal CVMFS cache.

I keep forgetting. You need more than one mind on this problem.
ID: 35212 · Report as offensive     Reply Quote
1 · 2 · 3 · Next

Message boards : ATLAS application : BOINC downloads only 2 ATLAS tasks


©2024 CERN