Message boards : ATLAS application : ATLAS long simulation 1.00
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 44544 - Posted: 25 Mar 2021, 9:21:42 UTC

Hi all,

We are testing out a new app which allows you to run much longer ATLAS simulation tasks. These tasks run over 1000 events instead of the usual 200 and so will take 5 times longer. The input data download is the same but the output data to upload is 5 times larger.

These tasks are designed for large powerful machines which are dedicated to running ATLAS and the aim is to reduce the load on the BOINC servers from large clusters currently running ATLAS tasks, and allow larger clusters to join in the future. They are not suitable for background running or less powerful hosts so please think carefully before trying them out! We will of course keep the regular shorter tasks going.

The app is currently only available for native Linux users, and requires minimum 4 cores per task, however you may run up to 48-cores per task.

To try it out, select "ATLAS (long simulation)" in your LHC@Home preferences, and since it is still in the testing phase you must select "Run test applications" too.

Note that the first batch of tasks will be for test purposes and the data produced will not be used, so if you want your machines to produce only data useful for science then wait until the real batches start :)

Let us know of any issues or feedback and happy crunching!
ID: 44544 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,184,186
RAC: 104,580
Message 44545 - Posted: 25 Mar 2021, 10:46:09 UTC

ID: 44545 · Report as offensive     Reply Quote
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 44546 - Posted: 25 Mar 2021, 11:22:41 UTC

There was a minor configuration issue with 1.00 so we just released 1.01.
ID: 44546 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,184,186
RAC: 104,580
Message 44547 - Posted: 25 Mar 2021, 11:55:56 UTC - in response to Message 44546.  

ID: 44547 · Report as offensive     Reply Quote
Michael Goetz
Avatar

Send message
Joined: 26 Jul 05
Posts: 4
Credit: 54,921
RAC: 0
Message 44548 - Posted: 25 Mar 2021, 14:46:14 UTC
Last modified: 25 Mar 2021, 14:50:15 UTC

I think there's still a configuration problem...

This is the host https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=10686009

It is an Intel i5-4670K, 4c/4t Haswell CPU, 24 GM memory, running Linux Debian 10 (Buster). It's native Linux, not a VM.

I was using it yesterday to successfully run the long ATLAS tasks on the development server.

Relevant snippet from the client log:

3/25/2021 10:25:23 AM | LHC@home | Sending scheduler request: To fetch work.
3/25/2021 10:25:23 AM | LHC@home | Requesting new tasks for CPU
3/25/2021 10:25:25 AM | LHC@home | Scheduler request completed: got 0 new tasks
3/25/2021 10:25:25 AM | LHC@home | No tasks sent
3/25/2021 10:25:25 AM | LHC@home | No tasks are available for ATLAS (long simulation)
3/25/2021 10:25:25 AM | LHC@home | Message from server: VirtualBox is not installed


(The vbox warning probably isn't important.)

There's two things I know are NOT the problem.

1) It's not my preferences, because another computer assigned to the same venue (default) just got a long ATLAS task.
2) I didn't break this computer's configuration after testing on the dev system, because I moved it to running the production short ATLAS tests and it got a task successfully.

The only obvious conclusion is that there's something wrong in the server configuration, most likely the plan class. The two machines have different CPUs (Intel vs. AMD), different cores (4 vs 8), and different memory (24 GB vs. 32 GB.)

EDIT: Another difference is that the computer that is able to get tasks is a Linux VM running on a Windows host, while the computer that is not able to get tasks is not a VM. Neither system has VBOX installed.
Want to find one of the largest known primes? Try PrimeGrid. Or help cure disease at WCG.

ID: 44548 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 223,011,553
RAC: 136,310
Message 44549 - Posted: 25 Mar 2021, 15:23:15 UTC - in response to Message 44548.  

There are/were just a few long tasks in the queue.
Might be all of them are already sent out.

Or:
You forgot to activate the "beta app" box in your preferences:
https://lhcathome.cern.ch/lhcathome/prefs.php?subset=project
ID: 44549 · Report as offensive     Reply Quote
Michael Goetz
Avatar

Send message
Joined: 26 Jul 05
Posts: 4
Credit: 54,921
RAC: 0
Message 44550 - Posted: 25 Mar 2021, 15:41:05 UTC - in response to Message 44549.  

There are/were just a few long tasks in the queue.
Might be all of them are already sent out.

Or:
You forgot to activate the "beta app" box in your preferences:
https://lhcathome.cern.ch/lhcathome/prefs.php?subset=project


All is GOOD on your end.

There was an ID 10T error. :)

I realized something wasn't right when the short Atlas task was running only on one core. I had set that machine to run only on 1 core while trying to get Theory (native) to run on it (before understanding why it didn't, and giving up.) The long tasks won't download with less than 4 cores, so problem solved. My bad!
Want to find one of the largest known primes? Try PrimeGrid. Or help cure disease at WCG.

ID: 44550 · Report as offensive     Reply Quote
Greger

Send message
Joined: 9 Jan 15
Posts: 151
Credit: 431,596,822
RAC: 0
Message 44555 - Posted: 26 Mar 2021, 8:09:32 UTC

ID: 44555 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,184,186
RAC: 104,580
Message 44565 - Posted: 26 Mar 2021, 18:47:48 UTC - in response to Message 44546.  

Since midnight no more longrunner, but Server-Staus show Tasks avalaible.
https://lhcathome.cern.ch/lhcathome/server_status.php
ID: 44565 · Report as offensive     Reply Quote
tullio

Send message
Joined: 19 Feb 08
Posts: 708
Credit: 4,336,250
RAC: 0
Message 44566 - Posted: 26 Mar 2021, 19:12:49 UTC

Cmpleted two Atlas tasks on my Linux laptop with good results. Got two more.
Tullio
ID: 44566 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1268
Credit: 8,421,637
RAC: 1,939
Message 44567 - Posted: 26 Mar 2021, 20:35:24 UTC - in response to Message 44566.  

Cmpleted two Atlas tasks on my Linux laptop with good results. Got two more.
Tullio
Tullio,
Your mentioned tasks are not the long ATLAS simulations meant in this thread and even not 'native', but the 'normal' VBOX ATLAS tasks.
ID: 44567 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,184,186
RAC: 104,580
Message 44569 - Posted: 27 Mar 2021, 7:36:51 UTC - in response to Message 44567.  

Long Simulation is back.
ID: 44569 · Report as offensive     Reply Quote
tullio

Send message
Joined: 19 Feb 08
Posts: 708
Credit: 4,336,250
RAC: 0
Message 44574 - Posted: 27 Mar 2021, 16:40:27 UTC - in response to Message 44567.  

Yes I know. But I have only two cores and cannot get them.In QuChemPedIA@home I am competing with hosts having more than 124 processors and my Intel I5 9400F has only three cores and 6 processors yet my RAC ranking, using VirtualBox on a Linux native project is 27.
Tullio
ID: 44574 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,184,186
RAC: 104,580
Message 44575 - Posted: 27 Mar 2021, 19:55:43 UTC - in response to Message 44574.  
Last modified: 27 Mar 2021, 20:07:19 UTC

Tullio,
your both Win10 PC's have 6 or 8 CPU's for the longrunner from Atlas, when you build a Linux-VM.
ID: 44575 · Report as offensive     Reply Quote
tullio

Send message
Joined: 19 Feb 08
Posts: 708
Credit: 4,336,250
RAC: 0
Message 44583 - Posted: 28 Mar 2021, 12:04:17 UTC - in response to Message 44575.  

I have a Linux VM on a Windows 10 PC running SuSE Thumbleweed with kernel 5.11..6-1-default But it is updated so frequently that I have to reboot it almost daily.
Tulliu
ID: 44583 · Report as offensive     Reply Quote
Profile tazzduke

Send message
Joined: 24 Jun 10
Posts: 39
Credit: 4,971,347
RAC: 4,992
Message 44588 - Posted: 29 Mar 2021, 0:28:26 UTC - in response to Message 44583.  

Well am still trying to get one of these long ones lol

Sooner or later, I will get one.

Cheers
ID: 44588 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 223,011,553
RAC: 136,310
Message 44589 - Posted: 29 Mar 2021, 5:03:07 UTC - in response to Message 44588.  

Did you activate "Run test applications?" for the venue your computer is attached to?
https://lhcathome.cern.ch/lhcathome/prefs.php?subset=project

Even then you may still need a couple of attempts.
ID: 44589 · Report as offensive     Reply Quote
Profile tazzduke

Send message
Joined: 24 Jun 10
Posts: 39
Credit: 4,971,347
RAC: 4,992
Message 44591 - Posted: 29 Mar 2021, 5:08:06 UTC - in response to Message 44589.  

Hi computezrmle,

Well that could be it, will have a look when i get home from work.

Cheers
ID: 44591 · Report as offensive     Reply Quote
Profile tazzduke

Send message
Joined: 24 Jun 10
Posts: 39
Credit: 4,971,347
RAC: 4,992
Message 44592 - Posted: 29 Mar 2021, 9:09:21 UTC - in response to Message 44591.  

Hi computezrmle

That did it, came home and found one crunching away.

Thankyou
ID: 44592 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2386
Credit: 223,011,553
RAC: 136,310
Message 44593 - Posted: 29 Mar 2021, 10:02:18 UTC - in response to Message 44592.  

Are you aware that your CVMFS is misconfigured?
[2021-03-29 08:58:28] VERSION PID UPTIME(M) MEM(K) REVISION EXPIRES(M) NOCATALOGS CACHEUSE(K) CACHEMAX(K) NOFDUSE NOFDMAX NOIOERR NOOPEN HITRATE(%) RX(K) SPEED(K/S) HOST PROXY ONLINE
[2021-03-29 08:58:28] 2.9.0.0 2551 808 62280 81656 0 62 3785025 4194305 2 130560 0 369983 99.977 17734 5 http://s1asgc-cvmfs.openhtc.io:8080/cvmfs/atlas.cern.ch http://128.142.248.156:3126 1

This points out that all requests are send to a CERN fall-back proxy in Geneva (128.142.248.156).
From there they go to openhtc.io in Europe.


Since your cluster reports much more than 5 worker nodes (=cores) the recommended way would be to use a local HTTP proxy.
Second best would be to use openhtc.io DIRECT and switch off the fallback-proxies.
This would ensure to get the data from openhtc.io proxies in Australia rather than Europe.
ID: 44593 · Report as offensive     Reply Quote
1 · 2 · 3 · Next

Message boards : ATLAS application : ATLAS long simulation 1.00


©2024 CERN