Message boards :
ATLAS application :
queue is empty
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 13 · Next
Author | Message |
---|---|
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
hadron wrote: Are you overclocking your CPU? Or maybe running each task on multiple threads?Actually your CPU is 18% faster according to benchmarks. Of course I use multiple threads, ATLAS is designed for this. I've left it on the default of 8 threads per task, it tends to average 7, so that's what I told the scheduler via app_config. <app_version> <app_name>ATLAS</app_name> <plan_class>vbox64_mt_mcore_atlas</plan_class> <cmdline></cmdline> <avg_ncpus>7.000000</avg_ncpus> <ngpus>0.000000</ngpus> </app_version> I haven't overclocked. |
Send message Joined: 4 Sep 22 Posts: 90 Credit: 15,270,206 RAC: 31,802 |
Of course I use multiple threads, ATLAS is designed for this. I've left it on the default of 8 threads per task, it tends to average 7, so that's what I told the scheduler via app_config. When I set the max threads on LHC, all I got were Atlas tasks. I also want to run Theory tasks, so I set LHC to use only 1 thread, then in app_config added a <cmdline>... entry to app_config to control the number of threads directly. It did what I wanted it to do, so that is what I will do again once these 4 tasks are finished (I've temporarily disabled Atlas tasks on LHC). In addition to the LHC tasks, I'm also running Rosetta, Einstein and Cosmology tasks, so I am certainly not inclined to dedicate 7 threads per Atlas task alone -- initially, I'll set my app_config file to run only 1 Atlas task at a time, on 4 threads, and see where that takes me: <app> <name>ATLAS</name> <max_concurrent>1</max_concurrent> </app> <app_version> <app_name>ATLAS</app_name> <avg_ncpus>4</avg_ncpus> <plan_class>vbox64_mt_mcore_atlas</plan_class> <cmdline>--nthreads 4</cmdline> </app_version> Once I see how this works out, I can easily make changes to suit my preferences. Thanks for the replies, they've been quite helpful. I'll post again once I see where this takes me. |
Send message Joined: 2 May 07 Posts: 2228 Credit: 173,753,070 RAC: 18,707 |
;-)), ok it is so nice to have Atlas back. Thank you Cern-IT. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
When I set the max threads on LHC, all I got were Atlas tasks. I also want to run Theory tasks,Yes, you never get what you ask for here. If I select to get any tasks, I only get CMS. Every other project gives you an even mixture. Since I have more than one computer, I just set some to get CMS, some Theory, some Atlas, but leave the "if nothing available give me something else" ticked. Then if everything is available, I get some of each. I have to do this as CMS maxes out my uplink and Theory maxes out my downlink. Half of each and it seems to manage without throttling the CPUs. If I had only one computer I've no idea how I'd get a mixture. |
Send message Joined: 2 May 07 Posts: 2228 Credit: 173,753,070 RAC: 18,707 |
When you have in prefs all LHC-Projects active, Boincmanager take the Project from Cern-IT they give us. So, you can take default, home, school or work to make a difference of work for each of your PC's. All Projects together in one PC don't work really good. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
When you have in prefs all LHC-Projects active,Yes, that's what I do, I have a venue for each type of task, and put a few machines in each. All Projects together in one PC don't work really good.I have no problem running them all on one PC, if I happen to get a selection. |
Send message Joined: 28 Sep 04 Posts: 720 Credit: 48,274,050 RAC: 30,220 |
Four Atlas tasks running here on two hosts, 4 cores used for each task. Run times are currently 48...66 hours (CPU times 156... 248 hours) and all 4 cores still active as seen on Top (console Alt+F3). If you view the console Alt+F2 you can just make out the number of jobs finished so far (mine are at 1200...1800 finished) averaging about 2000 seconds per job. So they are taking forever to finish. :-( |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
Four Atlas tasks running here on two hosts, 4 cores used for each task. Run times are currently 48...66 hours (CPU times 156... 248 hours) and all 4 cores still active as seen on Top (console Alt+F3).I estimate your fastest machine is 3/4 of the speed per thread as my Ryzen 9 3900XT. So if you allowed it 8 threads, it would take 1 day compared to my 17.5 hours. With only 4 threads, I'd expect you to take 2 days, so they're fine. Any particular reason you only use 4 threads? |
Send message Joined: 28 Sep 04 Posts: 720 Credit: 48,274,050 RAC: 30,220 |
Four Atlas tasks running here on two hosts, 4 cores used for each task. Run times are currently 48...66 hours (CPU times 156... 248 hours) and all 4 cores still active as seen on Top (console Alt+F3).I estimate your fastest machine is 3/4 of the speed per thread as my Ryzen 9 3900XT. So if you allowed it 8 threads, it would take 1 day compared to my 17.5 hours. With only 4 threads, I'd expect you to take 2 days, so they're fine. Any particular reason you only use 4 threads? I selected 4 threads to keep the utilization high (= trying keep the time that only a single thread is used short). With these long tasks I probably will raise the thread count for single task to shorten the run time. Let's see how long these tasks actually will take. My slower host only has 8 threads of which I leave 2 free and 2 are busy with Einstein GPU tasks, so that leaves only 4 threads for LHC. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
I selected 4 threads to keep the utilization high (= trying keep the time that only a single thread is used short). With these long tasks I probably will raise the thread count for single task to shorten the run time. Let's see how long these tasks actually will take.If you're not getting full utilisation, just run more of them. For example, I have a 24 thread machine and tell four Atlases to run. I let them use up to 8 threads, but tell Boinc they use an average of 6. If you use Boinctasks, you can see what percentage they used and change it accordingly. Not sure if Boinc Manager does this as I never use that horrid interface. For example when I had them set to use 8 threads, it said 75% usage. 75% of 8 is 6, so I set them to 6. My slower host only has 8 threads of which I leave 2 free and 2 are busy with Einstein GPU tasks, so that leaves only 4 threads for LHC.GPU tasks don't always need a full core. I adjust and make sure the GPU usage stays high. You can also run more than one task per GPU so the GPU has something to do while waiting for the CPU. |
Send message Joined: 4 May 17 Posts: 5 Credit: 118,785,284 RAC: 0 |
I`m getting that people don`t like the 2k events, so we`ll continue by submitting 400evt jobs. The MC coordination agreed to have a stream of dedicated tasks for this purpose - just waiting for the samples to be requested. |
Send message Joined: 2 May 07 Posts: 2228 Credit: 173,753,070 RAC: 18,707 |
Yes, 400 is a good number for events, because of this very different Hardware from us Volunteers. |
Send message Joined: 18 Dec 15 Posts: 1785 Credit: 117,066,079 RAC: 67,415 |
hi guys, I have a question? From where do you download ATLAS tasks? As per the Server Status page, there are none available ... |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
hi guys, I have a question?They appear now and again, they're just testing now. Should be a steady stream shortly. Set your account to get atlas only with other types if none available. |
Send message Joined: 14 Jan 10 Posts: 1411 Credit: 9,413,084 RAC: 12,519 |
I got a 2k job and let it run into an error cause the first event needed 787 seconds and the second 2800 seconds. Let's say on average 1500 seconds times 2000 makes 3 million seconds = 1 million on my 3 threaded VM = 11.5 days 24/7. Received ---------- 16 Sep 2023, 9:33:40 UTC Report deadline -- 24 Sep 2023, 8:44:17 UTC Make up your mind. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
I got a 2k job and let it run into an error cause the first event needed 787 seconds and the second 2800 seconds.You would have made it on 8 threads. Why throttle the poor thing? |
Send message Joined: 28 Sep 04 Posts: 720 Credit: 48,274,050 RAC: 30,220 |
A few new tasks has been generated (actually over 300), I got two of them. They are still 2000 job tasks. The server has cancelled all my tasks that were still running. Several hundreds of hours of crunching wasted. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,430,610 RAC: 4,841 |
Last one I did was 12th September (not sure if that's 2000 size?) took 21 hours on my Ryzen 9 3900XT. I estimate your slowest PC would take double that, just under 2 days. Sounds doable, what's the problem? https://lhcathome.cern.ch/lhcathome/result.php?resultid=399275439 I shall put everything on Atlas here and see what happens, I have very slow computers and reasonably fast ones. Great, changed all the settings and cancelled work from other projects then find there's none left to get. Well that was a waste of time. |
Send message Joined: 18 Dec 15 Posts: 1785 Credit: 117,066,079 RAC: 67,415 |
A few new tasks has been generated (actually over 300), I got two of them. They are still 2000 job tasks. The server has cancelled all my tasks that were still running. Several hundreds of hours of crunching wasted.this for sure is something that should not happen. Not really nice from the project people :-( The same happened here, but with a few tasks only, so the waste is not more than some 18 hours. But still, that's annoying. Hence, I will not try to download ATLAS for the time being. My favorite would be CMS, but tasks are still available on a irregular basis only :-( So, no other choice than to stick with Theory. |
Send message Joined: 28 Sep 04 Posts: 720 Credit: 48,274,050 RAC: 30,220 |
Well, I was counting the CPU hours. One task had 181 CPU hours on it and another had 216 hours. When server cancels them none of those values usually get reported to server and is shown as 0 hours (but sometimes they are reported, I don't know what makes the difference) . These tasks were re-sends but they still had time left before deadline, server cancelled them anyway. These 2000 job tasks take somewhere between 250 and 400 hours of CPU time to finish on my hosts. |
©2024 CERN