Message boards :
Number crunching :
Checklist Version 3 for Atlas@Home (and other VM-based Projects) on your PC
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · Next
Author | Message |
---|---|
Send message Joined: 7 Apr 18 Posts: 20 Credit: 137,327 RAC: 0 |
But there isn't described Window 10 version. Unfortunatelly I can't past here how looks like the BOINC file in my Program Files but there isn't this file :( |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
But there isn't described Window 10 version. Unfortunatelly I can't past here how looks like the BOINC file in my Program Files but there isn't this file :( You will never find it in ProgramFiles use the path given for Windows 8/8.1 Windows Vista/Windows 7/8/8.1: C:\ProgramData\BOINC\ (*) (*) This directory is hidden, so put the path to it directly into the path bar in Windows Explorer and press Enter. Supporting BOINC, a great concept ! |
Send message Joined: 7 Apr 18 Posts: 20 Credit: 137,327 RAC: 0 |
Thanks, I've got it :) A have one doubt yet. Have I open ALL the ports or one (or two) choosen of them is enought? |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
|
Send message Joined: 7 Apr 18 Posts: 20 Credit: 137,327 RAC: 0 |
Everything seems right apart proxy. Do you know that pages: http://www.freeproxylists.net/?c=&pt=3128&pr=HTTP&a%5B%5D=0&u=0 http://proxy-list.org/english/index.php ? Which servers you recommend? I'm sorry if it seems funny but I've never use proxy. I know that is theoretically safe but in the other hand if all data go throught one server it's easy to scan them. |
Send message Joined: 15 Jun 08 Posts: 2541 Credit: 254,608,838 RAC: 34,609 |
Everything seems right apart proxy. Do you know that pages: It does not make any sense to configure one of the proxies from your lists. What would make sense is to configure your own local proxy: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4611&postid=36101 Without a local proxy your VMs request the required data via CERN's CVMFS and Frontier systems. Both of them make heavy use of squid proxies. Theory VMs are already configured to use Cloudflare's openhtc.io proxy system which is also available for linux users who run ATLAS native. |
Send message Joined: 7 Apr 18 Posts: 20 Credit: 137,327 RAC: 0 |
Are Theory Simulation tasks so sensitive as ATLAS ones? I get credit every time but ATLAS did it, too for a long time :/ |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
|
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
|
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
|
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
Are you using VirtualBox or are you running Atlas direct on Linux without VirtualBox ? With Linux I can't help you, my domain are Windows-Machines, aahh you are using VirtualBox 6.0.2. I'm not shure whether this is a good idea. Can a Linux-User tell us, which VirtualBoxVersion (s)he is runninng Atlas succesfull ? Supporting BOINC, a great concept ! |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
Can a Linux-User tell us, which VirtualBoxVersion (s)he is runninng Atlas succesfull ? VirtualBox is a pain in the butt. Fortunately Linux users don't have to suffer that pain just to run ATLAS tasks. We can run the native version of the ATLAS app. Directions for how to set it up are in the ATLAS specific section. |
Send message Joined: 16 Sep 17 Posts: 100 Credit: 1,618,469 RAC: 0 |
Please open a new topic to further discuss. Let's keep this thread tidy. I haven't run VM in a long time, but it should be straight forward. You are encountering common issues that are described at length and have been solved many times. ATLAS tasks cannot be restarted reliably. You'll have to run only ATLAS and set switch between tasks ludicrously high (~24 hours). https://lhcathome.cern.ch/lhcathome/result.php?resultid=214262091 2019-01-18 16:59:13 (2898): Stopping VM. 2019-01-18 16:59:25 (2898): Successfully stopped VM. https://lhcathome.cern.ch/lhcathome/result.php?resultid=214264076 2019-01-20 15:50:39 (21966): Guest Log: ccopioepdi etdh e twheeb awpepb atpop /tvoa r//vwawrw/ww 2019-01-20 15:50:39 (21966): Guest Log: ccopioepdi etdh e twheeb awpepb atpop /tvoa r//vwawrw/www 2019-01-20 15:50:40 (21966): Guest Log: TThish ivsm vdmo edso enso tn onte ende etdo tsoe tuspe thutpt hpt tppr opxryox 2019-01-20 15:50:40 (21966): Guest Log: TThish ivsm vdmo edso enso tn onte ende etdo tsoe tuspe thutpt hpt tppr opxryoxy 2019-01-20 15:50:40 (21966): Guest Log: AATHENAT_HPERONCA__PNRUOMCB_ENR=UM2BE 2019-01-20 15:50:40 (21966): Guest Log: AATHENAT_HPERONCA__PNRUOMCB_ENR=UM2BER=2 https://lhcathome.cern.ch/lhcathome/result.php?resultid=214359243 2019-01-21 15:23:42 (1269): Stopping VM. 2019-01-21 15:23:55 (1269): Successfully stopped VM. also looks like triple work? 2019-01-21 13:53:39 (1269): Guest Log: Copying input files into RunAtlas. 2019-01-21 13:53:39 (1269): Guest Log: Copying input files into RunAtlas. 2019-01-21 13:53:39 (1269): Guest Log: Copying input files into RunAtlas. 2019-01-21 13:53:50 (1269): Guest Log: Copied input files into RunAtlas. 2019-01-21 13:53:50 (1269): Guest Log: Copied input files into RunAtlas. 2019-01-21 13:53:50 (1269): Guest Log: Copied input files into RunAtlas. 2019-01-21 14:01:32 (1269): Guest Log: copied the webapp to /var/www 2019-01-21 14:01:32 (1269): Guest Log: copied the webapp to /var/www 2019-01-21 14:01:32 (1269): Guest Log: copied the webapp to /var/www 2019-01-21 14:01:32 (1269): Guest Log: This vm does not need to setup http proxy 2019-01-21 14:01:33 (1269): Guest Log: ATHENA_PROC_NUMBER=2 2019-01-21 14:01:33 (1269): Guest Log: This vm does not need to setup http proxy 2019-01-21 14:01:33 (1269): Guest Log: This vm does not need to setup http proxy 2019-01-21 14:01:33 (1269): Guest Log: ATHENA_PROC_NUMBER=2 2019-01-21 14:01:33 (1269): Guest Log: ATHENA_PROC_NUMBER=2 2019-01-21 14:01:34 (1269): Guest Log: Starting ATLAS job. (PandaID=4214632869 taskID=16698164) 2019-01-21 14:01:34 (1269): Guest Log: Starting ATLAS job. (PandaID=4214632869 taskID=16698164) 2019-01-21 14:01:34 (1269): Guest Log: Starting ATLAS job. (PandaID=4214632869 taskID=16698164) https://lhcathome.cern.ch/lhcathome/result.php?resultid=214356357 2019-01-21 11:15:31 (5955): VM state change detected. (old = 'running', new = 'paused') 2019-01-21 11:15:45 (5955): Error in resume VM for VM: -2135228414 |
Send message Joined: 5 Apr 15 Posts: 18 Credit: 5,910,849 RAC: 0 |
Hi Yeti, Many thanks for the check list, you've saved me twice in less than 1 month ! Nasty things BIOS upgrades and issues with the lasting memory of the client_state.xml file... 1) downgraded BIOS back to last known stable (wanted to put in last one which fixed an Intel security bug, but turned out to be a bummer), lost the Hyper-V settings in the process. 2) downgraded Virtual Box back to version 5.2.26 from version 6.0.4 (including expansion pack) I had all WU's VM's stalled... 3) had to modify the client_state.xml back to enable vm_extensions. Wouldn't have been able to find all these details if it weren't for your exhaustive list... LHC seems now to run once again without issues ! Have a nice weekend ! Friendly Greetings, K. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
|
Send message Joined: 4 Mar 19 Posts: 2 Credit: 3,739,572 RAC: 0 |
Hello Yeti, That is a great checklist, just what I was looking for.... I got stuck already on the VT-X check, but I have contacted the developers of the tool already. VT-X is enabled in BIOS and running a Xeon E5 2620v4 I would tend to think I get two green checks, but I don't. Should I get any updates, I'll let you know here. The BOINC installer I believe also checked that, I seem to recall that I got two messages in the process that virtualization was a) supported and b) enabled. Of course I still read through the rest of your checklist, now I have one, quite general, question.... Does it make sense to try all that on a box with 8Gig of RAM? It feels as if it is generally quite fruitless to try with that amout of RAM, specifically if you try running other projects/stuff on the box as well. I have a Windows Home Server 2011 OS on the box, and though that is based on Server 2008 R2, it comes with a - quite annoying - 8Gig RAM limitation. Best Regards, Oliver |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
...running a Xeon E5 2620v4 I would tend to think I get two green checks, but I don't.I just checked it with my XEON E5 2620v3 and I get two green checks Does it make sense to try all that on a box with 8Gig of RAM? You hit the nail on the head ! I have run Atlas on similar boxes with 8 GB, but finally it didn't make me happy. One Atlas with 4-Cores should work, but you should limit Atlas to run really only 1 WU at a time via app_config.xml I have switched this box to run 3 or 4 Theory-Tasks and this works fine. Theory likes machine that restart only seldom. Supporting BOINC, a great concept ! |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
Does it make sense to try all that on a box with 8Gig of RAM? I used to do it on 8GB. At first I got a lot of invalids. Then I boosted the "switch between tasks every..." setting to 24 hours and got a considerably higher success rate (but not 100%). I also learned to schedule OS updates and reboots around the ATLAS tasks so as not to suspend them. In theory ATLAS VBox tasks should not be bothered by suspending/resuming but in practice I found they are. YMMV. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,268,029 RAC: 6,930 |
|
Send message Joined: 24 Jun 10 Posts: 43 Credit: 6,160,703 RAC: 1,317 |
Greetings Yeti Thankyou for your updated checklist and all you and your teams efforts. I have now managed to complete 2 x ATLAS jobs (using mt 4 cores per job). I believe my earlier (okay way earlier) troubles, were caused by insufficient RAM in the computer, had 8 GB, now has 16 GB. Yes both have the HITS file entry in the output. Cheers |
©2024 CERN