Message boards : Number crunching : VM Applications Errors
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 58
Credit: 4,010,807
RAC: 28
Message 44429 - Posted: 3 Mar 2021, 18:42:03 UTC

Thanks for your reply, that's a very large post of yours you linked, is their a particular part I can get away with reading? I don't want to spend hours on this.
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN, E@H.
Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RX580 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64bit
ID: 44429 · Report as offensive     Reply Quote
Profile Yeti
Volunteer moderator
Avatar

Send message
Joined: 2 Sep 04
Posts: 453
Credit: 193,369,412
RAC: 10,065
Message 44430 - Posted: 3 Mar 2021, 22:40:21 UTC - in response to Message 44429.  

Thanks for your reply, that's a very large post of yours you linked, is their a particular part I can get away with reading? I don't want to spend hours on this.
HM, If you really want to to crunch Atlas, Theory or CMS, you really need to go through the the list point by point as I already mentioned there:

Please, check this list and be sure to check really all Details, step by step, all are important.
...



Supporting BOINC, a great concept !
ID: 44430 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 58
Credit: 4,010,807
RAC: 28
Message 44461 - Posted: 8 Mar 2021, 18:10:32 UTC
Last modified: 8 Mar 2021, 18:11:01 UTC

In the short to medium term I would've just quit running LHC if I needed to go through all that.

Anyway, some team mates managed to help me - https://forums.anandtech.com/threads/weekly-dc-stats-28feb2021.2591268/post-40454305
Turns out AMD-V was disabled in the bios, and it was called SVM mode, hence I didn't spot it in the manual or bios.
Since enabling it, I've had no errors.
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN, E@H.
Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RX580 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64bit
ID: 44461 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 58
Credit: 4,010,807
RAC: 28
Message 44462 - Posted: 8 Mar 2021, 22:15:03 UTC

But I now have a WU stuck at 100% and 1.5 days elapsed time!
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN, E@H.
Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RX580 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64bit
ID: 44462 · Report as offensive     Reply Quote
etcarine

Send message
Joined: 13 Feb 21
Posts: 1
Credit: 113,521
RAC: 310
Message 44466 - Posted: 9 Mar 2021, 21:07:08 UTC

LHC@home ran for 10,161 units of work, but now says "No work available to process". I have no idea what to do.
Are there actually no work units? If there are, why arent they available?
ID: 44466 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 674
Credit: 43,152,620
RAC: 15,489
Message 44467 - Posted: 9 Mar 2021, 21:45:30 UTC - in response to Message 44466.  

LHC@home ran for 10,161 units of work, but now says "No work available to process". I have no idea what to do.
Are there actually no work units? If there are, why arent they available?

Your computers are hidden so no one can view what kind of work you have been running. But if it has been sixtrack tasks (not using a Virtual Machine) then the message you get is true, no new work units are available for that subproject, only a few re-sends of tasks that have failed on someone else's computer.
ID: 44467 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 58
Credit: 4,010,807
RAC: 28
Message 44498 - Posted: 16 Mar 2021, 7:27:43 UTC
Last modified: 16 Mar 2021, 7:31:32 UTC

Well this takes the biscuit, I'd set BOINC to 85% computing time to leave some CPU power for GPU folding.
Last night and this morning I found my system had ground to a crawl and LHC VM was taking 100% CPU time! So I restricted BOINC to 50% and now LHC is 'only' taking ~70%.
What with BOINCs messed up way of trying to balance credit, and now LHC hogging resources I'm done with it :(

But why was it taking 100% when I'd set 85%!? It should of left any spare threads for Rosetta, not try and run another 8 with LHC VM. (that's with Atlas WUs)
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN, E@H.
Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RX580 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64bit
ID: 44498 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 58
Credit: 4,010,807
RAC: 28
Message 44499 - Posted: 16 Mar 2021, 18:29:34 UTC - in response to Message 44498.  

Just thought of a better answer, rather than not run LHC altogether (I want to run it!), I've disabled the Atlas app. Let's see how the other apps behave....
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN, E@H.
Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RX580 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64bit
ID: 44499 · Report as offensive     Reply Quote
Profile anarchic teapot
Avatar

Send message
Joined: 15 Feb 06
Posts: 67
Credit: 460,896
RAC: 1
Message 44781 - Posted: 21 Apr 2021, 21:30:41 UTC - in response to Message 44499.  

It's CMS that's not working for me. So far, everything else seems fine. Perhaps the short break I took from LHC saved our relationship.
sQuonk
Plague of Mice
Intel Core i3-9100 CPU@3.60 GHz, but it's doing its bit just the same.
ID: 44781 · Report as offensive     Reply Quote
Profile Yeti
Volunteer moderator
Avatar

Send message
Joined: 2 Sep 04
Posts: 453
Credit: 193,369,412
RAC: 10,065
Message 44823 - Posted: 26 Apr 2021, 12:44:15 UTC

LHC@Home is not a plug and play project like other BOINC-Projects are.

You can easily run LHC@Home like a plug and play project: if you run Sixtrack only
You can easily run LHC@Home like a plug and play project: if you run one of Atlas / Theory / CMS exclusiv and if you keep this setting: "Use at most 100 % of CPU time" (VMs don't like this kind of throttling)

If you want to run all kind of applications LHC@Home offers, you will have to make mikro-managing with your client; BOINC will not be able to always give you what you want for your client.


Supporting BOINC, a great concept !
ID: 44823 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 58
Credit: 4,010,807
RAC: 28
Message 44898 - Posted: 6 May 2021, 10:17:25 UTC
Last modified: 6 May 2021, 10:22:11 UTC

I don't recall seeing that on the front page.

Anyway, disabling Atlas seems to have done the trick, no problems caused by LHC running now, but I currently have 14 errored WUs, 13 are for CMS sim, no idea why (exit codes mean nothing to me). Common ones are :-
207 (0x000000CF) EXIT_NO_SUB_TASKS
194 (0x000000C2) EXIT_ABORTED_BY_CLIENT
And a single 1 (0x00000001) Unknown error code

Also the estimated times for some Theory sim WUs are 8-9 days! Lol. But they aren't actually taking that long.
Team AnandTech - SETI@H, Muon1 DPAD, F@H, MW@H, A@H, LHC@H, POGS, R@H, DHEP, CPDN, E@H.
Main rig - Ryzen 3600, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RX580 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64bit
ID: 44898 · Report as offensive     Reply Quote
tullio

Send message
Joined: 19 Feb 08
Posts: 708
Credit: 4,336,250
RAC: 0
Message 44899 - Posted: 6 May 2021, 13:55:29 UTC

I can run SixTrack, Atlas and Theory tasks on my Windows 10 PC with 12 GB RAM with no problem, always using VirtualBox, its latest version. But CMS tasks all fail after about ten thousand seconds of Condor . God knows why. I have a 30 Mbit/s connection to my Internet provider and a WiFi connection from modem to PC which reaches 250 Mbit/s.
Tullio
ID: 44899 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1686
Credit: 100,401,459
RAC: 102,305
Message 44900 - Posted: 6 May 2021, 16:13:22 UTC - in response to Message 44899.  

@tullio: I had something like this last weekend - see the CMS thread here. And I had no idea what was the reason. 1 1/2 days later, everything ran okay.
ID: 44900 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 798
Credit: 644,777,836
RAC: 231,768
Message 44901 - Posted: 6 May 2021, 17:06:48 UTC - in response to Message 44899.  

Its normally because the back end CERN ran out of work, there is no sync between the BOINC jobs and the CERN jobs.
ID: 44901 · Report as offensive     Reply Quote
[AF>France>Est>Alsace]PFLIEGER...

Send message
Joined: 30 Nov 20
Posts: 9
Credit: 950,380
RAC: 0
Message 45003 - Posted: 25 May 2021, 9:11:09 UTC

I had error in the computing for CMS simulation
It started while i was sleeping at 2H30 UTC in the morning
The whole computers where computing errors
This morning i updated with CCleaner and cleaned my whole computers
I looked if exist a new version of the virtual machine but at this time it is not needed to update the virtual machine because i used the latest version

Maybe the origin could be come from an other reason

GP
ID: 45003 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 674
Credit: 43,152,620
RAC: 15,489
Message 45004 - Posted: 25 May 2021, 9:27:41 UTC - in response to Message 45003.  

I had error in the computing for CMS simulation
It started while i was sleeping at 2H30 UTC in the morning
The whole computers where computing errors
This morning i updated with CCleaner and cleaned my whole computers
I looked if exist a new version of the virtual machine but at this time it is not needed to update the virtual machine because i used the latest version

Maybe the origin could be come from an other reason

GP

The errors probably happened because CMS jobs ran out last night and everybody's tasks errored out. See more details https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5090. Scroll to the end of that thread to see the latest explanation. This error is quite common in CMS.

Everything should be OK at the moment and new jobs and tasks are available.
ID: 45004 · Report as offensive     Reply Quote
[AF>France>Est>Alsace]PFLIEGER...

Send message
Joined: 30 Nov 20
Posts: 9
Credit: 950,380
RAC: 0
Message 45012 - Posted: 26 May 2021, 17:41:39 UTC

Vm need security today:
-Avira Free
-Anti malwarebytes
-Avast free
-Ccleaner

Incresing of computing speed of 15% after completed control of the computers
price minimal of investment : 0€
ID: 45012 · Report as offensive     Reply Quote
[AF>France>Est>Alsace]PFLIEGER...

Send message
Joined: 30 Nov 20
Posts: 9
Credit: 950,380
RAC: 0
Message 45116 - Posted: 11 Jul 2021, 12:25:24 UTC

Today the AMD Ryzen machine made a lot of errors but no other machine working for CMS
I stopped the download of the amd machine and i test the machine by universe@home
the AMD Ryzen was working since 3 week without errors
I did not know what happened!

Guy PFLIEGER
ID: 45116 · Report as offensive     Reply Quote
[AF>France>Est>Alsace]PFLIEGER...

Send message
Joined: 30 Nov 20
Posts: 9
Credit: 950,380
RAC: 0
Message 45117 - Posted: 11 Jul 2021, 12:34:02 UTC - in response to Message 45116.  

Time of starting of the problems with the AMD Ryzen: 23h37:12 UTC last night
time i was sleeping

Guy PFLIEGER
ID: 45117 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,150,336
RAC: 105,728
Message 45118 - Posted: 11 Jul 2021, 12:44:29 UTC - in response to Message 45117.  

Ivan wrote in the CMS-folder about databridge Errors.
ID: 45118 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : VM Applications Errors


©2024 CERN