Message boards : Theory Application : Tests of Theory app on LHC@home
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2401
Credit: 225,541,770
RAC: 120,747
Message 27938 - Posted: 24 Nov 2016, 9:37:39 UTC - in response to Message 27926.  

... and VB and my DSL connection have problems ...

@ MAGIC Quantum Mechanic

Have you ever thought about using a local squid proxy together with policy based routing?
This would relieve your DSL connection.

My squid serves 2 BOINC hosts and has a hit rate of 95% and a byte rate of >50%.
ID: 27938 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1274
Credit: 8,480,870
RAC: 2,011
Message 27939 - Posted: 24 Nov 2016, 9:51:05 UTC - in response to Message 27936.  

I finally have 2 Theory simulations on the Windows 10 PC, my fastest machine. But where can I see the MCPLOTs?
Tullio

It's not yet consolidated on LHC@home, but your cpu-seconds, jobs and events from your Windows machine (4 jobs done yet),
can be displayed in MCPLOTS with the hostid 10407309 under userid 'Unknown'.
http://mcplots-dev.cern.ch/production.php?view=hosts&display=active
http://mcplots-dev.cern.ch/production.php?view=user&userid=Unknown#10407309
ID: 27939 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1118
Credit: 49,727,516
RAC: 13,445
Message 27965 - Posted: 25 Nov 2016, 22:07:37 UTC - in response to Message 27938.  

... and VB and my DSL connection have problems ...

@ MAGIC Quantum Mechanic

Have you ever thought about using a local squid proxy together with policy based routing?
This would relieve your DSL connection.

My squid serves 2 BOINC hosts and has a hit rate of 95% and a byte rate of >50%.


No that is something I don't want to even try here with my 7 computers running 3 versions of Windows and the main problem is here with Centurylink is the server is about 400 miles away.

The reason here is this DSL does not run nearly as fast as one in a city like Seattle where they get 10X the speed that I do......mine can run as fast as 325K at times but most often closer to 35K which is dialup speed.

So I have to make sure I have all the Cern VB tasks running beyond the HTCondor ping before I try to load all of my pc's with Einstein GPU files since I get hundreds of those tasks and each task gets lots of files/bins

It can take hours to d/l those and the same thing when I have to d/l a new .vdi when a VB task is updated to a new version.

(along with the wife on netflix)
Volunteer Mad Scientist For Life
ID: 27965 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1118
Credit: 49,727,516
RAC: 13,445
Message 27966 - Posted: 25 Nov 2016, 22:19:31 UTC

If any members joining here are having problems that turn out to be not having enough Ram you will see the CMS tasks take more memory each than these Theory tasks take.

I have all different types of CPU's and amounts of Ram but on my 8-core with only 8GB ram it can run as many as 6 of these Theory tasks at the same time (good idea to have them spaced out several minutes each)

Right now the one I am on is running 3 LHC Theory VB tasks and 3 vLHC Theory VB tasks (along with 2 Einstein GPU tasks).....and me typing this.

Now it doesn't leave much free memory but only uses 33% of the CPU
Volunteer Mad Scientist For Life
ID: 27966 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1118
Credit: 49,727,516
RAC: 13,445
Message 27977 - Posted: 28 Nov 2016, 2:36:42 UTC
Last modified: 28 Nov 2016, 2:45:10 UTC

Well as usual things were running fine and then in the last hour I am getting Error after error.

no heartbeat

Watching the VM Console and they don't get beyond the ....

Starting libvirtd daemon [OK]

It happened on 3 different computers I tried to start up new tasks

So after several of those 11 minute crashes I just decided to not start any new ones for now and start up the GPU's again.

I am having no luck with VB tasks and Cern right now other than the ones already running.

Same thing happened with the vLHC-dev tasks. (and probably will when the vLHC Theory tasks are finished)

Which means it never gets this far right now.....


Volunteer Mad Scientist For Life
ID: 27977 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1118
Credit: 49,727,516
RAC: 13,445
Message 27979 - Posted: 28 Nov 2016, 7:42:57 UTC

Ok it looks like maybe it came back to life since the pc in front of me got a new task and got beyond the HTCondor ping this time.

So I will go see if the other pc's will do the same.
ID: 27979 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1118
Credit: 49,727,516
RAC: 13,445
Message 28007 - Posted: 30 Nov 2016, 9:14:02 UTC

https://lhcathome.cern.ch/lhcathome/result.php?resultid=109035302

Right now I am unable to start new tasks because of the

[ERROR] Could not connect to Condor server on port 9618

and checking around and I see everyone having that problem with new tasks starting and it happens faster than usual with these Condor problems

Since it is after 1am here it looks like when the ones I have running are done they will be suspended until tomorrow when I can check and see if this has been taken care of since I would rather not get 100 Invalids in a row....like I see happening for others.
Volunteer Mad Scientist For Life
ID: 28007 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1118
Credit: 49,727,516
RAC: 13,445
Message 28016 - Posted: 30 Nov 2016, 11:57:23 UTC

Ok well 3 hours later it appears to be back up and running so I guess whoever sat on the Condor while having breakfast over there got up and let it get back to work and do the X509 credentials and the HTCondor ping.

I have got several to start up again and will finish doing that with the rest since I had to stay up until 4am waiting for this to work.

BUT it still is not working over at vLHC and it just repeats the X509 credentials over and over so I will suspend those again.

So I guess everyone else may have their start running again now too and wonder why they have all those computer error tasks.

Well just in case somebody trips over this thread they may see it wasn't on their end.

Goodnight
Volunteer Mad Scientist For Life
ID: 28016 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Theory Application : Tests of Theory app on LHC@home


©2024 CERN