Message boards :
Theory Application :
New version 300.00
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 2 May 07 Posts: 2242 Credit: 173,902,375 RAC: 2,798 |
It's not possible to ping sft.cern.ch. Seems to be down or unreachable. |
Send message Joined: 15 Jun 08 Posts: 2531 Credit: 253,722,201 RAC: 41,981 |
It's not possible to ping sft.cern.ch. Seems to be down or unreachable. It's not possible because it's not a server name. Hence it doesn't have a DNS entry. Instead it's the name of a CVMFS repository. Status of the stratum 1 servers can be checked here: http://cernvm-monitor.cern.ch/cvmfs-monitor/sft.cern.ch/ Be aware that stratum 1 servers should not (tends to must not) be used directly by LHC@home volunteers. Instead it's recommended to use their openhtc.io counterparts. |
Send message Joined: 7 Feb 14 Posts: 99 Credit: 5,180,005 RAC: 0 |
Jobs will last on average 2 hours rather than 12.Very long task: https://lhcathome.cern.ch/lhcathome/result.php?resultid=251991795 |
Send message Joined: 14 Jan 10 Posts: 1417 Credit: 9,441,051 RAC: 798 |
The longest known Theory task of batch 2279 lasted 376 hours and 55 minutes, the second longest 236.5 hours ;)Jobs will last on average 2 hours rather than 12.Very long task: https://lhcathome.cern.ch/lhcathome/result.php?resultid=251991795 |
Send message Joined: 7 Feb 14 Posts: 99 Credit: 5,180,005 RAC: 0 |
The longest known Theory task of batch 2279 lasted 376 hours and 55 minutes, the second longest 236.5 hours ;)Not so long then. Anyway that host avg time is about 1.6 hours, so 18 hours is quite a bit. :) What's the host avg time and theory app version for an almost 377-hours task? |
Send message Joined: 14 Jan 10 Posts: 1417 Credit: 9,441,051 RAC: 798 |
What's the host avg time and theory app version for an almost 377-hours task?That info comes from MC Production -> http://mcplots-dev.cern.ch/production.php?view=revision&rev=2279 There is no host-info available. You see my 2 mentioned jobs are real extreme out-liers. When clicking on the graph, you'll see the number of jobs per 5 minutes interval. |
Send message Joined: 14 Jan 10 Posts: 1417 Credit: 9,441,051 RAC: 798 |
Testing Theory vbox32:This task was ready within a few seconds. 2019-11-18 09:31:49 (6420): Guest Log: [INFO] ===> [runRivet] Mon Nov 18 09:31:38 CET 2019 [boinc pp jets 7000 25,-,760 - herwig7 7.1.3 default 100000 186] 2019-11-18 09:32:33 (6420): Guest Log: [INFO] Preparing output. 2019-11-18 09:32:33 (6420): Guest Log: [INFO] Job Finished Maybe the base memory for Theory vbox32 of 320MB is too low for herwig7. Pythia's are running fine. I increased the memory requirement to 384MB waiting for future herwig's. For BOINC you are reserving 700,000,000 bytes of memory by setting this in rsc_memory_bound. Maybe you could change this in line with the real needed memory. Could you also have a look to my remarks mentioned here. |
Send message Joined: 7 Feb 14 Posts: 99 Credit: 5,180,005 RAC: 0 |
Oh, I didn't notice that graph is clickable. A lot of njobs=1 looks to me there are many uncommon jobs or large runtime from a few slow hosts so that 5mins-bins are too thin. |
Send message Joined: 14 Jan 10 Posts: 1417 Credit: 9,441,051 RAC: 798 |
Maybe the base memory for Theory vbox32 of 320MB is too low for herwig7.Herwig++ is running OK with 384MB Base memory. Lowest free memory seen 6MB with only 3MB swap used. |
Send message Joined: 24 Oct 04 Posts: 1172 Credit: 54,748,185 RAC: 13,931 |
Yeah we don't get much Ram on a Windows X86 no matter how many you plug in (<4GB) I finally got to start my X86 Theory 300.02 (just ran 3 Sixtrack that was almost 100 hours each) This Theory task is now at 1% after 1 hour so far.....it says remaining time 4days 4mins 30 sec |
Send message Joined: 14 Jan 10 Posts: 1417 Credit: 9,441,051 RAC: 798 |
Remarks:I got Console ALT-F2 (vbox32) working to show the progress of events processing. It should be repaired otherwise, but this is what I did: Suspended a task with LAIM off. With VirtualBox Manager I discarded the saved state. I started the VM outside of BOINC with VirtualBox Manager. Wait until the pythia, herwig etc has started in 'top'. Switch to ALT-F2 and it works. Saved the machine state and resumed the task in BOINC. Console ALT-F2 is now also working in Remote Display Port. btw: I increased the Base Memory to 512MB. I also had a Pythia8 ready within a few seconds without doing real work https://lhcathome.cern.ch/lhcathome/result.php?resultid=252193424 |
Send message Joined: 18 Dec 15 Posts: 1811 Credit: 118,336,097 RAC: 25,785 |
Although more than unsent 600 tasks are shown in the Server Status page, my host received only 1 task, and from then on it keeps saying "no tasks available for Theory similation". Why so? |
Send message Joined: 2 May 07 Posts: 2242 Credit: 173,902,375 RAC: 2,798 |
Laurence is searching for this fauxpas. When you have a idea... |
Send message Joined: 14 Jan 10 Posts: 1417 Credit: 9,441,051 RAC: 798 |
Although more than unsent 600 tasks are shown in the Server Status page, my host received only 1 task, and from then on it keeps saying "no tasks available for Theory similation".Probably you have a limit of 1 for # CPUs. Set 'No Limit' and you will get as many tasks as you have cores or # of jobs, if the latter is less. |
Send message Joined: 18 Dec 15 Posts: 1811 Credit: 118,336,097 RAC: 25,785 |
Probably you have a limit of 1 for # CPUs. Set 'No Limit' and you will get as many tasks as you have cores or # of jobs, if the latter is less.yes, I did set the limit of "1 CPU", since the new version of Theory has 1-core tasks now (in opposite to multicore tasks as until short time ago). But this limit I set already a few days ago, and still I could download more than 1 task (as long as my setting for # of tasks was/is >1). But thanks anyway for the hint, I'll try it. |
Send message Joined: 7 Feb 14 Posts: 99 Credit: 5,180,005 RAC: 0 |
I have 3 sherpa jobs (not native). 1) CPU time 74:10:23, Elapsed time 72:59:24 2) CPU time 72:45:38, Elapsed time 71:43:31 3) CPU time 05:12:12, Elapsed time 42:06:48 Should I abort all of them? |
Send message Joined: 18 Dec 15 Posts: 1811 Credit: 118,336,097 RAC: 25,785 |
I had a similar situation recently, at the end they somehow failed, and I got no validation. So I would abort them. |
Send message Joined: 7 Feb 14 Posts: 99 Credit: 5,180,005 RAC: 0 |
|
Send message Joined: 29 Sep 04 Posts: 281 Credit: 11,866,264 RAC: 0 |
Have a look in "Show graphics" or "Console". Graphics then logs. The top line will show what flavour of simulation you have and the number of events to be processed. Some will only process a couple of thousand events instead of the 100,000 that Pythias do. Console then ALT-F2 will show current events and, every so often, an estimate of time remaining. If that estimate is a reasonable number and it is going down then the task is probably healthy. If that number is increasing or is something daft, like 2,000 days, then it is unlikely to finish successfully. If you are in doubt, post the first line and last 10? or so. Cross-posted so too late. I should be working so can't look at error logs just y. |
Send message Joined: 12 Jun 18 Posts: 126 Credit: 53,906,164 RAC: 0 |
Set 'No Limit' and you will get as many tasks as you have cores or # of jobs, if the latter is less.None of my rigs have been supplied with more than ten Theory WUs and all specify No Limit/No Limit in Prefs.. |
©2024 CERN