41) Message boards : Number crunching : I think we should restrict work units (Message 14011)
Posted 17 Jun 2006 by Philip Martin Kryder
Post:

....than helping the project get work done as soon as possible. ....


Why is this a goal?

It has been pointed out repeatedly that the work is getting done "fast enough" and if it were not, then the LHC folk could adjust the deadlines and max downloads per day.

Why do you try posit a position of
"I know better than you do, what you should do with your life or computer or whatever...?"
42) Message boards : Number crunching : ??what does average turnaround time mean?? (Message 13995)
Posted 15 Jun 2006 by Philip Martin Kryder
Post:
maybe that means you take less then a day to complete/report your workunit.


Maybe, but, I'm one of those cache hogs with connect to time at 10 days...
43) Message boards : Number crunching : ??what does average turnaround time mean?? (Message 13992)
Posted 15 Jun 2006 by Philip Martin Kryder
Post:
I'm seeing ZERO as the average turn around time on my computer.

How could that be correct?

Total Credit 516.46
Recent average credit 24.16
CPU type GenuineIntel
Intel(R) Celeron(R) CPU 2.53GHz
Number of CPUs 1
Operating System Microsoft Windows 2003
Standard Server Edition, Service Pack 1, (05.02.3790.00)
Memory 502.07 MB
Cache 976.56 KB
Swap space 4442.6 MB
Total disk space 23.29 GB
Free Disk Space 11.64 GB
Measured floating point speed 1233.62 million ops/sec
Measured integer speed 1881.52 million ops/sec
Average upload rate Unknown
Average download rate Unknown
Average turnaround time 0 days
Maximum daily WU quota per CPU 100/day
Results 0
Number of times client has contacted server 667
Last time contacted server 15 Jun 2006 5:37:17 UTC
% of time BOINC client is running 99.0571 %
While BOINC running, % of time work is allowed 99.5332 %
Average CPU efficiency 0.938361
Result duration correction factor 0.704289
44) Message boards : Number crunching : ??How did CERN intend to build the LHC **before** BOINC?? (Message 13991)
Posted 15 Jun 2006 by Philip Martin Kryder
Post:

They would have run the Sixtrack program on a bank of computers.

When doing an initial design, you don't need to run a million cycles to pick out how big of a magnet you need to bend a particle beam around a huge circle.
Running a million cycles lets you fiddle with the controls under a variety of conditions.

In the old days, they just built the cyclotron, and then fiddled with the controls.




So how much did they save by not having to buy this bank of computers?
45) Message boards : Number crunching : ??How did CERN intend to build the LHC **before** BOINC?? (Message 13986)
Posted 15 Jun 2006 by Philip Martin Kryder
Post:

Does anyone have any insight into how CERN intended to build and configure the LHC **before** they joined BOINC?

LHC has been planned and budgeted for YEARS.

They must have had a plan....

Anyone?
46) Message boards : Number crunching : I think we should restrict work units (Message 13985)
Posted 15 Jun 2006 by Philip Martin Kryder
Post:
I've started another thread asking if there is any way to view the WUs.

Why do I get the feeling that you'd like to see the wu's, so that you can try and associate them with a user whom you may believe can be pressured into returning them faster?

I smell a witchhunt brewing. :(

I no longer micromanage my systems, I rely on the infrastructure to work as designed. Granted, in the early days under certain conditions (quite a few actually :) the infrastructure failed to work properly. Those days are pretty much history, rendering micromanaging unnecessary.

The system has worked well enough to date. If the project team thought there was a flaw, or the scientists were clammering for answers earlier, they may be inclined to address a perceived "problem".

IMHO, impatient users are not a problem and the system doesn't look broke...Does it really need fixing?


I'm not witch hunting.

I don't know why you feel as you do, that is a question only you (and perhaps not even you) can answer.

I wanted DATA to try to stop the inuendos about "hogging and hoarding".

I don't think anything is "broke".

phil




47) Message boards : Number crunching : I think we should restrict work units (Message 13965)
Posted 13 Jun 2006 by Philip Martin Kryder
Post:
Perhaps this is a Social or Psychological experiment and not one of Physics...

I just see the paper on "Volunteer Responses to Contrived Shortages in an Altruisic Computing Environment..."


Does anyone have any insight into how CERN intended to build and configure the LHC **before** they joined BOINC?
48) Message boards : Number crunching : I think we should restrict work units (Message 13957)
Posted 12 Jun 2006 by Philip Martin Kryder
Post:
The way I see it, there's a way to "exploit" the system to make sure you get a lions share of the work. However, this wasn't kept a secret, so if anyone wants to they can - hence there's no secret advantage being used against people.

As someone said, if LHC want's results returned faster, they can modify the BOINC server settings, so no worries there.

I think the motive for doing it is stupid (e.g. "it's more important for me to score points than for the work to be shared and completed faster") but now that the secret is out, the best way to get back at the originators is to do the same yourself. Then they won't be getting a tactical advantage. And if the problem gets severe enough, LHC will do something about it, and we'll all be back on the level playing field.


It is interesting to me that some consider raising the cache (or time to connect) to be "unfair".

But, raising the resource percentage to favor LHC by large factors is "ok" and even somewhat encouraged.

Both have the INTENT of getting more LHC work.
One is more effective than the other.

One is considered "ok".
While the other is considered "unfair" (by some)....

Is this rational?






49) Message boards : Number crunching : How can one look at the uncompleted Work Units (Message 13956)
Posted 12 Jun 2006 by Philip Martin Kryder
Post:
I don't know of any built-in system function that allows viewing of all currently unfinished work units.


Thank you.

I should have been more clear.

I'm interestested in the work units that are shown on the homepage as still in progress.
Not, the ones on my account.

50) Message boards : Number crunching : I think we should restrict work units (Message 13951)
Posted 11 Jun 2006 by Philip Martin Kryder
Post:
... but maybe I'm wrong...


Perhaps.
We all run that risk - particularly when we speculate without data.

I've started another thread asking if there is any way to view the WUs.

Is it possible that those WUs have really reached Quorum, and that the fact that there are no WUs available is that there is some other asynchronous work (perhaps being done within CERN - perhaps even by **humans**) that needs to complete before more work can be sent out?

We really don't know why there is no work, or do we?


51) Message boards : Number crunching : Solution for LHC Long Term debt problem ? (Message 13950)
Posted 11 Jun 2006 by Philip Martin Kryder
Post:
That makes as much sense as anything.
Because it definitely is wrong. My switch time is 120 mins and my LTD are bigger than +-7200. I think Steve mixed LTD and STD. LTD has no influence in how much work is asked for but only in the decision if work is asked for. And to keep track of "long term debt" it must keep bigger values.

Norbert

If you force it to get more work and cause boinc to go into EDF, yes it can go beyond the the values that I stated. But if you leave it alone then what i said is true.



I don't get it - I have much larger values...
see below - from BoincDV

PRJ: ROSETTA@HO STD: -86400.000000 LTD: -291355.970175 RSRC: 1 -------------------------------
PRJ: EINSTEIN@H STD: 0.000000 LTD: -291177.619805 RSRC: 1 -------------------------------
PRJ: LHC@HOME STD: 0.000000 LTD: 260828.410659 RSRC: 10000000 -------------------------------
PRJ: SZTAKIDESK STD: 86400.000000 LTD: 286210.923153 RSRC: 100000 -------------------------------
PRJ: PRIMEGRID STD: 0.000000 LTD: 35494.256168 RSRC: 1 -------------------------------
52) Message boards : Number crunching : Solution for LHC Long Term debt problem ? (Message 13947)
Posted 11 Jun 2006 by Philip Martin Kryder
Post:
what is the downside of just setting the "connect to time" to 10 days and letting things run?
53) Message boards : Number crunching : How can one look at the uncompleted Work Units (Message 13946)
Posted 11 Jun 2006 by Philip Martin Kryder
Post:
Is there a web page that will allow one to look at the work units that are not yet complete?

There is a lot of speculation in other threads about the machine settings of machines running the Work Units not yet complete.

I'm wondering if it is possible to replace the speculation with data.

Thanks
Phil

54) Message boards : Number crunching : I think we should restrict work units (Message 13945)
Posted 11 Jun 2006 by Philip Martin Kryder
Post:
...
its painful to watch the servers wait for a timeout for less than 30 workunits to be returned...


do we really know that the servers are waiting for stragglers?

Or, could they be waiting for their own analysis to complete in order to build the next work units based on the work completed.

55) Message boards : Number crunching : Solution for LHC Long Term debt problem ? (Message 13930)
Posted 11 Jun 2006 by Philip Martin Kryder
Post:
...handles Long Term debt, it actually and continually shrinks the effective Cache of all machines....


Can someone explain the rationale behind this Cache shrinkage due to long term debt?




The description in the Wiki is confusing, at best. This is how I understand it. I could be wrong.

First, there really is no such thing as a cache setting. BOINC loads enough work to keep the machine busy until the next "Connect Interval". A long "connect interval" will, of course, mean more work is loaded at one time. There really is no "load x days work" setting.

Long term debt determines what program is queried for work. When Boinc needs to download (Download OK mode), it tries to get work from projects with a high LTD. It won't try to get work from projects with low LTD. Later, it decides that it must get work from somewhere because it is going to run out of work (Download required mode), it will get work from anyplace that has work, regardless of LTD.

So, I think in the case when LHC is out of work, it typically will have a relatively large LTD. As a result, BOINC won't download work until the scheduler goes into "Download Required".


There is no "cache setting" - got it.

What seems not to match my experience is:
1)The earlier posts about LHC being limited to 10000 LTD - mine was much higher.
Some of my other projects went negative.

2) your second to last paragraph seems to say that projects with a HIGH LTD will try to get work in Download OK Mode.
While your last paragraph seems to say that it won't go to "LARGE LTD" (is large different than high?) until "Download Required" mode...

So, If I have a High (?Large? LTD) - shouldn't that attract work?

Sorry to be so dense.

Phil
56) Message boards : Number crunching : Solution for LHC Long Term debt problem ? (Message 13925)
Posted 10 Jun 2006 by Philip Martin Kryder
Post:
...handles Long Term debt, it actually and continually shrinks the effective Cache of all machines....


Can someone explain the rationale behind this Cache shrinkage due to long term debt?

57) Message boards : Number crunching : I think we should restrict work units (Message 13919)
Posted 10 Jun 2006 by Philip Martin Kryder
Post:



While I can acknowledge and appreciate your feelings, I'm surprised to find them so common in a scientific study.

Instead of casting wide nets of guilt by inuendo, based on intuition and emotion, why not gather the data that you describe?

Some folks might even consider gathering data to support a hypothesis **before** posting.

As for deciding based on the attributes of the machine or network if any given cache size is "legitimate", who am I or who is anyone to decide what "legitimate " is?

I denigrate the underlying premise that somehow "greed is bad" but having a slow connection or slow computer is somehow makes it "proper" to have a large cache.
A person who was less "greedy" might buy a faster computer or a faster Internet connection or a second phone line.

Would that then make them "good"?

What if they bought a larger computer and then increased their cache, would they then be "greedy" or would they be "good?"


In summary:
1) LHC could control hoarding by limiting deadlines.
2) LHC could control server crashes by limiting max connections relative to known server capacity.
3) If the use of large caches is "improper" or "illegitimate", then this is the type of feedback that the BOINC folks need in order to establish the need for new and better controls and algorithms in BOINC.
4) There is lots of other great science to do on other projects when LHC runs dry.

crunch on!




how do you know that the "hoarders" didn't have a "proper" reason such as a slow modem or dial up line?



or an older machine, or one that doesn't crunch 24/7?


That's possible, sure, and I don't think people would complain much if people have a legitimate reason for using a large cache. One way to check this would be to find some work units that are still pending, then look at the computer's details. Some of the information there must give you an idea what the computer is like in terms of age, speed, percent of the time BOINC runs, network connectivity, etc. That should help you decide if the person is keeping a large cache for a legitimate reason or not. The on and off work status of LHC is new to me, so it's tough for me to understand exactly what's going on, why, and how to solve the problems. I do know that it's disappointing to see so many work units in progress, waiting to be done, while my computer has been sitting dry for over 1 week. I think many others share this feeling, and that's one of the reasons for the complaints. We suspect that not everybody has a proper reason for maintaining a huge cache of work, and the people who don't have a proper reason are just being greedy, taking work from others for themselves and preventing work from being done promptly.










58) Message boards : Number crunching : I think we should restrict work units (Message 13914)
Posted 10 Jun 2006 by Philip Martin Kryder
Post:
As I post this (23.00 hrs UK time) the front page of the LHC site shows that there are still 340 WUs out there somewhere, still unprocessed. This is days after I (and I'm sure many other people!) had returned the final WU unit from the last batch of WUs to be issued.

So those 340 WUs are being held in the cache of some irresponsible crunchers who 'hoard' work. This must ultimately slow down the whole LHC project!

Jeez - when will some people learn that we are working for LHC, not LHC working for our egos.....



how do you know that the "hoarders" didn't have a "proper" reason such as a slow modem or dial up line?

59) Message boards : Number crunching : Please sign BOINC-related petition (Message 13885)
Posted 4 Jun 2006 by Philip Martin Kryder
Post:
I suppose then if your browser did not handle the web pages well then you would care. Perhaps your browser isn't handleing the code well and you don't know it. Of course, it is more likely that those that are blind and would like to help can't because of the non-compliance. That is what W3C is about, well that and a few other things. So, you don't care if handycapped people ever help the differant projects of BOINC? You don't care if people that have to use text to speach browsers can't access LHC or Seti or what-have-you. I am sure that those that require help to use the internet don't care about you either. So, enjoy your non-caring life and don't complain about IE.

Tatheg





I'm not sure to whom you are addressing this, since the quotes seem to have gotten messed up.

But, this is an excellent example of why I feel that Rytis should build his Universal Boinc Portal, rather than bothering David Anderson.

By building the portal, we could see how much it really mattered.
And,
We we could learn this without impacting other development efforts.







60) Message boards : Number crunching : I think we should restrict work units (Message 13882)
Posted 4 Jun 2006 by Philip Martin Kryder
Post:
Thanks to your clear explanation, I raised my cach for .01 to 10 days.
And yup,
As soon as there was work to do, I was able to get a bunch of it to work on.

Please, don't do that.

Engouraging people to keep that kind of a cache without a proper reason on fast computers will bring only a problems. As we have seen servers at Cern (or where ever they are) won't stand the excessive amount of downloads what around 80 computers downloading WUs for ten days cause.

So don't come here whining when server is down and you wan't download work. You know the reason exactly.


So, you are saying that you have direct knowledge that the servers are so badly configured that they FAIL rather than throttle back connections to a level that they can handle?

Can you tell us where and how you learned this?

If your assertion is true, shouldn't be handled more directly by reconfiguring the servers rather than expecting 65k crunchers to configure their machines in some special way?

And by the way, I was never intending to be "...Engouraging [sic] people to keep that kind of a cache..."
I think that YOU should set YOUR cache to .01 and leave it there.
You should NEVER raise YOUR cache above .1.

By the way, what do you mean by "...a proper reason???"









Previous 20 · Next 20


©2024 CERN