1) Message boards : Number crunching : Exceeding maxium disk space. (Message 21303)
Posted 11 Mar 2009 by (banished: ID 70524)
Post:
Gas Giant wrote:
tpieloni wrote:
Yes, I am from CERN/LHC and I am new with LHC@home...
I am running some studies for the LHC and trying to learning how to use this powerful pool of CPUs.
I will try to stop by the forum every now and then to update you on the outcomes of the study.
Let\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\'s keep running.


Excellent. Welcome from us all!


tpieloni is one Gas Giants\\\' numerous sockpuppets

2) Message boards : Number crunching : Exceeding maxium disk space. (Message 21272)
Posted 26 Feb 2009 by (banished: ID 70524)
Post:
My Linux box, host 9780021, returned a Success on this one.
3) Message boards : Number crunching : Exceeding maxium disk space. (Message 21268)
Posted 26 Feb 2009 by (banished: ID 70524)
Post:
Are we seeing another HR scenario where the same science app behaves differently on different hosts?


I think Eric Macintosh licked that problem a long time ago with new floating point libs.

My guess is that given the same work unit, Vista under-reports the disk space the app is using. Or maybe XP and Linux over-report the disk space used by the app. Either way, if the WU simply sets the max a little higher there should be no problem. Just guessing, I have no evidence to suggest Vista over-reports or that Linux/XP under-report.
4) Message boards : Number crunching : Exceeding maxium disk space. (Message 21254)
Posted 25 Feb 2009 by (banished: ID 70524)
Post:
So what could nI do ? Is it the new Sixtrack or is it the new BOINC version ?
Thanks for quick help
Dr.Mabuse


It is not a problem with your settings. There is nothing you can do.

Every work unit from every project has a maximum disk space number. That number is determined by the project, not by BOINC client. In this case it looks like the maximum is 28.61MB. It is just one more safety mechanism. LHC is telling your BOINC client that the task\'s output files should never use more than 28.61 MB. Either BOINC or the Sixtrack application found the task was using 38.99MB so it aborted the task. Either the new Sixtrack application has a bug which causes the output files to grow beyond the maximum or else they are underestimating the maximum size of the output files.

5) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20838)
Posted 29 Nov 2008 by (banished: ID 70524)
Post:
Pay at the first window. Would you like to upsize your order to 1500 tasks for just $12.99?
6) Message boards : Number crunching : WU \\\"Aborted by Project\\\" (Message 20835)
Posted 28 Nov 2008 by (banished: ID 70524)
Post:
...it sends a message to your Computer to abort the WU.


That could be taken 2 different ways. It doesn\'t really \"send a mesage\" in the same sense that you send a letter. In other words it doesn\'t initiate a connection with your computer. It waits until your computer contacts the server, then the cancel message is delivered.

Very often your computer does not contact the server in time to receive a cancel message so it goes ahead and crunches a task that doesn\'t need to be crunched. That is why using the cancel mechanism is not a good way to eliminate the redundant work and waste inherent in the IR > minQ strategy.

7) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20834)
Posted 28 Nov 2008 by (banished: ID 70524)
Post:
Donations are a good idea, FalconFly.

Here\'s another... Since so many crunchers here think it\'s just the most wonderful thing in the world to be able to crunch LHC@home tasks, they won\'t mind paying for such an incredibly fulfilling privilege. Sell the work units for 1 penny per task. $10 would get you 1000 tasks.

8) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20796)
Posted 22 Nov 2008 by (banished: ID 70524)
Post:
YAY WORK!
Can someone explain why I have on each machine, a bunch of WUs that finish properly, and a large amount that are tagged as redundant result, or cancelled by server. I did a search in the message boards, but got no results.
Is that because the search doesn\'t work either?

Does this have anything to do with the BOINC 6.x problem you are all talking about?

Cragg


They send out 5 copies of each task, each copy goes to a different computer. Consider those 5 to be an ad hoc temporary work group. The other 4 member are your wingmen for the mission. 5 is the Initial Replication or IR for short. They need only 3 matching results to verify that they have \"the right answer\". 3 is the minQ, short for Minimum Quorum. If they get 3 that match before you start crunching your copy, they cancel yours because it is not needed (redundant). It would be a waste of your CPU time if they didn\'t cancel it. It has nothing to do with the BOINC 6.x problem directly. The need to send out more than 3 copies is a direct result of the fact that this project has always, even before the BOINC 6.X problem, had a high rate of compute errors and results that fail to match the other results returned by members of the work group.

The cancel mechanism does not always work because the cancel order frequently does not reach your computer in time. Then you waste CPU time doing work that doesn\'t need to be done. It is far more efficient (less wasteful) to send only 3 tasks (IR = minQ) and wait until those all return and then, if the 3 don\'t match, send a fourth copy. The reason they use IR > minQ is because they know from experience that some 25% of reults will fail to match. Thus IR > minQ speeds up the completion of the batch but at the expense of precious CPU time. There are other strategies for speeding up the completion of the batch that are much more efficient. The IR > minQ strategy is an anachronism from long ago.

The Advanced Search works much better but it\'s far from perfect.

9) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20784)
Posted 18 Nov 2008 by (banished: ID 70524)
Post:
Nice try but guess again.


So, where in the world can you divert funds from a gov\'t funded research project by simply \"[speaking] to the right person\"?


Just about everywhere in the world, in every public and private enterprise. It\'s simply bad management to lockdown every penny and require \"the board\" to vote on every penny spent. Every enterprise has at least 1 person who has the authority to divert funds and spend small amounts to meet unforseen circumstances. It\'s absolutely necessary to avert disaster and/or keep the enterprise on an even keel.

Fools and their money always part quickly.


I\'m gladly giving less than $0.1 a month to what I consider the most impressive project since the great pyramids were built. If you think that\'s foolish, fine for you, I couldn\'t care less. Personally I think it\'s great that I can contribute, even if it\'s just a tiny tiny bit.


It IS the most impressive project since the great pyramids, don\'t think even for an instant that I don\'t appreciate what a marvel of engineering it is and how badly we need the knowledge the collider will provide. That doesn\'t mean we should piss away CPU cycles needlessly just to make the collider work. The fact is you could contribute even more if they were not wasting so much. Hey, it\'s not like the waste is unavoidable. I don\'t whine about waste that is unavoidable. But the waste here IS avoidable to a large measure.

Now about the money...

OF COURSE I don\'t worry about the $.1 you waste every month. It\'s always been about the TOTAL waste in the project. And it\'s about the TOTAL wasted CPU cycles, not really the wasted money. Yeeesh! I thought you were reading my posts and on the same page.

10) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20779)
Posted 18 Nov 2008 by (banished: ID 70524)
Post:
Regarding diverting funds... there is someone who can divert funds, there always is. If funds haven\\\'t been diverted then you just haven\\\'t spoken to the right person or else you have but they\\\'re pretending to be deaf.


In other words, you have absolutely no idea what you\\\'re talking about at all.


Nice try but guess again.

As for me, I consider the scientists time more valueable than my CPU\\\'s idle cycles.


Fools and their money always part quickly.

11) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20767)
Posted 14 Nov 2008 by (banished: ID 70524)
Post:
the very least they can do and it would take much from their budget


The least they can do is nothing and I have explained this the budget is set, they asked for money to do certain things and they got the money to do certain things and LHC@home was not in that plan THEY ARE NOT ALLOWED DIVERT FUNDS THAT HAVE BEEN ALLOCATED.


Correction => The very least apathetic sloths would do is nothing. Responsible people would eliminate the waste. Regarding diverting funds... there is someone who can divert funds, there always is. If funds haven\\\'t been diverted then you just haven\\\'t spoken to the right person or else you have but they\\\'re pretending to be deaf. Either way there is a solution and I intend to help. More on that below.


And if they don\\\'t like that then let them buy the CPU time they pee down the drain. I have a hunch things would get fixed real quick if they had to do that.

The work just wouldn\\\'t get done in that case, there is no money for this, no money at all zilch, zero, nada, nothing if they had to buy the CPU this work would not get done, this is not the perfect scenario for the LHC but that is what they would have to do.


You seem to be saying they didn\\\'t know the magnets would require tuning/alignment when they designed the machine so they didn\\\'t budget for that. That would make them idiots which would mean the entire venture has little chance of accomplishing anything. I can\\\'t believe that.

You also seem to be saying there is absolutely 0 contingency planning and funds. If so then how will they pay for fixing the recent coolant system failure?

You seem to think we relish \\\"stealing CPU\\\" and working on a best effort basis with no money.


If that\\\'s what you think then you\\\'ve acquired the wrong impression.

I can assure you this project is very frustrating to work on without you telling me it is (by the way thanks for the reminder I was actually having a good day).


No problem. I was having a good time BOINCing until I came to understand what a wasteful, screwed up project this is aand how it hurts so many other projects that are equally broke but make every effort to be efficient. (One snide remark deserves another, goose<=>gander)

Do you think we have a massive pile of money we control and we are just sitting on it laughing maniacally? If we could make money and more effort appear we would have by now, all we can do is keep bidding for money from various bodies.


I am quite aware of how broke LHC@home is.

If the LHC has spent/allocated every penny and cannot afford to provide for its own needs then it\\\'s doomed to be shutdown soon by some unforseen failure/circumstance because they just won\\\'t be able to afford to fix it. If that\\\'s the case then there\\\'s not much point in continuing the magnet tuning work, is there?. No, Neasan, LHC has money. You know it and I know it. And we both know the only reason they haven\\\'t rubbed a little of it on this project\\\'s problems is because suckers here just keep donating CPU time, blissfully unaware of how much of it goes to waste. Preying on the ignorance of one\\\'s benefactors... is that how honorable men get jobs done?


Yes this what this project does is important but when they look at the list of things to do be done the see more important things, like actually fixing the machine after the failure in September.


The means to end the tragic waste of CPU time has been at hand for well over a year, long before the machine fired up and broke. So your argument is just spin, the shell game I referred to in an earlier post.

Your badgering doesn\\\'t \\\"shame the powers that be\\\" it just winds me up. We know you have misgivings about the IR minimum quorum and we have been talking to the scientists about it the new SixTrack they are working on should also to drop both of these numbers. In this case the squeaky wheel does not get the oil it merely makes the user consider walking


Keep the quorum of 3 if that much assurance is needed, nobody has a problem with that, at least I don\\\'t. Just reduce the initial replication to 3 or whatever the minimum quorum is.

Now back to your rant...

So we have the shaming strategy versus educating thousands of users to the point where they decide to spend their resources on efficient projects rather than LHC@home. Seems to me that if the former does not work then the latter is bound to because benefactors eventually tire of shameless beneficiaries and cut them off. That is the corner LHC has backed LHC@home into. Or maybe LHC@home put itself into the corner. Whatever. There is a graceful way out and LHC@home seems to be finally headed in that direction. Too bad it took the \\\"collapse\\\" of SixTrack under the \\\"burden\\\" of v.6 BOINC API to light the necessary fires under the buttocks of the powers that be but better late than never.

12) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20765)
Posted 13 Nov 2008 by (banished: ID 70524)
Post:
Well, $6trillion or $7billion, parent project or not, one would think they could ante up either money or manpower to set things right around here. Tuning the magnets is critical, not like it\'s to do with choosing the color of the paint on the bathroom walls. If they\'re going to leave it up to volunteers to provide the CPU time then the very least they can do and it would take much from their budget is to make sure the app runs well enough to not require IR > minQ or else adopt one of the newer strategies for getting the work back fast. If they can\'t make that minimal contribution then they should just be made to wait for whatever they get out of IR = minQ. And if they don\'t like that then let them buy the CPU time they pee down the drain. I have a hunch things would get fixed real quick if they had to do that.

I might not know how science funding works but I do know a welfare bum when I see one and it\'s obvious to me that the LHC is a welfare bum that bites the many generous BOINC hands that feed it. Worse than that, they \"steal\" CPU time away from other projects that are fighting just as hard for funding and are just as broke.

I detached my hosts from LHC@home some time ago when it became obvious there is little motivation to treat crunchers and the BOINC community with the respect they deserve. The ONLY reason I am here is to lobby and if necessary shame the powers that be into ending the needless waste going on here. I do that for the sake of all the other BOINC projects that need the spare CPU cycles LHC@home callously pees down the drain. Don\'t expect me to go away though I will turn the volume down a notch or 3 now that someone is looking into doing something about the problem.

13) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20748)
Posted 7 Nov 2008 by (banished: ID 70524)
Post:

Nice try, Tomas, but you are wrong :)

I do run other projects. None of my computers are attached to LHC because more than 30% of the work done for LHC is wasted effort. It is wasted because they have set IR > minQ. Using the IR > minQ to get results verified sooner was justifiable years ago when BOINC server did not have as many features as it does now. Modern versions of BOINC server has features that permit efficent strategies for getting results verified quickly. Unfortunately, LHC steadfastly refuses to implement those strategies and use the CPU time donated to them efficiently, the way professionally run projects attempt to do.

The reason I complain about LHC\'s wasteful practices is because they steal CPU time away from other worthy projects.



Do you mean like what is suggested by the WCG in this document?
http://boinc.berkeley.edu/trac/attachment/wiki/WorkShop08/ServerManagement-BOINC2008.pdf?format=raw

Same doc as a powerpoint:
http://boinc.berkeley.edu/trac/attachment/wiki/WorkShop08/ServerManagement-BOINC2008.ppt?format=raw




That document mentions a number of new BOINC server features/options and offers several excellent suggestions. I read it several weeks ago but not this morning. Since the document\'s contents are not fresh in my mind, I won\'t say each and every suggestion and option in the document should be implemented here at LHC@home but I will say some combination of those new options and suggestions should be tested, evaluated, tweaked and implemented soon, very soon, so that the long outdated and wasteful IR > minQ strategy can be abandoned. That strategy hurts not only this project but other projects as well.


14) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20745)
Posted 7 Nov 2008 by (banished: ID 70524)
Post:
Why should I work for them for free when their parent project has a $6 trillion budget and more money on the way? If they want me they can pay me a respectable wage.



So that\\\'s the reason why you are complain time after time after time after time on the same issue instead of just choosing to run an other project. You are one of those consults who is looking for a job. :)


Nice try, Tomas, but you are wrong :)

I do run other projects. None of my computers are attached to LHC because more than 30% of the work done for LHC is wasted effort. It is wasted because they have set IR > minQ. Using the IR > minQ to get results verified sooner was justifiable years ago when BOINC server did not have as many features as it does now. Modern versions of BOINC server has features that permit efficent strategies for getting results verified quickly. Unfortunately, LHC steadfastly refuses to implement those strategies and use the CPU time donated to them efficiently, the way professionally run projects attempt to do.

The reason I complain about LHC\'s wasteful practices is because they steal CPU time away from other worthy projects.

15) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20743)
Posted 6 Nov 2008 by (banished: ID 70524)
Post:
I think they don\'t have much time to run the project.


You are correct, the existing staff don\'t have much time. However, LHC@home\'s parent project has a $6 trillion budget and they will be receiving more money to design/build upgrades to the collider. The proper professional solution to their problems is for them to hire more staff and get the job done properly instead of expecting us to pay for the needless waste they produce. The solution they have been using for the past 2 years or more is the welfare bum\'s solution... do next to nothing and let everybody else take care of you and cover for you.

Maybe you could volunteer your services to help setup the system for free?


Why should I work for them for free when their parent project has a $6 trillion budget and more money on the way? If they want me they can pay me a respectable wage.

16) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20741)
Posted 5 Nov 2008 by (banished: ID 70524)
Post:
Stephen,

It always amazes me when I see people whining about not getting enough work from this project when more than 30% of the CPU cycles donated to this project are spent on needlessly duplicated work, a deplorable situation you can blame on this project\\\'s IR > minQ strategy for getting results verified quickly. Not one word from you on that deplorable waste of your money, just a whine for more abuse.

It\\\'s not surprising nothing important ever changes at this project. The majority of crunchers here are, like you, are either totally oblivious to the way this project abuses them or else they just don\\\'t care or else they love being abused. With such a blind, uncaring band of crunchers they can get away with just about anything they want.

If thousands of you sheep would just detach your hosts, they would be forced to fix their crap project. As long as you keep begging for more abuse they\\\'ll just rest on their long worn out laurels and do absolutely nothing about it.

They could spread the work around a little thinner and to more hosts simply by turning on a server option that limits each host to having no more than 2 or 3 tasks per core in the cache at any time. That alone would speed up the return and validation process. They could use homogenous redundancy which would reduce mismatches and speed up the validation process. Then they could reduce the IR to minQ which would shrink their database and reduce the load on the server and bandwidth. But why should they even think about lifting their little finger to do anything when you ignore all that crap?

In a round about way, Stephen, your apathetic attitude and/or ignorance and/or love for abuse has bought you exactly what you deserve... frustration and abuse. The question is, are you going to beg for more abuse or are you going to do the only thing that will force them to stop? Hmmmm?


17) Message boards : Number crunching : Why BOINC 6.X has issues with LHC@home and other things (Message 20725)
Posted 30 Oct 2008 by (banished: ID 70524)
Post:
Thanks for sticking with us folks especially Dagorath who I have only contemplated killing on one or two occasions ;-)


Are you saying my truth telling and dragging issues out from under the carpet they\'ve been swept under has finally embarassed the muppet masters into doing something about the huge, deplorable and needless waste caused by the IR > MinQ anachronism used at this project?

Or is this just another shell game wherein sleight of hand and obfuscation portrays the recent screensaver related compute errors as the villain and allows the IR> MinQ devil to live?

18) Message boards : Number crunching : Getting Work Units? (Message 20712)
Posted 30 Oct 2008 by (banished: ID 70524)
Post:
According to the server status, there is about 17000 new work units being processed. I was not able to get 1 work unit. Is there any trick to get work units when they are available?


Of course there are tricks. To understand the tricks and use them you first have to understand the problem. The problem has 2 parts:

1) the work units are usually gone before your host contacts the LHC server again.

2) your computer might have all the work it can handle from other projects when work is available from LHC so it won\'t download from LHC even though LHC has work available when your computer contacts the server

One trick is to sit at your computer and click the update button every few minutes. That forces your computer to contact the LHC server. Eventually you will click it at just the right moment when your computer wants more work and LHC has some available. Problem is your finger will get sore and your boss will wonder why you\'re not at work. Impractical and probably problematic too.

A better trick is to use the boinccmd tool to automate the clicking of the update button for you. The --project URL operation command detailed in the Control Operations section is the way. The command would be:

boinccmd --project http://lhcathome.cern.ch/lhcathome/ update

Put that line in a script and run it as a cron job if using Linux or do the Windows equivalent (batch (.bat) file run periodically by Windows Event Scheduler or whatever it\'s called).

Mind you, that doesn\'t guarantee your computer will actually be wanting work when it contacts the LHC server, as mentioned in 2) above. To be absolutely sure it wants work you also need to manipulate your cache and/or \"connect every\" settings. That can be automated too with a little batch/script magic. It works very well, I know because I\'ve used it myself. I haven\'t been using it lately because this project is the most screwd up project in the BOINC world and they waste about 33% of the CPU time donated to them. Never mind their lame \"we don\'t have any money\" excuses... this project\'s parent project has a $6 trillion budget with more funding on the way for collider upgrades. No money? Bullshit!

Not trolling here, just explaining the facts and keeping the karma flowing to where it should be :)

19) Message boards : Number crunching : Please note: this project rarely has work (Message 20704)
Posted 28 Oct 2008 by (banished: ID 70524)
Post:
changed my mind
20) Message boards : Number crunching : Actual LHC data to crunch? (Message 20677)
Posted 19 Oct 2008 by (banished: ID 70524)
Post:
I don\\\'t think it is a technical problem but a mentality problem.


Wrong. It\\\'s a technical problem. We simply don\\\'t have the bandwidth. They have already explained that. When is it going to sink in?


My ADSL has 7 Mbit/s, but big cities in Italy reach 20 Mbit/s.


Apparently the IT people at CERN think that\\\'s not enough. Do you know something they don\\\'t know?

Fastweb is offering fiber to the home in selected cities. Once fiber is installed there is no limit to bandwidth.
Tullio


Wonderful! When enough BOINCer get fibre to their homes and a GPU in their box CERN might change their policy. The reality of today\\\'s world is that CERN thinks our resources are inadquate and they are probably right. So why not crunch some other worthy projects along with the LHC tasks you get and be happy about it.

21) Message boards : Number crunching : Actual LHC data to crunch? (Message 20675)
Posted 19 Oct 2008 by (banished: ID 70524)
Post:
I don\\\'t think it is a technical problem but a mentality problem.


Wrong. It\'s a technical problem. We simply don\'t have the bandwidth. They have already explained that. When is it going to sink in?

22) Message boards : LHC@home Science : Nvidia CUDA support for Boinc in future? (Message 20635)
Posted 6 Oct 2008 by (banished: ID 70524)
Post:
Is there any possibility for the Boinc/LHC@home community to write CUDA for number crunching support for LHC and/or other projects? It just seems like a treasure trove of threaded processing power waiting under the hood of some gaming boxes.

-Stu


Check the action on PS3GRID. There you can download a test version of BOINC with CUDA support and get work units that run on nVidia GPU.

The reasons why this project\'s current app will likely never run on GPU has been discussed in at least 1 other thread already. Look around, you\'ll find it.

23) Message boards : LHC@home Science : no work units (Message 20602)
Posted 3 Oct 2008 by (banished: ID 70524)
Post:
anyone getting the problem of no work downloaded or is there somthing wrong with my connection lol


This project rarely has work. When it does have work the chances of getting some are slim.

There are plenty of other very worthwhile projects you can contribute your spare CPU cycles to while you wait for this project to send work. It appears you are new to BOINC so may I suggest attaching to an easy project, in addition to this one, that has a steady supply of work and very few decisions for you to make. Run that project for a few weeks until you see how it all works then maybe try a few other projects.

Some easy projects with steady work that are on the list in the Attach Project Wizard:

  • ABC@home
  • Einstein@home
  • Spinhenge@home
  • Quantum Monte Carlo at Home
  • Malariacontrol.net
  • Rosetta@home


Other projects that do not have steady work


  • SIMAP
  • LHC@home
  • Proteins@Home



P.S. For any project that rarely has work, you should set the Resource Share to a small percentage or else you'll eventually have problems getting work from any project due to a problem with the work scheduler.

24) Message boards : LHC@home Science : LHC @ Home on Playstation 3 (Message 20598)
Posted 2 Oct 2008 by (banished: ID 70524)
Post:
Another problem with the graphics cards/ps3 approach is that the cards are extremely fast single precision machines. Sixtrack however requires double precision. So the speed up is not as big.


The older graphics cards are single precision. I have heard the latest models from Nvidia are double precision but I have not verified that.

Another problem is with rounding errors. Sixtrack is very succeptible to incorrect rounding of floating point numbers. It took ages to work out the differences in the intel and amd architectures. It would be a lot of work to be certain that the GPU chips calculate correctly.


Maybe there is no need to do that work? The new server code LHC@home is using allows for homogenous redundancy which means it is possible to send all result replications for a given work unit to hosts that meet platform criteria. You can, for example, send a given work unit to just Linux hosts and/or hosts with AMD but not Intel. I\'m not sure if it\'s possible to select only hosts with a particular graphic card. If that\'s not possible with current code then it probably wouldn\'t take much effort to make it so.

Thirdly SixTrack as far as i remember relies on a lot of complex floating point math (sin/cos/exp/log) another area where the speed up of the graphics cards are not as impressive as when you can do simple vector operations.


I have heard the latest Nvidia cards do transcendentals too. Again, I have not confirmed it but the source is usually pretty reliable.


Cheers,
Chrulle
Ex-LHCatHome developer


Ex-developer, huh? Well, I don\'t know much weight you carry now that you\'re \"Ex\" but if you could possibly light a few fires under a few frozen-in-place butts (we don\'t know their names but it\'s obvious they exist) and get someone to rub a little money on this project, it would do this project and the crunchers who make it happen a world of good. We just don\'t buy the \"there\'s no money\" excuse anymore, not when this project\'s parent project has a $6 billion construction budget and more $ earmarked for upgrades. Think about it... a +$6 billion project getting it\'s magnet operating parameters computed on a 2 penny DC project.

25) Message boards : Number crunching : Segmentation violation (Message 20594)
Posted 2 Oct 2008 by (banished: ID 70524)
Post:
I restarted lhc@home and shall report. I am still thinking that the problem arises when the server deletes a redundant result.
Tullio


Could be. If it is then it would likely happen less than a second after the client contacts the server. Can you correlate the times of result cancels with the times of the SEGVs? Too bad the messages in your first post don\'t have times for the lines below the \"Scheduler request\". We can\'t say from that how much time elapsed between the request and the SEGV. It might have been less than a second, it might have been several minutes, we can\'t be sure. Can you think of any other way to correlate the times?

Here\'s another idea....

I had a SEGV in a Linux app (not BOINC) today. The app died or got killed or whatever and then came a popup showing the error message. The popup had the title K Stack Trace or something similar. The error was in 1 tab, the other tab was Stack Trace or something similar. So I clicked the stack trace tab, it thought for a few secs then said like \"cannot trace because gdb not present\".

Gdb, I think, is the gcc compiler debugger. I think if gdb had been installed on that machine, I might have got some solid clues as to where the error occured. The app also needs to be compiled with \"debug info\" or symbol tables.

If you go the index of all boinc versions and find a version for Linux that ends with \"_debug.sh\", it will have the symbol tables. Install it, run it and attach to LHC. Install gdb and the K Stack Tracer (or whatever it\'s actually called) too. You might get some very helpful clues to put in a Trac bug report.

Sorry, I can\'t tell you exactly how gdb and stack traces work on Linux, I\'ve never done it before. I\'ll install gdb and a debug version of BOINC here too. If we compare notes and get some hints/tips from people who know how, we can get to the bottom of it, I\'m sure.
26) Message boards : Number crunching : Segmentation violation (Message 20591)
Posted 1 Oct 2008 by (banished: ID 70524)
Post:

No, it is not sending that message, only this:
Scheduler request succeeded, got 0 new tasks


Hmmm. Well, I attached a host running Fedora and BOINC 5.10.45 and turned on <work_fetch_debug> in cc_config.xml. Right now it is saying only
Wed 01 Oct 2008 02:03:15 PM MDT|orbit@home|[work_fetch_debug] work fetch: project not contactable; skipping
Wed 01 Oct 2008 02:03:15 PM MDT|lhcathome|[work_fetch_debug] work fetch: project not contactable; skipping

Maybe some clues will turn up.

27) Message boards : Number crunching : Segmentation violation (Message 20587)
Posted 1 Oct 2008 by (banished: ID 70524)
Post:
It seems this happens also on other projects when the client asks for works and receives none. See this on the BOINC message boards:
BOINC client exits
Tullio


From the trials Jean-David reports in that thread it really looks like Hydrogen@home was causing the SEGV. It seems he got the SEGV only (mostly?) when he requested work from Hydrogen but didn\'t get any.

Now, one thing that is peculiar about Hydrogen is that when that server has no work, the client log shows...
 
[Hydrogen@Home] Message from server: No work sent


...in addition to the standard
24-Aug-2008 06:52:57 Scheduler request succeeded: got 0 new tasks


Is the \"Message from server: no work sent\" logged only when one has debug/logging options set in cc_config.xml?

I don\'t get that message from any of the projects I am attached to when they have no work. I am wondering if that message somehow triggers the SEGV and does LHC send that message too?

28) Message boards : LHC@home Science : article about what we\'re doing? (Message 20571)
Posted 1 Oct 2008 by (banished: ID 70524)
Post:
Nope. It\'s not about LHC@home. It\'s mainly about the Grid which connects the computers that will crunch the data the collider collects. We are crunching data to tune the magnets that keep the beams in the center of the collider\'s tube. The article doesn\'t mention that so it\'s not at all about us.

29) Message boards : Number crunching : Segmentation violation (Message 20556)
Posted 27 Sep 2008 by (banished: ID 70524)
Post:
The BOINC boinc is not static executable


[dagorath@Henry64 BOINC]$ file boinc
boinc: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.8, stripped


I could have sworn they are dynamically linked. In fact I even wrote and article in the wiki stating they are statically linked and had that verified by someone who should know.

Well, I am sure they were static at one time. Guess they changed that policy.

I\'ve searched the OpenSuse site for a BOINC installer but could not find it. Cheers.
Tullio


If there is a BOINC installer for SuSe (it might never have made it out of beta test for all I know), it may be in some other repository rather than the OpenSuSe site. SuSe\'s software installer/updater (is it apt?) may be able to locate it for you though you may have to point apt to some non-official repository.

30) Message boards : Number crunching : Segmentation violation (Message 20554)
Posted 27 Sep 2008 by (banished: ID 70524)
Post:
I installed from BOINC but I am running also SETI, Einstein, QMC, CPDN, CPDN Beta and I never had any problem. On SETI I am running also an optimized application and Astropulse and never had a compute error. Only LHC@home gives me some problems.
Tullio


About a year ago I was getting SIGSEGV on ABC@home tasks when I was running Berkeley\'s version of the client on SuSe. After receiving the same advice I\'m giving you, I decided to try a client compiled on a SuSe system from the guys who were building and testing the BOINC installer for SuSe. That fixed my problem. I no longer run SuSe but I think the BOINC installer for SuSe has been out of the testing phase for some time.

You having no errors from your other projects proves only 1 thing... you get no errors from them. It proves nothing about this project\'s SixTrack app. Their apps may not use the libraries the way SixTrack does. The fact that you run the optimized SETI app and Astropulse is irrelevant.

31) Message boards : Number crunching : Segmentation violation (Message 20552)
Posted 27 Sep 2008 by (banished: ID 70524)
Post:
Would this be happening when the server cancels redundant work units? (Just speculating out loud)

Yes. it happens after the client tries to connect to the server. I\'ve suspended LHC@home until I get an answer. I have 5 other projects running with tight deadlines and I cannot afford a boinc client stopping when I am not present.
Tullio


How long after contact with the server? This part of the log you quoted...
26-Sep-2008 21:06:02 [lhcathome] Scheduler request succeeded: got 0 new tasks
SIGSEGV: segmentation violation

... suggests it happens after contact with the server but since there is no time date-time on the SIGSEGV line it\'s difficult to say how much time elapsed between the scheduler request and the SIGSEGV. You might know if you happened to be at the computer watching. My point is that if there is a minute or 2 between the scheduler contact and the SIGSEGV then it\'s hard to put the blame on the scheduler request.

I am no expert on interpreting stack traces but these 2 lines...
/lib/libc.so.6(__libc_start_main+0xe0)[0xb7d49fe0]
./boinc(__gxx_personality_v0+0x165)[0x804bc21]

... suggest the SEGV happened in 1 of those modules? Now I\'m wondering if you installed BOINC from SuSe repositories or from the Berkeley installer (the installer aavailable from the BOINC site). The binaries in the Berkeley installer are compiled on Fedora, I believe, and are definitely static linked which means they may not run well on other Linux distros. Perhaps the libc.so.6 and/or gxx_personality are the problem?

If you installed BOINC from SuSe repos then the BOINC binaries are dynamic linked and more likely to run well on SuSe.

32) Message boards : Number crunching : work , but not for my host (Message 20547)
Posted 26 Sep 2008 by (banished: ID 70524)
Post:
Now, from your latest post I\'m getting a different picture. I could be all wrong but it sounds like maybe you are trying to make a political statement by not upgrading.


No, I just mentioned that because I do think that they have their priorities out of order and that they knew that there were older clients downloading tasks, yet either they did not test that scenario or they did and considered 0-credit as \"A-OK\" so long as it returned a valid result. IOW, they speak with forked tongue in regards to the value of credit parity.


Ah, ok, now I understand and I think you have a point.

Anyway, my primary reasons for not going for any newer versions are, in order:

  • The installer blowing up and leaving BOINC in a non-functioning state that took multiple reinstalls of 5.8.16 to fully correct.
  • No real need for any of the newer features.
  • The bugs of the newer versions (apppears to be sloppy testing and/or poor judgement on \"go / no go\" decisions to me).



Well, once bitten twice shy but I doubt you\'ll experience that again. Must have been some wierd glitch when you tried 5.10.28. Or else there is something very different about your system. There may be other reasons but I can\'t hink of any at the moment.

Only when this project started doing the cancels did I have a \\\"need\\\", but since I do not run this project very often, and considering that when I do I run it exclusively until I\'ve cleared out the work, then I can manually abort tasks just as easily.


I\'m confused. What am I missing here? To abort redundant tasks manually one must first go to the LHC website and see which of his unstarted tasks have reached quorum then abort them in the manager. I don\'t see how that is just as easy as letting the software cancel the tasks for you. Furthermore, if you run a version that will comply with server requests to check in, for example every hour, then you can have redundant tasks canceled even while you are away at work or on vacation or whatever, assuming the server is configured to ask hosts to check in every hour.

33) Message boards : Number crunching : work , but not for my host (Message 20544)
Posted 26 Sep 2008 by (banished: ID 70524)
Post:
I tell you what. When David Anderson and/or Eric Korpela make 5.10.45 the minimum version that will get work from SETI instead of allowing 3.x and 4.x clients to continue to download work, then I\'ll consider it. The \"fear\" of not forcing a newer version is that people were behind NTLM proxies and they would be \"cut off\" and that those people MIGHT be sponsors and that they just absolutely could not \"cut off\" a sponsor, nor could they even ask any of the sponsors if they were having that problem or if they knew people who did have problems, nor could they contact a sponsor and ask them to update. Meanwhile, knowing all this, they did not do testing of the newest science applications there under those versions of BOINC.

That\'s what I call \"shameful\"...


I have heard a little about that situation but I do not fully understand it so I refrain from commenting on their actions and motives.

You should run whatever version of BOINC you think suits your goals best. I am not trying to convince you one way or the other. I was just trying to get you over the fear of trying a newer version. Now, from your latest post I\'m getting a different picture. I could be all wrong but it sounds like maybe you are trying to make a political statement by not upgrading. Do your politics whatever way you deem best. I am just glad you are crunching, no matter which version you run :)

34) Message boards : Number crunching : Something is wrong managed!!! (Message 20540)
Posted 26 Sep 2008 by (banished: ID 70524)
Post:
So you mean if I start to crunch the WU before connect with the server, the WU will be completed without cancelling even when during the crunching, BOINC communicates with the server?


Yes.


35) Message boards : Number crunching : work , but not for my host (Message 20538)
Posted 26 Sep 2008 by (banished: ID 70524)
Post:

I think LHC@home does not use HR. I have primarily Linux hosts. I have not received LHC work for quite some time but if I recall correctly they were almost always grouped with Windows hosts so heterogenous redundancy seems to be in effect here.


That is the case. My line of thinking this moring was related to being involved in a Homogeneous Redunancy project that currently has a very severe problem with HR (Cosmology). I was in a bit of a rush, and have something else on my mind too (death of someone I grew up with), so I was a bit...terse (and perhaps tense) this morning...


No problem here with terse if terse means short and to the point. That\'s the way I like it. Now I see you were a bit distracted and in a rush, possibly a full bladder too. No problem.

Anyway, I noticed after posting that the status box listed the project as \"down\", but with work available. I didn\'t have the time to go back to clarify...


Actually the scheduler was down but everything else was running. There were several thousands results available but if the scheduler was down then the results won\'t get sent when hosts request work.

As for the newer versions of BOINC, I\'m still not convinced to try even 5.10.45, let alone 6.x.x. Many things are for Vista, which I don\'t have and don\'t plan to have anytime soon. As for the non-Vista stuff, most of that seems to be geared towards making the messages more palitable for \"non-technical\" users that might stumble across them. I think it also has some fix for NTLM proxies, but since I don\'t use one, that\'s not an issue I need fixed.


I recall you saying you tried to install 5.1.x but it did not go well, crashed or something. I vaguely recall one release in the early 5.x series which was quickly withdrawn because it caused a lot of grief. Perhaps you were unlucky enough to try that one.

I\'ve installed 5.10.45 on XP on several of my hosts and I can assure you it\'s very stable, both the install and the day-to-day running. I don\'t recall exactly which version first acquired the cancel feature but if your current version does not have it then 5.10.45 is well worth it just for that feature alone.

36) Message boards : Number crunching : Something is wrong managed!!! (Message 20537)
Posted 26 Sep 2008 by (banished: ID 70524)
Post:

I just think its unfair for the people who finish the WU before the deadline but only a bit later than other people. Not all the computers are running and connecting to Internet and crunching LHC@Home 24 hours.


It wasnt very long ago that I joined this project, and I must admit that initially I felt the same way you do. It just seemed totally unfair that after waiting SO long for WUs to crunch, that when I finally get some, it seems that most get aborted before my clients barely get started on them.

I dont feel that way any longer. Why, you might ask? Well, not long ago, while I was in the process of composing a post for a new thread(not unlike the first in this thread), I had an epiphany. I suddenly realized that, it really shouldnt matter to me how may WUs I crunch and get credit for. The important thing is that the work just gets done period. It doesnt matter if my total credit is 1 or 1 million. For me, its not a contest, because the only "winner" should be the LHC and the scientists who have devoted their lives to the science.

By participating, I am offering my available resources to the project. If they are used, great! If they are not, thats OK too. Either way, I am still participating and feel like I am a part of something very important thats much bigger than me.

Leevis


Hi Leevis

I understand what you mean, as long as you can contribute something to the project, you are satisfied.

But please don't forget crunching needs electricity, and in Australia, the power stations burn coal(85%). LHC@home aborted the results, this means some CO2 was generated for nothing.

It's not sustainable.

Cheers


They are canceling only redundant results that have not started crunching. If they haven't started then next to 0 CPU time and electricity has been spent on them. It's the tasks that are redundant but don't get canceled because the host has already started them... those are the results that are wasting CPU time and electricity. If hosts would contact the server more often then more redundant tasks would get canceled. The project managers can direct hosts to contact the server more frequently but they are not doing so.

There are BOINC options you can set on your end (the host end) which will tend to cause your computer to contact the server more often. More frequent contact increases the odds of your computer receiving a cancel order before it starts crunching a redundant task rather than after.
37) Message boards : Number crunching : work , but not for my host (Message 20531)
Posted 25 Sep 2008 by (banished: ID 70524)
Post:
As for the sched_op_debug thing, I thought that it would still tell you the HR-related messages, but since I do not use the newer versions of BOINC, I could very well be mistaken.


I use only 5.10.45 and higher, mostly 6.2.x. Those versions might give HR (homogenous redundancy) related messages without setting options in cc_config.xml. Unfortunately, at this time I am not attached to any projects using HR so I cannot say for sure.

I think LHC@home does not use HR. I have primarily Linux hosts. I have not received LHC work for quite some time but if I recall correctly they were almost always grouped with Windows hosts so heterogenous redundancy seems to be in effect here.

38) Message boards : Number crunching : work , but not for my host (Message 20526)
Posted 25 Sep 2008 by (banished: ID 70524)
Post:
To clarify, if there was work available for other hosts, you will be told that there was work available for other platforms in the response message from the scheduler.


Close, except littleBouncer\'s hosts are Intel and AMD processors running on WinXP. A non-supported platform is not the issue here. The thread title is a tad misleading for the uninitiated.

There are tasks \"in progress\" right now, but no tasks needing to be sent out. This project is at least fairly reliable when it comes to sending correct messages to BOINC hosts when there is no work, unlike the major problems being encountered over at Cosmology.


Older versions of BOINC display those messages but newer versions do not display them unless you turn on the <sched_op_debug> option. With that option turned on, newer clients will display an explanation for why they get no work.

To turn on the <sched_op_debug> option, create a text file named cc_config.xml in your BOINC data directory (if it does not exist already). The data directory is mentioned in the message tab of BOINC manager. With a text editor, enter the following into that file:
<cc_config>
<log_flags>
<sched_op_debug>1</sched_op_debug>
</log_flags>
</cc_config>

Note there is a digit \'one\', not letter \'el\' between <sched_op_debug> and </sched_op_debug>.

More information on cc_config.xml is available in the official Boinc wiki in Client configuration.

39) Message boards : Number crunching : work , but not for my host (Message 20518)
Posted 25 Sep 2008 by (banished: ID 70524)
Post:
What a childish staement, my little Bouncer !


Well, if that had been the first time the server had been down and if that were the only, ummm... let\'s just say deficiency... to be polite...

and if this project were only 4 months old rather than 4 years old...

and if this project\\\'s parent had a $6 budget instead of a $6 billion budget...

then yah, it would be a childish statement.

If it\'s the vulgarity in the vocabulary that bothers you, Mabuse, then you must surely detest the vulgarity in this project\'s attitude and actions.

No reflection on our esteemed admins, Neasan and Alex. They\'ve done so much for so long with so little they are now qualified to do absolutely anything with absolutely nothing.

40) Message boards : Number crunching : Actual LHC data to crunch? (Message 20513)
Posted 24 Sep 2008 by (banished: ID 70524)
Post:

I got several of those WUs recently. And since the machine that I've made available currently to LHC is a slower box, I wasn't one of the first to complete a WU. My machine checks in with LHC@HOME frequently, and so got the request from the server to cancel the processing of that WU.

So, even though some of the "extra" time wasn't used, the project was nice enough to let me know it didn't need it. That's better than letting me finish the whole WU and then not give me credit. It's a good balance.

Mark


It worked good for you on 1 occasion but in the grander scheme it\'s just slightly better than it used to be and a far cry from what it could be.
41) Message boards : LHC@home Science : LHC shut down for months after helium leak (Message 20511)
Posted 24 Sep 2008 by (banished: ID 70524)
Post:
That means that a 3 meter long steel rail cooled down to -250 Celsius would shrink 250x3/100 = 7.5 mm. How come that these magnets shrinks so much as 200 mm?


Well the magnets are not made of steel used for building rails. They are made of different materials like maybe Alnico & a combination of many other alloys - each having their own expansion coefficinets, so you see the difference.


I just did not realized that the difference between different metal was that big.

Which metal has the biggest temperature dependent change?


I cannot find a table that shows all the different metals but this table shows several.

Mind you those are the coefficients at 20 Celsius. The coefficients are different above and below 20. They are probably not much different between 10 and 30 Celsius but they may be very different down around absolute zero. What I wonder is... do they increase or decrease with decreasing temperature?


P.S. Another table listed at the end of the Wikipedia article. Zinc an lead have coefficients about double that of steel, according to the table.
42) Message boards : LHC@home Science : LHC shut down for months after helium leak (Message 20505)
Posted 23 Sep 2008 by (banished: ID 70524)
Post:
The whole apparatus works under extreme conditions. I\\\\\\\'ve read that the 3 m long magnets get shortened by 20 cm when supercooled. Imagine the mechanical problems. I think that a failure was almost inevitable.


Yes. The pipe carrying the helium to/from the magnet has to accomodate the 20 cm shrinkage/expansion in the magnet. How? You can get a pipe run to expand/shrink that much by putting and S bend in it but that gets scary at ultra-low temperatures. I\\\'ve worked with some incredibly strong steel reinforced synthetic rubber hose but it gets brittle at ultra-low temps too. A sliding seal between a fixed section and a moving section? Seals become hard and brittle at low temps too.

That\\\'s just the pipe carrying the liquid helium. Then there\\\'s stress in the magnet itself due to expansion coefficient differences between the various materials in the magnet. There are other stress inducers too. Yes, failure was almost inevitable.


43) Questions and Answers : Preferences : Preferences - BOINC Manager (Message 20500)
Posted 23 Sep 2008 by (banished: ID 70524)
Post:
Why does BOINC Manager not remember how I want it to appear whenever I call it up?


The developers have been working on that problem for quite some time. They say it\'s not an easy problem to solve since the goal is to have the same code, or as much of the code as possible, run on a variety of operating systems. If you have any insight into how the problem can be solved, the developers would love to hear from you.

44) Questions and Answers : Preferences : why i cant get any work ? (Message 20498)
Posted 22 Sep 2008 by (banished: ID 70524)
Post:
i still cant get any work :(

there are Results ready to send 13,308


No, there were not 13,308 results ready to send when your computer requested work. There might have been 13,308 results 10 minutes before your computer requested work but during that 10 minutes other computers took all of them.

so what should i do ??


There is nothing you can do.



45) Message boards : Number crunching : Something is wrong managed!!! (Message 20495)
Posted 22 Sep 2008 by (banished: ID 70524)
Post:
THx for all replies.

Sure is better to speed-up the server-side; when 3 sucess results are returned, then the other work is not necessary.

But you can do this by sending out only 3 WU !!
(or 4 , for assurance, when one fails)


The problem with LHC tasks is that the first 3 Success returns frequently fail to match. Frequently, even a 4th Success result fails to match. And frequently 1 or more of the first 3 returns Compute Error. That happens so frequently that they find it is faster just to send 5 results at the beginning, in other words, to use Initial Replication = 5 (IR = 5).

More precisely, it was faster to use the IR = 5 strategy when they were using the old server code. In fact, that was the only speed up strategy possible with the old server code. The new server code they recently installed has new features and capabilities that make possible new strategys where maybe they can have IR = 3 and complete the batch of work just as quickly as with IR = 5. But that strategy also includes implementing some as yet undetermined combination of these new features:

1) limiting crunchers to having no more than X tasks per CPU in their cache, to spread the work out over more hosts
2) the server keeping a list of fast reliable hosts to which additional tasks will be sent if 1 or more of the first 3 tasks fails or the first 3 don\'t achieve quorum.
3) possibly reducing the deadline
4) making BOINC client version 5.8.16 a minimum requirement to participate in the project
5) other new features I have forgotten at the moment

Using those new features, other projects that were forced to use IR = 5 a few years ago have found they are now able to reduce IR to 3 and still finish the batch quickly but they likely had to rewrite some/all of their verifier/assimilator/feeder/transitioner scripts to accomplish all that. That takes time and money. We can only hope that LHC will find a way to incorporate some of those new features into their strategy, if they can. Canceling redundant tasks is a good start.


46) Message boards : Number crunching : Am I that unlucky? (Message 20487)
Posted 22 Sep 2008 by (banished: ID 70524)
Post:
The short version:
I got work on the 13th, completed on the 13th.
Since then, I get nada.


The reason for that is plastered all over these forums. Read and learn.

47) Message boards : Number crunching : Something is wrong managed!!! (Message 20486)
Posted 22 Sep 2008 by (banished: ID 70524)
Post:
But compared with how it used to be this will definitely speed up the process.


It used to be that hosts did not abort redundant tasks. Recently LHC implemented cancels to reduce wasted CPU time by aborting redundant tasks that have not been started. That strategy also speeds up the process by freeing the host from some redundant tasks in order that they can crunch tasks that are not redundant. littleBouncer\'s plan insures that his host will NOT abort redundant tasks. His plan defeats the recently implemented waste reduction and speed up speed up strategy. littleBouncer\'s plan is not progress, it is regression to the old, slow, wasteful way.


48) Message boards : Number crunching : Something is wrong managed!!! (Message 20478)
Posted 22 Sep 2008 by (banished: ID 70524)
Post:
Next time I will set the manager of \\\"no more work\\\" from LHC, when I reached daily-quota, and contact the server when all WUs from LHC is done (not earlier !!!), so I can crunch all work which was send to me...


But then you will crunch work that does not need to be crunched. Which is more important... to crunch everything they send you? or to crunch only what needs to be crunched?

P.S. You are correct, though. Something is wrong managed... VERY wrong managed.
49) Message boards : Cafe LHC : Group Protests Treatment of Hadrons at CERN (Message 20427)
Posted 18 Sep 2008 by (banished: ID 70524)
Post:
In my sleep I was having nightmares about crashing into something I could not see and dying. My analyst asked me if I had ever been in a car accident, aairplane crash or similar incident. I had not. We tried some deep hypnosis so see if more details concerning the cause of my nightmares could be determined, to free a memory deeply hidden in my sub-conscious self.

It turns out that in a previous life I was a hadron. Someone at some collider crashed me into another hadron. Damn those scientists! I never had to pay taxes when I was just a measly old hadron!!! Leave the hadrons alone, they aren\'t hurting anybody.

50) Message boards : Number crunching : lhc@home and simd (Message 20418)
Posted 18 Sep 2008 by (banished: ID 70524)
Post:
At most I think mmx.,but not sure.


I am not sure either but I suspect that since they have difficulty getting results to match due to differences in CPUs and operating systems, they likely keep it as simple as possible and avoid simd.


51) Message boards : LHC@home Science : how to download software? (Message 20416)
Posted 18 Sep 2008 by (banished: ID 70524)
Post:
i just signed up.. where do i download the software to LHC@home??


Welcome to LHC@home, djfred.

I see you don't have any computers attached to this project yet so I assume you have not downloaded the software yet.

The software you need is called BOINC. Visit the BOINC website. Near the top of the page there is a download link. If you click that link it will detect your operating system and give you the option to download the latest version of the BOINC software. Normally that would be the best version to get but right now the latest version does not work properly with LHC@home. For the time being you should get the previous version of BOINC which is version 5.10.45. Therefore you need to go to this page which lists the various operating systems BOINC runs on and the various versions of BOINC. Find your operating system and then version 5.10.45.

You will also want to read the BOINC user manual. The user manual is being moved from one format (Trac) to the wikipedia format. On the BOINC website\'s homepage there are 2 links to the user manual, both are named Documentation. You want the Documentation link near the top of the page under the Volunteer heading. Lots of good reading there, do take a look at the user manual as it answers lots of questions. Be aware that the BOINC software has 2 separate parts: the manager and the client. The manager is the part you will interact with and it has 2 different views: Simple View and Advanced View. You should read enough of the user manual to learn how to switch between the Simple View and Advanced View before you go too far. I'm going to give you a few hint to get you started. Those hints pertain to the Simple View.

As Michael already suggested, this project does not always have work for us to do. In fact, as I write this post, there is no work and there may not be for several more days or weeks. In the meantime there are about 50 other very worthwhile projects that you can help out using the BOINC software. So, after you get the BOINC software downloaded and installed it will automatically start up for you and tell you that you are not attached to a BOINC project. It will also show you a list of projects that you can attach to. Scroll through that list and click on LHC@home. It will ask you for an email address and a password. You can use the same email address, password and username you used to create the account you created for posting here or you can give different info to create a different account. I suggest using the account you've already created. Then click OK or Finish a couple times and you'll be attached to LHC@home.

Since this project does not have any work for us to do at this time, BOINC will just kind of sit there doing nothing. If you want to attach to a second project at that time then in the Simple View just click Add Project and you'll get the Attach to Project wizard again. It is important to use the same email address and username for each project you attach to. You can attach as many projects as you like but I would suggest no more than 2 or 3 until you get the hang of things. You can detach from any project at any time.

Help and advice for BOINC in general is available at Boinc dev forums though you can always get advice here too :)

___________________________
Team Canada
52) Questions and Answers : Windows : Stuck at 100%, still Running, high priority. (Message 20407)
Posted 17 Sep 2008 by (banished: ID 70524)
Post:
They haven\\\'t said why version 6 clients don\\\'t work but the most likely explanation is that the science application isn\\\'t compiled against the version 6 API.

Version 6 clients do work, it seems to be just some workunits on some peoples computers that have this problem. I haven\\\'t had any problems running LHC on 6.2.18, and as WildKard said above, 1 workunit completed and 1 failed on his PC.

Michael[/quote]

Thank you. I stand corrected :)

53) Questions and Answers : Preferences : why i cant get any work ? (Message 20406)
Posted 17 Sep 2008 by (banished: ID 70524)
Post:
hi i don\'t get any work from lhc@home but at einstein and seti @ home it works. why ?


Is your computer requesting work? If it is then the most likely reason you're not getting any is that they have no work to give you or anybody else when your computer requests work. The work here comes in small batches with a wait of days between batches.

Look around the forums here, There is a thread called This Project Rarely Has Work or something similar. Read it.

Watch the Server Status box on the home page. When it says Results Ready To Send 0 there is no work for anybody. The numbers in that box are updated about every 15 minutes which means the box can say work is available but it actually all got sent 5 minutes ago.
54) Message boards : Number crunching : No Credit for WU (Message 20404)
Posted 17 Sep 2008 by (banished: ID 70524)
Post:
I seem to have received no credit for the following WU:

http://lhcathome.cern.ch/lhcathome/workunit.php?wuid=2850021

It\'s not a lot of credit but could happen to a WU that runs for hours.


Nobody has received any credit because the project has not received the 3 matching results they need to verify they have the proper result.

Since 11 hosts have returned Compute Error, the maximum 10 error rule takes effect which means they\'ll mark it as a defective WU and nobody will get credit.

At this time, there are 2 Successes for that WU and 1 result still in progress on computer 9715557. If just 1 more Success had returned before it racked up the 10 errors and if that 3rd Success had matched the other 2 Success results then those 3 would have received credit. Unfortunately, with the 10 strikes against the WU, it\'s game over. I\'m not sure but I think nobody will get credit because even if computer 9715557 happens to return Success they won\'t bother trying to verify it against the other 2 Successes. Gotta draw the line somewhere.

55) Questions and Answers : Preferences : ????? (Message 20393)
Posted 16 Sep 2008 by (banished: ID 70524)
Post:
Your computer either had all the LHC@home tasks it could finish before the deadline or it asked for work at a time when there was no work available.

Maybe read this thread.

56) Questions and Answers : Windows : Stuck at 100%, still Running, high priority. (Message 20392)
Posted 16 Sep 2008 by (banished: ID 70524)
Post:
The only thing I could suggest would be to try upgrading to the latest 6.2.18 client. I don\\\\\\\\\\\\\\\'t think it\\\\\\\\\\\\\\\'s helped other people, but you never know.

Michael


Okay, I\\\\\\\'ve done this and am now running 6.2.18.


From the News on this project\'s home page

06.09.2008 12:30 BST -

Hi,
Further to yesterday\'s news item, we are recommending that you download/use the 5.10.X version of the BOINC client for your opertaing system. We hope to have this sorted as soon as possible but with LHC turn on in 4 days everyone is busy preparing for that.
Later days,
Neasan


They haven\'t said why version 6 clients don\'t work but the most likely explanation is that the science application isn\'t compiled against the version 6 API.

It hasn\'t helped. I gave up on expecting this result to finish properly and decided that maybe the best thing to do was reset the specific computation/task. Unfortunately I messed up on this and ended up causing the task to end with a Compute Error (is this better than a result that hangs indefinitely? I don\'t know.


The reset button resets a project. It does not reset specific tasks. The reset button aborts all of the selected project\'s tasks, downloads the project\'s app again and does some other cleanup and reinitialisation. Basically it puts the project back to the way it was when you first attached it.

If you want to abort just 1 task then you have to select the task and click the abort button. If you have just 1 task for the project then reset is pretty much the same as abort.

Note that reset does NOT start a task over from the beginning which is what I think you may have been trying to do.


57) Message boards : Number crunching : Actual LHC data to crunch? (Message 20382)
Posted 15 Sep 2008 by (banished: ID 70524)
Post:
Dagorath --

Is it possible that the fourth and fifth crunchers are not continuously attached to the Internet? Perhaps they connect only once a day, or maybe even once every other day, because they utilize a dial-up connection.


Well of course it\'s possible they\'re not attached continuously but what does that have to do with anything I said in this thread?

I grant you, that the method being used may not be the most efficient, but it does work (most of the time).


Well of course it works. I\'ve never doubted that for a moment. Some of us would like it to work much better so that it doesn\'t waste so much of our CPU time and money. You have a problem with that? Or do you prefer wasting your electricity and money?

Like you, I do not get all the WUs that I can handle


Wrong. When they have work I get all I can handle. You get the leftovers. I\'ll send you my \"get as many as you want\" script if you want it.

, and remember, I FREELY DONATE MY TIME TO THIS PROJECT (among others).


Really? Thanks for telling me that. Silly me, I thought they were paying you.

Last time I looked, there is nobody holding a revolver to my head, and forcing me to make time available on my machines. I am here BY MY OWN CHOICE. If I feel at any time, that my services are being mis-used, there\\\'s a button on the PROJECTS page of BOINC called DETACH ... I understand it works very well


Gasp! How very sassy. I\'ve never heard that one before. I bet you copied that from somewhere, saved it in a secret little file called \"sassy_replies_guaranteed_2_impress.txt\" and just pasted it into your post? Mmmmm. I bet you\'ve just been itching to spring that sassy little bit of \"originality\" on someone, huh? Tell us, did you add all that yelling yourself? If so then Bravo!!! Well done, OA!!! You should write up your technique and build a website dedicated to that. Yah, you could have a blog and everything :)

Here\'s another one for you to copy and paste later....

If you don\'t like my posts then plonk me. Nobody\'s holding a gun to your head forcing you to read them.

There. You got your cookie. Off you go now. Add a little yelling to it then spirit it away in that special little sassy response file you keep. If you\'re nice I\'ll send you a like totally kewl skull \'n crossbones icon you can use for that file. When your skater chicks see it they\'ll just know you\'ve got some edge.
58) Message boards : Cafe LHC : LHC cooling problems! (Message 20360)
Posted 14 Sep 2008 by (banished: ID 70524)
Post:
Many thanks for the replies.

I have been doing distributed computing for many years,
be it for Seti@home, the Cliamte prediction project,
or stardust@home, and only very recently I started
with LHC@home, that\\\'s why my knowledge about it is rather
limited, so thank you again for the info.

Physics was always my passion, and I am more than happy
to contribute to one of the greatest scientific experiments
of our time, whether it is a simulation or real data,
and even if it is only needed once every 6 months! ;-)


Glad to hear you\'re no stranger to BOINC and welcome to LHC@home. Yes, it is one of the greatest scientific experiments of our time, no doubt about that.

The magnet tuning work done so far has at least allowed them to switch the collider on. I am guessing they\'ve taken some readings and will be tuning the magnets further. It seems like they run a batch of work units through the simulation, examine the results and then create more work units from whatever they learn from the previous batch. In the past the wait between batches has been rather long sometimes. Now that the collider is functional the batches might come more frequently. We may never get to analyze actual experiment data because the data transfer bandwidth to crunch time ratio is not favorable but the magnet tuning work is vital to the experiment too.

59) Message boards : Cafe LHC : LHC cooling problems! (Message 20356)
Posted 14 Sep 2008 by (banished: ID 70524)
Post:
ayarka,

The work we have been doing here at LHC@home for the past few years and the work you have been getting since you joined is not data from collider experiments. We are running a simulation that helps the scientists tune the magnets that guide the beam through the tunnel. We got that work before the collider was even built and it is possible for them to generate that work even when the collider is shutdown for repairs/maintenance.

You really should do some reading in these forums to understand exactly what it is you\'re doing here. You may be pleasantly surprised or you may leave in disgust. But so far you seem to not have a clue what\'s going on here.

60) Message boards : Number crunching : Actual LHC data to crunch? (Message 20317)
Posted 13 Sep 2008 by (banished: ID 70524)
Post:
Yeah, it would be nice if the max amount of WUs we get queued up at any time was less than it is now. I get the feeling there\'s a lot of slow computers out there chugging away at more WUs than they should have received. Not that their help isn\'t appreciated, but it would be more efficient with shorter queues as the faster PCs would be able to pick them up after finishing their first sets.


The admins at this project don\'t see it that way. Instead they send out 5 results when they need only 3 to make the quorum. They think doing more work than is necessary will somehow get all the work done faster. <sigh>

Take this result for example. Computers 9690526 7016646 and 9631907 returned 3 Success results which made the quorum of 3 and allowed the Canonical Result to be declared. Computer 124273 therefore wasted 3.25 hours crunching a task that didn\'t need to be crunched. At this time computer 78890 hasn\'t returned his result so we don\'t know how many hours he will waste but whatever the number is it will be wasted effort that could have been spent on a task that does need crunching. Shameful.

61) Message boards : Number crunching : Bandwidth/Data Size Question... (Message 20256)
Posted 12 Sep 2008 by (banished: ID 70524)
Post:
Just wondering about the size of the data files download and uploaded by BOINC.. I don\\\\\\\'t have an unlimited internet connection..

The program lets you limit bandwidth(speed), but not a data limit.

Any one able to give a weekly or monthly estimate??



Most people get so few work units from this project (because they rarely have work) it isn\'t a concern.

If you want to see the size of the uploads then turn off network activity after you download a work unit to stall the upload. When the task shows status Uploading, switch to the Transfers tab where you\'ll see the file(s) and size(s). Then resume network activity to allow the upload to proceed. No guarantee all uploads will be the same size.

62) Message boards : LHC@home Science : New Tasks? Yes or No !! (Message 20254)
Posted 12 Sep 2008 by (banished: ID 70524)
Post:
Why is it, when the Server Status says:-
--‘Up, 4200 workunits to crunch’
And I hit the ‘Update’ button to get more tasks
The message reply says
--‘Scheduler request succeeded: got 0 new tasks’

This is really irritating

Mike


It lies. There were 0 workunits to crunch when you hit the update button. The number of work units to crunch gets updated every X minutes. There were 4200 a few minutes before you hit Update but they were taken before your computer requested work. If you had checked a few minutes later (after the number was updated) it would have said 0. So it's a crap shoot: sometimes you get lucky, most times you don't. Attach your computer to one of the many other worthy projects if you want to contribute to science because unless you have a script that checks for availability of LHC work every 5 minutes and forces an update when work is available you'll likely wait a loooooong time to get any.
63) Message boards : Number crunching : What does RAC mean? (Message 20177)
Posted 11 Sep 2008 by (banished: ID 70524)
Post:
What does RAC under my mane mean?

Thanks :-)


Runned And Cruncheded.

No wait. Recent Average Credit. See the wiki article on Computation Credit.

64) Message boards : Number crunching : I get no \"workload\" (Message 20091)
Posted 10 Sep 2008 by (banished: ID 70524)
Post:
The admins have estimated they need about £60k to do all the upgrades and enhancements they would like to, but they don\\\'t have the funding. (Given the cost of the LHC itself, that seems rather short-sighted of CERN, but that is the current situation.)


I\'m beginning to think we\'re not really calculating anything of any importance at this project. The magnet work is beyond important, it\'s absolutely critical. If not done properly there will be major damage. Allowing the magnet work to be done on a project funded and managed as badly as this project? Nope, I can\'t believe it any longer.

A more plausible explanation, in my mind, is that LHC@home is little more than an effort to help sell the collider (I heard they had a tough time getting the funding back then) by giving it the appearance of the people\'s project, thousands of common folk believing in and wanting the collider so bad they donate time on their own computers to help it along. I think the work units we\'re getting are bogus units that have nothing at all to do with the magnets or anything of any importance.

65) Message boards : Number crunching : BOINC 5.10.x?? (Message 20083)
Posted 10 Sep 2008 by (banished: ID 70524)
Post:
Maybe Mr. Anderson should try fixing his software before worrying about his other pet peeves?


Actually, Brain, Dr. Anderson\'s software isn\'t the problem here. The problem is that LHC didn\'t compile their app against the new version 6 API. Until LHC does that, their app won\'t run properly (if at all) under version 6 BOINC clients.

The v6 API was released several months ago. Guess they were too busy upgrading the server to bother with such a minor detail. Right, the server software that was obsolete a year ago. Not blaming Neasan and Alex as it sounds like they\'re doing all they possibly can.

66) Message boards : LHC@home Science : 64 Bit proccessing (Message 20077)
Posted 9 Sep 2008 by (banished: ID 70524)
Post:
This said, BOINC is 32-bit typically and in any case the client files are.


There is a 64 bit version of BOINC for Linux, Mac and Windows. There are several projects that have 64 bit applications. Some of those are much faster than the 32 bit version, see ABC@home for example.


67) Message boards : LHC@home Science : 64 Bit proccessing (Message 20059)
Posted 9 Sep 2008 by (banished: ID 70524)
Post:
Is there any advantage in running an O/S in 64 Bit config with the workload in coming months,I run Ubuntu/Linux at moment in 32 Bit mode.
jon


This project has been sending a 32 bit application to 64 bit Linux hosts. As for the workload in coming months, well, ya never know for sure but I suspect they won\'t bother with a 64 bit version. It likely wouldn\'t run much faster than the 32 bit version.



68) Message boards : Number crunching : The project giveth then taketh away (Message 19922)
Posted 13 Aug 2008 by (banished: ID 70524)
Post:
Only my two linux machines running 6.2.14 are getting redundant WUs.
My two linux and four windows boxes running older clients haven't got any.

That, the stderr message, and considering 6.2.14 is a pre release version, a "development version that may not function propely" as it warns at start, seem to point to a core client problem. But it could be just coincidence.


I follow the version changes rather closely. Changes between 5.xx.xx and 6.2.xx are mostly to the Windows installer for the purpose of making BOINC install properly on Windows. There have been only minor changes to the Linux client and none have been changes to anything that could affect cancels and redundancy. See the client does not initiate cancels anyway, the server does that.

I am switching those two machines to 6.2.15 today, when they crunch their last cached WUs. Let's see if that gets rid of the problem.


It's not a problem. It is an attempt by the admins to fix a problem.

Am I the only one experiencing problems with the posting text box?
Words disappear or mess up and backlashes get added to single and double quotes when I hit the preview button. It's pretty awkward.


No, you are not the only one. It is a bug in the forum code on the server. Avoid it by not using contractions in your post.


P.S. Ooops! Avoiding contractions doesn't prevent the backslashes being added to quote characters, sorry.
69) Message boards : Number crunching : The project giveth then taketh away (Message 19916)
Posted 13 Aug 2008 by (banished: ID 70524)
Post:
I got eight WUs labeled redundant and canceled.
This one
lived only 46 minutes, from creation to return time.

Stderr says: <core_client_version>6.2.14</core_client_version>, but I crunch for SETI and E@H too, without redundant WUs so far.

Any redundant WUs with the core client 6.2.15?


Not sure why you are asking that. If you are thinking canceling tasks (WUs) and/or redundant tasks is a bug in client 6.2.14, it is not. It is a feature (not a bug) since approximately version 5.10.x. Thus anyone running client 6.2.15 might have had redundant tasks that got canceled.

The reason you are not seeing canceled SETI and E@H tasks is because, I suspect, those 2 projects cancel very few tasks. I have not crunched much SETI or E@H lately but I think they have very few redundant tasks, if any, thus they have very infrequent need to cancel a task.

@ Ocean Archer: Thanks for the info :) I was thinking: new app -> tons of compute errors -> cancels
70) Message boards : Number crunching : The project giveth then taketh away (Message 19908)
Posted 12 Aug 2008 by (banished: ID 70524)
Post:
Like you both mention - the discussion of initial replication is one for another thread, but for a 'junior cruncher' like myself, it works ...


Yes, another thread some other time. The cancels could have been due to a batch of corrupt work units or some other reason, not necessarily redundant results.

I didn't get any tasks from the last batch. Did the tasks run on the SixTrack app or are they using a new app for the dynamic aperture work? I suspect a new app though I don't see anything new on the Applications page.

Anyway, glad to hear "new intense efforts are being devoted to massive numerical simulations" from Massimo Giovanozzi. Sounds like there may be a lot of work coming up.

71) Message boards : Number crunching : The project giveth then taketh away (Message 19899)
Posted 11 Aug 2008 by (banished: ID 70524)
Post:
This is a feature which came with the new server code recently installed. Might or might not have been consciously activated by the admins.

Michael

I hope it was activated on purpose. I have long stood behind the project's decision to issue extra tasks to increase turnaround. However I do not like my machines doing unneeded work. This feature allows the extra initial tasks without the unneeded ones.


But if you crunch a task to 90% and then it gets canceled as redundant then your computer has done unneeded work. And, if the cancels work the way they do on other projects I've seen, you don't get any credits for the work you did (which doesn't bother me but I have a hunch it might bother lots of other folks).

The better way to get rapid completion without all the wasted effort that setting Initial Replication greater than Quorum entails, is to make the deadlines shorter. If there are complaints that some people cannot handle shorter deadlines then tough, not every project is for every cruncher.

72) Message boards : LHC@home Science : Colder than space (Message 19880)
Posted 7 Aug 2008 by (banished: ID 70524)
Post:
Maybe they just don't want rogue states to have access to the data.



What are they going to do, build their own black hole and suck the whole world into it?


No. Just the bits between Canada and Mexico, where all the trouble makers and the presidunce lives. Then they'll turn it off before the people on the other side realize they've been slimed and push it all back through the hole to us.

Geez, Tom! If *I* knew what they could do with the data then there would be no reason to build a collider and get the data. Huh?


73) Message boards : LHC@home Science : LHC Images (Message 19872)
Posted 4 Aug 2008 by (banished: ID 70524)
Post:
Great pictures from the LHC

http://www.boston.com/bigpicture/2008/08/the_large_hadron_collider.html


Congratulations! You've won this week's "Post The Same Link A Day Later In A New Thread Award"

http://lhcathome.cern.ch/lhcathome/forum_thread.php?id=2817&nowrap=true#19860

;)

j/k

Al.


Maybe Pletsch plonked ya and didn't see that link.

;)

l/m

Bob.


74) Message boards : LHC@home Science : Colder than space (Message 19864)
Posted 4 Aug 2008 by (banished: ID 70524)
Post:


They've already said the average BOINCer doesn't have the bandwidth required to transfer the humongous data files that will be generated when the collider is running. Maybe more SixTrack work for us? Or some new app related to training the magnets? If not then why did they update the server software? So we can have a nicer forum to discuss why there's no work? How considerate of them.



The basic Seti work likewise generates large data file but they have managed to figure out how to "chunk" the data out so us lowly peons not running Crays can participate. This just sounds like an lame excuse.


Could be politics. Maybe they just don't want rogue states to have access to the data. I don't know what their official policy is on that matter. Not that official policy really means much to some people/organisations. They can say one thing for show and do the opposite. If rogue states complain about not having access to the data they can be ignored. (Not saying they should be ignored.)

75) Message boards : LHC@home Science : Colder than space (Message 19832)
Posted 28 Jul 2008 by (banished: ID 70524)
Post:
Looks like lots of work coming up for us here then :)

Al.


I wouldn't bank on it!


They've already said the average BOINCer doesn't have the bandwidth required to transfer the humongous data files that will be generated when the collider is running. Maybe more SixTrack work for us? Or some new app related to training the magnets? If not then why did they update the server software? So we can have a nicer forum to discuss why there's no work? How considerate of them.

76) Message boards : Number crunching : Stand back....... (Message 19643)
Posted 19 May 2008 by (banished: ID 70524)
Post:
'bout time I test something here.

EDIT: yup, as I expected, the security hole I used on previous posts in this thread to get into Neasan's account is still open.

Have a nice day. Keep your passwords safe! :)


Yeah, I suspect it was you who deleted his post in which he said something like "all you can do is laugh". Or maybe you posted it from his account and he deleted it. That's why I changed all my passwords at all my projects and quit posting and crunching here for a while. That and the fact that some 25% of the WUs crunched here are wasted effort but nobody seems to care. Biggest screw up of a project I've ever seen.

77) Message boards : Number crunching : Stand back....... (Message 18542)
Posted 4 Nov 2007 by (banished: ID 70524)
Post:
Maybe a little taste of their own medicine will get us, the people who donate the CPU time that makes this project work, the respect we deserve.



Respect is earned, and not always deserved.


If crunching LHC doesn't earn respect from LHC then do tell what we have to do to get respect. Stand on our heads and spit out gold ingots?

Where I come from people almost always get respect from a charity when they donate labor, money or whatever it is the charity needs. That respect usually includes some more or less detailed info on the charities activities, how the donations are utilized, some bean counting, etc.

What has LHC shared with us regarding how our donation is used? Seems about all we know is that we're helping to align the magnets. Wonderful! I'm sincerely glad to help but I think we deserve a few more details about IR when there seems to be no need for it to be as high as it is plus better server software.

This is not a poke at you, but a comment on this process. You can have the well intentioned desire to improve this project, but ultimately this does fall into the hands of those runing the project. I am sure that they would love to upgrade the server systems to the latest and greatest, but may have other internal politics to deal with that cannot be discussed in this open forum.


It could be internal politics, budget/manpower constraints, incompetence, or other reasons. Experience has taught me that it doesn't hurt to voice one's concerns, ask questions and pursue issues. Those who prefer to just shut up and crunch are free to exercise that option. Those whose lives have been ruined by reading my posts and other posts that ask questions and pursue issues please email me stating how your lives have been ruined and how much money it will take to remedy your sorry situation and I will promptly put a cheque in the mail, dress in sack cloth and sit on ashes. Sorry, I don't do PayPal.

And to quote a cliche "all good things take time". "Patience is a virtue".


One good cliche deserves another... "A fool and his money are soon parted." Same applies to CPU cycles.

78) Message boards : Number crunching : Stand back....... (Message 18517)
Posted 1 Nov 2007 by (banished: ID 70524)
Post:
No, Masud, I feel terrible. I ate all the unclaimed Hallowe'en candy. It went down just fine but now it wants to come back up. Sorry to hear about your mom.

79) Message boards : Number crunching : Stand back....... (Message 18501)
Posted 1 Nov 2007 by (banished: ID 70524)
Post:
You and others have expressed valid concerns, but I think that you should ease up a bit on the direct insults, but that's just my opinion.


Is an insult any less of an insult if it's indirect? You'll notice people here who speak in numerous cliches that are insults though somewhat indirect and therefore "clever". Somehow those traits make them acceptable, precedent seems to have something to do with it. I call them Netspeak, cliche, boring, mundane, insults none the less. Sorry you don't like my direct to the point style but it ain't gonna change because at least people know what the hell i'm talking about. You'll notice how often the Netspeakers' points are lost on readers. And besides they're cliche and all cliche smacks of mediocrity.

80) Message boards : Number crunching : Stand back....... (Message 18500)
Posted 1 Nov 2007 by (banished: ID 70524)
Post:
guess you dont have much of a memory either.... have a nice day! you need to re read that last post.....wheres all those GREAT THINGS? I wanna see you help for a change instead of trashing everyones effort on the message boards!!


I don't deny trashing some people's efforts but those are efforts that I think deserve to be trashed, same as you think my effort of late at this project deserves to be trashed. As far as helping LHC... what you expect me to do? Install the new server software for them? Praise their work and ignore the deficiencies? As I said earlier, criticism was given politely, they were invited politely to join discussions and provide information that would have fostered polite discussions amongst crunchers but they chose to do otherwise so now me and a few others choose to do otherwise too.

As far as great things I've done... do your own homework and stop expecting me to do it for you. (btw, I don't think I've done anything great but I have, on many ocassions and at many boards, been helpful)

ya right back at ya,... boy, u'r to smart for me!!!!!


Definition of a hypocrite: one who stops whining about insults just long enough to throw an insult. Now stop your yelling, we can all hear you just fine.

81) Message boards : Number crunching : Stand back....... (Message 18491)
Posted 1 Nov 2007 by (banished: ID 70524)
Post:
Now thats interesting.... you consider this project a charity.... personally I consider it an opportunity to help on a project that may help ALL mankind....


It's that too but it's also an expense to all of us who crunch.

u'r worried about respect but u show NONE!


Well obviously you either can't read English or your memory is very short.


Dago........ for the last time......... ARE YOU GOING TO BE PART OF THE SOLUTION, OR ARE YOU GOING TO BE PART OF THE PROBLEM?


You seem to think being part of the solution is to shut up and take whatever we get and don't complain. If that works for you then do it.


DUH... IS THAT A TOUGH ONE?



Right back at ya, duh

82) Message boards : Number crunching : Stand back....... (Message 18486)
Posted 31 Oct 2007 by (banished: ID 70524)
Post:
Seems to me everyone should "stand back" and let Alex and Neason do their job without interference,


That works sometimes but unfortunately, as we all know, some charities become complacent and adopt the attitude that it doesn't matter what they do with the resources we donate to them. That is very disrespectful. The security faults with LHC's outdated server code have been mentioned politely several times. Unfortunately, the admins here either choose to ignore critical issues or just don't have time to do their jobs properly. Fine, consider all these protest messages as us doing what we gotta do to teach them what they need to learn in order to run a respectful project.

and while constructive criticism is helpful it seems that all this arguing is counterproductive.


It may seem that way to you but to others it seems like they are now starting to take an interest in what's posted here rather than just laughing and walking away. Yes, at least that much has been accomplished, finally.

considering what they inherited I think they're doing a great job and yes the server needs to be updated. NOW I'm sure their painfully aware of that. I guess it comes down to this, who's gonna be part of the problem and who's gonna be part of the solution? if their not getting it done as fast as you think they should then give them a hand instead of creating more headaches for them.


They DID a fine job in the beginning getting the project functioning again but it functions at a minimal level and very inefficiently. Since then there has been little change other than superfluous and unnecessary changes to the look and feel of the website. They're resting on laurels and that won't do. Advice and requests for improvements have been made politely as well as requests for information they should have at their fingertips if they're doing their jobs properly but it gets shrugged off... very disrespectful. So it seems that trying to be part of the solution earns nothing here. Maybe a little taste of their own medicine will get us, the people who donate the CPU time that makes this project work, the respect we deserve.

83) Message boards : Number crunching : Stand back....... (Message 18442)
Posted 29 Oct 2007 by (banished: ID 70524)
Post:
Good thing time was spent on changing the look and feel of the web site, rather than fixing the servers.

They succeeded at making the upgrade even harder; now they have to merge the new look changes with the thousand changes from the official code. Seriously, that was totally the wrong order.

I tell you what I'll do my job and you do yours mmm'kay?

The quota has been altered. We've been talking to the scientists about the workflow and at the moment they are unsure whether it will be more constant when we know you'll know


Oh dear, he's told them off now hasn't he.

Zombie and PovAddict you've got the wrong impression. What they are trying to achieve here is more like art or perhaps a fashion statement... a marvel of modern engineering juxtaposed beside a quaint little BOINC server, sub-atomic particles whirring about at close to c beside server that just trundles along ka-bump, ka-bump, kabump. Relax! Breathe in the ambience. I'll put Mario Lanza on the phonograph if you'll crank the handle. We'll make up lies to tell our grand-children about how you and I, greatest particle fizzicists in the world, found der Higgs.

Now that we have this beautiful new website that does nothing the old one didn't we are sure to attract zillions of new crunchers to crunch work we're not too sure will ever exist. Ingenious!! Ahh, here it is... Mario singing Whutderfuk Ist Bain Happenink Dis Plaze.





84) Message boards : Number crunching : Initial Replication (Message 18335)
Posted 22 Oct 2007 by (banished: ID 70524)
Post:
but in fact, scientists could use the results at the moment the quorum is okay...


Not necessarily. In some types of analysis the scientists cannot use any results until the canonical result from all the work units have been determined.

In that case the best strategy (the strategy that obtains the canonical result for all work units in the least amount of time) is to have IR = minimum quorum.

85) Message boards : Number crunching : Initial Replication (Message 18327)
Posted 20 Oct 2007 by (banished: ID 70524)
Post:
so why you don't get your own project instead?


@ Forum moderators... is the above considered on topic or not?


86) Message boards : Number crunching : Initial Replication (Message 18326)
Posted 20 Oct 2007 by (banished: ID 70524)
Post:

Good point but having 5 crunchers working on WU A when quorum = 3 means WU B gets delayed (because 2 of those 5 crunchers could be working on WU B rather than A).


You could argue that replicating anything more than the quorum is causing a "delay" in work being done, but you have to keep in mind that the insertion into the science database is the ultimate goal, and it may or may not be delayed by more replication.


Continuing with the example scenario I gave you involving 2 WUs (A and B) and responding to your assertion that the ultimate goal is insertion into the science data base, I ask you to ponder this question... Is the ultimate goal to get result A into the database as quickly as possible and worry about B later or is the ultimate goal to get both results into the database as quickly as possible? Developing that question further...

There seems to be 2 different types of projects:

1) Those that are able to generate a run of several thousands of WUs all at once. For example by inserting all the combinations of 10 known parameters into individual WUs one could build 10^10 WUs all at once.

2) Those that can only generate WUs on the fly, in other words they cannot generate the WUs required all at once because the parameters for the later WUs depend on the results of the earlier WUs.

If LHC is type 2) then the need for IR=5 is obvious... they need to rush result A because WU B cannot be generated until result A is returned and verified.

However, if LHC is like type 1) and if analysis of the run cannot begin until all the results are returned and verified then IR=5 makes little sense, in fact it only delays completion of the entire run due to unnecessary duplication of work. In that case the strategy that gets the job done quickest is to wait and see if further replications are required rather than replicate beyond the quorum number simply because more replications might be required.

So there is a question or 2 for Neasan... Is LHC generating WUs on the fly as in type 2) above or is LHC more like type 1)?

The fact that LHC units are so short at this point in time and that nobody is holding a large cache makes it difficult to give you a good example of what can happen if you get a reissue.

To give you a better idea of what can happen, take a look at this example from Einstein numerous reissues.


Well that's all fairly elementary. Nothing new there, at least not for me though it may be food for thought for others. I think it illustrates how a single WU could have completed earlier with a higher IR but that doesn't make the argument that the entire job would get done sooner with higher IR.


As for task A and task B waiting, the bigger cause of any "wait" right now is the forced low quota...


True. But the forced low quota is only a temporary measure. Still, they have delayed WUs on purpose on past occassions, for example just a week or 2 ago when they delayed sending work while they were waiting to receive additional WUs from the scientists in order to spread the work around a little more. I mention it only because it makes me think the scientists are not in as big a rush to get results back as some people assume so let's not waste a bunch of CPU cycles over it, especially when the wasted CPU cycles may not be speeding up the work anyway (depends on whether LHC is type 1) or 2) as argued above).
[/quote]

Bear in mind that it may need a server upgrade, and folks like me, that use BOINC 5.8.16, would not process the server-side requests due to the support for it was added in BOINC 5.8.17 (I believe). I know 5.8.16 doesn't support it...


Good point. So then they would be faced with another problem.... Do they make 5.8.17 a minimum requirement and face all the criticism from people who don't want to upgrade to 5.8.17? Or do they just invoke the KISS principle (keep it simple stupid), re-examine the need for IR=5 and possibly reduce IR to 3?
87) Message boards : Number crunching : Initial Replication (Message 18299)
Posted 19 Oct 2007 by (banished: ID 70524)
Post:
you simply don't like my posts and you don't like them because it means less WUs.


I cut out all the noise and boiled it down to this. The "you" mentioned above is to be taken in general, not specific, as your words were aimed at those who disagree with you...

I think you can tell that I don't totally disagree with you. At least I hope you can tell that... Having said that, in my opinion, you are spamming multiple threads and what you're doing does border on hijacking the thread.

I've laid out some reasons why the additional replication can help speed the process up. Neasan has said that they will revisit the issue with the project scientists. Alex and Neasan are only the administrators of the servers. They have to abide by what the project scientists want.



I read you loud and clear, Brian. Thanks for your support and reasoned arguments. I'll limit my comments on IR to this thread.

I will also respond to your earlier post that explained in some detail why IR=5 is necessary, just too tired and too busy at the moment.

88) Message boards : Number crunching : Initial Replication (Message 18298)
Posted 19 Oct 2007 by (banished: ID 70524)
Post:
Anybody who is unhappy with my posts is welcome to either ignore me or kiss my ass. The issue, C0M you insufferable twit,


Rules:
No messages whose only intention is to annoy or antagonize other people.
No messages that are deliberately hostile or insulting.


I find CoM's "love it or leave it" attitude more than a little annoying and antagonizing.

Dagorath and Fat Loss 4 Idiots (and anyone else) you are volunteers here and as such are free to take your computers elsewhere. If you wish to detach that is fine but by staying attached you're implicitly agreeing to do things our way. You have both made your points and we have not just ignored them but we have set IR to 5 and are leaving it as such.


You could have avoided a lot of the hostility/bickering/nonsense in this thread by partaking in the discussion rather than offering flippant comments like you did near the top of this thread where you said "all I can do is laugh". Fat Loss 4 Idiots opened the thread with a valid concern and you make a comment like that? And then refuse to comment further, clarify facts that we as crunchers are not privy to and answer questions? Considering what we donate here and the fact that this project would not exist without that donation, we deserve to be informed about how our donation is utilised.

If you do detach I will not delete your credit, if you read the post you will see that I was pointing out that if you stayed attached and willing to do the work but continued to bitch, moan and whine I would consider taking the drastic step of banning you and deleting credit.


You can delete my account any time you please. I'm not here for the credit. I am here to hep get an important job done. Delete me if you will but I'll return from a different IP address and continue the discussion of IR=5.

Also the predictor@home thing was a bit much don't you think? I've let you have your say and only as a last resort have I threatened to do anything drastic.


Not sure but you seem to think the Predictor thing was not mine. It wasn't.
89) Message boards : Number crunching : Initial Replication (Message 18297)
Posted 19 Oct 2007 by (banished: ID 70524)
Post:
Anybody who is unhappy with my posts is welcome to either ignore me or kiss my ass. The issue, C0M you insufferable twit, is primarily to save CPU cycles and secondarily to save electricity, if you would care to read the thread (though I doubt reading is one of your basic skills.

As for spam, the definition of spam is invariably linked to unsolicited messages. You numbskulls fail to realise ALL the messages here are unsolicited therefore your own drivel is spam too, by your definition. The reason you asses trot out the spam word is that you simply don't like my posts and you don't like them because it means less WUs.


Wow, sounds like schoolyard, i didn't expect to hear something like that here, its even worser than one of our Collaboration Meetings.(Yes, i am a particle physicist. And i doubt you really know what this project wants to accomplish.)
By the way, i had to look up some of these nice/nasty words, cause thats not the language i am used to.


Yes, your "love it or leave attitude" it is very schoolyard. Particle physicist, huh? Perhaps you were one of my students?

90) Message boards : Number crunching : Initial Replication (Message 18276)
Posted 18 Oct 2007 by (banished: ID 70524)
Post:
Anybody who is unhappy with my posts is welcome to either ignore me or kiss my ass. The issue, C0M you insufferable twit, is primarily to save CPU cycles and secondarily to save electricity, if you would care to read the thread (though I doubt reading is one of your basic skills.

As for spam, the definition of spam is invariably linked to unsolicited messages. You numbskulls fail to realise ALL the messages here are unsolicited therefore your own drivel is spam too, by your definition. The reason you asses trot out the spam word is that you simply don't like my posts and you don't like them because it means less WUs.

91) Message boards : Number crunching : The press release (Message 18260)
Posted 18 Oct 2007 by (banished: ID 70524)
Post:
It is my mPU time to waste. I am here because I choose to be. I hope the LHC proves the Standard Model to be wrong, but if I'm wrong that would be OK too.


Waste whatever you wish whenever you wish, I have no problem with that. I just think all the new crunchers the news release might bring should be made aware ahead of time what really goes on here at LHC. The journals should be better informed about what they are selling to the public. Far too many people assume that just because projects are run by scientists and Ph.D.'s that they are without fault. Bad assumption.

92) Message boards : Number crunching : Initial Replication (Message 18259)
Posted 18 Oct 2007 by (banished: ID 70524)
Post:

Which pretty much proves they don't need the results back ultra-fast (the argument some people were using to justify IR=5 in spite of the fact IR=5 only slows things down).

<snip>

Well, if your objective is do unnecessary work then you can be happy. The fact is the project could be getting the same results (a quorum of 3) with 40% less work. Nobody should be happy about that.



What does indeed get slowed down is archiving the completed workunits. The workunit as a whole must remain so long as there is one resultID that hasn't been turned back in and that has not passed the deadline for the result.

From what I've been able to read (and experience), this project is much more sensitive to floating point math differences than others. I had a couple of results that were declared invalid just over this past week. In both cases I was either first or second to report a completed result. If the replication had been at 3 and quorum at 3, then there would've been at least one more replication made. That replication would have the same amount of time to be returned as the initial replication, but it causes the workunit as a whole to be waiting longer to be stored in the Master Science Database than perhaps a replication of 5, all with the same deadline, would have.


Good point but having 5 crunchers working on WU A when quorum = 3 means WU B gets delayed (because 2 of those 5 crunchers could be working on WU B rather than A). Therefore B has to wait to be stored in the Master Science Database. If it's bad for A to have to wait then it must also be bad for B to be forced to wait. Or do I not understand how the Master Science Database works?

To make the determination you're making that 5 is "wasteful", you really need to know the exact error rates on the first replication. I don't think someone outside of the project team can know that for a fact...


Earlier in this thread someone mentioned the error rate is 25%. Since that's never been disputed, I've assumed it's true. I've heard other projects have around 15% to 20% error rate. Given that LHC results are more sensitive to floating point errors than other projects, 25% makes seems reasonable.

I think the best thing to do is to implement the server-side aborts, like what SETI did, but leave the replication at 5.


If Neasan or Alex would agree to that then I would shut up.

93) Message boards : Number crunching : The press release (Message 18237)
Posted 17 Oct 2007 by (banished: ID 70524)
Post:

Thanks for listing all the publications. I'll be writing to all of them to explain how LHC wastes 40% of the CPU cycles donated to it for no good reason. Maybe they'll print a retraction and recommend that their readers stay away from LHC, would be the responsible thing to do when LHC refuses to act responsibly.

94) Message boards : Number crunching : Initial Replication (Message 18235)
Posted 17 Oct 2007 by (banished: ID 70524)
Post:
Other than the fact that some WU's may get crunched 2 more times than needed (with credit granted), I'm not sure where this is causing harm. Sure you're using electricity, but it's up to the project.


You're also wasting precious CPU cycles that other projects you crunch for would dearly love to have.

People have been complaining about "lack of work" here for years, and to cut IR from 5 to 3 means that there's 40% less work right off the bat.


That 40% is work that doesn't need to be done. It's "make work" not "work". The complaints about lack of work are the complaints of loons sqwaking about nothing and everything, they can be safely ignored with no ill effects. Say what? They'll quit? Who cares? Name one harmful effect of their leaving? Say what.... there won't be enough crunchers left to waste 40%? Good!

Right now, today, LHC, has taken some measures to keep work in the pipeline longer - the 2/day/cpu, the 1h delay, etc. with the press release and all.


Which pretty much proves they don't need the results back ultra-fast (the argument some people were using to justify IR=5 in spite of the fact IR=5 only slows things down).

I think we should all just step back and be happy that there has been a flow of work (be it 2/day) for the longest time I've seen in years.


Well, if your objective is do unnecessary work then you can be happy. The fact is the project could be getting the same results (a quorum of 3) with 40% less work. Nobody should be happy about that.

If you don't like the way the project is being managed, speak with your feet and crunch for another project.


I may do that in the future. For now I lobby the admins here to act responsibly and put an end to this deplorable waste of CPU cycles for those cycles are badly needed elsewhere. If you don't like my lobbying then that's just too bad :)

95) Message boards : Number crunching : Initial Replication (Message 18222)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:
With BOINC 5.10 it should be possible to abort the already send workunits when three canocial results are back!?


That would satisfy me.

96) Message boards : Number crunching : Initial Replication (Message 18220)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:
My 2 cents worth...

1: I doubt that the engineers assembling this beast really wait much longer, if at all with the IR set to 5 rather then 3. Why? Because we have more hungry systems then work to do. The work gets done much faster then the "due by" date. I'm sure they are not just "sitting around" waiting for us to get done; I'm sure they have scheduled work as needed and don't just wait for us. [Yes I know you think that we waste 40% that could be used by other projects - see point #4]


For me it's primarily about wasted CPU cycles that could be used at other projects. The wasted energy is an issue too.



2: We work for "them", in this case being LHC@home. If "they" wanted to set the IR to what ever, then let them set it. Yes voice your opinion, but don't keep hammering it when you don't get your way.


Your attempt to paint this as a temper tantrum is lame.

The reasons I'm hammering it are:

1. So far NOBODY has given a good reason why 5 iterations are required.
2. We and they all work together for the common wealth, I don't see it as we working for them.
3. People who receive donations have a responsibility to use those donations efficiently rather than squandering them.



3: From my understanding the internal and international coordination efforts probably totally swamp this piddly 3 vs 5 IR issue. You need to keep some perspective.


You need to explain that point better.


4: Act locally. If you are really that concerned with efficiency and waste what are YOU doing about it? Are you using solar/battery power to run your systems? Have you gotten rid of all other applications on your systems and removed any non essential tasks so that back ground task switching does not use CPU time? Are your running RAID 5 or 10 to speed disk access? Do you have enough real memory so that you never swap out to disk? Are you using Dual Quad CPU's? Are your systems dedicated? Have you installed a virus/spam gateway device (like IronPort)and removed antivirus/antispam software from your systems that don't access the internet except for BOINC?


Yep. Done most of that stuff. Now it's LHC's turn to trim some of the inefficiency and waste and act responsibly.

97) Message boards : Number crunching : Initial Replication (Message 18217)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:
IR = 5

That is the decision that has been made by the project, with input from all sides


Why 5 rather than 3?
98) Message boards : Number crunching : Initial Replication (Message 18197)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:

**bump**

@Neasan

Now that you're back and active once again could we have some comment on this issue, please.

99) Message boards : Number crunching : server 4.67 ???? (Message 18186)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:
Not an explicit criticism/insult, Masud, but an implicit one. He figured the admins are so dense they downloaded the wrong server software, read the docs and change log and failed to realise they had a version that predates the current version. They would have to be thick as a brick to make such a mistake. And no he didn't ask a polite, sincere question, it was rhetorical.
100) Message boards : Number crunching : server 4.67 ???? (Message 18184)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:
You're right about that... if I don't know it then it's likely not worth knowing it.

Now about this criticism thing. Who came here criticizing/insulting/attacking the admins for going backwards? Matters not that you were mistaken it was criticism all the same. You reap what you sow.

101) Message boards : Number crunching : server 4.67 ???? (Message 18182)
Posted 16 Oct 2007 by (banished: ID 70524)
Post:
6 track is the science application that crunches the work units, an entirely different critter than the server. You didn't really think they would go backwards, did you?

102) Message boards : Number crunching : work units?? (Message 17951)
Posted 21 Sep 2007 by (banished: ID 70524)
Post:

:) thanks Dagorath ;) never say die, LOL.
Regards
Masud.


Good to hear from you again, Masud ;)

I think some people just don't want their secret method for insuring that they get WUs to be made public.

103) Message boards : Number crunching : work units?? (Message 17949)
Posted 21 Sep 2007 by (banished: ID 70524)
Post:

Lesson 2

Yesterday I discussed boinccmd, properly known as boinccmd.exe. It comes from Berkeley and it gets installed in your BOINC folder when you install BOINC. I also drew your attention to the --project parameter you can give to boinccmd and the update parameter as well. Today we will use boinccmd and the aforementioned parameters to force BOINC (boinc.exe more precisely) to update project LHC. The reason why we want to use boincmd to force an update is explained below.

One of the reasons some crunchers don't get WUs when they are available is because their computer has deferred communications with the LHC server for several hours. By the time the deferral expires the WUs are all gone. However, if you click the Update button you force your computer to contact the LHC server immediately and request work (assuming LHC's debt is high enough which it likely will be). The problem, of course, is that the WUs are made available on the server when you are not at your computer so you can't click the Update button. This is where boinccmd comes in handy. We will use boinccmd to force boinc to update (contact the server and request work) when you are away from your computer. First, I will show you how use boinccmd in a batch file to force an update, just 1 update. In Lesson 3 I will show you how to automate the entire process and force an update every hour, every 2 hours, every 10 minutes, as often as you like, whether you're at you computer or not. It will be entirely hands free after you set it up.

1. Browse to your BOINC folder.

2. Create a new text file in your BOINC folder. One way to do that is to right click in the BOINC folder, click New, then click Text Document. It will create a new file called something similar to New Text Document.txt.

3. Double click the New Text Document.txt to open it in the Notepad text editor.

4. Cut and paste the following line into the text document:
boinccmd.exe --project http://lhcathome.cern.ch/lhcathome update

5. Save the document and exit Notepad.

6. Right click on the document you just created and edited, click Rename, rename it to lhc_update.bat or whatever name you prefer as long as it ends with the .bat extension. Windows will warn you about changing the extension from .txt to .bat but ignore that warning.

7. Now you should have a file named lhc_update.bat in your BOINC folder and it should contain the single line "boinccmd.exe --project http://lhcathome.cern.ch/lhcathome update" without the quotes.

8. Open BOINC Manager an click the Messages tab to see the messages. Now double click the lhc_update.bat icon. In a second or 2 you should see boinc contacting the LHC server to request work. If it doesn't contact the LHC server then go back to step 1 above and check your work.

You may be wondering "What is the difference between clicking the lhc_update.bat icon and clicking the Update button in BOINC Manager?". Really, they accomplish the same thing. The difference is that we can easily automate running the batch file but we cannot easily automate clicking the Update button. OK, it can be done with VBscript but this way is easier.

Tomorrow in Lesson 3 I will show you how to automate running the batch file.

104) Message boards : Number crunching : work units?? (Message 17939)
Posted 20 Sep 2007 by (banished: ID 70524)
Post:
Already I hear folks saying like "How dare you threaten/coerce this project?" My response to that is... "How dare these project admins waste precious donated resources when it would take only a few minutes to correct the problem."

You can't get your way, so you throw a tantrum and resort to blackmail? Wow - that's incredibly lame. Congratulations on just flushing any credibility you had on these boards right down the shi**er. You've belittled yourself and any point you may have hoped to make. Good work. I actually thought your arguments had some degree of sense to them, although I find your level of emotional commitment to the subject more than a little disturbing. But now, this is just sad. Turning to tactics like this doesn't make you powerful, or any more of a man. It makes you a weasel and a crybaby. And yes, you can take those as insults if you like. You've earned them.


As I said earlier... the admins here don't seem to respond to reason and common sense so now I will attempt to motivate them through different means. It's not like they're being asked to devote a lot of time and manpower to the problem, 5 minutes is all it would take to tweak a few parameters.

The fact that you don't respect me and my methods makes little difference to me. It would make a difference if you had any respect for common sense and reason but you don't seem to. And yes you can take that as an insult if you like because you've earned it. Maybe direct your anger toward the the cause of the problem next time rather than toward the one who is trying to cure the problem.

105) Message boards : Number crunching : work units?? (Message 17935)
Posted 20 Sep 2007 by (banished: ID 70524)
Post:
Aye, it will make the admins unhappy. But if crunchers don't hammer the server they won't get work and they'll be unhappy. And if only a few crunchers get work (like now) then LHC will have to have IR=5 and then I will be unhappy. It's better for admins to be unhappy than crunchers be unhappy.

Users unhappy -> users quitting the project -> less users asking for work -> more work for the ones who stay. But by telling everybody how to hammer the server, it would be the same as if nobody did it, with the difference that server overload may make it even more difficult to get work.

Only the server admins can. It's server-side parameters.

Exactly. Server side.

I think the admins should set min_sendwork_interval to an insane value like 3600 to make your stupid "solution" stop working.


Setting it to an insane value would tend to make them insane, would it not? They can chose to be more insane than they are now or to be more sane... that's the wonderful thing about being an admin.

The wonderful thing about being me is that I can chose to not tell folks how to increase their chance of getting work. So far I'm only at Lesson 1 which reveals very little to your average Windoze lemming. If the admins choose to be more sane (i.e. take 5 minutes to tweak the parameters that will spread the work out more evenly) then I will be inspired to end it with Lesson 1. If not then we proceed to Lesson 2 and learn how to invoke boinccmd from a .bat. If they cancel my account then I'll just put my info on a website they can't control and post links here from new accounts opened from a multitude of IPs other than my own.

Already I hear folks saying like "How dare you threaten/coerce this project?" My response to that is... "How dare these project admins waste precious donated resources when it would take only a few minutes to correct the problem."

106) Message boards : Number crunching : work units?? (Message 17933)
Posted 20 Sep 2007 by (banished: ID 70524)
Post:
can anyone say "hammer the server"?
I can say it; but it's a very bad idea.


Aye, it will make the admins unhappy. But if crunchers don't hammer the server they won't get work and they'll be unhappy. And if only a few crunchers get work (like now) then LHC will have to have IR=5 and then I will be unhappy. It's better for admins to be unhappy than crunchers be unhappy.

Can anyone tweak the parameters mentioned in this thread.
Only the server admins can. It's server-side parameters.


Exactly. Server side.

107) Message boards : Number crunching : work units?? (Message 17931)
Posted 20 Sep 2007 by (banished: ID 70524)
Post:
Daemion's suggestion requires too much work and attention. There is a far better way to dramatically increase your chance of getting LHC work units. If you use my method you can almost guarantee you'll get some work. But I'm not going to hand it to you on a silver platter, oh no. You're going to have to actually learn something and think a little, yah, like Linux users do. omg!

Lesson 1:

1. Open a Command Prompt. (Click Start->All Programs->Accessories-> Command Prompt)

2. Change directory to your BOINC directory by typing in the Command Prompt:
cd "c:\\program files\\boinc" including the quotes.
If your BOINC directory is on drive D: in directory My Programs then you would type: cd "d:\\my programs\\boinc" instead

3. Now type: boinccmd --help. You should see the following list of commands (parameters, actually) that you can pass to boinccmd.exe. Boinccmd is an executable that you can use to control boinc and do many of the things you can do via the GUI interface (BOINC manager).
Commands:
 --get_state
 --get_results
 --get_file_transfers
 --get_project_status
 --get_simple_gui_info
 --get_disk_usage
 --result url result_name op
   op = suspend | resume | abort | graphics_window
 --project url op
   op = reset | detach | update | suspend | resum
 --project_attach url auth
 --file_transfer url filename op
   op = retry | abort
 --set_run_mode mode duration
   mode = always | auto | never
 --set_network_mode mode duration
 --get_proxy_settings
 --set_proxy_settings
 --get_messages seqno
 --get_host_info
 --acct_mgr_rpc url name password
 --run_benchmarks
 --get_screensaver_mode
 --set_screensaver_mode on|off blank_time [deskto
 --get_project_config url
 --get_project_config_poll
 --lookup_account url email passwd
 --create_account url email passwd name
 --read_global_prefs_override
 --read_cc_config
 --network_available
 --get_cc_status
 --set_debts URL1 std1 ltd1 [URL2 std2 ltd2 ...]
 --quit

4. We are interested in the --project url op parameter(s) which we will discuss tomorrow in Lesson 2. Specifically we will discuss the update parameter and write a batch file. In Lesson 3 we will learn how to run the batch file periodically in Windows Scheduler.

Your homework for this evening is to work through steps 1 to 3 until you can make the list of parameters shown above appear in a Command Prompt. Keeners and Neasan may already know where we're going with this but I'll give everyone a clue.... can anyone say "hammer the server"? Can anyone tweak the parameters mentioned in this thread. Hmmmmm?

108) Message boards : Number crunching : dear Neasan & Alex (Message 17922)
Posted 19 Sep 2007 by (banished: ID 70524)
Post:
There are two quotas:
1) The amount of WUs you can download at a time
2) The number of WUs you can download for each CPU


I'll clarify that... There are four parameters that can be tweaked:
1a. The maximum amount of WUs you can download on a single scheduler request. (max_wus_to_send)
1b. The minimum amount of time you must wait between requests. (min_sendwork_interval)
2. The maximum amount of WUs you can download for each computer *per day*. The counter to check for this quota is reset (for that host) every 24 hours. (daily_result_quota)
3. The maximum amount of WUs a computer can have on its cache at a time (max_wus_in_progress; not supported by your ANCIENT server code).

A high min_sendwork_interval and low max_wus_to_send would improve things quite a bit. But the real solution is max_wus_in_progress. That would really avoid hosts with a hundred vs hosts with nothing.


It would fix another problem too. It would eliminate the need for an initial replication of 5 when a quorum of only 3 is required which would eliminate the ridiculous waste of CPU cycles.

How difficult is it to tweak the parameters? Is it just a matter of opening a config file in a text editor and altering a few numbers? Or does it take hours of tedious work? Would the admins have to swim the English Channel to tweak the parameters? Stand on their heads and spit pennies into a box 10 meters away? I really have no what the job would require so I ask.

109) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 17909)
Posted 18 Sep 2007 by (banished: ID 70524)
Post:

OJ is innocent this time too.

110) Message boards : Number crunching : Initial Replication (Message 17904)
Posted 16 Sep 2007 by (banished: ID 70524)
Post:
The argument that LHC needs to get individual results back as soon as possible even if it means delaying the entire batch and wasting precious donated resources has always sounded fishy. Here is proof from the mouths of the admins that the argument is false... they are delaying work units.

When can we expect the project admins to put an end to the blatant, unnecessary and disrespectful waste happening here at LHC and reduce the initial replication to 3?

111) Message boards : Number crunching : Initial Replication (Message 17748)
Posted 7 Aug 2007 by (banished: ID 70524)
Post:
Girls, girls! Please!

Has everyone quite finished telling the guys actually planning and running this project how to do their jobs?

Let it ride, will you?


To both questions the answer is nnnnnnnnnnnnno! I am a partner in this job and so is anybody else who choses to crunch LHC. We claim the right to make whatever suggestions we deem proper on the grounds that it's our resources that ultimately get the job done and we do not want those resources squandered. I believe the LHC devs and admins appreciate hearing about our concerns and I will continue to believe that until they state otherwise.

Now this girls, girls remark of yours. I think real men eat quiche and voice their concerns when they have concerns. You will not bully us into silence and submission by calling us girls. If you don't like our conversation then you can ignore this thread and still enjoy all the other wonderful benefits LHC offers. Or you can join the conversation and tell us IR = 5 nixers why we are wrong/right/nuts/whatever. Life is grand, don't you think?



112) Message boards : Number crunching : Initial Replication (Message 17744)
Posted 6 Aug 2007 by (banished: ID 70524)
Post:
and here we have it: Flame Fest 2007 Part II

I sincerely hope your pointless e-arguments inflate your e-ego to the point where you feel better about yourself in the real world. At least something good will come of this nonsense.


Definition of a hypocrite: one who stops complaining about a flame war long enough to pull out the ol' flame thrower and toast the crowd. You got us all with 1 well aimed whoosh, very efficient, which is kind of what this thread is about. Well done daemion :)
113) Message boards : Number crunching : Initial Replication (Message 17743)
Posted 6 Aug 2007 by (banished: ID 70524)
Post:
The initial studies when the IR was decieded upon showed an almost aysomatic(sp?) curve in turnaround time when increasing the IR. The point of diminishing returns was between 5 and 6 copies for a quorum of 3, so 5 was chosen. Once the design and testing phase of the collider is finished a different IR will most likely be desireable, until then the reasons for this decision are still valid.

Basically that means any work with the sixtrack application should continue to have the same IR and quorum. Work with other applications will most likely have different needs and therefore different IR and quorum.


I think you mean asymptotic, i.e. the function "rises" sharply and approaches the limit but of course never quite reaches it.

Funny how this graph is used to justify the one method that creates wasted effort. Really when you THINK about it, the graph only says 5 and higher gets results back faster than 3 initial replications. It does not say/prove/suggest that tweaking the IR up is the ONLY way or the BEST way to get results back sooner.

Does the graph take alternative approaches into consideration? If not then the graph validates absolutely nothing more than the fact that 5 is faster than 3. The decision is therefore not at all what wise men call valid. Wise men take all the factors into consideration.

So, John, what were the reasons for tossing out all the other methods for speeding up the work, the methods that don't waste our resources, and favoring the one method that does waste our resources. I would dearly love to hear that line of reasoning and how the graph supports that line.

114) Message boards : Number crunching : Initial Replication (Message 17736)
Posted 6 Aug 2007 by (banished: ID 70524)
Post:
I've also added four machines to this project. If you’re still upset about wasting time, feel free to leave the project as you've mentioned. I'll pick up your slack.
I can always tell when I am dealing with someone of lesser intelligence when they resort to overworked cliches like the above. You have demonstrated your ability to repeat rote and very little creativity or ability to analyse the situation and generate new ideas unfetered by dogma. Now take your hackneyed "I can cover for you" attitude and ram it up your nose.


After re-reading what I said, I'll admit that was a stupid comment on my part. And reading your reply, I'd have to say our intelligence about the same. <g>


In your dreams. Your remark was the lamest of lame brinksmanship, the last desperate words of a man who hasn't understood a single word of the entire conversation, a man with nothing to offer other than canned cliches he cuts and pastes from other people's posts that have, in his feeble mind, turned a good phrase. Nothing you say is ever original, you're just a parrot. Yet you think you belong to some exclusive club. You do not. You are a wannabe.

Rude I am? Yes. But tomorrow I will wake up and deal with normal people and have no need to be rude. You, on the other hand, will still be an idiot when you wake up tomorrow. You can either get PO'd or be appreciative of the favor I do you by taking the time to tell you you're a zero.

115) Message boards : Number crunching : Initial Replication (Message 17735)
Posted 6 Aug 2007 by (banished: ID 70524)
Post:
I agree the setting IR 5 to can be a waste, but I see my questions haven't been answered, lets try again.

Sometimes the results don't match what then? What if a host machine crashes and the work is lost?


So, with IR set to 3, what happens? Does a new batch have to be created for the missing/corruped work? How long does this take (identifiying what work is missing/corrupted, creating the new batch, waiting for the work to get completed, and hoping that runs 100% error free)? Maybe 4 would be better?



There is no need to create a new batch or delete the WU from the current batch and put it in the next batch. On all the projects I've ever crunched, if you return a result that has errored on your system then a replacement copy of the WU gets generated and put in the outbound queue within a few seconds. It gets sent out whenever another host requests work, could be only a matter of minutes if lots of hosts are requesting work. Some projects put the replacement at the head of the queue, some put it at the tail. If it goes to the tail then no problem, other WUs get sent and completed a little sooner, the replacement waits. It doesn't matter which one gets done first, it does matter how soon they ALL get done. Ya have to look at the entire forest not just individual trees.

OK, so far I've described what happens when 1 of the results errors on the host. Now here's what happens if you have initial replication of 3 and only 2 match after all 3 have been returned... they just replicate another copy of the WU and shove it in the queue, takes about 2 seconds. Again, it could go to the front of the queue or it could go to the tail, makes no difference in the long run.

Now, what if a host crashes, you asked. Basically the same thing as in the first 2 cases, it just takes a little longer. If the result hasn't been returned by the deadline then it (come on, you should know this by now) replicates another copy of the WU, shoves it in the queue, yada, yada, yada, same story as above.

See? It just doesn't make sense, when you have quorum = 3 to send 5 WUs on the initial replication just because 1 or 2 MIGHT fail. You wait until you're SURE 1 has failed before you send another WU. If you send 5 immediately then, in many cases, you're duplicating work and wasting effort for no reason. It matters not 1 bit, in the long run, that an additional WU MIGHT also fail to match or that it might return with errors. You wait until you're SURE and in the meantime you feed other WUs to the crunchers. You waste absolutely ZERO time while you're waiting to see if the result has errors.

OK, now here's how you add some finesse to the whole shebang. You keep a list of hosts that have very low error rates and fast turn around times. You call them your fast reliable hosts. If you need to replicate another WU then you send the replication to one of your fast reliable hosts.

Don't let the fear of what might go wrong lead you into making irrational choices. Don't get all excited and instigate a plan that appears to be a "safety factor" but which, in reality, only makes matters worse. Think it through. It's not magic, the process was not invented by gods with super intelligence. It was invented by men who are no more intelligent than you or I. It is within your power to understand the events and circumstances. You don't need to throw crunching power at the problem needlessly to get the job done as fast as it can possibly be done. We CAN do this with minimal wasted effort. We do NOT need to be fools spending CPU cycles like a drunken sailor spends his paycheque.

116) Message boards : Number crunching : Initial Replication (Message 17726)
Posted 5 Aug 2007 by (banished: ID 70524)
Post:
Free Ads said
Why waste computing time? If you only need a quorum of 3 why an initial replication of 5???
... if this project continues to waste resources that other (just as deserving projects) could have used to actually accomplish something then I will donate my humble resources to them.


Dagorath said
No, it does NOT give a safety margin. If you think it does then you obviously are not aware of how the quorum works. If the first 3 results match then they declare the canonical result. They do not wait for the 2 remaining results to return to see if they match with the canonical result. So how can you possibly say 5 provides a safety margin? Man, THINK about it.



If this were a perfect world, I could see your argument. But it's not. Sometimes the results don't match what then? What if a host machine crashes and the work is lost? And the one I've experienced a few times, upgrade BOINC and it wipes out the work? (Again, if this were a perfect world, I would ALWAYS back up the BOINC folder before upgrading) Now you have to reissue the missing work, and if I remember correctly, that won't happen until the batch is complete. Now the scientists have to wait, wasting time.


When you think about it you can see that it really doesn't matter if work unit A gets delayed until the end of the batch because then all other work units will finish that much sooner and the entire batch will finish sooner.

Your method makes the scientists wait longer than necessary. My method gets the entire job done sooner. Again, you cannot speed up the process by wasting time crunching WUs that don't need to be crunched again. To think that you can is ludicrous.

See, it doesn't matter whether B finishes before A. The order is totally irrelevant. The order matters ONLY when WUs are generated on the fly based on returned results. Yes, then you need A before you can generate B. But that's not the way they do it here and the fact that they put up 20,000 at a time and they're all gone in a matter of a few hours proves they are not generating on the fly. If they were generating on the fly then we would be able to get work for several days as new work continued to be generated. Therefore, the goal is NOT to waste time getting a match on individual WUs but rather to get the entire batch done as soon as possible.

Your method delays the entire batch in order to get individual WUs completed earlier when they don't need individual WUs completed earlier. The benefit you claim to be driving from wasting time just doesn't happen.

Let’s also not forget that this project has just moved to a new home with new admins. I think I read somewhere that this is not a typical BOINC installation. I'm sure time is needed to figure it out, a few jobs will need to be run to make sure things are running correctly.


Exactly... new home, new admins. They probably inherited a whole pail full of stupid stuff from the previous admins. It's definitely going to take some time to get it all sorted out. I would say that at the moment we should just be happy that they're generating work and redeiving WUs back without crashing the server/database/scheduler, etc. This issue with the initial replication of 5 probably should not be high priority for now. I can live with it this for now, it's not like I need it changed immediately.

I've also added four machines to this project. If you’re still upset about wasting time, feel free to leave the project as you've mentioned. I'll pick up your slack.


I can always tell when I am dealing with someone of lesser intelligence when they resort to overworked cliches like the above. You have demonstrated your ability to repeat rote and very little creativity or ability to analyse the situation and generate new ideas unfetered by dogma. Now take your hackneyed "I can cover for you" attitude and ram it up your nose. I intend to stick around and encourage this project to improve and grow in spite of your lack of intelligence and/or your need to do unnecessary work just so your LHC RAC matches your RAC at other projects or whatever your foolish reason is for turning this into a make work project.


117) Message boards : Number crunching : Initial Replication (Message 17725)
Posted 5 Aug 2007 by (banished: ID 70524)
Post:
How's your blood pressure? Not too high, I hope.

I am not persuaded by arguments that resort to yelling and name calling.


@BobGuy

I'm not sure where you stand on the initial replications = 5 issue but if you are one of those who cannot be persuaded by the simple common sense in the argument that crunching WUs that don't need to be crunched can never speed up the completion of the entire batch then all that remains is to call you an idiot and yell at you until you either STFU or see the light of day. Notice I said "IF you are one of those..."

118) Message boards : Number crunching : Initial Replication (Message 17720)
Posted 5 Aug 2007 by (banished: ID 70524)
Post:
If you want to be too literal:

"At least three" might be construed to mean "not less than three", but it does not mean "not more than three".

So five is OK, gives a saftey margin, provides more work from a limited supply, ensures that a quorum is met sooner rather than later and allows for evaluation of the possible AMD not equal to Intel issue.


No, it does NOT give a safety margin. If you think it does then you obviously are not aware of how the quorum works. If the first 3 results match then they declare the canonical result. They do not wait for the 2 remaining results to return to see if they match with the canonical result. So how can you possibly say 5 provides a safety margin? Man, THINK about it.

You are correct when you say it provides more work from a limited supply. And th e fact you approve of that tells me your thinking is all mixed up. We want to do only the work that is necessary to get the job done right. Any additional work is wasted effort unless it has some other benefit. So far NOBODY has stated what those other benefits are. There have only been guesses that there might be other benefits.

Your third point has been shown to be a false economy several times. When are you going to learn? Are you REALLY that thick between the ears? The initial replication of 5 does get a quorum for 1 WU quicker BUT IT DELAYS THE QUORUM ON ALL OTHER WUS. Are you REALLY so STUPID you cannot see that?

Finally your point about allowing for evaluation of the Intel vs. AMD issue... when the admins here state that they are using the data for that purpose and that the initial replication of 5 is for that evaluation THEN your point is valid. Until then you are just guessing.

The issue of wasting resources can be carried to extremes. If I wanted to be a dictator I might order that only recent AMD CPUs and C2Ds and C2Qs be used in a project because all the others are wasteful of resources due to operating inefficiency. That's not going to happen but it might be a reason for an individual to voluntarily retire their own old computer.


Now you are confusing 2 separate issues simply because they share 1 common word... efficiency. The issue of crunching on an old inefficient machine has NOTHING to do with the issue of waste produced by the initial replication of 5.
And of course nobody other than you should decide what machine you can crunch on . However, you trying to twist that point around and use it to argue that reducing the initial replication to 3 would be an act of a dictator or somehow extreme is .... well... it's just so stupid I can hardly believe you said it. My gawd what some people try to pawn off as critical thinking!
119) Message boards : Number crunching : Initial Replication (Message 17685)
Posted 2 Aug 2007 by (banished: ID 70524)
Post:
[quote]:-) if admin thinks its is important then they will implement your ideas ;-) but why should they go to all that hassle if their requirements are being met? B-)

To demonstrate that they are responsible adults rather than selfish, wasteful sloths, just for openers.

[quote] ;-) Now heavy lift gear is rated to 250 percent of Lifting Load, what would you call it? a waste of material, power and man power? :-)[quote]

No, I call that sound engineering and the only way to meet a need, the need being to hoist loads without dropping them (inertia considerations and all that). This project can achieve its goals without wasted effort. So give me one good reason why they should waste our resources?

Thanks for the article but if you want it to carry any weight then you need to quote the source. Anyway, I don't see anything in the article that says LHC needs initial replication of 5. Do you?

I don't see anything in the article that says they can't do their research into DC without wasting our resources. Do you?

120) Message boards : Number crunching : Initial Replication (Message 17680)
Posted 2 Aug 2007 by (banished: ID 70524)
Post:

It seems that many hosts attached to LHC@home aren't getting work because other hosts grab all the available work units first. In other words there are resources available that are not being utilized. To make matters worse, the initial replication of 5 for a quorum of 3 means the few hosts that are getting work are being used inefficiently because they are frequently crunching WUs for which a quorum has already been achieved.

The solution is obvious:

1. reduce the initial replication to 3
2. limit the number of WUs a host can download in X hours so that more hosts can get work, the more hosts working on the batch the sooner the batch will be completed
3. shorten the deadline to 4 or 5 days

I don't know what an appropriate value for X hours would be but I would guess anywhere in the neighborhhod of 10 to 15 hours would work well.
121) Message boards : Number crunching : Initial Replication (Message 17677)
Posted 1 Aug 2007 by (banished: ID 70524)
Post:
I am assuming that one day in the not too distant future there will be a steady supply of LHC work or that the runs will become so big (or hosts so few) they won't be gobbled up in a few hours.

While I haven't been with the project that long I don't remember this being true in the past and I wouldn't hold my breath that it will ever be true.


OK, let's assume the runs remain as small and infrequent as they are now. Would it not be a good idea to crunch the runs as quickly as we are now but use fewer CPU cycles to do it? And then spend the cycles we saved at another project? Or would you rather just take what we could save and pee it down the drain?

We could save cycles by reducing the initial replication. We could get the entire run crunched in the same amount of time by reducing the deadline.

122) Message boards : Number crunching : Initial Replication (Message 17669)
Posted 1 Aug 2007 by (banished: ID 70524)
Post:
Your observation is absolutely true.

Not efficient? Probably true.

Why we do it the way we're doing it? Probably easier to implement.

Are you saying that it's normal for you to doubt? (A deliberate misconstruction by me. I'm just teasing you!)


I say it's normal and quite healthy. Teasing? If it bothered me I'd soon learn to shut up and lay low. Thick skinned I guess.

There's nothing wrong with going with the the easy implementation if that's what must be done to get the ball rolling but eventually you want to knock off the bumps so the ball rolls with less effort. Seems like there are alternatives to homogeneous redundancy that shouldn't be too hard to implement, for example, shorten the deadline and reduce the initial replication (unless they need IR set to 5 to gather data on error patterns).





123) Message boards : Number crunching : Initial Replication (Message 17666)
Posted 1 Aug 2007 by (banished: ID 70524)
Post:
I think there is some concern that the AMD results don't match or might not match the Intel results. That's still a valid reason to get as many WUs run that have both AMD and Intel results represented for a single WU.


It could also be that results from different operating systems don't match. Either way, throwing more CPU cycles at the problem via more replications is not an efficient way to fix the problem. Homogeneous redundancy is a better way though I'm not sure they can apply that method at this project. If they can't then I suppose we're stuck with working hard rather than working smart.

124) Message boards : Number crunching : Initial Replication (Message 17657)
Posted 1 Aug 2007 by (banished: ID 70524)
Post:
...why make other work wait while you waste time crunching results that don't need to be crunched?


Even with the initial replication of 5, I don't see ANY work waiting on the servers here since it all gets distributed so fast, or are you referring to the work on your host? Sure somebody's host will be taking time away from some other project. Where are you trying to optimize useful results : work done? Considering only LHC, and all of it's odd work availability, it's OK to have a higher replication to reach quorum faster since it is unknown how fast certain hosts will be, and there is FAR more hosts available than there is work. Taking a step back and looking at all projects, LHC could be seen as (not in my opinion) a "resource hog" but that is what LHC admins/scientists have decided what they need in order to complete their task, which is to build a damn sweet machine. Taking a step in and looking at a single host, a result that is not needed for quorum would be wasted effort and redundant. Which is exactly why the new feature that Scarecrow mentioned came to fruition. Draw your line and take your stance.


I am assuming that one day in the not too distant future there will be a steady supply of LHC work or that the runs will become so big (or hosts so few) they won't be gobbled up in a few hours. I assume the short runs we see now are just test runs. I have no problem with an initial replication of 5 for these test runs because 5 does get these runs completed sooner. I don't like the fact that it wastes CPU cycles that could be spent on other projects but construction of the collider will soon be complete and the sooner we prove the configs and hardware the sooner we get can get into the real work. Then, I assume, we will see much longer queues that take weeks to crunch because there won't be enough hosts available to gobble up all the WUs in a matter of a few hours. At that point I hope LHC admins will reduce the inital replication to whatever the quorum is at that time.

125) Message boards : Number crunching : Initial Replication (Message 17651)
Posted 31 Jul 2007 by (banished: ID 70524)
Post:
If you feel this project is such a waste of your time, there are other projects that you apparently don't have issues with. Why not just stick with those. That is normally what a person does if they don't like something, they move to something else.


That's normal behavior at buffet restaurants, candy counters and bawdy houses. In LHC@home I see a very important project, more important than all others in my opinion. It has a few rough edges but those can be smoothed out in time. I don't give up and walk away from things just because they don't match my vision of perfection when I first lay eyes/hands on them.

I'm looking at the tasks this way: There have been multiple migration issues and the configs need to be tested. So if I am just running results to help with testing the hardware, I am fine with it. The high number of replicated tasks could easily be to check for patterns of errors.


I am fine with spending a little CPU time testing the configs and hardware too. If the high replications is for checking/exposing patterns of errors then I'm fine with that as well. However, if the only reason for the high initial replication is to speed up production then that notion needs to be exposed for the nonsense that it is.
126) Message boards : Number crunching : Initial Replication (Message 17647)
Posted 31 Jul 2007 by (banished: ID 70524)
Post:
Einstein can afford this kind of thing occasionally, LHC can't.


Thanks for the opinion but I'll wait for LHC to confirm that statement before I believe it.

LHC has people waiting on the results to get physical work done (aligning the magnets). Einstein does not. Since LHC has real world deadlines, LHC has a problem with maximum turn around time for the WUs, therefore a higher initial replication is required. Einstein and S@H on the other hand are trying to get through a huge pile of work, and that leads to an initial replication to match the minimum quorum.

This is documented in earlier threads if you care to go looking (try a couple of years ago).


It sounds like it is an explanation of how you can speed up the job by wasting effort. I won't waste my time digging up garbage. I note that none of the folks who support initial replication of 5 think enough of the explanation to provide a link to it either. It sounds rather like the explanation given to one Chris Columbus, "Look, we settled that argument years ago. Everybody knows Earth is flat."

Again, if you want to minimize turn around time for ALL your work then you don't want to crunch each and every work unit 5 times when you need only 3 matches. You want to replicate work units IF and ONLY IF one of the 3 original replications returns a compute error or a result that fails to match the other 2. LHC may have a high failure rate but that's no reason to assume that an inital replication 3 will never produce 3 matches. The fact is a 25% failure rate means there is a 42% chance (.75 ^ 3) that an initial replication of 3 will give 3 matches. In other words, 42% of the time, 2 results will be wasted effort. The prudent and more productive approach is to wait and see if you don't get a match before replicating more than 3.

Ooops! There's that dirty word... wait. Nope, we can't wait. Well then why make other work wait while you waste time crunching results that don't need to be crunched? And that's EXACTLY what happens much of the time when you replicate 5 right off the bat when you need only 3 matches.

The bottom is, was and always will be... you cannot speed up the job by wasting effort.

127) Message boards : Number crunching : Initial Replication (Message 17642)
Posted 31 Jul 2007 by (banished: ID 70524)
Post:
I don't care what you think Dagorath.




Translation: I can't defeat Dag's reasoning so I'll just pretend I don't care instead of admitting defeat.

BWAHAHAHAHAHAHAHAHAHAHAAAAAAAAAAAAAAA!

ROFLMAO!

Hey, all you k00k whiners. Waste whatever resources you own to whatever extent your k00k brain dictates you should waste resources. But DO NOT come around here proposing policy that wastes MY resources as well. Not everybody in the world wants to be as silly as you and it's a fortunate thing that most of us are not.

128) Message boards : Number crunching : Initial Replication (Message 17640)
Posted 30 Jul 2007 by (banished: ID 70524)
Post:
As an example I give you a wu from Einstein

Einstein unit

basically the wu was initially issued on 8th June but did not reach quorum until 27th July !! enought said?


Nope. I think a lot more needs to be said :)

The project admins/scientist are the only people who have the whole picture, we who contribute our resourses can only trust that they are choosing the correct parameters. Thats not to say we cannot suggest what we think is better.


I have found that whenever and wherever government takes my money/resources and whenever/wherever I donate my money/resources to charity/friends/neighbors they almost always have far less concern for MY resources/money than they have for their own. Furthermore, who says LHC can't afford this kind of thing ocassionally?

Einstein can afford this kind of thing occasionally, LHC can't.


Thanks for the opinion but I'll wait for LHC to confirm that statement before I believe it.

Anyway, Colin is making the same mistake Alex makes, he focuses on 1 tree and neglects the forest. Yes, throwing 5 hosts onto 1 WU when you only need 3 matches will sometimes get a canonical result for that WU sooner but it delays work on all other WUs and more often than not is wasted effort. Again, you cannot speed up the job by wasting effort. Inefficiency can only retard progress.

I definitely like what they're doing at SETI... telling hosts to abort WUs for which the canonical result has been determined (unless host has started crunching it already). I hope LHC acquires that ability soon.

In the meantime, it seems to me these 2 extra replications are just a make work project to satisfy the k00k whiners who think their world is going to come to an end if they don't get some LHC work. Never mind the k00ks and their incessant, monotonous, brain-dead whining and drivel.
129) Message boards : Number crunching : Initial Replication (Message 17637)
Posted 30 Jul 2007 by (banished: ID 70524)
Post:

:-)welcome back Dagorath. we were computing the adjustments of the magnets for the machine at CERN. i think the power required to warm it up will be far more then what we spend on crunching extra WU's. keep the safety factor in mind.
Regards
Masud.


Masud, what you are saying about the power is true but it doesn't mean we should waste it. Anyway, neither my argument nor Alex's has anything at all to do with wasting power. It has everything to do with getting the job done faster and with less wasted time. It is all about lost opportunity. Every time you set host A to crunching WU X, you lose the opportunity to have host A crunch WU Y. That is a simple fact. Now, why would you want to lose the opportunity to have A crunch WU Y when you gain absolutely nothing by having A crunch X? For "safety"? More nonsense!!

How does wasting effort translate into greater safety? They take the first 3 results that match and call it a done deal. If they required 4 matches we could say the comparison is safer but 3 is all they need.

130) Message boards : Number crunching : Initial Replication (Message 17627)
Posted 30 Jul 2007 by (banished: ID 70524)
Post:
Why waste computing time? If you only need a quorum of 3 why an initial replication of 5???



There's a mathematical reason that there's 5 sent out instead of 4.
20-25% of the crunching instances have errors. (I looked at the most recent work units that MY pc crunched, and there were 5 out of 27 client errors reported by all pc's crunching)
With that error rate, a replication of 4 ends up with the server having to resend out work units again after the deadline has passed for 20% of the work units.

For a 20% error rate:
1000 work units, will need to have 200 work units redone. 200 reworked units will fail again, resulting in another 40 to be redone, those 40 will have 8 that needs to be redone, and another 8 will be redone.
This results in having to wait 3 iterations for all results to come in.

If you have a replication of 5.
1000 work units will need to have 70 redone. Those 70 will need 10 redone, and those 10 may need 1 redone.

It's when you have a 25% error rate, that having a replication of 4 ends up in more work than an initial replicaton of 5.

Crunching for LHC means that the Intel PC's don't always agree with the AMD PC's, so you will always see a higher redo rate than Seti (which is just a fast fourier filter performed on data which is not sensitive to floating point errors in the the last precision digit)

Then again, some people's computers just suck: http://lhcathome.cern.ch/lhcathome/results.php?hostid=88058





Your argument is mathematical in the sense that it bandys some numbers around but it does not prove the need for IR = 5 if 3 matching results is all you really need. In fact your analysis of what happens to 1000 WUs with x% errors completely ignores all the work that is duplicated unnecessarily when IR = 5. Your analysis ignores the fact that in most instances 5 of the 5 returned results will match which means 2 were wasted effort. Those 2 hosts could have been doing something useful instead of wasting their time.

If the goal is to accomplish as much as possible in as little time as possible then you want to eliminate wasted effort not create it. IR = 5 only creates wasted effort. Indeed it yields a 3 match on any given WU sooner but only at the expense of delaying the iteration of other WUs. See, we are focused on getting ALL the work done sooner not just getting 1000 WUs done sooner. If the project needs only 3 to match then IR = 5 is a false economy. You simply cannot get the job done sooner by wasting effort. To think that you can is utter nonsense.


131) Message boards : Number crunching : Project dead ? (Message 17164)
Posted 29 Jun 2007 by (banished: ID 70524)
Post:

Old Chinese Ploverb:

I crick rinks and read my finger aching too many crickings.
You crick rinks and read and tell me what you read then you finger aching.
Better you have sore finger than me have sore finger.

132) Message boards : Number crunching : Good things come to those who wait. (Message 17089)
Posted 24 Jun 2007 by (banished: ID 70524)
Post:
They writhe in agony from the pain you inflict upon them, KAMasud. Surely they will learn a lesson from your horrible wrath and vow to never make you angry again. We stand in awe of the power you wield and the fact you ain't afraid to use it ;)

133) Questions and Answers : Windows : 'You used the wrong URL for this project' (Message 17026)
Posted 14 Jun 2007 by (banished: ID 70524)
Post:
That problem was discussed in this thread.
134) Message boards : Number crunching : Good things come to those who wait. (Message 16972)
Posted 4 Jun 2007 by (banished: ID 70524)
Post:

Alex and Neasan,

Congratulations on what appears to be a successful migration to Queen Mary. Keep up the good work! We are thankful for tiny steps and remain patiently at your disposal.

135) Message boards : Number crunching : Because you asked.... (Message 16971)
Posted 4 Jun 2007 by (banished: ID 70524)
Post:

Nice work, Skip!
136) Message boards : Number crunching : 64-bit Linux (Message 16970)
Posted 4 Jun 2007 by (banished: ID 70524)
Post:
I second that.

Oink! Oink!
137) Message boards : Number crunching : Please note: this project rarely has work (Message 16969)
Posted 4 Jun 2007 by (banished: ID 70524)
Post:
Was just wondering, at cern the big particle thingy (cant remember the name of it at the moment) should be up and running by th end of this year. Once it is running, will we get lots of work?


There may be lots of WUs available but I intend to grab most of them. You will be left to fight other crunchers for the few crumbs I leave behind. If you oink nice i might share my piggie proggie with you. We'll be Team WUP at LHC (Work Unit Pigs) and we'll bask in infamy for the rest of our days.

Oink! Oink!
138) Message boards : Number crunching : Please note: this project rarely has work (Message 16915)
Posted 18 May 2007 by (banished: ID 70524)
Post:
I like your "why fight it" attitude. So I'm joining the pigs instead of fighting them. From this day forward I am dedicating a significant portion of my spare time to enhancing my reputation as a pig.

To achieve that goal, I will build a wee proggie that updates LHC every few minutes. It will run on the only machine I have attached to LHC. When that machine snags an LHC WU it will attach the other machines on my LAN and temporarily adjust various settings on those machines to grab as many WUs as possible (eg. boost the benchmarks sky high, tweak up the WU cache, etc.)

Oink! Oink!

139) Message boards : Number crunching : Because you asked.... (Message 16900)
Posted 15 May 2007 by (banished: ID 70524)
Post:

k00k whiner

Ooops!

My apologies to Dag (and the kcuf wits, who can reasonably feel insulted by me).


No problem. So nice of you to post an apology. So sad the KWs have detached and quit else they could read your kind words too. Oh well, maybe when they're done sulking they'll come out and play again.

140) Message boards : Number crunching : Because you asked.... (Message 16892)
Posted 14 May 2007 by (banished: ID 70524)
Post:

k00k whiner
141) Message boards : Number crunching : Because you asked.... (Message 16885)
Posted 14 May 2007 by (banished: ID 70524)
Post:


Alex, I rarely do this, so please know that I do so out of necessity

This project has had 5 months to transition the server and get things running.

I call BS, and you sir - smell of it!

On an afternote, there would be nothing to please me more than for you to prove me wrong. I dare you. Double dare you!

Course.. that would mean we would get the stats exporting problem fixed... HAHAHAHAHAHAHAHAHAHAHAHAHAHAHAHA! - that'll happen.


Aaron, I rarely do this, so please know that I do so out of necessity

You are a DH

- The necessity here is that the minus ratings and red x's aren't working.



He's a DH? Hmmm. I am convinced he's a KW but I've been wrong before. It's just that DHs usually have both feet on the ground and he clearly does not. Yikes, the lad suffers from the delusion that this project owes him something and is obligated to meet deadlines he sets, as if he knows anything about the task. He also seems to think everybody else here "thinks" just like he does. That screams KW to me.

142) Message boards : Number crunching : Because you asked.... (Message 16865)
Posted 11 May 2007 by (banished: ID 70524)
Post:
... when I find something out it solves 1 problem but reveals 2 more...


Alex, look at the bright side... you're getting 2 for 1. It's going to take only half of the usual time ;)

143) Message boards : Number crunching : Because you asked.... (Message 16861)
Posted 11 May 2007 by (banished: ID 70524)
Post:

Q is Barry White? Or you mean I have Barry's aura? I have no clue about Barry.

Nah, you didn't strike a nerve. I get a chuckle out of all this "gimmee WUs now or I'll throw a hissy fit" stuff. Reminds me of being at the grocery store and watching a 4 year old throw a fit because mom won't buy him a candy bar. Saw one just today and he was hilarious. His mom smacked his butt and told him his picture would be on the milk carton tomorrow if he didn't behave.

144) Message boards : Number crunching : Because you asked.... (Message 16853)
Posted 9 May 2007 by (banished: ID 70524)
Post:
The admins should post stating they are rewriting the entire server side code in a new language called ANAL (a non algorithmic language), developed at QMC just for BOINC Server scripts, which uses COMEFROM instead of GOTO for flow control. Would be fun to watch the KKBs (k00ky kiddie brats) rush off to the other forums and repeat the kewl news for geek points.
145) Message boards : Number crunching : Because you asked.... (Message 16852)
Posted 9 May 2007 by (banished: ID 70524)
Post:

There would be no point is sending WUs if nothing has changed for the results would only prove that the same bugs/faults are still present. I therefore assume they altered some code and that they sent the WUs to test the new code. If the new code works then they've made progress. If it does not work then it's still progress in the sense they will learn what does not work and will be steered in a new direction.

Has the app version changed then? (I honestly don't know, my host hasn't picked up any WUs from the last few batches - I don't have it on 24x7) In the past new batches (as I understand it) have been with the same code using different parameters.
Since the initial Sixtrack runs were finished some time ago, I guess that this flurry of new WU batches is either related to the problems LHC had a couple of weeks ago, or is part of testing by the new Admins to see that they have the new server set-up etc. working. I could be wrong of course, (and the work could be for a different reason) but it would be nice to know...


OK, so they post like "we're tweaking Sixtrack and the result verifier". Is that going to keep the k00ks happy? Probably not. So then the k00ks ask questions like "what are you doing to Sixtrack and the verifier". The team responds with technical details. Then the k00ks need to know what the technical details mean. And if the team doesn't respond with a crash course on BOINC Server the k00ks get all pissy that the team doesn't care and threaten to quit anyway. Give the k00ks the boot, cancel their accounts, they're just attention seeking kids that yammer on and on about nothing important just to yank people's chains. Yah, you got it, they're trolls.

146) Message boards : Number crunching : Because you asked.... (Message 16844)
Posted 8 May 2007 by (banished: ID 70524)
Post:
The Admins have specifically stated that they aren't working on this project even remotely full-time. That is the reality.


Indeed it is the reality and no amount of k00k whining is going to change it. No amount of pretending that the lack of WUs/dialogue/credits/whatever has caused any cruncher any real grief is going to change it. One might argue that the squeaky wheel get the grease but that can't happen when there is no grease.

Of course, that begs the question "Why wasn't the project given to a team that can devote more time to it?" Maybe there is no other team that has adequate knowledge of the Six Track application? Maybe there are security issues that make the current team the best choice?

The fact they sent out a few thousand WUs last week is proof something is happening behind the scenes. I call it progress.


I don't. Sporadic unannounced batches of WUs has been the nature of this project for several months. Nothing has changed, so where is the progress?


There would be no point is sending WUs if nothing has changed for the results would only prove that the same bugs/faults are still present. I therefore assume they altered some code and that they sent the WUs to test the new code. If the new code works then they've made progress. If it does not work then it's still progress in the sense they will learn what does not work and will be steered in a new direction.

147) Message boards : Number crunching : Because you asked.... (Message 16837)
Posted 7 May 2007 by (banished: ID 70524)
Post:
For example, continuous, neurotic whining about the lack of WUs from lhc just begs the response I gave a few posts up. It's beyond passion, beyond hyper-passion, it's bloody sick and to not call it so does a disservice because they get the impression their behavior is normal and healthy.


Although I'd rather tend to take it with humor, this isn't simple whining you're witnessing.

People came here to support an Idea and since more than a year now, find the Project claiming be active, but in reality continue to find it dead as a doornail.
Visible Progress is zero, updates (if any) come in at a pace that would disgrace any freeware SourceForge Developer.

It's all about expectation, and if people are told to "hang in there", that's what they'll simply do (at least most of them). That's not "bloody sick", it's what you get from some of the most dedicated, best bunch of people any Distributed Project can wish for.


I didn't say hanging in there is bloody sick. I said whining about lack of WUs is sick. Are you purposely twisting what I said?

Had they closed down and just said :
We don't expect to have this baby up & running before mid-2007...
...folks would have accepted it (no other choice anyway) and checked in periodically and waited for the projected timeframe.


Well then just pretend that's what happened. It's far healthier than pretending neurotic whining is going to change anything.


What is wrong here is that this Project is hanging inbetween and going nowhere, for more than a year now.
Noone made the decision to shut it off, yet noone made the decision to put an appropriate effort to affect the other either.


Whoa! Stop right there. Who the hell are you to decide whether or not an appropriate effort has been made? As far as evaluating effort and progress on BOINC projects goes you are a nobody with an inflated ego. Your evaluation is not based on reality. It is based on your own selfish whims and fantasy.

That's basically what ticks people off, at least the few that remained here to date.


Suppose you tell us how many remain, Mr. Know-It-All.

Although someone recently called that "the best proof for a successful BOINC Project", I'd call it a "DC Community Test : how long can a BOINC Project last without any maintenance or progress made"
I'm surprised it didn't permanently go offline at some point e.g. after a power outage by now, because Staff forgot it was still there.


The fact they sent out a few thousand WUs last week is proof something is happening behind the scenes. I call it progress. The fact they are not here answering stupid questions about "why Harry snagged 20 of those and I got only 1" shows they have their priorities in proper order and realise most who post here are k00k whiners who don't deserve a minute of their time.

Personally, I don't come here to expect anything running anymore, it's just out of curiosity and time to spend to read through a few Forum threads a week.


Well, it that is truly the way you approach this project then I would say you have a healthy perspective.
148) Message boards : Number crunching : Project dead ? (Message 16830)
Posted 7 May 2007 by (banished: ID 70524)
Post:
By by !

Martin


If you leave quickly you can take your time coming back.
149) Message boards : Number crunching : Because you asked.... (Message 16829)
Posted 7 May 2007 by (banished: ID 70524)
Post:
It's just that I'm not the kind of guy who blows sunshine up everybody's butt. That distorts one's perception of reality. I call it the way I see it and I don't pull punches. I don't criticise people, I criticise their ideas/thoughts/behavior. I don't find that offensive when I am on the receiving end and I feel those who get offended are far too sensitive. Really, if one cannot stand having their thoughts/behavior criticised then perhaps one ought to refrain from posting, get a thicker skin or just think about what they're saying. For example, continuous, neurotic whining about the lack of WUs from lhc just begs the response I gave a few posts up. It's beyond passion, beyond hyper-passion, it's bloody sick and to not call it so does a disservice because they get the impression their behavior is normal and healthy.
150) Message boards : Number crunching : Because you asked.... (Message 16822)
Posted 6 May 2007 by (banished: ID 70524)
Post:
Dagorath, I like you.

But you're the biggest ass wipe in the lhc forums. Do you always drink a few cocktails before posting? Next time try posting sober. Works for me. - - ;)



Gee, I like you too, Ariel, but I'm not convinced it's working for you. What do you mean by ass wipe and why am I the biggest? I've been D&A free for a long time, never went to AA though.
151) Message boards : Number crunching : Because you asked.... (Message 16818)
Posted 5 May 2007 by (banished: ID 70524)
Post:
There's no harm in being hyper-passionate about a project.


Obsessing pointlessly over something one cannot change and demanding pointless dialogue about topics most here will never understand anyway cannot be healthy. Pray as they do at Alcoholics Anonymous...

God, grant me the courage to change the things I can change
The serenity to accept the things I cannot change
And the wisdom to know the difference.

Or maybe get a Lavalife account and find someone to play with. Or get a pet or something.

152) Message boards : Number crunching : Gone in a flash! (Message 16802)
Posted 2 May 2007 by (banished: ID 70524)
Post:
I hope your friend is including in his costs all the gas he's burning running his route. That's gotta be a significant impact againt his profits ...


Would be for some people but Richard is no dummy. He has other lucrative business near many machines. Other machines are tended by other people for a cut. Like I said, he doesn't work for peanuts, peanuts work for him.


153) Message boards : Number crunching : Gone in a flash! (Message 16800)
Posted 2 May 2007 by (banished: ID 70524)
Post:
I disabled work requests on this project.
I don't like the idea that somebody, sometimes throws me peanuts.


What's wrong with peanuts? :-)


My friend Richard owns 500 peanut vending machines and earns a tidy sum from them. Some people work all their lives for peanuts but not Richard. He has peanuts working for him ;)

154) Message boards : LHC@home Science : LHC magnets failed during test (Message 16723)
Posted 16 Apr 2007 by (banished: ID 70524)
Post:
Well, in that case, they could work around the problem by adding a pressure relief valve to their plumbing to reduce pressure load.


3 possible reasons why that's a bad idea:
- You end up spraying Helium into a confined space (tunnel) giving an asphysiation risk
- Spraying very cold Helium all round electrical equipment is a bad idea
- Helium's in short supply and getting darn expensive

As I implied before, the whole LHC is _very_ complex, and the specific magnet flaw is only part of the problem exposed here...

Henry

(and why do we have to have this idiot BBCode thing that can't manage simple lists?)


You mean lists like:

  • you would pipe helium escaping a relief valve away from the confined space
  • then no danger with electrical equipment
  • ok, who's hoarding all the helium?



155) Message boards : Number crunching : Please note: this project rarely has work (Message 16717)
Posted 15 Apr 2007 by (banished: ID 70524)
Post:
Can we get some more computer experts in


Well of course they can, silly. All you gotta do is send them 1000 $1 coins. They'll plant the coins and the coins will grow into 1000 money trees that the additional manpower you request can harvest at will.

Huh? You don't have 1000 $1 coins to spare? No problem! I have these magic beans I can send you. If you plant them they'll grow into a huge beanstalk that reaches up above the clouds to where a giant who has a chicken that lays golden eggs lives. All ya gotta do is climb the beanstalk, snatch the chicken while the giant sleeps and send the eggs to LHC@home. I know it works because I pulled the same scam on this other giant a while back. Tip: if the giant says "fee, fi, fo, fum" then he's onto you and you best run fast.

-- Jack

156) Message boards : Number crunching : Versions for x86_64 platforms avaible ? (Message 16710)
Posted 14 Apr 2007 by (banished: ID 70524)
Post:
What's the use of a 64-bit BOINC-Client?!?
BOINC does not do any calculation, so there's no other use in optimising it than cheating.


In this post at ABC@Home, Augustine explains that 32-bit Linux apps can spawn 64-bit apps on 64-bit Linux. Thus, there seems to be no need for a 64-bit BOINC core client for Linux. But it doesn't hurt, I think.

Windows is different, however. Only a 64-bit Win app can spawn 64-bit apps so if one wants to crunch with a 64-bit science app on Windows then one must be running a 64-bit BOINC core client.


157) Message boards : Number crunching : Because you asked.... (Message 16703)
Posted 14 Apr 2007 by (banished: ID 70524)
Post:
You can wait for BOINC to repair the damage or you can just exit BOINC and edit the client_state.xml file and change the long term debt for each project to 0. There is freeware utility available to do that for you but I can't remember the name or the download link.

158) Message boards : LHC@home Science : LHC magnets failed during test (Message 16673)
Posted 7 Apr 2007 by (banished: ID 70524)
Post:
Well, in that case, they could work around the problem by adding a pressure relief valve to their plumbing to reduce pressure load.


Nah, they would have put a relief valve in the helium vessel, that's standard pressure vessel design. But was it the proper relief valve?

The 4th post in this thread says they didn't take into account stress along the longitudinal axis of the tube. That leads me to believe they overestimated the strength of the helium container and might have installed a 10 atm (for example) relief valve when the vessel was capable of containing only 5 atm. When the helium pressure spiked the vessel ruptured before the pressure required to actuate the relief valve was achieved. That's my guess.

159) Message boards : Number crunching : What do we crunch for? (Message 16650)
Posted 31 Mar 2007 by (banished: ID 70524)
Post:
Yeeesh! Beverley Hillbillys proves that.

Yeeesh! You learn about the world from TeleVision? or Hollywood? That speaks volumes.


Harumph on your Yeesh!ing of my Yeesh! I certainly do learn a few things about the world from TV. Radio too. I watch News Live at 6 every day, don't you? Doesn't everybody? After NL@6 there is often good stuff on The Learning Channel and History Channel. You think the tube is just for sports?

160) Message boards : Number crunching : What do we crunch for? (Message 16644)
Posted 30 Mar 2007 by (banished: ID 70524)
Post:
And how do you deduce that hill-folk speak that way?


Yeeesh! Beverley Hillbillys proves that.

Let me tell you a story 'bout a man named Jed
A poor mountaineer who barely kept his family fed
Then one day he was shootin' at some food
And up through the ground came a bubblin crude.

Oil that is
Black gold
Texas tea


Y'all come back now.[/quote]


Jethro Beaudein


161) Message boards : Number crunching : What do we crunch for? (Message 16643)
Posted 30 Mar 2007 by (banished: ID 70524)
Post:
And how do you deduce that hill-folk speak that way?


Yeeesh! Beverley Hillbillys proves that.

Let me tell you a story 'bout a man named Jed
A poor mountaineer who barely kept his family fed
Then one day he was shootin' at some food
And up through the ground came a bubblin crude.

Oil that is
Black gold
Texas tea


Y'all come back now.
162) Message boards : Cafe LHC : Can you help to stop this inhumane slaughter? (Message 16641)
Posted 29 Mar 2007 by (banished: ID 70524)
Post:
Maybe my "grand scheme of things" is to grand, or possibly, inapplicable. But for better or worse, that's how I perceive the World.


Your perception is correct, I think. Man is selfish. We will destroy the ecosystem to the point where it plus our engineered systems can no longer support human life. We won't become extinct but our numbers will decrease to the point where we can neither produce smog pumps nor perceive a need to kill thousands of dolphins. We'll go back to living in mud and grass huts and hunting what little prey we haven't killed. We will kill only what we can eat and be one with Nature rather than Nature's enemy. We will hunt with pointy sticks until we learn how to make arrow heads and bows again, bows made from springy saplings not the fancy sort you see in stores today, the technology for that will be lost though you might see icons of those bows hung for worship in whatever we call churches. In short, Nature will adjust our numbers as Nature requires, without mercy.

In our spare time we'll write a new bible that blends vague memories of the past with some new ad hoc theology. Maybe we'll call him Joe this time instead of God. Maybe we'll get bit a little mixed up and call him Dog. Maybe some shaman will convince us Joe Dog wants us to "go forth and multiply" one more time. Whatever. Eventually, from that theology there will emerge philosophy and eventually, after a thousand oppressive kings and cruel empires that philosophy will become science. If we're lucky, the selfishness will have been "bred" or "evolved" out of us but not likely. That part of us seems to be our destiny. Or will it simply be our choice, again? Our choice because it's the path of least resistance, again?

Hey, it's not that *I* have a problem with all that, no not the *me* that sits before these keys "full of sound and fury" as Will said, pounding out pleas to stop the insanity. Why would *I* complain! I live in a warm cozy house and when I want steak I just hop in my smog pump, drive to the store and buy it and a bottle of fine Burgundy to wash it down with, a cheesecake to follow that and a young girl with slim milky white thighs and huge plastic breasts to follow that. And when that's done this *me* burps to denote its pleasure, scratches out a cheque to some poor kid in Africa to ausuage its conscience and crunches a few WUs in case the cheque to the kid in Africa doesn't take, they never bounce but sometimes they just don't take. See? Who cares about dolphins or what my grandson will be doing 100 years from now!! Reincarnated back into the body of my grandson? Forced to live through the hell this *me* helped create through my selfishness? Phapp!!!! Give me one shred of scientific evidence that we are reincarnated. Me fending off big cats with a pointy stick and a pine pitch flare!! Ha! No way! There won't be any pine trees then!!!

Read "A Canticle to Liebowitz", Bantam books, circa 1965, can't remember the author, if you can find a copy. You'll never read another novel that will make you wonder so much about who you really are and what you're going to become.

EDIT: Sorry for the rant. Deep down inside I do know that the solution is to genetically engineer the dolphins to eat Big Macs like everybody else does. Then they wouldn't eat the fish and the Japanese wouldn't have to kill them so they can have the fish instead.

163) Message boards : Number crunching : Trojan used by dishonest BOINC cruncher (Message 16636)
Posted 27 Mar 2007 by (banished: ID 70524)
Post:
There may be some movement on this issue at Predictor.
According to the QMC Forum, Mr. Braun has frozen the account.


But Braun doesn't have the cahones to post on the Predictor forums that he was initially wrong and that he has now frozen the Wate account. I never expected him to have the cahones but it does not matter because I don't have any intention of crunching Predictor again under any circumstances

Shun Predictor!! Wipe the words Predictor and David Braun from every document and never utter the words again.



164) Message boards : Number crunching : Trojan used by dishonest BOINC cruncher (Message 16628)
Posted 26 Mar 2007 by (banished: ID 70524)
Post:
Grrr I was really out of it when I posted that last. My thoughts on the matter are the same, but here is the real text from that post. The mistakes I put in the last post are fair evidence of why the mob should not go on a rampage before doing some fact checking.

<quote>This thread is closed. Please do not create a new one.

Since this thread was first created I have deleted two other threads incorrectly accusing volunteers of cheating.

If someone has installed the boinc client on machines that they do not have permission to that is wrong. I have no way of knowing if this has or hasn't happened. Just because "some guy" posted something on the internet is not good enough reason to take any action against anyone.

dlb</quote>

The major complaints I keep seeing are "I started a new thread discussing Wate and it was deleted". Not surprising since they asked that people treat the problem as handled & get on with their lives.

As for false accusations, their final statement says it well; "Just because "some guy" posted something on the internet is not good enough reason to take any action against anyone"

One of those banned said he had 7 new threads deleted ... maybe the Admins just got tired of swatting flies and put up screens,


It was not just "some guy" who made the accusations. It was a respected moderator who passed along information given to her by respected project admins at a respected project, CPDN. The information was later corroborated by the project admins in the same thread in which the moderator initially related the details. Other respected admins at other respected projects apparently weighed the evidence against Wate and arrived at the same conclusion... Wate hijacked computers to use for crunching. Now think about this Fritz and think carefully... no project admin throws away the crunching power Wate claimed to own without giving it some serious consideration. You can be sure the admins at CPDN weighed the evidence carefully before deciding Wate was guilty. You can bet your last dollar they considered how they would look if, after deleting Wate, he came back and proved he was legitimate. Braun, if he had been smart, would have thought about that too or at least would have been open to discussing it. Instead, he censored and banned any and all discussion.

See, Fritz, nobody claims Braun had to immediately buy into it and follow suit. A discussion would have sufficed. We all know the majority can be wrong but that is the purpose of discussion... to weigh the evidence, sift out the chaff from the grain and act prudently. Instead, Braun censored and banned those who were merely attempting to convince him to weigh the evidence and do the right thing. Now he and Predictor look like fools and many of us have little doubt they are exactly what they appear to be.

165) Message boards : Number crunching : Trojan used by dishonest BOINC cruncher (Message 16623)
Posted 25 Mar 2007 by (banished: ID 70524)
Post:

@Fritz

If you read the final message in the thread linked to, you will find that the Admins there banned 2 people for posting false accusations. They also state that where there is no evidence they cannot assume a breaking of the rules. That seems to be fair enough.


The accusations are not false. There is plenty of evidence and it is conclusive, Wate broke the rules. The evidence (at least links to the evidence) was given to Predictor. David Braun refused to read the evidence, refused to discuss the evidence and decided to censor rather than read and discuss. His attitude is plainly arrogant. Scientific investigation requires humility, not arrogance. People like Braun who think they know everything and cannot benefit from discussion are incapable of conducting science. And since the project leader, Brooks, has not indicated he disagrees with Braun then the only reasonable assumption is that Brooks is either disinterested or equally as arrogant as Braun. They're not scientists, they're goofs with a BOINC server, nothing more.

I haven't looked at their message board rules, but if there is mention of posting false or unsupported info that harms, or is intended to harm. another user, that would justify the 2 banishments mentioned.


Well, if deleting Wate's account and not using CPU cycles stolen from unsuspecting people is harm in your books then you need your head read.

If Admins there declared the matter closed and asked that new threads beating this dead horse not be created, then it would be well within their rights to delete threads as they appear, again haven't looked for this, but they have stated that they are dealing with the person who has been proven to have violated their rules and from the Admin's comments & Pscheofer's complaint they are containing the issue in a manner they deem best.


The horse is not dead. If they would do the right thing and ditch Wate and stop using the computer's he hijacked then the isue would me more or less dead. But they refuse to even discuss it. Then spin doctors like you call it "containing". In fact, Predictor is engaging in a criminal activity by using stolen CPU cycles.

166) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16571)
Posted 17 Mar 2007 by (banished: ID 70524)
Post:
What about that fried rice receipt?

Well, it's "recipe" not "receipt", and I truly appreciate that you are so candid about language skills. This thread is a great place to learn English, and I will be happy to help. Be prepared to have someone say something negative about that; ignore them.

As for the fried rice recipe, I'm not sure what Quantum was thinking but I have one of my own.

Spam Fried Rice
1. Put a very small amount of cooking oil in a wok or large frying pan.
2. Heat oil to medium-high temperature.
3. Add 4 to 6 ounces of thinly sliced or diced Spam (1/2 inch or less.)
4. Cook Spam until light brown or slightly crispy.
5. Add previously steamed white rice and mix until evenly coated with oil.
6. Add steamed green peas, corn, diced carrots, and/or other steamed vegetables.
7. Mix and turn for about 5 to 10 minutes, or until rice begins to clump.

Yum.


This thread was philosophical at the beginning since it wrestled with the question of whether the desires (not needs) of a few should be allowed to create a bunch of unnecessary work for project devs/admins.

The point where it turned to discussion of Godwin's Law is where most of us folks who were incorrectly branded as trolls stopped posting because it's really hard to type when you're rolling on the floor laughing your buttocks off. (Are we allowed to say buttocks, booger and barf?) We never have lived under a bridge and never will. The fact that some some people bandy about words like troll, spam, philosophy and receipt with no apparent understanding of the meaning of such words is amusing and, at the same time, slightly disgusting.

As for cooking... peas, corn and rice are basically the same thing, starch. The only significant difference is the color and flavor. Forget peas and corn, sub in brocolli, thin sliced cabbage, bok choy, asparagus, green pepper, etc. for the peas and get some vitamins/minerals instead of predominately starch, no need to pre-steam them. Toss a little smashed garlic into the oil before any other ingredients (essence of garlic is an oil and therfore non-polar so it dissolves nicely in other oils which helps suffuse it into/onto other ingredients). Sprinkle a healthy bit of curry on the rice after it is hot, before adding veggies. After spooning onto plate, garnish with dollop (or 2/3) of ketchup, Heinz of course.

167) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16545)
Posted 12 Mar 2007 by (banished: ID 70524)
Post:
and as the author of BOINCprog, I'm satisfied that I've given back to the community sufficiently to be forgiven for the occasional whinge or lapse in judgment.


Your bigger than Manhattan ego and elitist attitude endears you to every Nazi hiding in S. America. As for BOINCprog, lol, I wrote more and better than that for weekly assignments for Comp Sci 101. You're no software engineer. You're just another picker in another bar band pretending to be somebody. And it's whine, not whinge.

168) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16533)
Posted 11 Mar 2007 by (banished: ID 70524)
Post:
Francks, I would rather be a troll than an idiot like you. There is hope for trolls but unfortunately there is no hope for morons like you. You'll always be as stupid as you are today.

How stupid are you? You're so stupid you can't tell the difference between me attacking someone's ideas and me attacking the person. That makes you an idiot and you'll never be anything more.

And if any of you other morons thinks I care about being banned or thinks I actually can be banned then get your head read. Maybe you'll believe a doctor when he tells you you're an idiot.





169) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16530)
Posted 11 Mar 2007 by (banished: ID 70524)
Post:

You insult me by calling me a troll. You can't defeat the arguments with logic so you launch a personal attack. You broke the rules.

ALL your damned posts are personal attacks


That's a blatant lie and a personal attack and the d* word is profane. I am insulted. You broke the rules with 4 infractions in 7 words. That's gotta be a record, congrats.

YOU are breaking the rules, and basically shouting asking for a ban.


So are you. I guess you want a ban too.


170) Message boards : Number crunching : claimed credits (Message 16528)
Posted 11 Mar 2007 by (banished: ID 70524)
Post:
I've seen this but that is not an answer of my question. Sorry!
What have Intel and AMD to do with claimed credits? I only know there are differences between Windows and Linux in claiming credit.



Claimed credit depends on more than just CPU time, it depends on benchmark scores too. It is not unusual for host A to take less time than host B but claim more credits. It can happen if host A's benchmarks are higher than host B's.

171) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16527)
Posted 11 Mar 2007 by (banished: ID 70524)
Post:
I honestly do believe you to be trolling. It is not intended to be an insult, but you should feel free to take it as such.


Right back at you. I honestly do believe you to be an imbecile. It is not intended to be an insult but you should feel free to take it as such.

172) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16525)
Posted 11 Mar 2007 by (banished: ID 70524)
Post:
Odd, the k00k whiners who masquerade their ridiculous little yearnings


Troll. Go away. 0 credits, 0 RAC, 0 credibility.


You insult me by calling me a troll. You can't defeat the arguments with logic so you launch a personal attack. You broke the rules. And your statement is by far the most ridiculous thing posted in this thread so far, even more ridiculous than the "I feel so helpless" statement, lol, as if a poster's RAC has anything to do with the truth of his statements. Man you are an imbecile for sure!


I will not lose a moment of sleep fretting over the fact I have failed to endear myself to selfish imbeciles.


If you would refrain from personal attack and insults, you might have SOME credibility. As it is, you have none. Troll.


You just don't get it, imbecile. If you don't want to be called an imbecile and a k00k then don't act like an imbecile and a k00k. It really is that simple.



As for profanity, if you don't like words like crap then don't write crap


Crap wasn't the word to which I was referring. It just happened to be the one I quoted. Have a splendiferous day, troll.


Have an even more splendiferous day, moron.
173) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16522)
Posted 11 Mar 2007 by (banished: ID 70524)
Post:
You post that crap as if you're the only one in this forum who has figured that out.


You, sir, have a gift for not endearing yourself to others. You would also do well to skip the profanity.


Odd, the k00k whiners who masquerade their ridiculous little yearnings as essential needs which, if unfulfilled will lead to their demise as well as the demise of this project, have a gift for not endearing themselves as well. Believe me, I will not lose a moment of sleep fretting over the fact I have failed to endear myself to selfish imbeciles.

As for profanity, if you don't like words like crap then don't write crap for there is no other way to refer to crap than to call it by it's proper name... crap.

174) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16519)
Posted 10 Mar 2007 by (banished: ID 70524)
Post:
"Helpless because I can't crunch them", I haven't heard anything so ridiculous for a long time.

Rules:
# No messages whose only intention is to annoy or antagonize other people.
# No messages that are deliberately hostile or insulting.

Was that comment really needed?


Yes, it is important to let k00ks know they are k00ks. Otherwise they continue posting their k00k crap. Their nonsense is annoying, antagonising and insulting. It breaks all the rules.

He's expressing an OPINION, and it's in fact the most accurate description of the problem I have seen.


The problem? WHAT problem? It is NOT a problem if he doesn't get WUs. NOBODY INCLUDING THE ADMINS give a rusty rat's ass if he doesn't get WUs or whether he feels helpless/happy/hopeful/hungry. The fact he thinks they should care proves he's a k00k.

If there is any problem then it's a problem with the way his/your synapses fire. For some reason, he/you thinks LHC owes him/you WUs and that something is amiss if he/you doesn't get any. Idiotic, moronic, lunatic, stoopid, k00k whining.... those are the only words that describe his/your "logic".

it's not fair credit-wise.


Next to the "i feel so helpless" crap, that has to be the 2nd most ridiculous thing I've read in a long time. You seem to think this project exists to serve you credits and serve them fairly. You've got it completely backwards. You could not possibly be more wrong. Please, just stop posting. The crap you post here is an insult and it irritates.


Imagine this: the project has a thousand hosts, and sends a thousand workunits, which are grabbed by 10 hosts (a hundred each). That's wasted computing power. They would get it done a hundred times faster if it was one WU per host. It's useless to have so many hosts attached if only a few will have work at a time. Of course, this "hundred times faster" is an exaggeration, I don't know what are the actual numbers (average number of units per host when work is sent).


You post that crap as if you're the only one in this forum who has figured that out. BWAAAHAHAHAHAHAHAHAHAHA. Imbecile. And I say that not to insult you or irritate you. I say that to help you get a grip on reality and know thyself.

The fact is, witholding WUs from any host may very well cause the WUs to go to a host that takes even more time to return them. If they had been sent to the host that was denied then that host might have crunched them all really quick and returned them the same day. The point is, it is impossible to know whether the requested feature will speed up or slow down return of results. And there may be no need because the WU flow may be continuous. So why should the project bother implementing it? Maybe, MAYBE some time in the future when it's all running smooth and they have absolutely nothing else to do and they are all bored stiff. But NOW?? You k00ks just prove you're k00ks by pressing the point now and engaging in childish emotional blackmail like "I feel so helpless".



175) Message boards : LHC@home Science : Close Account (Message 16445)
Posted 1 Mar 2007 by (banished: ID 70524)
Post:
Can Linux run underneath a full-fledged GUI... somewhat the way DOS used to run underneath Windows and Solaris can, but doesn't necessarily need to, run underneath Java Desktop?


You've got it all backwards. GUIs, including Windows, always have and always will run on top of the basic command line operating system. You seem to think it's the other way around. It is not.

Also, does BOINCview run well on Wine? What about KBoincSpy?


Some say BOINCview runs on Wine, others say it runs but not well. It never worked at all under Wine for me but I didn't try very hard. I bet it would run best on Xen, a virtual machine for Linux but have never tried that so not sure.

Finally (hope I'm not annoying you all) is openSUSE/KDE something that runs with Linux or instead of Linux? (Hey, don't roll your eyes like that!)


No with or instead about it. OpenSUSE is Linux. Not the only Linux, but it is a Linux. OpenSUSE is one of several Linux distributions, distro for short.

KDE is a desktop that runs on the X graphic subsystem which in turn runs on Linux. Gnome is another desktop that runs on X. The Knoppix Live CD uses a desktop called Flux Box. These are not just different wallpapers glued over top of the same basic desktop as in Windows. These are different desktops.

I'm thinking of converting one of my Windows boxes to a Linux box, so any helpful comments would be much appreciated.


Just remember, Linux is very modular and very customisable because it's all open source. That is generally a good thing but it has led to many different flavors of Linux and some confusion until you get the hang of it. Just when you think you've got it figured you'll realise it works that way only for This Distro or That Distro and only if you have modules A, B and Z installed and not C. If you're going to convert a Windows machine then you'll be a little frustrated at first until you get a feel for Linux.

Live CDs like Knoppix are great for getting a feel for Linux but they run the Flux Box desktop which most Linuxes do not use. Also, Live CDs don't give quick and easy access to your disk. Try a Linux Live CD but don't get the impression that is what Linux is all about. Install a mainstream Linux like openSUSE, Fedora, Debian or others if you want to give Linux a proper chance. Each distro has 1 or more forums dedicated to that distro so use those forums as primary sources of help, at first, when you want to be sure you're getting advice that applies to the distro you installed and not some other distro. There are also forums that are not dedicated to any distro. Those have excellent help too but always specify which distro you ask about and always remember the advice you read may not apply to your distro.

176) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16440)
Posted 28 Feb 2007 by (banished: ID 70524)
Post:
This will benefit the project because you won't have a single host sitting on hundreds of WUs that won't finish within a month


Then all the project has to do is put a 1 week deadline on the WUs. If they feel that's not necessary then so be it, it's their project, love it or leave it. And as the batch winds down to the final few hundred WUs they can shorten the deadline even more to prevent stragglers.

That is what I don't find to be "fair", it is NOT fair to the project.


Funny, but I don't hear the project complaining about it. If it were a problem for them they would do something about it. Like Finney, you are making unwarranted assumptions about how quickly results need to be returned and what the project's needs are. MYOB. You're imagining a problem where none exists.

I am not bothered by the fact that I don't have any LHC WUs to crunch. I am bothered by the fact that there are WUs to crunch and I feel helpless because I can't crunch them.


Relax, pour yourself a tall cold one and crunch the WUs you CAN get. You're making mountains out of mole hills. All you're going to get out of this messianic complex of yours is a heart attack. "Helpless because I can't crunch them", I haven't heard anything so ridiculous for a long time.

177) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16425)
Posted 27 Feb 2007 by (banished: ID 70524)
Post:
I can't believe that you would INNOCENTLY sit there and tell me that my computers need to be limited so that your *single* computer can get it's 3 alotted work units...


But he's not telling you that. Read his/their proposal. I see nothing in it that would limit your computer or anybody else's on MOST days. The only time their proposal might limit someone else is at the very end of the WU batch where it might work out that he could be sitting with a few WU in his cache when you have none. But I can also see where it could work out the other way too. In the end, I don't think a 1 or 2 day delay of a few WUs at the end of a batch will make an appreciable difference to the way the universe unfolds.

Where you are absolutely correct, Finney, and this is why I know you and I are birds of a feather, is that there are far too many crunchers who think that the act of attaching a 'puter to a project creates a binding contract requiring the project to drop everything else and bestow eternal bliss upon them first by contorting and convoluting the code in the most unusual ways. And they always seem to end their poasts with "...that shouldn't be too hard to implement" when they probably couldn't write a 2 line windows .bat file and make it work.

It's really become a gimmee, gimmee, gimmee thing when it was always intended to be the other way around. The ONLY thing we can reasonably expect is regular updates on problems and developments so we can make informed decisions, AND THAT'S ALL FOLKS. If you feel your million and 1 petty litle yearnings are the god given rights and basic necessities then... hey! roll up your sleeves, grab a free compiler or maybe just learn how to script and damn well make what you want, debug it and submit it to the devs. What's that you say? You don't have time? And the feature won't really be needed in 2 months? So it would all be a waste? Well, now maybe you know how the admins and devs feel!! And maybe you understand why some of us ROFL when ya poast, yah poast, not post.
178) Message boards : Number crunching : Fairer distribuiton of work(Flame Fest 2007) (Message 16409)
Posted 26 Feb 2007 by (banished: ID 70524)
Post:

@Aaron Finney

Actually you are inflammatory. Flamatory too, probably. Of course that means we're birds of a feather and I like you already so please don't think I'm flaming you.

Difference of opinion I guess, but I would like to think that you could find a project you like enough to actually give them something worthwhile.


You would like that, huh? And now you're sooooo disappointed, huh? Well, dirty cream o' wheat AF, I would like to think you're bright enough to realise that maybe the good doctor loves all the projects equally and just can't make up his mind which project he wants to be his One and Only True Love. Please don't disappoint me and remember... I like you, don't feel I've flammatoried all over ya. OK?



Previous 20


©2024 CERN