Message boards :
Number crunching :
most unpolite host of the day
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 7 · Next
Author | Message |
---|---|
Send message Joined: 22 Jan 18 Posts: 32 Credit: 2,756,359 RAC: 0 |
Here is something fun. I couldn't get a Cosmology VBox WU all weekend and into this week (0 WU available) so I crunched some other stuff including Universe@Home, SRBase, Yafu, YoYo, and some massive Citizen Science Grid WU (3+ days each). Every task was successful and validated. So I finished up all tasks, uninstalled VBox, uninstalled BOINC, rebooted, and reinstalled a fresh copy of the latest BOINC (7.12.1 + VBox 5.2.8). Added LHC again and got the exact same results with a 8 CPU WU...it finished in 600~ seconds as invalid. So I switched my preferences to 4 CPU max and it downloaded some of those. This time around it took what I consider a better time, a little over 3 hours. However the couple I did all gave a "error while computing" and actual errors in the stderr output. Here is one of them: https://lhcathome.cern.ch/lhcathome/result.php?resultid=204064087 Can anyone interpret this as anything? |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,136,943 RAC: 31,282 |
... and reinstalled a fresh copy of the latest BOINC (7.12.1 + VBox 5.2.8). You better had taken 5.2.16. I'm just testing it on my win10 1803 boxes, with BOINC 7.12.1 and VB 5.2.16 and it works fine. But my machines don't crunch 8-Core-WUs instead 2x (3x) 4-Core-WUs. Supporting BOINC, a great concept ! |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,136,943 RAC: 31,282 |
|
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
The "Exit status 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED " in the stderr output pretty much says it all... not buggy software, host not configured properly for the job, classic PICNIC. Anyway, that host leads back to a user named Anonymous which may or may not be you which means your example task may or may not be yours. If you refuse to expose your hosts then there is no way anybody can help. I believe you're here just to yank chains. |
Send message Joined: 28 Sep 04 Posts: 728 Credit: 48,853,904 RAC: 21,484 |
The "Exit status 196 (0x000000C4) EXIT_DISK_LIMIT_EXCEEDED " comes from a situation where the disk usage of a slot directory where the task is running exceeds the limits set in init_data.xml <rsc_disk_bound>xxx</rsc_disk_bound>. Either the value was set too low for the task by the server when task was created or for some reason there were some extra files in the slot directory left behind from previous tasks or extra files were created during the task calculation which bloated the disk consumption over the <rsc_disk_bound> value. This is not related to any user setting of allowed disk usage for Boinc. |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
Thank you. My mistake. |
Send message Joined: 28 Sep 04 Posts: 728 Credit: 48,853,904 RAC: 21,484 |
No worries, it is an easy enough mistake to make (most people do). I just happen to know because I've been around here long enough. Sixtrack has had a few of those occasions with this error and virtualbox tasks as well when it was introduced. If you google the error, you find links back to these message boards talking about them. It is too bad that in this case the error makes solving the problem in hand even more difficult than normal. |
Send message Joined: 22 Jan 18 Posts: 32 Credit: 2,756,359 RAC: 0 |
and did you ever take a ride through this checklist ? Yes...multiple times. Even tried running just 1 WU by itself. Same results. If you refuse to expose your hosts then there is no way anybody can help. I believe you're here just to yank chains. I did expose my hosts, for a week, and one person looked at things but didn't know what was wrong. Every reply you've given me has been negative and not helpful. I'm trying to figure out what is wrong and all you do is think I'm trying to cheat. Which doesn't make any sense. Please stop replying to my posts if you dont have anything helpful to say. It is too bad that in this case the error makes solving the problem in hand even more difficult than normal. So is there anything I can do other then abandon this project on this host? Can I manually put a setting in the app config the Tells it to use more disk space? The server has 100+ gigs free space so its definitely not a resource issue. I ended up doing 4 4-core WU and all ended with the same error. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,136,943 RAC: 31,282 |
|
Send message Joined: 22 Jan 18 Posts: 32 Credit: 2,756,359 RAC: 0 |
So is there anything I can do other then abandon this project on this host? Currently it is set for use no more than 100 gigs of disk space, leave at least 10 gigs free, and use no more than 90%. The drive is a 200Gb SSD which currently only has 50 gigs used. Surely an Atlas task can't use a 100 gigs of space, can it? |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,136,943 RAC: 31,282 |
Currently it is set for use no more than 100 gigs of disk space, leave at least 10 gigs free, and use no more than 90%. The drive is a 200Gb SSD which currently only has 50 gigs used. Surely an Atlas task can't use a 100 gigs of space, can it? This should be okay. Does the client show the same figures as the WEB ? Perhaps you set a local profil in the past ? Supporting BOINC, a great concept ! |
Send message Joined: 2 May 07 Posts: 2242 Credit: 173,898,210 RAC: 2,810 |
This is one from vseven from a earlier message. https://lhcathome.cern.ch/lhcathome/result.php?resultid=204064087 Is this your Computer? |
Send message Joined: 22 Jan 18 Posts: 32 Credit: 2,756,359 RAC: 0 |
Currently it is set for use no more than 100 gigs of disk space, leave at least 10 gigs free, and use no more than 90%. The drive is a 200Gb SSD which currently only has 50 gigs used. Surely an Atlas task can't use a 100 gigs of space, can it? Yes, same settings on client. I unchecked all three anyway so it has no limits, removed the project completely, verified the project directory was gone, and readded the project. Its downloading the VDI and a 4 CPU task right now....I'll know in 5 - 6 hours if it worked (maybe more if it doesn't error out.) This is one from vseven from a earlier message. Yup, same computer. |
Send message Joined: 22 Jul 17 Posts: 1 Credit: 5,047,977 RAC: 0 |
and did you ever take a ride through this checklist ? Sorry folks, I can't follow you. The concept of BIONC is a platform for distributed computing. Usable for interested people without the need to take special care. Without the need to read several pages of quite specialized computer-stuff. Part of the design is to scale down the tasks in a way most PC are able to deal with flawlessly. Imho the problem with LHC-tasks like ATLAS is pushing out overly demanding tasks from overambitious projects. No wonder this kind of setup is causing trouble. It makes no sense to insult people not able to compute those monster-jobs properly. It's the result of a project not respecting the limits of BOINC. Such elitism-computing-approach is bad for the overall reputation of BOINC. If you want to participate from the large numbers of the "mass market" I suggest to civilize those tasks into a form the mass market is able to crunch. (see Cosmology dealing with VirtualBox, f.e.) Else call out an extra-project and ask for users willing to prepare the extras needed. And protect that extra-project with a password from the dumb folk. |
Send message Joined: 7 Aug 14 Posts: 27 Credit: 10,000,233 RAC: 2,319 |
As noted on this thread on the ATLAS boards, the validation logic has been tightened so that credit is not allocated for these short unsuccessful tasks. Another couple of tasks sent 4th August and returned 7th August got validated for 325+ credits... https://lhcathome.cern.ch/lhcathome/result.php?resultid=203599212 https://lhcathome.cern.ch/lhcathome/result.php?resultid=203606471 So catching most but some still get through. Shame he hasn't been able to get tasks to work properly. |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
Such elitism-computing-approach is bad for the overall reputation of BOINC. <sarcasm> Elitist computing is a hoax invented by the Chinese, embodied by the abacus, for the purpose of bankrupting IBM and Microsoft. It has been expanded by leftist, virtue signaling globalists planted here by the UN cabal. Anybody who can crunch ATLAS tasks is obviously Illuminati, born in Kenya for sure. They all sell uranium to Putin and hookers from family owned pizza joints. </sarcasm> |
Send message Joined: 22 Jan 18 Posts: 32 Credit: 2,756,359 RAC: 0 |
Shame he hasn't been able to get tasks to work properly. So this test task, after telling BOINC there was no hard drive space limits at all by unchecking all three options, was a success (marked valid) and appears to have created a HITS file. It took just under 7 hours using 4 CPUs. I don't know what the difference between telling BOINC it can use 100Gb and not to exceed 90% when I have 100+ Gb free and telling it there are no limits but it didn't give the disk space error. Going to run some more 4 CPU tasks then re-try some 8 CPU tasks and see what happens. |
Send message Joined: 13 Apr 18 Posts: 443 Credit: 8,438,885 RAC: 0 |
I don't know what the difference between telling BOINC it can use 100Gb and not to exceed 90% when I have 100+ Gb free and telling it there are no limits but it didn't give the disk space error.You have misinterpreted the disk space error the same way I misinterpreted it. See Harri Liljeroos's post upthread for a good explanation of what it means. Going to run some more 4 CPU tasks then re-try some 8 CPU tasks and see what happens.I think the experts here will agree that with respect to efficiency you're better off running 2 X 4-core tasks than 1 X 8-core tasks but it's your rig so experiment as you see fit. 2-core tasks are even more efficient than 4-core tasks, the trade-off is that 4 X 2-core tasks requires more memory than 1 X 8-core task. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 201,136,943 RAC: 31,282 |
I don't know what the difference between telling BOINC it can use 100Gb and not to exceed 90% when I have 100+ Gb free and telling it there are no limits but it didn't give the disk space error. HARDDISC-spaces have grone so much in the last years and if the developers (or the System itself) uses a wrong value, this will give unwanted side-effects. Example of former computer-days. If you have a real integer-variable and the variable contains 32767. What happens, if you add 1 to this variable ?
Supporting BOINC, a great concept ! |
Send message Joined: 22 Jan 18 Posts: 32 Credit: 2,756,359 RAC: 0 |
And something similar this way could make the difference you saw here Sure. But why would it only affect Atlas tasks and no other projects? I'm pretty positive VBox would know the difference since its kept up to date.....would it be in the older VBox wrapper this project uses? I completed another 4 CPU tasks without errors (around 9 hours) and with another 160Mb HITS file so my host seems to be happy. Still going to try a 8 CPU and 2 CPU task to make sure those both work. |
©2024 CERN