Message boards : Number crunching : Request for database cleanup
Message board moderation

To post messages, you must log in.

AuthorMessage
Toby

Send message
Joined: 1 Sep 04
Posts: 137
Credit: 1,691,526
RAC: 48
Message 13508 - Posted: 4 May 2006, 23:53:20 UTC

This is primarily for the admins of this project but feel free to chime in if you have something to add :)

So I'm sure you guys aren't just sitting around over there eating swiss cheese all day long however it seems like you have no plans to perform some database maintenance that needs to be carried out on this project. There are 2 issues I would really like to see fixed at some point as they are creating problems for the external stats sites and annoying users.

1) Clear out the old orphaned work units. I (and many other users) have several hosts that no longer exist or are not able to run BOINC for me anymore. I would really like to delete them so that my cross-project host stats are cleaner and more accurate. They have been deleted from all of the other projects they once participated in however I can't delete them here on LHC because they still have work units from 2005 associated with them. There was a bug in the BOINC backend that caused this problem. I believe it has been corrected and there are ways to clean up the mess. If you need help, I know the staff of seti@home have battled with this problem several times and I'm sure they would be glad to point you in the right direction on what to do with these orphaned work units.

2) There are still a couple hundred host records that are mangled from the upgrade that went bad and corrupted all the data in the host table. This is creating all kinds of trouble for some of the features I would like to implement on my stats site. It has been so long now that I think it is safe to assume that these hosts are no longer participating in the project. As such they could be safely deleted and thereby eliminated from the XML stats export.

Obviously these problems aren't the end of the world but if you have a few spare moments I think the users around here would appreciate these things being looked at. I know I would :)

I have mentioned this at least once before in another thread but I'm not sure if any of the admins saw it or not. If the answer is "no, we don't think it is worth our time" then so be it - I will let it go. Just wanted to make sure someone knew what issues it is creating for us users.

Toby
- A member of The Knights Who Say NI!
My BOINC stats site
ID: 13508 · Report as offensive     Reply Quote
Gaspode the UnDressed

Send message
Joined: 1 Sep 04
Posts: 506
Credit: 118,619
RAC: 0
Message 13509 - Posted: 5 May 2006, 1:14:41 UTC

This is primarily for the admins of this project but feel free to chime in if you have something to add :)


It's unclear whether we have an admin on this project at present. Chrulle has left for pastures new, and no new name has appeared. Ben Segal has overall responsibility for the project, but he's not involved on a day to day basis. Other names I have seen associated with the technical background either haven't been seen for a long time, or have no involvement beyond the application development role, or both.


the staff of seti@home have battled with this problem several times


Here's the real issue behind some of the maintenance you ask for: it's not easy. Previous requests of this type have been turned down because the solution could introduce more problems than it fixes. It would be nice to think that the downtime could be used to tidy things up, but the reality is that there's no resource available to do it.


Gaspode the UnDressed
http://www.littlevale.co.uk
ID: 13509 · Report as offensive     Reply Quote
Profile Chrulle

Send message
Joined: 27 Jul 04
Posts: 182
Credit: 1,880
RAC: 0
Message 13528 - Posted: 8 May 2006, 14:54:35 UTC

It's unclear whether we have an admin on this project at present.


When i left, there was talk about a new spanish ph.d coming to replace me. But i have not heard anything more than that.


Other names I have seen associated with the technical background either haven't been seen for a long time, or have no involvement beyond the application development role, or both


All the people that have worked as sys admins through time has gone back to their studies. Jakob, Karl, Kalle and Markku all left to finish their studies, as did I. I returned and worked as a sysadmin for a year, but I am back in Denmark now enjoying the unseasonably warm weather and writing job applications.

The sys admin job has now been transferred to Ignacio Reguero and Phil Defert. Phil is working on automating the installation of a boinc server(so it is as simple as an rpm package), this makes it a lot easier to integrate the two new servers we got in the cern computer centres. Ignacio is mainly working on porting the new applications to lhc@home. Furthermore lhc@home is only a small part of their jobs. I have promised that I will be available if they have any technical question or need a few hours help with something I can guide them through remotely.

So you should not expect any major work being undertaken for the moment. Dave Anderson has also just released a request that people update the server software as soon as possible. I did an update just before I left so some of the features are already there. So I would guess they are mainly working on this.

2) There are still a couple hundred host records that are mangled from the upgrade that went bad and corrupted all the data in the host table. This is creating all kinds of trouble for some of the features I would like to implement on my stats site.


Couldn't you make a list of those hosts' id numbers and then use that list to discard those records from your stats program? You could also post that list here that would make the job of removing/fixing the records a lot easier for any coming admin.

cheers,
Chrulle


Chrulle
Research Assistant & Ex-LHC@home developer
Niels Bohr Institute
ID: 13528 · Report as offensive     Reply Quote
Toby

Send message
Joined: 1 Sep 04
Posts: 137
Credit: 1,691,526
RAC: 48
Message 13530 - Posted: 8 May 2006, 19:40:11 UTC - in response to Message 13528.  
Last modified: 8 May 2006, 19:42:01 UTC

I haven't really been paying attention to the "behind the scenes" news of this project - just crunching work units whenever they are available :) So I wasn't aware of all of the changings/leavings that have taken place. I can understand this issue not being at the top of the To-Do list.

Couldn't you make a list of those hosts' id numbers and then use that list to discard those records from your stats program?

Technically yes but all my stuff is programmed in a project-agnostic way so I would have to add in some special cases specifically for LHC. There have been problems like this before but so far I have managed to get projects to clean up their data - since that is usually more desirable for both me and the projects.

You could also post that list here that would make the job of removing/fixing the records a lot easier for any coming admin.


Sure. I'm just going off of the host CPID. It looks like the corruption caused some number from another field to be stored in the CPID field. Normally the CPID is either null or a 32 character string so any records where it isn't one of those two is a pretty good indication that things are bad. Of course I don't have access to all the data so I may not be seeing all of them but I get a list of 197 hosts with those criteria. As a double-check I included the OS name in the list. In corrupt hosts it is also some number from another field. Here is the query and its results in my database. Warning! SQL ahead! :)

edit: I see the formatting is kind of ugly. At least <pre> works a little better than <code>

mysql> select id, os_name, host_cpid 
from lhc_hosts where host_cpid is not null 
and length(host_cpid) > 0 
and length(host_cpid) < 32 
order by id;
+-------+------------------+------------+
| id    | os_name          | host_cpid  |
+-------+------------------+------------+
|   548 | 4323203497.21169 | 1          |
|  3734 | 977385294.095796 | 1          |
|  8971 | 0                | 0          |
|  9677 | 1983243362.2538  | 1          |
| 14537 | 1561268999.72147 | 1          |
| 22501 | 2600236416.96948 | 11         |
| 23004 | 0                | 0          |
| 23006 | 0                | 0          |
| 23865 | 3726697966.29887 | 0          |
| 24898 | 2049053722.11362 | 2          |
| 26495 | 3152835385.64328 | 0          |
| 26525 | 4534006265.63645 | 150727     |
| 28290 | 3939925429.47743 | 1          |
| 28896 | 1                | 14421      |
| 29301 | 0                | 0          |
| 30120 | 2424569495.06303 | 1          |
| 31447 | 1145744152.96676 | 0          |
| 32919 | 4697424811.79383 | 0          |
| 33680 | 0                | 536330240  |
| 34303 | 0                | 0          |
| 34351 | 0                | 0          |
| 34666 | 0                | 0          |
| 37112 | 2621600241.63275 | 0          |
| 37513 | 2271822513.8661  | 1          |
| 38015 | 0                | 2147483647 |
| 38273 | 5942798540.3082  | 0          |
| 38725 | 0                | 0          |
| 39614 | 893110268.234828 | 1          |
| 39677 | 1021501403.30489 | 1          |
| 39758 | 1704788578.98496 | 0          |
| 40016 | 0                | 0          |
| 40481 | 0                | 0          |
| 40941 | 2376273010.74518 | 1          |
| 40968 | 0                | 0          |
| 41119 | 0                | 0          |
| 41650 | 2024136132.97026 | 2          |
| 41816 | 1                | 71         |
| 42071 | 1154146892.35444 | 0          |
| 42242 | 0                | 0          |
| 42577 | 3906708995.76546 | 3          |
| 42690 | 874842098.407367 | 0          |
| 43113 | 0                | 0          |
| 43467 | 0                | 0          |
| 43786 | 2550269936.19907 | 0          |
| 43920 | 0                | 0          |
| 44058 | 0                | 0          |
| 44155 | 0                | 0          |
| 44193 | 0                | 0          |
| 44240 | 0                | 0          |
| 44352 | 916493809.072506 | 0          |
| 44380 | 1107791056.29853 | 0          |
| 44719 | 894119581.103467 | 0          |
| 44827 | 0                | 0          |
| 44849 | 923379409.435186 | 0          |
| 45156 | 0                | 0          |
| 45686 | 2384312321.68865 | 1          |
| 46788 | 0                | 0          |
| 46817 | 1482348646.4883  | 3          |
| 47087 | 6962793579.15997 | 2177       |
| 47546 | 3097332144.9154  | 0          |
| 47686 | 0                | 0          |
| 48450 | 1787129178.26463 | 0          |
| 48947 | 3098252970.98247 | 1          |
| 49311 | 1630824338.1244  | 1          |
| 50119 | 3105935765.27067 | 0          |
| 50518 | 1                | 1977       |
| 50876 | 0                | 0          |
| 51426 | 0                | 0          |
| 51526 | 1535821154.37697 | 0          |
| 52085 | 1604303277.76273 | 8          |
| 52300 | 1410306766.77178 | 0          |
| 52448 | 0                | 0          |
| 52561 | 2782578814.00683 | 1          |
| 52570 | 1045892216.92531 | 25         |
| 52820 | 2495761027.37635 | 0          |
| 52918 | 1721466069.97402 | 0          |
| 53172 | 0                | 0          |
| 54300 | 3260408918.25324 | 2          |
| 54668 | 2720697448.85741 | 0          |
| 54792 | 2109371781.35409 | 3          |
| 55474 | 0                | 0          |
| 55941 | 0                | 0          |
| 55942 | 1                | 470        |
| 55962 | 0                | 0          |
| 56181 | 2972593717.45706 | 0          |
| 56348 | 4408857667.7467  | 9          |
| 56882 | 1936218873.61121 | 0          |
| 56951 | 3700086870.56286 | 0          |
| 57368 | 2056652271.56663 | 2          |
| 57463 | 0                | 0          |
| 57773 | 4                | 361        |
| 57798 | 1628833427.76863 | 0          |
| 58472 | 2900334075.22634 | 4          |
| 59086 | 2571805226.50791 | 6          |
| 59101 | 2427238943.38612 | 0          |
| 59195 | 0                | 0          |
| 59340 | 2854493973.79037 | 0          |
| 59844 | 1896798558.89864 | 0          |
| 59929 | 3298933645.47202 | 0          |
| 59977 | 851239982.752834 | 1          |
| 60083 | 3319233446.64212 | 1          |
| 60659 | 1726060356.98958 | 4          |
| 61275 | 5058899163.32357 | 2          |
| 62573 | 1894066700.27905 | 0          |
| 62690 | 527413546.689693 | 2          |
| 62743 | 0                | 0          |
| 62892 | 1505613196.78777 | 1          |
| 63037 | 1671314966.35334 | 66         |
| 63365 | 1802745389.40112 | 0          |
| 63442 | 3534942433.77073 | 0          |
| 63452 | 2904865441.34413 | 0          |
| 63595 | 1062675980.88734 | 0          |
| 63601 | 0                | 0          |
| 63603 | 1                | 1374       |
| 63738 | 2205100976.72589 | 0          |
| 63892 | 1827142463.38993 | 2          |
| 64163 | 2903289176.73199 | 15         |
| 64201 | 1                | 9778       |
| 64351 | 0                | 468799488  |
| 64537 | 1194402711.70521 | 5          |
| 64746 | 2501185522.06349 | 1          |
| 64951 | 2932006551.62088 | 2          |
| 65110 | 0                | 0          |
| 65226 | 3044592506.52268 | 3          |
| 65246 | 1644682011.05807 | 0          |
| 65456 | 0                | 0          |
| 65460 | 0                | 6668805    |
| 65485 | 0                | 0          |
| 65601 | 4213261330.1001  | 2147483647 |
| 65736 | 0                | 0          |
| 65826 | 0                | 2147483647 |
| 65885 | 0                | 0          |
| 66103 | 2237757890.91319 | 1          |
| 66134 | 2502604673.85323 | 1          |
| 66170 | 4038019172.32875 | 3          |
| 66292 | 3363466357.20569 | 0          |
| 66519 | 1511912119.28598 | 2          |
| 66597 | 3361844752.0287  | 0          |
| 66603 | 0                | 0          |
| 66680 | 2453943624.00024 | 11638166   |
| 66839 | 1                | 14593      |
| 66894 | 991318413.199696 | 0          |
| 66895 | 1641673934.77377 | 1          |
| 66912 | 2284916341.8593  | 2          |
| 66984 | 1899837246.51473 | 4          |
| 66997 | 2                | 756        |
| 67007 | 566809880.768021 | 1          |
| 67034 | 1788512637.06459 | 0          |
| 67070 | 1099446288.59139 | 1          |
| 67116 | 0                | 0          |
| 67124 | 3157278646.94204 | 1          |
| 67281 | 2513385598.52156 | 13         |
| 67347 | 0                | 0          |
| 67363 | 0                | 0          |
| 67381 | 0                | 1213607936 |
| 67425 | 0                | 0          |
| 67543 | 0                | 2147483647 |
| 67554 | 2378907051.09849 | 1          |
| 67563 | 3625212730.48433 | 1          |
| 67679 | 0                | 0          |
| 67702 | 2386615932.43266 | 2          |
| 67720 | 4551851295.73843 | 0          |
| 67726 | 0                | 0          |
| 67727 | 0                | 0          |
| 67732 | 0                | 0          |
| 67736 | 2810077628.356   | 2          |
| 67767 | 2490350511.8638  | 3          |
| 67818 | 3030430380.00519 | 2          |
| 67839 | 0                | 0          |
| 67854 | 0                | 0          |
| 67875 | 3315368082.77732 | 0          |
| 67886 | 3074956964.42975 | 0          |
| 67901 | 1612025877.46335 | 0          |
| 67903 | 1739529447.3969  | 0          |
| 67920 | 0                | 0          |
| 67926 | 1                | 343127     |
| 67928 | 0                | 0          |
| 67943 | 2638663556.13652 | 0          |
| 67945 | 1                | 64298      |
| 67949 | 2497764045.85739 | 0          |
| 67951 | 0                | 1306050560 |
| 67954 | 0                | 0          |
| 67955 | 1525466615.48148 | 0          |
| 67976 | 0                | 502775808  |
| 68020 | 1137702378.70946 | 1          |
| 68042 | 1419980554.40945 | 0          |
| 68067 | 0                | 0          |
| 68134 | 0                | 0          |
| 68166 | 2269560040.04591 | 3          |
| 68284 | 2815066697.11811 | 0          |
| 68287 | 0                | 0          |
| 68308 | 1349263078.25237 | 3          |
| 68343 | 1                | 64334      |
| 68430 | 2680801997.35765 | 2          |
| 68434 | 2842800520.66051 | 1          |
| 68461 | 2944340031.67415 | 0          |
| 68475 | 2902763254.84331 | 0          |
+-------+------------------+------------+
197 rows in set (0.23 sec)

- A member of The Knights Who Say NI!
My BOINC stats site
ID: 13530 · Report as offensive     Reply Quote

Message boards : Number crunching : Request for database cleanup


©2024 CERN