Message boards :
CMS Application :
CMS jobs are becoming available again
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
The CMS jobs graphs are failing both here and at dev. Back up again; perhaps a major contributor had some down-time. I'm supplying nearly 40 job slots myself at the moment, so if my Uni gets cut off (one of our two redundant feeds has been back-hoed this week) you would see a similar dip in the graph. |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,502,723 RAC: 131,911 |
This is mainly caused by CMS Cache information for squid: Today, 0:00 - 22:40 Statistics include uncacheable traffic like uploads and via special ports, e.g. 443 or 9618 (WMAgent). Requests served: 1,546,134 Bytes served: 104.48 GB and there's still 1 h to go. Average HTTP requests per minute since start: 800.6 (restarted last Sunday evening) Hits as % of all requests: 5min: 90.8%, 60min: 94.2% Hits as % of bytes sent: 5min: 72.3%, 60min: 40.2% Memory hits as % of hit requests: 5min: 98.7%, 60min: 94.9% |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
|
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
Well, the output consoles are working again, but there are a couple of buglets. The system console (Alt-F1) is displaying a grep error from time to time, which has the hallmarks of a typo. Also the Finished_nnn.Log in the Web interface is writing a file with new nnn several times per minute rather than overwriting the old file -- I guess the index increment got put inside the wrong loop.The responsibles have been informed... [Edit] Hmm, I might have just been unlucky and had the task start when the configuration was not fully changed. Two others that have started since don't show the symptoms. [/Edit] [Edit2] It's been confirmed that there was a bad version of the patches "in the wild" for about 30 minutes, so I was unlucky enough to catch it. [/Edit2] |
Send message Joined: 14 Jan 10 Posts: 1280 Credit: 8,496,817 RAC: 2,374 |
Well, the output consoles are working again, but there are a couple of buglets. The output to Console ALT-F2 (events processing) was working at first, but now the output is killed by a typing failure in the script of that output directory: /vr/lib/condor/execute/dir_nnnn etcetera what should have been /var/lib/condor/execute/dir_nnnnn etcetera |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
|
Send message Joined: 18 Dec 15 Posts: 1689 Credit: 103,909,318 RAC: 121,840 |
However, something seems to be strange (not to say "wrong") with the credit points: CMS tasks earn only about a third of what is earned for Theory tasks. How come?I am still curious what is the reason for this discrepancy. Any logical explanation(s) ? |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
However, something seems to be strange (not to say "wrong") with the credit points: CMS tasks earn only about a third of what is earned for Theory tasks. How come?I am still curious what is the reason for this discrepancy. Any logical explanation(s) ? My understanding is that LHC@Home jobs award credit based on the task CPU time, but it's been a long time since I asked about it so I may be misremembering. |
Send message Joined: 18 Dec 15 Posts: 1689 Credit: 103,909,318 RAC: 121,840 |
My understanding is that LHC@Home jobs award credit based on the task CPU timeI don't think so; here a few examples: total time --- CPU time ----- points CMS: 44,884.65 -- 36,639.44-- 423.30 46,714.42 -- 37,825.50-- 438.76 Theory: 47,088.19-- 45,886.61-- 1,849.16 44,955.45-- 43,741.28-- 1,739.19 so there seem to be other criterions in place. |
Send message Joined: 2 May 07 Posts: 2101 Credit: 159,818,488 RAC: 127,549 |
A good answer from Tullio some time ago: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=3400&postid=23814#23814 |
Send message Joined: 19 Feb 08 Posts: 708 Credit: 4,336,250 RAC: 0 |
I am getting huge amounts of credits in Milkyway@home and Asteroids@home in Science United. But I believe they are cumulative credits, that is belong to all users of those projects seen as a single user. Tullio |
Send message Joined: 18 Dec 15 Posts: 1689 Credit: 103,909,318 RAC: 121,840 |
A good answer from Tullio some time ago:Tullio said: "Credits are like money during an inflation.The more they are the less they are worth" okay, so that's the secret behind :-) |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
|
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
|
Send message Joined: 18 Dec 15 Posts: 1689 Credit: 103,909,318 RAC: 121,840 |
no new tasks available right now :-( |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
|
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
no new tasks available right now :-( Oh, blast! It's a holiday today at CERN (Ascension), so not much chance of getting a response -- possibly not until Monday if everyone takes tomorrow off as well for a long weekend. |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
OK, people are trickling back from the holiday weekend. The WMAgent has been restarted, but I'm getting errors in tasks on my home PC. Probably best to defer re-starting tasks until tomorrow to let errors propagate out of the system. I'll keep an eye on it for the next hour or two before closing myself down for the night. https://www.youtube.com/watch?v=XQ6fbsFiwWQ |
Send message Joined: 15 Jun 08 Posts: 2413 Credit: 226,502,723 RAC: 131,911 |
Thanks. Got a fresh task. So far no issues. https://lhcathome.cern.ch/lhcathome/result.php?resultid=231035328 |
Send message Joined: 29 Aug 05 Posts: 1006 Credit: 6,272,230 RAC: 352 |
Thanks. Great. I'm still having issues with my home PC but at least one work server picked up new tasks seamlessly. Hitting the sack now, see y'all tomorrow... |
©2024 CERN