Message boards : CMS Application : no new WUs available
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · 23 . . . 24 · Next

AuthorMessage
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1176
Credit: 54,887,670
RAC: 5,761
Message 49477 - Posted: 9 Feb 2024, 12:24:17 UTC - in response to Message 49476.  

[quote]I just checked the two I started 7 hours ago and this time they are still running but still stuck at
*application starting check log files* and the ones doing this before just stopped after an hour running and sent back since they didn't run the CPU run time more than the usual less than 10 minutes.....so I will just let them run and see what happens.

application starting check log files
After this message, you can do this
You can in Boincmanager for the CMS Task open the Graphic Button.
After this, your Browser opens a new page.
There are Information also for this task.


LOL I have checked thousands of my tasks doing that Axel......every single VB task I have run I watch them by doing that.
and the entire Run time 13 hours 53 min 5 sec on that page it never got past *application starting check log files*
But last month they did some update so you can't get the same version of the log with a CMS as they used to be so you had to check another tab there but not the running log tab
https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3302911
This one did finally have the info on the stderr instead of watching that other url so until the one tonight they were all doing this here and at -dev https://lhcathomedev.cern.ch/lhcathome-dev/result.php?resultid=3302801

That is the one I mentioned and it finished and sent it in BUT it still stayed at *application starting check log files*
And all the ones before that didn't actually run and were sent back after one hour runs and less than 10 minutes CPU time.
Before this started they always * Requesting an idtoken from LHC@home*
I only have one running now doing the same thing and since it is 4;20am I better wait until later to try again and see if they will actually run a job (oh and I always take snap shots of logs so I have thousands of those too)
ID: 49477 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1176
Credit: 54,887,670
RAC: 5,761
Message 49478 - Posted: 9 Feb 2024, 12:36:25 UTC

well I couldn't sleep until I went upstairs and got this current running snap shot and as I mentioned the CMS don't run this the same as it used to and the Theory tasks do still give us the running logs there.....and this is after running 13 hours and at 66.5%


goodnight
ID: 49478 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1422
Credit: 9,484,585
RAC: 1,266
Message 49522 - Posted: 12 Feb 2024, 9:42:00 UTC

CMS jobs inside BOINC-VM available again . . .
ID: 49522 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49526 - Posted: 12 Feb 2024, 10:08:30 UTC - in response to Message 49522.  

CMS jobs inside BOINC-VM available again . . .
credentials working now?
ID: 49526 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2541
Credit: 254,608,838
RAC: 34,609
Message 49527 - Posted: 12 Feb 2024, 10:15:38 UTC - in response to Message 49526.  

credentials working now?

Looks like they do.
Got a bunch of CMS tasks all running fine and CERN Grafana shows an increasing number of running jobs since 08:12 UTC.
ID: 49527 · Report as offensive     Reply Quote
Profile Magic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1176
Credit: 54,887,670
RAC: 5,761
Message 49528 - Posted: 12 Feb 2024, 10:19:43 UTC

NICE
Good thing I am always awake after 2am
ID: 49528 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49531 - Posted: 12 Feb 2024, 12:45:07 UTC - in response to Message 49527.  

credentials working now?

Looks like they do.
Got a bunch of CMS tasks all running fine and CERN Grafana shows an increasing number of running jobs since 08:12 UTC.
all works fine now. So let's keep our fingers crossed :-)
ID: 49531 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49587 - Posted: 17 Feb 2024, 6:15:51 UTC

good morning Ivan,

last night, the jobs bucket got empty again :-(
ID: 49587 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1061
Credit: 7,737,455
RAC: 298
Message 49600 - Posted: 19 Feb 2024, 15:01:43 UTC - in response to Message 49587.  

good morning Ivan,

last night, the jobs bucket got empty again :-(

Sorry, I got caught out by how fast the queues were emptying. One of our script suppliers made an inadvertent reversion that meant more jobs were waiting for reconnection to Condor than recent configurations.
ID: 49600 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 850
Credit: 692,823,409
RAC: 68,497
Message 49601 - Posted: 19 Feb 2024, 18:48:42 UTC

❤️ CMS error rate dropped below 10 %, sadly Theory still over 50 %
ID: 49601 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49722 - Posted: 7 Mar 2024, 12:12:56 UTC

queue is empty :-(
ID: 49722 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49748 - Posted: 11 Mar 2024, 4:00:31 UTC

queue is empty :-(
ID: 49748 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1061
Credit: 7,737,455
RAC: 298
Message 49762 - Posted: 12 Mar 2024, 14:03:07 UTC - in response to Message 49748.  

queue is empty :-(

Sorry, misjudged the job queue.
ID: 49762 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49777 - Posted: 16 Mar 2024, 6:07:57 UTC

new new tasks since last night :-(
ID: 49777 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 49778 - Posted: 16 Mar 2024, 8:12:33 UTC - in response to Message 49777.  

new new tasks since last night :-(
sorry, should read "NO new tasks ..."
ID: 49778 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1061
Credit: 7,737,455
RAC: 298
Message 49779 - Posted: 16 Mar 2024, 15:05:25 UTC - in response to Message 49778.  

new new tasks since last night :-(
sorry, should read "NO new tasks ..."

Tja, I was waiting to see if Daniele's multi-core workflow would spawn new jobs, in case my old one was holding it back. Turned out not to happen, so I'm submitting smaller job batches until we work out how to get the multi-core jobs into the system. There may be intermittent disruptions over the next few days if my sleep cycle disagrees with the job queues' needs for attention.
ID: 49779 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1821
Credit: 118,941,546
RAC: 21,113
Message 50405 - Posted: 14 Jun 2024, 6:14:07 UTC

no jobs available since yesterday. This time, the automatic task distribution stop worked well :-)
ID: 50405 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1061
Credit: 7,737,455
RAC: 298
Message 50413 - Posted: 17 Jun 2024, 15:24:59 UTC

There seems to be a problem at CERN. Several WMAgents, including ours, are showing error status and I don't think we are generating jobs. A polite e-mail has been sent.
ID: 50413 · Report as offensive     Reply Quote
Saturn911

Send message
Joined: 3 Nov 12
Posts: 59
Credit: 142,182,531
RAC: 42,802
Message 50414 - Posted: 17 Jun 2024, 15:49:22 UTC - in response to Message 50413.  

No work to do, but this workstation has loaded hundreds of wu s just for killing them:

https://lhcathome.cern.ch/lhcathome/results.php?hostid=10698193&offset=0&show_names=0&state=0&appid=11

Would be great to limit the work for this workstation or to resend these wu s
ID: 50414 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 1061
Credit: 7,737,455
RAC: 298
Message 50415 - Posted: 17 Jun 2024, 17:41:44 UTC - in response to Message 50413.  

There seems to be a problem at CERN. Several WMAgents, including ours, are showing error status and I don't think we are generating jobs. A polite e-mail has been sent.

Polite response:
The CMSWEB team have been upgrading cmsweb-testbed frontends to a new technology and the redirect rules are still being polished (i.e. it looks like WM is still not fully functional).This transition started last Thursday.

Sorry about that.

ID: 50415 · Report as offensive     Reply Quote
Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · 23 . . . 24 · Next

Message boards : CMS Application : no new WUs available


©2024 CERN