Message boards :
Theory Application :
Theory tasks failing
Message board moderation
Author | Message |
---|---|
Send message Joined: 14 Jan 10 Posts: 1439 Credit: 9,624,852 RAC: 2,528 ![]() ![]() ![]() |
Theory tasks are all failing on my Win10 32-bit: https://lhcathome.cern.ch/lhcathome/results.php?hostid=10416365 Exit status: EXIT_NO_SUB_TASKS Guest Log: 10/10/17 08:29:00 **** condor_startd (condor_STARTD) pid 3709 EXITING WITH STATUS 0 Guest Log: [ERROR] No jobs were available to run. Guest Log: [INFO] Shutting Down. |
![]() Send message Joined: 11 Aug 11 Posts: 8 Credit: 3,629,431 RAC: 0 |
Exactly the same for me on my Win Vista 32 bit for the 9 and 10 Oct. https://lhcathome.cern.ch/lhcathome/results.php?userid=456564 Hope that helps. Kind regards Geoff |
![]() Send message Joined: 14 Feb 14 Posts: 5 Credit: 17,818,305 RAC: 0 ![]() ![]() |
Same here. First there were some problems with Condor, then no tasks avail. ![]() |
![]() ![]() Send message Joined: 24 Oct 04 Posts: 1184 Credit: 57,265,814 RAC: 62,893 ![]() ![]() |
I keep getting these with X64 [ERROR] Condor exited after 2373s without running a job. Running over an hour each task so I always used to think these were going to run and wouldn't have to keep checking all of the tasks on all of my computers. Now I can't trust them to be running and also can't just leave them set to *Allow New Tasks* just to get these *Computer Error* tasks that are not actually computer errors. Can't get much done this way so I will have to just suspend all of mine until something is done. Volunteer Mad Scientist For Life ![]() |
Send message Joined: 14 Jan 10 Posts: 1439 Credit: 9,624,852 RAC: 2,528 ![]() ![]() ![]() |
For me Theory is working again on the platforms Win32 and Win64. |
![]() ![]() Send message Joined: 24 Oct 04 Posts: 1184 Credit: 57,265,814 RAC: 62,893 ![]() ![]() |
Guest Log: [ERROR] Condor exited after 57012s without running a job. More wasted time with these theory tasks lately and I have to start them all running after 2am just for the internet speed for the dreaded VB https://lhcathome.cern.ch/lhcathome/results.php?userid=5472 Volunteer Mad Scientist For Life ![]() |
![]() Send message Joined: 15 Jun 08 Posts: 2567 Credit: 258,693,321 RAC: 119,444 ![]() ![]() |
Since this morning all new Theory VMs are failing after a few minutes. Mostly with "207 (0x000000CF) EXIT_NO_SUB_TASKS" but also with "206 (0x000000CE) EXIT_INIT_FAILURE". |
Send message Joined: 18 Dec 15 Posts: 1838 Credit: 122,145,649 RAC: 98,711 ![]() ![]() ![]() |
same here, unfortunately :-( |
Send message Joined: 18 Dec 15 Posts: 1838 Credit: 122,145,649 RAC: 98,711 ![]() ![]() ![]() |
since a few hours, all Theory tasks fail after about half an hour with "207 (0x000000CF) EXIT_NO_SUB_TASKS" Is the WMAgent down? |
![]() Send message Joined: 1 Sep 04 Posts: 140 Credit: 2,579 RAC: 0 |
since a few hours, all Theory tasks fail after about half an hour with "207 (0x000000CF) EXIT_NO_SUB_TASKS" Thanks for the heads-up. I just asked our system manager (Nils) about it and he replied: "This is probably because the Condor node handling jobs for Theory is being updated for Spectre and Meltdown today. https://cern.service-now.com/service-portal/view-outage.do?n=OTG0041682 Should be back again soon. Cheers, Nils" |
![]() ![]() Send message Joined: 24 Oct 04 Posts: 1184 Credit: 57,265,814 RAC: 62,893 ![]() ![]() |
Hello Ben, Yes we had the same problems with these over at LHC-dev so I had 7 multi-core tasks crash (not start up) but I just got another 6 of them up and running so maybe it was taken care of......just in time (as I mentioned over there) EDIT: well it looks like they are still having problems. They do make it past HTCondor Ping but then after a few minutes crash with [ERROR] Condor exited after 758s without running a job. So far after a few tries I may only have ONE multi-core running and the other 5 are past HTCondor Ping but I don't trust them to run.......so I will watch but I have them set to not get new ones if these fail again and get back to running all those AVX tasks I have still. Volunteer Mad Scientist For Life ![]() |
Send message Joined: 18 Dec 15 Posts: 1838 Credit: 122,145,649 RAC: 98,711 ![]() ![]() ![]() |
same or similar problem here with the Condor Server: ... guest Log: 01/22/18 18:42:57 CCBListener: connection to CCB server vccondor01.cern.ch failed ... All the tasks from this afternoon failed :-( Often enough in the past, also with CMS tasks I made the same experience: no connection to Condor Server. No idea why the Condor Server is making problems so many times. And no idea whether the people in charge have ever looked into that. |
![]() Send message Joined: 5 Nov 15 Posts: 144 Credit: 6,301,268 RAC: 0 ![]() ![]() |
The three Theory WU that are still left from late yesterday are still calculating but the new WU can get MCPlots work. Error count is near 300 on all 3 servers. Switching to backup jobs till morning. (10pm local time now). |
©2025 CERN