Message boards : CMS Application : EXIT_NO_SUB_TASKS
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 15 · Next

AuthorMessage
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1143
Credit: 6,994,740
RAC: 1,182
Message 43424 - Posted: 26 Sep 2020, 8:20:27 UTC - in response to Message 43423.  

And as I see it, 1 and 8 are pretty much (180-degree) rotationally symmetric in themselves, 2 & 5 aren't (they need a reflection as well), which leaves 6 and 9.

It wasn't that hard, Ivan, although I didn't really understand what you meant with rotationally symmetry, but 9 and 1 maybe is also one.
Since you start studying in 1970 you must be somewhere between 68 and 72. Congratulations anyway. Feel good, you must be a bit younger than I am.
For your birthday I crunched 4 CMS-tasks under hard conditions (stopping, starting, stop BOINC, reboot, very long pausing, using snapshots) and 4 surprises for you:
All went well:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=284776453
https://lhcathome.cern.ch/lhcathome/result.php?resultid=284779479
https://lhcathome.cern.ch/lhcathome/result.php?resultid=284778797
https://lhcathome.cern.ch/lhcathome/result.php?resultid=284779528
ID: 43424 · Report as offensive     Reply Quote
Profile Ben Segal
Volunteer moderator
Project administrator

Send message
Joined: 1 Sep 04
Posts: 134
Credit: 2,579
RAC: 0
Message 43425 - Posted: 26 Sep 2020, 12:40:31 UTC - in response to Message 43423.  

Thanks for letting us out of our misery Ivan! Meax nearly got it with 48hex but I see the answer is 45hex.

Happy 69th !!!!
ID: 43425 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43457 - Posted: 30 Sep 2020, 19:39:18 UTC - in response to Message 43425.  
Last modified: 30 Sep 2020, 19:40:50 UTC

Thanks for letting us out of our misery Ivan! Meax nearly got it with 48hex but I see the answer is 45hex.

Happy 69th !!!!

Thanks again.
Meanwhile there are problems with Ceph, S3, and our WMAgent at CERN. I've no idea whether they are inter-related, I'm not a "storage guy". Jobs are reporting as failed, in varying percentages over time, but it doesn't look like its affecting your tasks and credits.
ID: 43457 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43458 - Posted: 1 Oct 2020, 11:49:24 UTC - in response to Message 43457.  

Meanwhile there are problems with Ceph, S3, and our WMAgent at CERN. I've no idea whether they are inter-related, I'm not a "storage guy". Jobs are reporting as failed, in varying percentages over time, but it doesn't look like its affecting your tasks and credits.

The failure rate graph is back down to normal levels so I guess the problem is fixed.
ID: 43458 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jun 08
Posts: 2100
Credit: 162,391,205
RAC: 118,222
Message 43459 - Posted: 2 Oct 2020, 13:06:15 UTC - in response to Message 43458.  

The problem is solved but now there are no subtasks in the queue.
ID: 43459 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1542
Credit: 52,202,684
RAC: 34,707
Message 43460 - Posted: 2 Oct 2020, 17:24:11 UTC

from what I can see now: there are subtasks (jobs) available, but no new tasks :-(
ID: 43460 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43461 - Posted: 2 Oct 2020, 18:11:11 UTC - in response to Message 43460.  
Last modified: 2 Oct 2020, 18:18:59 UTC

from what I can see now: there are subtasks (jobs) available, but no new tasks :-(

Not sure what went wrong. There are a few WMAgent components down -- responsibles have been notified -- one of which is accounting, so the WMStats numbers don't add up. (3962 created, 0 queued, 0 pending. 29 running, 549 success, 4 failure!)
I was finally allowed back on campus this week and as you can expect, I have over six months' worth of problems to deal with. So I didn't think to check the CMS@Home situation this afternoon. [schade/]
I submitted a new workflow just now, it remains to be seen if it makes jobs available.
ID: 43461 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1542
Credit: 52,202,684
RAC: 34,707
Message 43462 - Posted: 2 Oct 2020, 18:14:25 UTC - in response to Message 43461.  

I submitted a new workflow just now, it remains to be seen if it makes jobs available.
Ivan, thanks for your help !!!
However, the problem seems to be the lack of tasks, NOT the lack of jobs.
ID: 43462 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43463 - Posted: 2 Oct 2020, 18:20:52 UTC - in response to Message 43462.  
Last modified: 2 Oct 2020, 18:26:59 UTC

I submitted a new workflow just now, it remains to be seen if it makes jobs available.
Ivan, thanks for your help !!!
However, the problem seems to be the lack of tasks, NOT the lack of jobs.

Laurence has it set up so that when no jobs are available then no new tasks are created. Give it an hour or so to see if new jobs allow 200 new tasks to be created (the server status page only updates once an hour).
ID: 43463 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43464 - Posted: 2 Oct 2020, 21:04:12 UTC - in response to Message 43463.  

I've had some feedback that suggests some database has exceeded its quota. :-(
I've asked for a ticket to be logged with CERN IT, I don't have enough information to do that myself.
ID: 43464 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 1542
Credit: 52,202,684
RAC: 34,707
Message 43465 - Posted: 3 Oct 2020, 11:33:13 UTC - in response to Message 43464.  

Due to the weekend now, I guess it will take till some time next week until the problem is being solved?
ID: 43465 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43466 - Posted: 4 Oct 2020, 12:08:08 UTC - in response to Message 43465.  

Due to the weekend now, I guess it will take till some time next week until the problem is being solved?

Yes, I don't expect it to be fixed out of office hours. :-(
ID: 43466 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1143
Credit: 6,994,740
RAC: 1,182
Message 43469 - Posted: 6 Oct 2020, 7:39:44 UTC

Since yesterday afternoon the distribution of BOINC-tasks and CMS-jobs seems to be normal again.
ID: 43469 · Report as offensive     Reply Quote
ivan
Volunteer moderator
Project tester
Volunteer developer
Volunteer tester
Project scientist
Avatar

Send message
Joined: 29 Aug 05
Posts: 906
Credit: 5,916,632
RAC: 1,933
Message 43470 - Posted: 6 Oct 2020, 9:51:53 UTC - in response to Message 43469.  

Since yesterday afternoon the distribution of BOINC-tasks and CMS-jobs seems to be normal again.

Yes, the database team increased our quota from 2 GB to 20 GB. Let's see how long it takes to fill that up!
ID: 43470 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 4,473
Message 43472 - Posted: 6 Oct 2020, 17:00:27 UTC - in response to Message 43470.  

It looks like you have gotten their attention. Very good.
I hope to keep a machine (Ryzen 3600) on it all winter.
ID: 43472 · Report as offensive     Reply Quote
Greger

Send message
Joined: 9 Jan 15
Posts: 151
Credit: 431,596,822
RAC: 0
Message 43544 - Posted: 1 Nov 2020, 19:55:39 UTC
Last modified: 1 Nov 2020, 19:55:59 UTC

ID: 43544 · Report as offensive     Reply Quote
NOGOOD

Send message
Joined: 18 Nov 17
Posts: 118
Credit: 43,102,300
RAC: 18,751
Message 43545 - Posted: 1 Nov 2020, 21:17:09 UTC - in response to Message 43544.  

ID: 43545 · Report as offensive     Reply Quote
NOGOOD

Send message
Joined: 18 Nov 17
Posts: 118
Credit: 43,102,300
RAC: 18,751
Message 43571 - Posted: 6 Nov 2020, 10:30:01 UTC

No sub tasks again.
ID: 43571 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 4,473
Message 43572 - Posted: 6 Nov 2020, 13:03:59 UTC - in response to Message 43571.  

I bailed out just in the nick of time, losing only one to "no sub tasks".
https://lhcathome.cern.ch/lhcathome/result.php?resultid=288717194

There were two others that failed after two hours for some sort of Condor problem:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=288717684
https://lhcathome.cern.ch/lhcathome/result.php?resultid=288716912

But I am off to Rosetta. There is a new strain going around.
ID: 43572 · Report as offensive     Reply Quote
NOGOOD

Send message
Joined: 18 Nov 17
Posts: 118
Credit: 43,102,300
RAC: 18,751
Message 43573 - Posted: 6 Nov 2020, 13:21:15 UTC - in response to Message 43572.  

I bailed out just in the nick of time, losing only one to "no sub tasks".
https://lhcathome.cern.ch/lhcathome/result.php?resultid=288717194

There were two others that failed after two hours for some sort of Condor problem:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=288717684
https://lhcathome.cern.ch/lhcathome/result.php?resultid=288716912

But I am off to Rosetta. There is a new strain going around.

Yes, CMS project is like a monkey with a grenade :-)
ID: 43573 · Report as offensive     Reply Quote
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 15 · Next

Message boards : CMS Application : EXIT_NO_SUB_TASKS


©2022 CERN