Message boards :
Number crunching :
"New" project, old problem (LHCb)
Message board moderation
Author | Message |
---|---|
![]() Send message Joined: 28 Jun 06 Posts: 5 Credit: 40,125,748 RAC: 10,211 ![]() ![]() ![]() |
After moving my machines over to LHC@home I've left my settings as the defaults. So far I don't think I've crunched a single LHCb unit successfully. Theory, CMS, sixtrack no problems. LHCb fail after fail. Good old EXIT_INIT_FAILURE. It kind of seems pointless to keep this up. So should I just give up and disable LHCb again? JJ |
Send message Joined: 21 Jun 10 Posts: 38 Credit: 9,295,687 RAC: 4,045 ![]() ![]() ![]() |
jjv, Yes it is a known problem and has been reported on the "LHCb Application" topic. The virtual machine can't communicate with the HTCondor server so it waits 600+ seconds then aborts. My recommendation would be to turn it off and monitor this post https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4014&postid=27898 to see when the project admins get it fixed. |
![]() Send message Joined: 28 Jun 06 Posts: 5 Credit: 40,125,748 RAC: 10,211 ![]() ![]() ![]() |
Yeah well, it is "better" now. Not all of them fail. Current ratio for me is 18 valid vs 33 error. It seems kind of odd that my machines are running nothing but LHCb if there is no actual work to be done. 33 x 15 minutes of runtime per failed unit equals over 8 hours of wasted computation. Seriously considering disabling LHCb again. JJ |
©2023 CERN