21) Message boards : LHCb Application : Relative Efficiency of LHCb Tasks (Message 33159)
Posted 29 Nov 2017 by Profile ritterm
Post:
I'm sorry if I've missed something elsewhere in the forums, but I wanted to ask a general question about what I'm seeing with LHCb jobs. I've noticed that recent tasks run a couple of jobs, wait ten minutes, and then shutdown. This seems very inefficient compared to CMS and Theory tasks (even though those jobs can have what seems to be hours of idle time). For example, here's part of the stderr output from Task 168175998:

2017-11-29 06:14:43 (15391): Guest Log: [INFO] LHCb application starting. Check log files.
2017-11-29 06:14:43 (15391): Guest Log: [DEBUG] HTCondor ping
2017-11-29 06:14:45 (15391): Guest Log: [DEBUG] 0
2017-11-29 06:16:06 (15391): Guest Log: [INFO] New Job Starting in slot1
2017-11-29 06:16:06 (15391): Guest Log: [INFO] Condor JobID: 5268.117 in slot1
2017-11-29 06:16:26 (15391): Guest Log: [INFO] Starting pilot in slot1
2017-11-29 06:18:59 (15391): Guest Log: [INFO] Job finished in slot1 with 700.
2017-11-29 06:19:26 (15391): Guest Log: [INFO] New Job Starting in slot1
2017-11-29 06:19:26 (15391): Guest Log: [INFO] Condor JobID: 5268.181 in slot1
2017-11-29 06:19:47 (15391): Guest Log: [INFO] Starting pilot in slot1
2017-11-29 06:21:17 (15391): Guest Log: [INFO] Job finished in slot1 with 700.
2017-11-29 06:31:30 (15391): Guest Log: [INFO] Condor exited with return value N/A.
2017-11-29 06:31:30 (15391): Guest Log: [INFO] Shutting Down.
2017-11-29 06:31:30 (15391): VM Completion File Detected.
2017-11-29 06:31:30 (15391): VM Completion Message: Condor exited with return value N/A.
22) Message boards : Number crunching : -152 (0xFFFFFF68) ERR_NETOPEN and 206 (0x000000CE) EXIT_INIT_FAILURE (Message 33158)
Posted 29 Nov 2017 by Profile ritterm
Post:
still, the ERR_NETOPEN failures occur, due to no connection to Condor server:

2017-11-18 16:37:21 (2688): Guest Log: [DEBUG] Testing connection to Condor server on port 9618
2017-11-18 16:37:51 (2688): Guest Log: [DEBUG] nc: connect to vccondor01.cern.ch port 9618 (tcp) timed out: Operation now in progress
2017-11-18 16:37:51 (2688): Guest Log: [DEBUG] 1
2017-11-18 16:37:51 (2688): Guest Log: [ERROR] Could not connect to Condor server on port 9618
2017-11-18 16:37:51 (2688): Guest Log: [INFO] Shutting Down

hopefully, the people at CERN will find out one day what the problem is.

I just wanted to bump this up and report that I've been seeing bursts of these same errors recently. Maybe I should be posting this is the LHCb forum, because, for me, they've been occurring primarily, if not exclusively, on LHCb tasks.
23) Message boards : Number crunching : Tasks Fail on Linux Host (Message 31030)
Posted 24 Jun 2017 by Profile ritterm
Post:
Based on the exit code of some of my tasks at another project, a forum poster there suggested my BOINC install might have a permission problem. After going through a couple of uninstall/reinstall cycles, I'm back in business (who knows why it took more than one).

Problem solved... Thanks for the feedback.
24) Message boards : Number crunching : Tasks Fail on Linux Host (Message 31017)
Posted 24 Jun 2017 by Profile ritterm
Post:
...There should be a line in the event log saying it found VBox.

From the event log following a recent startup:

Fri 23 Jun 2017 10:40:35 PM EDT |  | VirtualBox version: 5.1.22r115126
25) Message boards : Number crunching : Tasks Fail on Linux Host (Message 31015)
Posted 24 Jun 2017 by Profile ritterm
Post:
Is the user running boinc a member of the vbox group?

"boinc" is the user running BOINC, correct? If so, it seems that it's not in the vbox group:
    mark@FrankenLinux ~ $ groups boinc
    boinc : boinc video
    mark@FrankenLinux ~ $


However, on my other Linux host currently in the middle of a Theory task, "boinc" isn't a member of that group, either.

26) Message boards : Number crunching : Tasks Fail on Linux Host (Message 30999)
Posted 24 Jun 2017 by Profile ritterm
Post:
I'm trying to get VBox tasks running on one of my Linux hosts that used to not have a problem running those tasks here. As a result of troubleshooting a hardware problem (bad RAM since removed), I wiped the HDD clean and re-installed the OS (Mint 18). I've installed VBox 5.1.22 and its extension pack, like my other hosts (including another Linux machine) which have run VBox tasks recently (though not LHC).

The three Theory tasks I've run all failed after a short time and the stderr output includes errors like NS_ERROR_FAILURE and VBOX_E_OBJECT_NOT_FOUND.

I'm hoping I'm just missing some setting I can't think of. Virtualization (SVM in my ASUS motherboard) is enabled in the BIOS. Maybe I have a corrupt installation; however, I've unistalled/re-installed VBox and the extension pack a couple of times, including rebooting between each uninstall and reinstall. So, if that's it, I'm not sure how I would correct it.
27) Message boards : Number crunching : Less boinc credits than on other projects? (Message 30507)
Posted 26 May 2017 by Profile ritterm
Post:
...is it just me or do I get much less boinc credits on lhc@home than on other projects...

Based on credits/hour, I've always found LHC credits (at least for SixTrack) to be lower compared to most other projects.
28) Message boards : News : SixTrack News - May 2017 (Message 30506)
Posted 26 May 2017 by Profile ritterm
Post:
Very good! Thanks a lot for the feedback...The more we get the impression something meaningful is being done with our computational efforts, the more support you may expect.

+1
29) Message boards : Number crunching : User Account No Showing Up on Statics Page or Search (Message 30477)
Posted 24 May 2017 by Profile ritterm
Post:
Can't see anything under Top Participant.

If you search through the Top Participants listed by Total Credit you will see yourself ranked (as of right now) at #455 (follow the link).
30) Message boards : Number crunching : User Account No Showing Up on Statics Page or Search (Message 30475)
Posted 24 May 2017 by Profile ritterm
Post:
My account is nowhere to be seen...

I see you as User ID 206546 and ranking at #455 by total credit.
31) Message boards : Number crunching : 8th BOINC Pentathlon (May 5-19) (Message 30245)
Posted 7 May 2017 by Profile ritterm
Post:
...this BOINC project was listed as a candidate for the 8th BOINC Pentathlon (May 5-19)...

And it has now been selected!

The 4th project of the BOINC Pentathlon in the discipline Swimming is:
LHC@home
32) Message boards : Number crunching : Credit Migration (Message 29367)
Posted 17 Mar 2017 by Profile ritterm
Post:
Hi folks, unfortunately I am missing 16896 hours (LHC@home 1.0)at WUProp.

Maybe I misunderstand your concern, but I don't think that would be something caused by the credit migration at the LHC@home project.

Are you sure your "LHC@home 1.0" hours at WUProp haven't been moved into your hours for "LHC@home"? At some point, the admin at WUProp merged the two. I haven't had any "LHC@home 1.0" hours for quite awhile and I don't think I lost any hours.
33) Message boards : Sixtrack Application : Inconclusive results (Message 29355)
Posted 17 Mar 2017 by Profile ritterm
Post:
My inconclusive results are building again. Granted it's not a high percentage, but most are with wingman Computer 10451497. That host has returned >1700 inconclusives in the past few days.
34) Message boards : Sixtrack Application : 260.000 WUs to send, but no handed out (Message 29071)
Posted 5 Mar 2017 by Profile ritterm
Post:
However, the current Sixtrack tasks are rather short...

I'm wondering if the opposite is true, in some cases. I just noticed that I'm getting "Not requesting tasks: don't need...job cache full". Two hosts getting this message have tasks in their queues with 6-8 hour estimated run times, which, I think, is much longer than most recent tasks.
35) Message boards : Sixtrack Application : 260.000 WUs to send, but no handed out (Message 29070)
Posted 5 Mar 2017 by Profile ritterm
Post:
However, the current Sixtrack tasks are rather short, so it might be that the quota runs full pretty quickly...

What quota is that? Is there something other than 14 in-progress tasks/core? I should have added that I'm getting "No tasks are available for SixTrack" rather than something indicating I've reached a time-related or other quota.
36) Message boards : Sixtrack Application : 260.000 WUs to send, but no handed out (Message 29063)
Posted 5 Mar 2017 by Profile ritterm
Post:
Could there be another problem with work distribution? I see 100Ks of sixtrack WUs ready to send, but I haven't been able to keep my caches full for over 24 hours.
37) Message boards : Number crunching : Credit Migration (Message 29052)
Posted 3 Mar 2017 by Profile ritterm
Post:
Sorry to hijack the thread a little, but, since Nils brought it up... ;-)

Nils Høimyr wrote:
Please make sure that the LHCb application is selected to continue to crunch LHCb tasks here...

As well as selecting "Run test applications", correct?
38) Message boards : Sixtrack Application : MacOS executable (Message 28838)
Posted 9 Feb 2017 by Profile ritterm
Post:
I've gotten a bunch of resends recently that appear to be from validation inconclusives with an apple-darwin wingman. Progress, perhaps? :-)
39) Message boards : Sixtrack Application : User Account Not Listed (Message 28821)
Posted 7 Feb 2017 by Profile ritterm
Post:
What page shows me list as number two?

I think Ivan means you are second on a list of users when he searched for "james" and some other parameters. Unfortunately, I don't think there's a way to search for yourself in the rankings of total credit or RAC.

[edit]By playing around with the offset, you can find yourself here (as of right now!) among users listed by total credit. Is that what you're looking for?[/edit]
40) Message boards : Sixtrack Application : MacOS executable (Message 28769)
Posted 2 Feb 2017 by Profile ritterm
Post:
kyrsjo wrote:
I don't know what can be done about those WUs... As I said, they might have passed if compared to another mac result. How does this affect your scores (I'm new to BOINC...)?

How it affects volunteers scores/credit depends on which tasks sent out for a workunit get validated. For me, I suspect that a third task returned by a Windows or Linux host will validate with me and I'll get credit; if that third task is from a Mac host, then it will likely validate with the other Mac host. In the latter case, my task will get marked as invalid and get reduced or no credit**. I'm not sure which because I don't remember ever having an invalid task here. Does that make sense?

** Of course, it's the science that matters... ;-)


Previous 20 · Next 20


©2024 CERN