1) Message boards : Number crunching : Local control of which subprojects run`2 (Message 37276)
Posted 8 Nov 2018 by captainjack
Post:
pls,

The app_config.xml file does not control which tasks get downloaded. All the app_config.xml file does is control the manner in which the downloaded tasks run (number concurrent, memory size, etc.). The only way to control which tasks get downloaded is by using the project preferences.

A complete description of options for the app_config.xml file can be found at the bottom of https://boinc.berkeley.edu/wiki/client_configuration

Hope that helps.
2) Questions and Answers : Preferences : Stoping delivery of new tasks don't work in LHC@home and Seti@home (Message 36701)
Posted 13 Sep 2018 by captainjack
Post:
Carlos,

In the BOINC Manager app, when looking at the tab for "Tasks", there is a Command on the left panel for "Show Active Tasks" or "Show All Tasks". If you have it set to show only active tasks, then you may have tasks that are downloaded but do not show up until they become active. Please check to see if it is set for "Show All Tasks".

Hope that helps.
3) Message boards : Theory Application : Theory and app_config ? (Message 36670)
Posted 8 Sep 2018 by captainjack
Post:
You are welcome, glad you got it working.

One other clue that you might find useful in the future, it there is any doubt about which names to use in the app_config.xml, you can find many of the names for the current version in the client_state_prev.xml file located in the BOINC data folder.

Good luck with your crunching.
4) Message boards : Theory Application : Theory and app_config ? (Message 36667)
Posted 8 Sep 2018 by captainjack
Post:
Hi Yeti,

Try the following app_config.xml

<app_config>
  <app>
    <name>Theory</name>
      <max_concurrent>1</max_concurrent>
  </app>
  <app_version>
    <app_name>Theory</app_name>
    <avg_ncpus>1.0</avg_ncpus>
    <plan_class>vbox64_mt_mcore</plan_class>
    <cmdline>--nthreads 1</cmdline>
  </app_version>
</app_config>


I just put this together and it appears to be working running theory on one thread.

The two main differences are the plan_class and the addition of a cmdline for --nthreads.

Also, in your app_config, there are several parameters that I do not find in the BOINC documentation for client configuration http://boinc.berkeley.edu/wiki/client_configuration.

I do not know how the undocumented parameters in your app_config affect the client configuration, my suggestion is to use the minimum parameters until you find something that works then add parameters.

Also, just in case you didn't know, when you add parameters to an app_config, you can activate the new parameters by clicking on "Options", "Read Config Files" in the BOINC Manager. If you need to remove parameters, it is best to shut down BOINC and start it back up with the new parameters in place.

Hope that helps, let us know how it turns out.
5) Message boards : ATLAS application : VirtualBox 5.2 (Message 36363)
Posted 9 Aug 2018 by captainjack
Post:
Machine is running Windows 10 - 1803, VB 5.2.16 and BOINC 7.12.1.

Tried running a 3-core ATLAS, 2-core Theory, and a 1-core LHCb tasks at the same time.

The Stderr.txt for the ATLAS task had the following error message:
2018-08-09 12:57:11 (1988): Error creating VirtualBox instance! rc = 0x80004002

While the tasks were processing, the machine stopped running. When I restarted it, the Event Log had this message:

8/9/2018 3:41:13 PM | LHC@home | [error] no project URL in task state file

Atlas task finished and validated, Theory and LHCb tasks are still running.

Let me know if you need more information.
6) Questions and Answers : Getting started : Issues changing email address (Message 35244)
Posted 12 May 2018 by captainjack
Post:
I can't change my email address here or at the test site either. Maybe when they fix it here, it will work there too.
7) Message boards : ATLAS application : Download failures (Message 32824)
Posted 13 Oct 2017 by captainjack
Post:
The task fetch seems to ignore the parameter for "Max # CPUs". For computer 10476963 the Max # CPUs was changed to 2, but the server keeps sending 4 core tasks. The client_state.xml says

<app_version>
<app_name>ATLAS</app_name>
<version_num>101</version_num>
<platform>windows_x86_64</platform>
<avg_ncpus>4.000000</avg_ncpus>
<max_ncpus>2.000000</max_ncpus>


Seems odd that the max_ncpus is 2, but the avg_ncpus is 4.

Computer is using the default preferences.

Please let me know if I can provide more information.
8) Message boards : Number crunching : Less boinc credits than on other projects? (Message 30502)
Posted 26 May 2017 by captainjack
Post:
RaimundD,

Just to make sure you know, WCG takes the BOINC points and multiplies them by 7 to get WCG points. If you want to know how many BOINC points you get at WCG, you can check one of the accumulator web sites like boincstats.
9) Message boards : ATLAS application : New app version 1.01 (Message 29178)
Posted 10 Mar 2017 by captainjack
Post:
Just tried one on Linux. Task ran for about 20 minutes then got this:

2017-03-10 12:49:18 (8776): Guest Log: - Last 10 lines from /home/atlas01/RunAtlas/Panda_Pilot_5904_1489171051/PandaJob_3273309522_1489171055/athena_stdout.txt -
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.preExecute 2017-03-10 12:38:27,950 INFO Batch/grid running - command outputs will not be echoed. Logs for EVNTtoHITS are in log.EVNTtoHITS
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.preExecute 2017-03-10 12:38:27,952 INFO Now writing wrapper for substep executor EVNTtoHITS
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe._writeAthenaWrapper 2017-03-10 12:38:27,952 INFO Valgrind not engaged
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.preExecute 2017-03-10 12:38:27,952 INFO Athena will be executed in a subshell via ['./runwrapper.EVNTtoHITS.sh']
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.execute 2017-03-10 12:38:27,952 INFO Starting execution of EVNTtoHITS (['./runwrapper.EVNTtoHITS.sh'])
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.execute 2017-03-10 12:46:25,442 INFO EVNTtoHITS executor returns 65
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.validate 2017-03-10 12:46:26,351 ERROR Validation of return code failed: Non-zero return code from EVNTtoHITS (65) (Error code 65)
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.trfExe.validate 2017-03-10 12:46:26,365 INFO Scanning logfile log.EVNTtoHITS for errors
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.transform.execute 2017-03-10 12:46:26,588 CRITICAL Transform executor raised TransformValidationException: Non-zero return code from EVNTtoHITS (65); Logfile error in log.EVNTtoHITS: "AthMpEvtLoopMgr FATAL makePool failed for AthMpEvtLoopMgr.SharedEvtQueueProvider"
2017-03-10 12:49:18 (8776): Guest Log: PyJobTransforms.transform.execute 2017-03-10 12:46:29,792 WARNING Transform now exiting early with exit code 65 (Non-zero return code from EVNTtoHITS (65); Logfile error in log.EVNTtoHITS: "AthMpEvtLoopMgr FATAL makePool failed for AthMpEvtLoopMgr.SharedEvtQueueProvider")

Task number 124796186

Let me know if you need more info.
10) Message boards : LHCb Application : Low CPU usage (Message 28087)
Posted 8 Dec 2016 by captainjack
Post:
Getting nothing but these error messages.

2016-12-08 15:01:50 (22444): Guest Log: [INFO] Job finished in slot1 with unknown exit code.


And no CPU usage.

Turning these off until I hear that they are working again.
11) Message boards : LHCb Application : Condor exited after 608s without running a job (Message 27989)
Posted 28 Nov 2016 by captainjack
Post:
Looks like it is working for me now. The task has made it past the 608 second mark and is using a full CPU thread.

Thanks for getting the image updated.

Will post again if anything changes.
12) Message boards : Number crunching : "New" project, old problem (LHCb) (Message 27971)
Posted 27 Nov 2016 by captainjack
Post:
jjv,

Yes it is a known problem and has been reported on the "LHCb Application" topic. The virtual machine can't communicate with the HTCondor server so it waits 600+ seconds then aborts. My recommendation would be to turn it off and monitor this post https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4014&postid=27898 to see when the project admins get it fixed.
13) Message boards : Sixtrack Application : Very low CPU-usage on Windows with SixTrack tasks (Message 27968)
Posted 26 Nov 2016 by captainjack
Post:
I noticed something similar on my Windows 10 machine.

When a Sixtrack task started, it used about 40% of a thread until the task was ~7% complete (~3 minutes). Then the % complete jumped back to almost 0% and it started using 100% of a thread.
14) Message boards : LHCb Application : Condor exited after 608s without running a job (Message 27951)
Posted 25 Nov 2016 by captainjack
Post:
Just tried another one and it looks like the problem still exists. Task number 108722165.

2016-11-24 19:16:39 (3560): Guest Log: [DEBUG] HTCondor ping
2016-11-24 19:16:49 (3560): Guest Log: [DEBUG] 0
2016-11-24 19:27:10 (3560): Guest Log: [ERROR] Condor exited after 627s without running a job.
2016-11-24 19:27:10 (3560): Guest Log: [INFO] Shutting Down.
2016-11-24 19:27:10 (3560): VM Completion File Detected.
2016-11-24 19:27:10 (3560): VM Completion Message: Condor exited after 627s without running a job.


Let me know if you need more information.
15) Message boards : LHCb Application : Condor exited after 608s without running a job (Message 27920)
Posted 23 Nov 2016 by captainjack
Post:
Still no response from HTCondor. That must be why the average run time for all Beauty tasks is 0.16 hours.

Time to turn this one off for a while.
16) Message boards : News : LHC@home consolidation (Message 27912)
Posted 22 Nov 2016 by captainjack
Post:
Crystal Pellet and Laurence,

Thanks for the clarification. I had completely misinterpreted the usage for the "Max # CPUs" parameter. Now I know how I will have to set up my app_config.xml.
17) Message boards : News : LHC@home consolidation (Message 27904)
Posted 22 Nov 2016 by captainjack
Post:
On the profile preferences, there is a parameter for "Max # CPUs". What is that parameter supposed to do?

One of my profiles is set for Max # jobs = 3 and Max # CPUs = 1. There are 3 tasks downloaded and all three of them are running.

Thanks for the insight.
18) Questions and Answers : Sixtrack : VirtualBox is not installed (Message 27886)
Posted 19 Nov 2016 by captainjack
Post:
CERN has decided to merge all of their volunteer projects into one umbrella project and the new merged project was previously the sixtrack project. You can read about that here: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4002&postid=27816

The new and improved project will include virtualbox projects along with the sixtrack project which in not virtualbox.

If you go to the new project web site, and go to your preferences, you can see the other projects that will be included. Many of those projects require virtualbox.

Apparently, the new BOINC server checks to see if virtualbox is installed even if you do not have any of the virtualbox projects selected.

Your question was also asked by another person in a different thread. Apparently no one has been able to figure out why BOINC checks to see if you have virtualbox installed if you do not have any virtualbox projects selected.

Hope that helps.
19) Message boards : News : LHC@home consolidation (Message 27851)
Posted 15 Nov 2016 by captainjack
Post:
Ah, now I can see the options in Windows. I was taking the old link and putting https in front of it. That gets a security certificate error. The https web site has a different address. When I linked from BOINC, it worked. I have updated my favorite link to the new address.
20) Message boards : News : LHC@home consolidation (Message 27848)
Posted 15 Nov 2016 by captainjack
Post:
When I access the web site using Ubuntu/Firefox, I can see options to let me select applications that I want to run, number of tasks to download at one time and number of tasks to run at one time for each profile.

When I access the web site using Windows 10, I do not see any of those options.

When I access the web site using Windows 10 and use the https:// address, the browser shows a Security Certificate Error.


Next 20


©2019 CERN