1) Message boards : Number crunching : Local control of which subprojects run`2 (Message 37274)
Posted 8 days ago by Gunde
Post:
max_concurrent need to be 1 or higher. To use 0 would not work.
2) Message boards : LHCb Application : exit code 53 /no connection to cern.ch on port 80 (Message 37080)
Posted 23 days ago by Gunde
Post:
A few hosts lost connections, hope it fixed now.
3) Message boards : LHCb Application : exit code 53 /no connection to cern.ch on port 80 (Message 37079)
Posted 23 days ago by Gunde
Post:
Getting task but some host with new task from LHCb ends before it reached 5 min

errorcode for task: Exit status 53 (0x00000035) Unknown error code

Stderr output:
2018-10-23 20:08:09 (58259): Guest Log: [INFO] Mounting the shared directory
2018-10-23 20:08:09 (58259): Guest Log: [INFO] Shared directory mounted, enabling vboxmonitor
2018-10-23 20:08:09 (58259): Guest Log: [DEBUG] Testing network connection to cern.ch on port 80
2018-10-23 20:09:10 (58259): Guest Log: [DEBUG] nc: connect to cern.ch port 80 (tcp) timed out: Operation now in progress
2018-10-23 20:09:10 (58259): Guest Log: nc: connect to cern.ch port 80 (tcp) timed out: Operation now in progress
2018-10-23 20:09:10 (58259): Guest Log: [DEBUG] 1
2018-10-23 20:09:10 (58259): Guest Log: [ERROR] Could not connect to cern.ch on port 80
2018-10-23 20:09:10 (58259): Guest Log: [INFO] Shutting Down.
2018-10-23 20:09:10 (58259): VM Completion File Detected.
2018-10-23 20:09:10 (58259): VM Completion Message: Could not connect to cern.ch on port 80

Only some task effected so might be a network change? I don´t any issue on my network and so far only a few host effected. Sorry to that my hosts sent a lot of errors i would stop fetch new thask until it solved/changed.
4) Message boards : Theory Application : jobs is empty (Message 36820)
Posted 22 Sep 2018 by Gunde
Post:
Looks like we are dry now

Exit status 207 (0x000000CF) EXIT_NO_SUB_TASKS
5) Message boards : Theory Application : New version 263.80 (Message 36804)
Posted 21 Sep 2018 by Gunde
Post:
a lottery and strange that these task hand out credits so diffrently.

~300 up to ~32k is a big gap.
6) Message boards : Theory Application : jobs is empty (Message 36802)
Posted 21 Sep 2018 by Gunde
Post:
Looking MCPLOT and it hit 0 and contributed CPU time is also 0 and MCPLOTS spend no time to generate new. I have seen a post about server issue and backup server replacement but wounder if new jobs is the cause of this?
Hosts running and get some jobs but may not run fulltime and idle, have checked a few task and sum is diffrent in jobs done.

Could project admin announce if new jobs would get out and fill the task with jobs that are sent out?

Should we suspend the tasks until batch system is back to normal. Any info to users and guidelines would be appreciated.
7) Message boards : News : File upload issues (Message 33838)
Posted 14 Jan 2018 by Gunde
Post:
LHC would probably not be able to help your cpdn task issue.
8) Message boards : News : File upload issues (Message 33755)
Posted 9 Jan 2018 by Gunde
Post:
My host manage to upload half yesterday but now stuck again, as it got some task back it started to download new but as it looks now download got pending and take 2 min to download a task that have 30 sec runtime.

Just wait it out and hopefully server would catch up.

Edit: Got most task uploaded and download no longer have pending time.
9) Message boards : Sixtrack Application : No new WUs (Message 33491)
Posted 24 Dec 2017 by Gunde
Post:
Upload and download issues, some task are close to reach deadline now.
10) Message boards : ATLAS application : Download failures (Message 31727)
Posted 30 Jul 2017 by Gunde
Post:
Sun 30 Jul 2017 02:42:16 PM CEST | LHC@home | Temporarily failed download of jf_3536bf3e25f337041aca72316e5e0fec: transient HTTP error
Sun 30 Jul 2017 02:42:16 PM CEST | LHC@home | Backing off 00:25:41 on download of jf_3536bf3e25f337041aca72316e5e0fec
Sun 30 Jul 2017 02:42:16 PM CEST | LHC@home | Temporarily failed download of jf_d4b6ce59cac0e54eb4bddb1b2e4b43e2: transient HTTP error
Sun 30 Jul 2017 02:42:16 PM CEST | LHC@home | Backing off 00:16:41 on download of jf_d4b6ce59cac0e54eb4bddb1b2e4b43e2
Sun 30 Jul 2017 02:42:18 PM CEST | | Internet access OK - project servers may be temporarily down.

With debug:
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | [http] HTTP_OP::init_get(): http://boincai04.cern.ch/Atlas-test/download/10d/vnPNDmwxevqnSu7Ccp2YYBZmABFKDmABFKDmXNGKDmhFLKDmFy3E7n_EVNT.11266146._002827.pool.root.1
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | Started download of jf_3536bf3e25f337041aca72316e5e0fec
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | [http] HTTP_OP::init_get(): http://boincai04.cern.ch/Atlas-test/download/13c/GmYNDmcofvqnSu7Ccp2YYBZmABFKDmABFKDmXNGKDmuHLKDmJIpshn_EVNT.11266146._002831.pool.root.1
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | Started download of jf_d4b6ce59cac0e54eb4bddb1b2e4b43e2
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | [http] [ID#1548] Info: Connection 853 seems to be dead!
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | [http] [ID#1548] Info: Closing connection 853
Sun 30 Jul 2017 03:37:54 PM CEST | LHC@home | [http] [ID#1549] Info: Found bundle for host boincai04.cern.ch: 0x559afaf3cfe0 [serially]
Sun 30 Jul 2017 03:37:54 PM CEST | | [network_status] status: online
Sun 30 Jul 2017 03:37:55 PM CEST | LHC@home | [http] [ID#1548] Info: Trying 128.142.202.86...
Sun 30 Jul 2017 03:37:55 PM CEST | LHC@home | [http] [ID#1549] Info: Hostname was found in DNS cache
Sun 30 Jul 2017 03:37:55 PM CEST | LHC@home | [http] [ID#1549] Info: Trying 128.142.202.86...
11) Message boards : ATLAS application : all ATLAS tasks fail after about 10 minutes (Message 31709)
Posted 29 Jul 2017 by Gunde
Post:
Looks like Atlas servers got some issue, getting problem to download task.

For some of those that are running i´m not able to open VM console, those task that i could open show no event done.

Could we get a status what issue could be?

(this only related to Atlas, and all task is not effected to this.)
12) Message boards : News : CMS@Home -- please set No New Tasks and perhaps temporarily run another project (Message 31462)
Posted 17 Jul 2017 by Gunde
Post:
does that mean in bionic or in the LHC@Home preferences? (Run only the selected applications)

If it is in bionic do you want us to accept no new tasks for LHC@home? or does CMS@home has its own project you can add to Bionic
(sorry for all the questions just want to make sure im doing the right thing)


BOINC Berkeley Open Infrastructure for Network Computing

bionic is a different thing.
13) Message boards : Theory Application : Tests of Theory app on LHC@home (Message 27891)
Posted 20 Nov 2016 by Gunde
Post:
Yea did a check now and can´t found any error message from todays task, looks like they solve it.
I compare with vLHC task as they use same version and both project works just fine now both of them.

So they did take another turn from yesterday
Valid (334) · Invalid (0) · Error (387)

Start and suspend is fine here same as at vLHC but i notice that i got a lot postponed do to high cpu usage when it started other task. I saw a peak and hit 100% so they got interrupted when it saved data.They resume after a restart, no data lost.

Only one host now is unable to run these task now. It´s a win 10 host with vpn on. I will suspend that host.
14) Message boards : Theory Application : Tests of Theory app on LHC@home (Message 27889)
Posted 20 Nov 2016 by Gunde
Post:
Tested Theory Application today

Valid (56) · Invalid (0) · Error (383)

Those which got error it say
Exit status -1073740791 (0xC0000409) STATUS_STACK_BUFFER_OVERRUN

Stderr
2016-11-20 17:02:48 (11452): VM Heartbeat file specified, but missing.
2016-11-20 17:02:48 (11452): VM Heartbeat file specified, but missing file system status. (errno = '2')



©2018 CERN