1) Message boards : ATLAS application : Bad WUs? (Message 47188)
Posted 27 Aug 2022 by keputnam
Post:
May or may not apply to you, but I had similar problems a while ago

At the suggestion of someone (don't remember if it was here or my ISP Help Desk), i connected my cruncher machine via copper rather than WIFI, even though I was using 802.11AC

All of my LLHC/Atlas connection problems went away
2) Message boards : Number crunching : No Tasks Available (Message 46781)
Posted 15 May 2022 by keputnam
Post:
Thank You !

I don't remember changing that, but obviously I did

Changed that (un-selected), saved, updated project from client and got 40 WUs !! all with due dates of 2022-05-23

with a historical min run time (on this machine) of just under 2 hours and a max of 5.5 hours averaging about 3 1/2 hours, there is no way I will successfully complete 40 WUs in 7 days!

The scheduler has done this to me before :-(
3) Message boards : Number crunching : No Tasks Available (Message 46779)
Posted 15 May 2022 by keputnam
Post:
I hadn't realized that about ATLAS Long, thanks I'll remove it from preferences

As for priority, LHC has the highest priority of any Project I run, and has for the last 18 months or so. Almost twice that of any other project except WCG (don't even get me started on the way their migration has been handled)

But why would Priority result in the "No tasks available" message? Isn't that more along the lines of "not highest priority"?
4) Message boards : Number crunching : No Tasks Available (Message 46778)
Posted 15 May 2022 by keputnam
Post:
Nope, no changes for LHC at all
5) Message boards : Number crunching : No Tasks Available (Message 46775)
Posted 15 May 2022 by keputnam
Post:
The last ATLAS WU I received was 3 May 2022, 0:16:37 UTC

When updating LCH I get

2022-05-15 9:42:11 AM | LHC@home | Scheduler request completed: got 0 new tasks
2022-05-15 9:42:11 AM | LHC@home | No tasks sent
2022-05-15 9:42:11 AM | LHC@home | No tasks are available for ATLAS Simulation
2022-05-15 9:42:11 AM | LHC@home | No tasks are available for ATLAS (long simulation)
2022-05-15 9:42:11 AM | LHC@home | Project requested delay of 6 seconds


The server status page currently shows

ATLAS Simulation 2548 20145
ATLAS (long simulation) 0 0

And has shown both Atlas and Atlas Long as having WUs ready to send at multiple times since this started

Any hints as to what is going on here?
6) Message boards : ATLAS application : credits for runtime, not for cputime ? (Message 46053)
Posted 13 Jan 2022 by keputnam
Post:
So if I run one core on a 3.5 Ghz machine, I get the same credit as someone who runs 8 cores or 10 cores on a 3.5 GHz machine?

That is totally non-comprehensible, as I'm doing MUCH less work and returning MANY fewer results

If this is indeed correct, I can allocate more cores to other projects and get the same credit in Atlas
7) Message boards : ATLAS application : Repeated computation errors (Message 45799)
Posted 6 Dec 2021 by keputnam
Post:
Thanks, guys

The uninstall/re-install seems to have cured the problem

I'm about 10 minutes from completing a WU


Thinking back, I did upgrade VBox to 6.1.30 Apparently it didn't clean up the previously installed version very well
8) Message boards : ATLAS application : Repeated computation errors (Message 45798)
Posted 6 Dec 2021 by keputnam
Post:
Thanks for the responses

No, I have not done any tweaking of VBox

I have uninstalled VBox, used ADAware uninstall cleanup, booted reinstalled VBox and the extension pack and rebooted again

Now waiting for the job queue to clear out to where the Scheduler requests another ATLAS job
9) Message boards : ATLAS application : Repeated computation errors (Message 45789)
Posted 6 Dec 2021 by keputnam
Post:
I have used E: as my BOINC Data Drive for over 8 years Started when my system HD was a little undersized No problems at all

BOINC and VBOX executables and VBOX VMs are all on C:
10) Message boards : ATLAS application : Repeated computation errors (Message 45787)
Posted 6 Dec 2021 by keputnam
Post:
Got a new computer recently, and for a few days, things were great

Then I started getting Computation Errors on every job I run

Last time this happened the consensus was that I had lousy internet (failure to connect to the LHC servers)



I have since upgraded my connection from 25MB to 150MB
(I can post results from several speed test sites if it will help)

Running BOINC 7.16.11
VBox 6.1.26 (+ Extension Pack)



Any assistance gratefully accepted
11) Message boards : ATLAS application : Bad WUs? (Message 45176)
Posted 3 Aug 2021 by keputnam
Post:
Already aborted the second Doesn't surprise me that there is no log file, the WUs never got any CPU time

But a third, previously downloaded and queued has started and is responding normally
12) Message boards : ATLAS application : Bad WUs? (Message 45174)
Posted 3 Aug 2021 by keputnam
Post:
The last two ATLAS WUs that I have downloaded appear to be faulty

One ran for 3 days before I noticed that while I could access the VM, ALT-F2 had no effect at all I aborted that one

The latest one has now been running for about an hour and exhibits the same behavior

Also, while BOINC reports the WU as running Resource Monitor show VBox using no CPU at all

I have re-booted and upgraded to the latest VBox version with no change


TIA for any ideas or suggestions
13) Message boards : ATLAS application : Checkpointing? (Message 43393)
Posted 22 Sep 2020 by keputnam
Post:
Can anybody tell me why a program, would take regular checkpoints, and then ignore them on restart and start from scratch? For ATLAS, BOINC manager/properties shows "time since last checkpoint," which resets every two minutes

I shut down BOINC to apply some Windows service

On restart as soon as VBox manager initialized and BOINC monitoring starts, it correctly shows a certain member of events completed

Then at some later point, it resets to zero and starts at event 1 again


???
14) Message boards : ATLAS application : Confused (Message 43309)
Posted 4 Sep 2020 by keputnam
Post:
Oh, i realize the purpose, but this has only ever happened on ATLAS as far as I can remember

And this is the third go round on the circus for me
15) Message boards : ATLAS application : Confused (Message 43308)
Posted 4 Sep 2020 by keputnam
Post:
Already have an app_config with max_concurrent set to 1

We took three power hits over about 19 hours, so that would account for three restarts

I think I did one for Windows maintenance, too




As for the aborted tasks, and ridiculous number of new tasks sent

I haven't changed relative resource share in over 6 months, nor added any new projects and the scheduler SHOULD be smart enough to not send me work I will never complete on time
16) Message boards : ATLAS application : Confused (Message 43305)
Posted 4 Sep 2020 by keputnam
Post:
Can anyone explain this to me


Run Time CPU Time Credit
120,886.49 140,595.50 590.26
107,254.38 36,754.30 2,149.71



Also I have another another complaint about the Scheduler. I had five WUs cancelled by the server because I would have returned them late Fair enough

But it sent me another 15 with an 8 day return date !

There is no way in hell I'll get through them all on time

What gives?
17) Message boards : ATLAS application : Processor Time Locks Up Elapsed Time Continues to Climb (Message 43263)
Posted 24 Aug 2020 by keputnam
Post:
Well, running copper seems to have reduced latency considerably

Your Internet speed is
29Mbps

Latency
Unloaded
23 ms

Loaded
35 ms

Upload
Speed
4.6 Mbps



Finished one WU after installing the CAT6, and the next is now an hour and a half into execution, crunching along quite nicely



Thanks for all the assistance, guys
18) Message boards : ATLAS application : Processor Time Locks Up Elapsed Time Continues to Climb (Message 43257)
Posted 23 Aug 2020 by keputnam
Post:
So when the task "locks up" and gets no more CPU time, as verified by Windows Resource Manager, what am I looking for?


Current WU is at 13:48:24 and crunching happily along
19) Message boards : ATLAS application : Processor Time Locks Up Elapsed Time Continues to Climb (Message 43255)
Posted 22 Aug 2020 by keputnam
Post:
Thanks for the response

I'll get some CAT6 today or tomorrow

I only run 1 LHC task at time, but have BOINC set to use 10 of my 12 "processors" (Hex core/MT)


The last two ATLAS tasks ran to completion, waiting for another at this point

Curiously, one of them kind of lugged along getting 1-2 sec of CPU time per 20 seconds of Elapsed, then at about 8:30 took off and ran to completion!
20) Message boards : ATLAS application : Processor Time Locks Up Elapsed Time Continues to Climb (Message 43244)
Posted 21 Aug 2020 by keputnam
Post:
copper to the house, WiFi throughout Contract is 24Mb Modem/Router is a Two Wire
Speed from desktop to the router from 56 to over 120 Mbps, currently 72, which is about average My to-do list for the day includes making a few phone calls to other ISPs

The supervisor I talked to suggested trying copper from router to the desktop to see if if made any difference was going to get some the next-time I was out and about If it helps, I've then got to figure out how to permanently run it frim the Kitchen (middle of the house) to my den

C:\WINDOWS\system32>ping 8.8.8.8

Pinging 8.8.8.8 with 32 bytes of data:
Reply from 8.8.8.8: bytes=32 time=25ms TTL=117
Reply from 8.8.8.8: bytes=32 time=31ms TTL=117
Reply from 8.8.8.8: bytes=32 time=25ms TTL=117
Reply from 8.8.8.8: bytes=32 time=24ms TTL=117







from fast.com
Your Internet speed is
16
Mbps


Latency
Unloaded
25 ms

Loaded
125 ms


Upload
Speed
4.2 Mbps


Next 20


©2023 CERN