Message boards : Number crunching : Tasks stuck at 99.99% with run time of 1 day+
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
AndreyOR

Send message
Joined: 8 Dec 19
Posts: 37
Credit: 7,587,303
RAC: 192
Message 46554 - Posted: 29 Mar 2022, 21:33:14 UTC - in response to Message 46526.  

Disabling macOS time sync early in the process helped as I stopped getting those kinds of messages and the last batch of tasks just completed successfully. As I was looking into your suggestion I noticed that when setting up a VM in VBox there's an option under System/Motherboard to specify "Hardware Clock in UTC Time". It's checked by default so I unchecked it and turned time sync in macOS back on (which is the default anyway). I'm curious to see if this simpler solution will also work as changing BIOS time and updating Windows registry and making sure that VMs are set up right is a bit more involved.
ID: 46554 · Report as offensive     Reply Quote
Nuadormrac

Send message
Joined: 26 Sep 05
Posts: 85
Credit: 421,130
RAC: 0
Message 46865 - Posted: 10 Jun 2022, 15:28:32 UTC
Last modified: 10 Jun 2022, 15:29:42 UTC

I have an odd one because it's not exactly this. The task has run over a day, but the % is just 99.982% and still incrementing. Is it going to turn into one of these? Or is there a reason some tasks might proceed way slower then the average, and it's still worth completing? Most of my ATLAS tasks complete more around 4ish hours, at times down to 2+ hours...
ID: 46865 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2451
Credit: 233,896,546
RAC: 155,898
Message 46866 - Posted: 10 Jun 2022, 16:17:14 UTC - in response to Message 46865.  

You are running a mix of ATLAS/CMS/Theory.
Be so kind as to post a link to the task in question.
ID: 46866 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2159
Credit: 164,212,604
RAC: 172,275
Message 46867 - Posted: 10 Jun 2022, 18:10:22 UTC - in response to Message 46865.  

Most of my ATLAS tasks complete more around 4ish hours, at times down to 2+ hours...

When the runtime for Atlas is longer as you wrote (2-4 hours), something went wrong with this Task.
Have also a few per day went wrong, but hundreds are ok.
When you look into Boincmanager for this wrong task and see after half a hour no growing CPU-Task, you can delete this task.
ID: 46867 · Report as offensive     Reply Quote
Nuadormrac

Send message
Joined: 26 Sep 05
Posts: 85
Credit: 421,130
RAC: 0
Message 46868 - Posted: 10 Jun 2022, 22:54:16 UTC - in response to Message 46867.  
Last modified: 10 Jun 2022, 22:54:49 UTC

The task was this one

https://lhcathome.cern.ch/lhcathome/result.php?resultid=356639181

I aborted, though it had progressed up to 99.998% from what I indicated earlier, which is why i was holding off. It definitely seemed to be an outlier....
ID: 46868 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2451
Credit: 233,896,546
RAC: 155,898
Message 46871 - Posted: 11 Jun 2022, 7:12:00 UTC - in response to Message 46868.  

Since you already aborted the task the log doesn't contain useful debugging information except that it confirms it was an ATLAS task.
From another task log on the same computer:
2022-06-09 04:29:54 (246820): Setting CPU Count for VM. (7)


This computer has an i7-7700HQ CPU with 4 physical cores.
The VirtualBox experts recommend not to configure VMs with more than the available physical cores.
Hence, on this computer the limit would be 4.
You configured a 7-core VM.

See:
https://forums.virtualbox.org/viewtopic.php?f=35&t=77413
ID: 46871 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 817
Credit: 663,083,581
RAC: 200,151
Message 46873 - Posted: 11 Jun 2022, 10:14:53 UTC

I do get a few per week, I don't really know how to fix it and overall its less than 1%, so I just abort them.
ID: 46873 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Tasks stuck at 99.99% with run time of 1 day+


©2024 CERN