Message boards : Theory Application : Theory Simulation task jumps to 10 Days time remaining...
Message board moderation

To post messages, you must log in.

AuthorMessage
Win Prion

Send message
Joined: 26 Nov 22
Posts: 3
Credit: 37,168
RAC: 0
Message 48508 - Posted: 7 Sep 2023, 16:50:50 UTC

I'm new to LHC. I did a quick scan of the forums and don't see this problem:

Most of the Theory Simulation tasks I've run become "infinite" runs after about 40 minutes of crunching. That is, the estimated time remaining jumps from minutes to 9 or 10 days. I assume something failed/corrupted and I just abort the tasks.

Is there a fix?

Thanks!

Win
ID: 48508 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2425
Credit: 227,510,326
RAC: 129,896
Message 48509 - Posted: 7 Sep 2023, 17:22:58 UTC - in response to Message 48508.  

You may check the stderr.txt in the slots folder.
If it looks a bit like this you should let the task run:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=399093799

Your logs show that you cancelled many tasks.
This may have confused BOINC's runtime estimation.
Beside that Theory tasks can have real runtimes between a few seconds and 10 days.
Extreme runtimes are rare but they happen and this always confuses BOINC which expects runtimes to be as close to the average as possible.
ID: 48509 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 28 Sep 04
Posts: 677
Credit: 43,766,334
RAC: 14,495
Message 48510 - Posted: 7 Sep 2023, 17:52:32 UTC - in response to Message 48508.  

I'm new to LHC. I did a quick scan of the forums and don't see this problem:

Most of the Theory Simulation tasks I've run become "infinite" runs after about 40 minutes of crunching. That is, the estimated time remaining jumps from minutes to 9 or 10 days. I assume something failed/corrupted and I just abort the tasks.

Is there a fix?

Thanks!

Win

This is normal behavior for Theory tasks, at least when running in VM. There is no actual feedback of task progress to Boinc manager, it is just using an internal estimate of task progress. And if task is not finished by original estimate the time remaining display on Boinc jumps to show maximum runtime (which is 10 days). This estimation has nothing to do with task's actual needed runtime. Only place where you can see the progress of the task is inside the virtual machine terminal window Alt+F2.
ID: 48510 · Report as offensive     Reply Quote
Win Prion

Send message
Joined: 26 Nov 22
Posts: 3
Credit: 37,168
RAC: 0
Message 48511 - Posted: 7 Sep 2023, 18:05:31 UTC - in response to Message 48509.  

Thanks.
ID: 48511 · Report as offensive     Reply Quote
Win Prion

Send message
Joined: 26 Nov 22
Posts: 3
Credit: 37,168
RAC: 0
Message 48512 - Posted: 7 Sep 2023, 18:05:41 UTC - in response to Message 48510.  

Thanks!
ID: 48512 · Report as offensive     Reply Quote
keputnam

Send message
Joined: 27 Sep 04
Posts: 102
Credit: 7,340,888
RAC: 6,218
Message 48597 - Posted: 20 Sep 2023, 22:38:28 UTC - in response to Message 48509.  

I also am seeing time remaining jump to around 10 days, and it never drops I let one run for two days, and time remaining never dropped

At the same time VBOXHEADLESS is using ZERO CPU according to Resource Monitor
ID: 48597 · Report as offensive     Reply Quote
keputnam

Send message
Joined: 27 Sep 04
Posts: 102
Credit: 7,340,888
RAC: 6,218
Message 48598 - Posted: 20 Sep 2023, 22:39:22 UTC - in response to Message 48509.  

clicking your link gives

No such task: 399093799
ID: 48598 · Report as offensive     Reply Quote
computezrmle
Volunteer moderator
Volunteer developer
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 15 Jun 08
Posts: 2425
Credit: 227,510,326
RAC: 129,896
Message 48601 - Posted: 21 Sep 2023, 11:02:48 UTC - in response to Message 48598.  

The DB deletes successful results about 2 weeks after assimilation.


Near the end of stderr.txt you should find a line like this:
2023-09-21 02:44:46 (21624): Guest Log: 02:44:46 PDT -07:00 2023-09-21: cranky: [INFO] Container 'runc' finished with status code 0.

If the status code is "0" the scientific app succeeded.


At the end of stderr.txt you should find a line like this:
02:44:53 (21624): called boinc_finish(0)

Here, status code "(0)" means the task succeeded from BOINC's perspective and you should be given credits.


2023-09-21 02:26:45 (21624): Setting Memory Size for VM. (2000MB)

This points out you are using a higher RAM setting for your VMs via app_config.xml.
That's not wrong although Theory tasks don't need it.
They run fine with the standard setting of 630 MB.
ID: 48601 · Report as offensive     Reply Quote

Message boards : Theory Application : Theory Simulation task jumps to 10 Days time remaining...


©2024 CERN