Message boards : Theory Application : Task at 100% and still running
Message board moderation

To post messages, you must log in.

AuthorMessage
S@NL - John van Gorsel

Send message
Joined: 8 Aug 11
Posts: 7
Credit: 2,748,640
RAC: 229
Message 52862 - Posted: 22 Jan 2026, 17:43:44 UTC
Last modified: 22 Jan 2026, 17:53:40 UTC

I have a task that took 26 hours to reach 100% and even at 100% it continues. The Windows task manager shows that Virtualbox still keeps a CPU core busy. The logfile that I get to see through the "show graphics" button also shows that there is still information added. This is the second time that I see this and the first time I let the file run another day but then I noticed that the running.log seemed to have reset so I aborted that task.

Current task:
https://lhcathome.cern.ch/lhcathome/result.php?resultid=431737820

Is there a graceful way to end this task? My pc has done 28 hours of productive work (although that is what it looks like) so I don't want to simply abort this one.
Any help is appreciated.

Regards, John
ID: 52862 · Report as offensive     Reply Quote
Toggleton

Send message
Joined: 4 Mar 17
Posts: 45
Credit: 12,922,881
RAC: 6,041
Message 52863 - Posted: 22 Jan 2026, 21:05:35 UTC - in response to Message 52862.  

Theory tasks can be quite long. There are threads about 6-10days long running tasks. The % in BOINC is clueless about the state in the Virtual machine. So that is no useful indicator for this Project.

If you still see "217 of 760 integrations done" or "52800 events" and counting up then the task is fine(i think most task have been 100k events)
i have not exact knowledge of virtual box tasks. did run theory native tasks so can't help with the exact location of the log file if "show graphics" window is not printing that.

Theory is always a big surprise bag. with ultra short tasks and long runner randomly. If you prefer more stable runtimes then is atlas the better project.
ID: 52863 · Report as offensive     Reply Quote
S@NL - John van Gorsel

Send message
Joined: 8 Aug 11
Posts: 7
Credit: 2,748,640
RAC: 229
Message 52864 - Posted: 22 Jan 2026, 21:28:43 UTC - in response to Message 52863.  

Thanks, I'll allow it some more time then. Its now at "Integrating 328 of 760, iteration 3"
ID: 52864 · Report as offensive     Reply Quote
Anne Havinga

Send message
Joined: 4 Mar 20
Posts: 23
Credit: 8,028,039
RAC: 7,667
Message 53008 - Posted: 9 Feb 2026, 21:08:56 UTC - in response to Message 52862.  

Like the others say, just keep it running. Some may take a long time on a not so powerful computer.
This long running task finished today.
https://lhcathome.cern.ch/lhcathome/result.php?resultid=431519777
Run time 30 days 13 hours 32 min 28 sec
CPU time 26 days 0 hours 45 min 47 sec
Credit 32,023.18
ID: 53008 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 61
Credit: 4,228,708
RAC: 3,276
Message 53031 - Posted: 14 Feb 2026, 16:29:19 UTC

Yea I've had loads that have appeared to be stuck at 100% and I aborted quite a few too!
Would've been useful if the LHC guys had sent us a BOINC message warning about the long runtimes and the inaccuracy of the BOINC % for VM tasks, would've saved wasted crunching time :(.

My 'show graphics' button is greyed out, any idea why?
Team AnandTech - WCG, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H.
Main rig - Ryzen 5800X, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64
ID: 53031 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1552
Credit: 10,068,606
RAC: 604
Message 53032 - Posted: 14 Feb 2026, 17:30:39 UTC - in response to Message 53031.  

In reply to [TA]Assimilator1's message of 14 Feb 2026:
Yea I've had loads that have appeared to be stuck at 100% and I aborted quite a few too!
Would've been useful if the LHC guys had sent us a BOINC message warning about the long runtimes and the inaccuracy of the BOINC % for VM tasks, would've saved wasted crunching time :(.

My 'show graphics' button is greyed out, any idea why?
Read here: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6448&postid=53018
ID: 53032 · Report as offensive     Reply Quote
[TA]Assimilator1
Avatar

Send message
Joined: 29 Nov 13
Posts: 61
Credit: 4,228,708
RAC: 3,276
Message 53047 - Posted: 16 Feb 2026, 19:44:56 UTC - in response to Message 53032.  
Last modified: 16 Feb 2026, 19:45:27 UTC

In reply to Crystal Pellet's message of 14 Feb 2026:
In reply to [TA]Assimilator1's message of 14 Feb 2026:
Yea I've had loads that have appeared to be stuck at 100% and I aborted quite a few too!
Would've been useful if the LHC guys had sent us a BOINC message warning about the long runtimes and the inaccuracy of the BOINC % for VM tasks, would've saved wasted crunching time :(.

My 'show graphics' button is greyed out, any idea why?
Read here: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=6448&postid=53018


Thanks :), I see this for one task I have running :-

PYTHIA Warning in StringFragmentation::fragmentToJunction: bad convergence junction rest frame
PYTHIA Warning in JunctionSplitting::SplitJunPairs: parallel junction state not allowed.
PYTHIA Warning in JunctionSplitting::CheckColours: Not possible to split junctions; making new colours
PYTHIA Warning in JunctionSplitting::CheckColours: Made a gluon colour singlet; redoing colours
PYTHIA Error in StringFragmentation::fragment: stuck in joining
PYTHIA Error in Pythia::next: hadronLevel failed; try again
PYTHIA Error in MiniStringFragmentation::fragment: no 1- or 2-body state found above mass threshold
PYTHIA Warning in SimpleSpaceShower::pT2nextQCD: weight above unity
100 events processed
PYTHIA Error in SimpleSpaceShower::pT2nearThreshold: stuck in loop
PYTHIA Warning in SimpleSpaceShower::pT2nextQCD: small daughter PDF
200 events processed
PYTHIA Error in StringFragmentation::fragmentToJunction: caught in junction flavour loop
300 events processed
400 events processed
500 events processed
PYTHIA Warning in StringFragmentation::finalRegion: random axis needed to break tie
600 events processed
700 events processed
PYTHIA Warning in MiniStringFragmentation::ministring2two: random axis needed to break tie
800 events processed
900 events processed


ends :-


Pythia::next(): 19000 events have been generated
19000 events processed
19100 events processed
19200 events processed
19300 events processed
19400 events processed
19500 events processed
19600 events processed


I assume it's still running then? Is there anywhere in that file that shows progress point?
Are the warnings anything to worry about?
Team AnandTech - WCG, F@H, MW@H, Ast@H, LHC@H, R@H, CPDN, E@H.
Main rig - Ryzen 5800X, MSI B450 Gm Pro C AC, 32GB DDR4 3200, RTX 3060 Ti 8GB, Win10 64bit
2nd rig - i7 4930k @4.1 GHz, 16 GB DDR3 1866, HD 7870XT 3GB(DS), Win7 64
ID: 53047 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1552
Credit: 10,068,606
RAC: 604
Message 53048 - Posted: 16 Feb 2026, 20:41:13 UTC - in response to Message 53047.  

In reply to [TA]Assimilator1's message of 16 Feb 2026:
Thanks :), I see this for one task I have running :-

19300 events processed
19400 events processed
19500 events processed
19600 events processed


I assume it's still running then? Is there anywhere in that file that shows progress point?
Are the warnings anything to worry about?
Don't worry about warnings as long as you see the number of events processed (slowly) increasing.
In the very first line of runRivet.log you see the job parameters. The penultimate number on that first line is how many events should be processed. Mostly 100,000 events.
Be aware that the task will not survive a system restart,
ID: 53048 · Report as offensive     Reply Quote
DocLogic

Send message
Joined: 12 May 20
Posts: 6
Credit: 28,927
RAC: 353
Message 53184 - Posted: 18 Mar 2026, 14:25:46 UTC

F.Y.I.
LHC@home 100.00% Running 6d 09:04:22 Deadline 3/21/2026 7:12:12 PM Theory Simulation 302.10 (docker) Theory_2922-4882143-675_2
all other tasks have finished in hours not days...
ID: 53184 · Report as offensive     Reply Quote
Toggleton

Send message
Joined: 4 Mar 17
Posts: 45
Credit: 12,922,881
RAC: 6,041
Message 53185 - Posted: 18 Mar 2026, 16:51:07 UTC - in response to Message 53184.  

On this page you can read the input file of that long runner task. https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=239288667
runspec=boinc pp winclusive 7000 - - sherpa 2.2.9 default 1000 675"

The other user has canceled that task after 8days. hopefully you can finish the task.
As long as the task is still progressing and is writing to the logs. i think sherpa was in the past a bit crashhappy)
ID: 53185 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1552
Credit: 10,068,606
RAC: 604
Message 53186 - Posted: 18 Mar 2026, 17:08:34 UTC - in response to Message 53185.  

runspec=boinc pp winclusive 7000 - - sherpa 2.2.9 default 1000 675"
That's not looking good:
From: https://mcplots-dev.cern.ch/production.php?view=runs&rev=2922&display=fail

            run                             events  attempts success failure unknown
pp winclusive 7000 - - sherpa 2.2.9 default    0      48         0      6      42
ID: 53186 · Report as offensive     Reply Quote
DocLogic

Send message
Joined: 12 May 20
Posts: 6
Credit: 28,927
RAC: 353
Message 53220 - Posted: 20 Mar 2026, 13:56:22 UTC - in response to Message 53184.  

Now at 08d 07:20:30 and still running, Log active and other Tasks reporting there. No error messages of note. Allowing run to continue...
ID: 53220 · Report as offensive     Reply Quote
rob

Send message
Joined: 4 Mar 11
Posts: 45
Credit: 4,016,025
RAC: 116
Message 53232 - Posted: 22 Mar 2026, 14:12:27 UTC

The progress estimate of these long duration tasks is very misleading. It would appear that a whole section of the calculation is not considered in the duration calculation - for example my current task reached ~100% in about 4 hours, it is now at 15hours elapsed, but runRvet.log shows that it's at iteration 71 of 760. I assume this very slow phase starts at iteration 1, and continues to iteration 760, thus it should be a fairly simple change to include this potentially quite substantial part of the calculation into expected runtime calculation, and so give a better estimate of the time remaining.
ID: 53232 · Report as offensive     Reply Quote
DocLogic

Send message
Joined: 12 May 20
Posts: 6
Credit: 28,927
RAC: 353
Message 53261 - Posted: 24 Mar 2026, 13:53:52 UTC - in response to Message 53184.  

Ending run after 12 days....
Application
Theory Simulation 302.10 (docker)
Name
Theory_2922-4882143-675
State
Running
Received
3/11/2026 7:12:13 PM
Report deadline
3/21/2026 7:12:12 PM
Estimated computation size
3,600 GFLOPs
CPU time
11d 08:24:46
CPU time since checkpoint
---
Elapsed time
12d 04:29:33
Estimated time remaining
---
Fraction done
100.000%
Virtual memory size
2.55 MB
Working set size
95.37 MB
Directory
slots/8
Process ID
18140
Progress rate
0.360% per hour
Executable
docker_wrapper_18_windows_x86_64.exe
Application Name
Theory
Plan Class
docker
ID: 53261 · Report as offensive     Reply Quote
DocLogic

Send message
Joined: 12 May 20
Posts: 6
Credit: 28,927
RAC: 353
Message 53263 - Posted: 24 Mar 2026, 14:05:39 UTC - in response to Message 53261.  

This task is now under ERROR but continued to run after BOINC & PC shutdown and restart...
Name Theory_2922-4882143-675_2
Workunit 239288667
Created 11 Mar 2026, 21:53:12 UTC
Sent 12 Mar 2026, 1:12:12 UTC
Report deadline 23 Mar 2026, 1:12:12 UTC
Received ---
Server state Over
Outcome No reply
Client state New
Exit status 0 (0x00000000)
Computer ID 10970528
Run time 0 sec
CPU time 0 sec
Priority 0
Validate state Initial
Credit 0.00
Device peak FLOPS 5.56 GFLOPS
Application version Theory Simulation v302.10 (docker)
windows_x86_64
ID: 53263 · Report as offensive     Reply Quote

Message boards : Theory Application : Task at 100% and still running


©2026 CERN