Message boards : Theory Application : Task at 100% and still running
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profilerilian
Avatar

Send message
Joined: 12 Jul 08
Posts: 23
Credit: 941,384
RAC: 42
Message 53499 - Posted: 27 Apr 2026, 14:28:58 UTC

I have 8 tasks, still running on 100% for from 4 to 9 days

Project              % Done   Elapsed          Deadline              Status         Procs   WU name
LHC@home             100.00%  9 days 20:01:12  26-Apr-2026 02:39:34  executing      1 CPU   Theory_2922-4858714-845
LHC@home             100.00%  9 days 02:51:15  26-Apr-2026 02:39:34  executing      1 CPU   Theory_2922-4893018-844
LHC@home             100.00%  8 days 11:57:37  26-Apr-2026 02:39:34  executing      1 CPU   Theory_2922-4784078-845
LHC@home             100.00%  7 days 16:37:05  26-Apr-2026 02:39:35  executing      1 CPU   Theory_2922-4855406-845
LHC@home             100.00%  5 days 08:15:02  26-Apr-2026 02:39:35  executing      1 CPU   Theory_2922-4799558-845
LHC@home             100.00%  5 days 09:24:38  26-Apr-2026 02:39:34  executing      1 CPU   Theory_2922-4832757-845
LHC@home             100.00%  5 days 00:38:17  26-Apr-2026 02:39:35  executing      1 CPU   Theory_2922-4870278-845
LHC@home             100.00%  4 days 17:21:01  26-Apr-2026 02:39:34  executing      1 CPU   Theory_2922-4871537-845


Today they were marked in my tasks list as "Timeout - no response" for example https://lhcathome.cern.ch/lhcathome/workunit.php?wuid=240676463

I see after timeout task was picked up by other computer and finished in 30 mins ..

Should i abort or let them run?
I crunch for Ukraine
ID: 53499 · Report as offensive     Reply Quote
Michael E.

Send message
Joined: 9 Apr 23
Posts: 6
Credit: 99,895
RAC: 1,440
Message 53500 - Posted: 28 Apr 2026, 4:49:06 UTC - in response to Message 53499.  

I have a similar question @rilian.

I have two LHC Theory tasks that are taking 2 - 9 days to complete so far. The Computer is an older one running Windows 11, here: https://lhcathome.cern.ch/lhcathome/show_host_detail.php?hostid=11058658

I am using BOINC version 8.2.11 as part of BOINC testing. I obtained the Theory type using the Work unit link and the Input link. Many other tasks have completed with hours, but these two long tasks are:

Sent 19 Apr 15:41 and deadline 30 Apr 15:41, type pythia8, https://lhcathome.cern.ch/lhcathome/result.php?resultid=434988958

Sent 24 Apr 13:56 and deadline 5 May 13:56, type sherpa, https://lhcathome.cern.ch/lhcathome/result.php?resultid=434988958

The tasks have run for 7 days 8 hours (pythia8) and 2 days 7 hours (sherpa) as of 28 April 04 AM.

About 20-24 hours ago, I saw LHC disk use grow at a alarming rate so I rebooted and it seemed better. LHC disk use reached about 4 GB before reboot when viewed in BOINC Manager's Disk tab. Last night I increased BOINC disk quota because other BOINC projects had disk space problems reported in the Event Log. LHC disk usage is now at about 100 MB.

It there a way to view progress on a windows system? I saw a tail command mentioned in message forums but that seems to be a Linux command.
ID: 53500 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2304
Credit: 179,727,092
RAC: 17,509
Message 53501 - Posted: 28 Apr 2026, 5:12:20 UTC

What for info is in runRivet.log?
Are there xxxxx events processed?
A small number of Tasks from Theory would be better canceled.
For me up to five in the last few days.
Some Sherpa's are more special,
because they need days before first events line is shown.
ID: 53501 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1556
Credit: 10,101,515
RAC: 1,464
Message 53502 - Posted: 28 Apr 2026, 5:42:42 UTC - in response to Message 53500.  
Last modified: 28 Apr 2026, 5:44:46 UTC

In reply to Michael E.'s message of 28 Apr 2026:
About 20-24 hours ago, I saw LHC disk use grow at a alarming rate so I rebooted ...
When you have docker Theory's running, a reboot will kill those tasks and they will start from scratch after the restart.
The pythia8 probably will not succeed. From the attempts by others so far no one succeeded. For the sherpa job you provided the same link.

It there a way to view progress on a windows system? I saw a tail command mentioned in message forums but that seems to be a Linux command.
With Windows Powershell you can use the tail command.
Get-Content BOINC'sDataDir\slots\slotnumber\shared\runRivet.log -tail 8
ID: 53502 · Report as offensive     Reply Quote
Profilerilian
Avatar

Send message
Joined: 12 Jul 08
Posts: 23
Credit: 941,384
RAC: 42
Message 53503 - Posted: 28 Apr 2026, 13:46:57 UTC - in response to Message 53501.  

In reply to maeax's message of 28 Apr 2026:
What for info is in runRivet.log?

i have no such file in any of slots folders of LHC tasks

/var/lib/boinc/slots/8$ ls -la
total 77456
drwxrwx--x  2 boinc boinc     4096 Apr 28 00:23 .
drwxrwxr-x 11 boinc boinc     4096 Apr 27 19:27 ..
-rw-r--r--  1 boinc boinc        0 Apr 16 02:54 boinc_lockfile
-rw-r--r--  1 boinc boinc     8192 Apr 28 13:45 boinc_mmap_file
-rw-r--r--  1 boinc boinc        0 Apr 16 02:54 boinc_setup_complete
-rw-r--r--  1 boinc boinc      499 Apr 28 13:45 boinc_task_state.xml
-rw-r--r--  1 boinc boinc      474 Apr 16 02:54 Dockerfile
-rwxr-xr-x  1 boinc boinc  1209104 Apr 16 02:54 docker_wrapper
-rwxr-xr-x  1 boinc boinc    28909 Apr 16 02:54 entrypoint.sh
-rw-r--r--  1 boinc boinc     6077 Apr 28 00:23 init_data.xml
-rw-r--r--  1 boinc boinc   359841 Apr 16 02:54 input
-rw-r--r--  1 boinc boinc      148 Apr 16 02:54 job.toml
-rw-r--r--  1 boinc boinc 77666330 Apr 28 13:45 stderr.txt


contents of stderr.txt, last lines are the same:

running docker command: ps --all -f "name=boinc__lhcathome.cern.ch_lhcathome__theory_2922-4858714-845_1"
program: podman
command output:
CONTAINER ID  IMAGE                                                                         COMMAND               CREATED       STATUS      PORTS                  NAMES
af37de79728c  localhost/boinc__lhcathome.cern.ch_lhcathome__theory_2922-4858714-845:latest  /bin/sh -c ./entr...  18 hours ago  Created     0.0.0.0:56657->80/tcp  boinc__lhcathome.cern.ch_lhcathome__theory_2922-4858714-845_1
running docker command: stats --no-stream  --format "{{.CPUPerc}} {{.MemUsage}}" boinc__lhcathome.cern.ch_lhcathome__theory_2922-4858714-845_1
program: podman
command output:
0.00% 0B / 0B
invalid usage stats; using defaults


does this indicate anything ?
I crunch for Ukraine
ID: 53503 · Report as offensive     Reply Quote
Michael E.

Send message
Joined: 9 Apr 23
Posts: 6
Credit: 99,895
RAC: 1,440
Message 53504 - Posted: 28 Apr 2026, 14:52:33 UTC - in response to Message 53501.  

Thank you both @maeax and @Crystal Pellet.

@rilian, I just looked in each slot number directory for a shared directory. If you find a shared directory in .../slot/n/, the large runRivet.log file should be there. I assume the runRivet.log data exists for active tasks only?

Thank you for the Powershell tip @Crystal Pellet. Here is the data requested by @maeax:

The last lines of the C/ProgramData/BOINC/slots/1/shared contain:
PS C:\Users\muser> Get-Content C:/ProgramData/BOINC/slots\1\shared\runRivet.log -tail 12
0 90 (system) -11 0 0 0 0 0 0 0.000 0.000 0.000 13000.000 13000.000
1 2212 (p+) -12 0 0 3 0 0 0 0.000 0.000 6500.000 6500.000 0.938
2 2212 (p+) -12 0 0 4 0 0 0 0.000 0.000 -6500.000 6500.000 0.938
3 21 (g) -21 1 0 5 6 101 102 0.000 0.000 281.595 281.595 0.000
4 21 (g) -21 2 0 5 6 103 104 0.000 0.000 -0.010 0.010 0.000
5 21 g 23 3 4 0 0 101 104 0.691 0.627 258.954 258.956 0.000
6 21 g 23 3 4 0 0 103 102 -0.691 -0.627 22.630 22.650 0.000
Charge sum: 0.000 Momentum sum: 0.000 0.000 281.585 281.605 3.430

The sherpa task in slot 2 does not seem to have events yet:
PS C:\Users\muser> Get-Content C:/ProgramData/BOINC/slots\2\shared\runRivet.log -tail 8
1: (45.1301,-2.15495e-15,-2.15361e-16,45.1301)
2: (45.1301,0,0,-45.1301)
3: (3.33752e-07,5.59174e-13,-6.94771e-13,3.33752e-07)
4: (90.2603,-5.61329e-13,6.94556e-13,-3.33752e-07)
Process_Group::Differential(): Cross section is 'nan'.
Phase_Space_Integrator::AddPoint(): value = -nan. Skip.
Process_Group::Differential(): Cross section is 'nan'.
Phase_Space_Integrator::AddPoint(): value = -nan. Skip.

I will let sherpa task run until near the deadline. Should I also let PYTHIA task run for another day? I am suspending other BOINC tasks for now (I also run WCG) so the two LHC tasks both get closer to 100% CPU time (instead of 2/3).

And yes the I really appreciate the help!
ID: 53504 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2304
Credit: 179,727,092
RAC: 17,509
Message 53505 - Posted: 28 Apr 2026, 15:43:46 UTC
Last modified: 28 Apr 2026, 15:44:14 UTC

In MCPLOT you can search for this task.
[url]http://mcplots-dev.cern.ch/production.php?view=status&plots=hourly#plots [/url]
ID: 53505 · Report as offensive     Reply Quote
Pascal

Send message
Joined: 13 May 20
Posts: 64
Credit: 3,163,098
RAC: 3,250
Message 53506 - Posted: 28 Apr 2026, 18:37:33 UTC

Pour infos,les taches theory sur virtualbox dure environ 30 mn a 2 ou 3 heures maximum.c'est tres rare une tache qui dure 8 heures.on est ,2 fois sur 3,en dessous d'une heure et ça marche.Je ne sais pas ou en est theory sur docker,mais a chaque que j'ai voulu essayer,ça plantait en permanence que ce soit sous linux ou windows 10 ou windows 11.je n'ai jamais eu une tache theory qui dure une journée.En moyenne on est a 45 mn.

For your information, the theory tasks on virtualbox last about 30 minutes to 2 or 3 hours maximum. It’s very rare for a task to last 8 hours. You’re twice out of three, under an hour, and it works. I don’t know where Docker’s theory is, but every time I’ve wanted to try,It was constantly crashing, whether on Linux or Windows 10 or Windows 11. I have never had a theory task that lasts a day. On average, we are at 45 minutes.

https://lhcathome.cern.ch/lhcathome/results.php?userid=611803&offset=20&show_names=0&state=4&appid=
ID: 53506 · Report as offensive     Reply Quote
ProfileMagic Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 1313
Credit: 97,694,594
RAC: 106,766
Message 53507 - Posted: 28 Apr 2026, 19:41:28 UTC - in response to Message 53506.  
Last modified: 28 Apr 2026, 19:53:02 UTC

I have many VB Theory that were over 30 hours and even over 50 hours and even over 6 days and Docker version over 50 hours and even over 4 days total.
(and I save them since they tend to delete them here after a few days)

BTW the way I watch all of my Theory tasks with VB or Docker is leaving my Slots page open so I can check any as they are running to make sure they are actually running using
This PC > Windows (C:) > ProgramData > BOINC > slots > 1 > shared
windows 11
ID: 53507 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2304
Credit: 179,727,092
RAC: 17,509
Message 53508 - Posted: 29 Apr 2026, 3:53:33 UTC - in response to Message 53507.  
Last modified: 29 Apr 2026, 4:28:23 UTC

runRivet.log show atm for one running Theory-Docker-Task:
92000 events processed after 4 days and 11 hour.
===> [runRivet] Fri Apr 24 16:17:49 UTC 2026 [boinc pp jets 13000 180 - pythia8 8.244 CP1-CR1 100000 884]
This or next day Task will find the exit.
===> [runRivet] Mon Apr 27 08:22:46 UTC 2026 [boinc pp z1j 13000 - - sherpa 2.1.1 default 17000 878]
===> [runRivet] Mon Apr 27 09:42:24 UTC 2026 [boinc pp zinclusive 13000 - - sherpa 2.2.0 default 4000 878]
Those two running now 5 days!
ID: 53508 · Report as offensive     Reply Quote
Michael E.

Send message
Joined: 9 Apr 23
Posts: 6
Credit: 99,895
RAC: 1,440
Message 53510 - Posted: 1 May 2026, 15:23:19 UTC - in response to Message 53507.  

BTW the way I watch all of my Theory tasks with VB or Docker is leaving my Slots page open so I can check any as they are running to make sure they are actually running using
This PC > Windows (C:) > ProgramData > BOINC > slots > 1 > shared

That is helpful!

runRivet.log show atm for one running Theory-Docker-Task:
92000 events processed after 4 days and 11 hour.
===> [runRivet] Fri Apr 24 16:17:49 UTC 2026 [boinc pp jets 13000 180 - pythia8 8.244 CP1-CR1 100000 884]
This or next day Task will find the exit.
===> [runRivet] Mon Apr 27 08:22:46 UTC 2026 [boinc pp z1j 13000 - - sherpa 2.1.1 default 17000 878]
===> [runRivet] Mon Apr 27 09:42:24 UTC 2026 [boinc pp zinclusive 13000 - - sherpa 2.2.0 default 4000 878]
Those two running now 5 days!

Very helpful. The LHC Theory Docker tasks that ran very long on my old PC were also pythia8 and sherpa.

I have a general question. I use 2 Win 11 PCs with Docker, one almost 10 (!) years old and the other about two years old. The long multi-day tasks seem to occur mostly on the older one so far. Is the LHC Theory app optimized for certain instruction sets? If so, I should run most Theory tasks on the newer PC and fewer on the old one.
ID: 53510 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1556
Credit: 10,101,515
RAC: 1,464
Message 53511 - Posted: 1 May 2026, 15:59:13 UTC - in response to Message 53510.  

In reply to Michael E.'s message of 1 May 2026:
I have a general question. I use 2 Win 11 PCs with Docker, one almost 10 (!) years old and the other about two years old. The long multi-day tasks seem to occur mostly on the older one so far. Is the LHC Theory app optimized for certain instruction sets? If so, I should run most Theory tasks on the newer PC and fewer on the old one.
No, the Theory tasks you get is totally random.
ID: 53511 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Theory Application : Task at 100% and still running


©2026 CERN