Message boards : Theory Application : Long running task, how to proceed?
Message board moderation

To post messages, you must log in.

AuthorMessage
wolfbert

Send message
Joined: 29 May 23
Posts: 2
Credit: 146,244
RAC: 2,660
Message 49964 - Posted: 18 Apr 2024, 7:36:55 UTC

I'm currently working on a task that is in its third try.

Name Theory_2687-2600407-1156
created 18 Mar 2024, 23:43:41 UTC

407982525 10687016 19 Mar 2024, 2:54:05 UTC 1 Apr 2024, 19:55:02 UTC Fehler beim Berechnen 829,530.39 812,599.90 --- Theory Simulation v300.10 (vbox64_theory)
408420018 10837826 31 Mar 2024, 2:20:51 UTC 11 Apr 2024, 0:48:43 UTC Fehler beim Berechnen 870,322.35 859,686.90 --- Theory Simulation v300.10 (vbox64_theory)
408756665 10830856 11 Apr 2024, 14:35:42 UTC 22 Apr 2024, 14:35:42 UTC In Bearbeitung --- --- --- Theory Simulation v300.10 (vbox64_theory)

The first two errors may have been caused by failed restarts, but I'm no expert in log analysis.
Currently, the task has been running for over 4 CPU days and is at 43%. I'll probably miss the deadline. I've had a look at the VM status as described in a related thread, but can't interpret the data. There doesn't seem to be progress.

There are two PYTHIA Warnings and the last lines are
700 events processed
dumping histograms...

Is it worth continuing, or what else could I check? Any help appreciated.
ID: 49964 · Report as offensive     Reply Quote
Crystal Pellet
Volunteer moderator
Volunteer tester

Send message
Joined: 14 Jan 10
Posts: 1302
Credit: 8,662,418
RAC: 7,341
Message 49966 - Posted: 18 Apr 2024, 9:47:18 UTC - in response to Message 49964.  

This is the info of the job you're running: pp mb-inelastic 7000 - - pythia8 8.302 vincia-default 100000 1156
That means that 100000 events should be processed and you have done only 700 so far in 4 days.
When you did not restart the VM (several times) it's time to abort the task.
ID: 49966 · Report as offensive     Reply Quote
wolfbert

Send message
Joined: 29 May 23
Posts: 2
Credit: 146,244
RAC: 2,660
Message 49967 - Posted: 18 Apr 2024, 12:58:01 UTC - in response to Message 49966.  

Thanks for having a look into that. I've meanwhile compared the VM output of this task to other tasks I have running (they log 30-50.000 events after a few hours), so, sadly, I have aborted this task.
ID: 49967 · Report as offensive     Reply Quote

Message boards : Theory Application : Long running task, how to proceed?


©2024 CERN