21) Message boards : ATLAS application : Very long tasks in the queue (Message 29710)
Posted 29 Mar 2017 by BRG
Post:
Has anyone encountered, or had experiences with, any of the WUs with TaskID=11016767 ?
<snip>
I seem to have acquired four of them and the BOINC estimated completion time has hit the roof at 13hr28m.

So I bumped one of them to the front of the queue ... and it bombed after only 10mins run time with a crazy looking output log https://lhcathome.cern.ch/lhcathome/result.php?resultid=129069334.

Either somone was having fun when they coded it or else my machine has a nasty stutter and has severely mangled the content of that file! Anyway, back to the "regular" ones for now and I'll get around to the remaining long ones some time tomorrow.

I have a failed tasks to with such a weird looking log file to, but with another task id:

https://lhcathome.cern.ch/lhcathome/result.php?resultid=129219392


My ws is also listed under your WU with an issue (what brought me to this thread)... i have had a lot of tasks come back with validate error the last 24hrs so much so i have since stopped tasks on LHC.

Any task i have downloaded that is more than 4 ish hrs of work seems to run for any amount of time up to an hour but not near the 1.xx days its due to... those that have run all come back with a validate error...
22) Message boards : ATLAS application : Console monitoring (Message 29607)
Posted 25 Mar 2017 by BRG
Post:
I found 4 is the most that's worthwhile with the old
Project site.

Now I just run 1 core as I put more memory in my computers, however I still have to cap at 15 simultaneous when mixed with other task/projects

with the "old" ATLAS Project, on my bigger machine (6+6HT cores, 32GB RAM) I ran 3 tasks 3 cores each (besides 2 tasks GPUGRID).
Now, after the change, I tried 4 tasks 2 cores ea., everything going fine, too. Using total 10 cores instead of 11 (out of 12 available) seems to make the system run a little faster and smoother.


Interesting, i have been running with 4 since and its been faultless, however if i can get away with 2 cores a wu and double production, that would be even better :)

My worry was i don't have great core speed?!?!?! Will try 2 cores over night and see what happen tomorrow and see what happens.
23) Message boards : ATLAS application : Console monitoring (Message 29599)
Posted 24 Mar 2017 by BRG
Post:
I found 4 is the most that's worthwhile with the old
Project site.

Now I just run 1 core as I put more memory in my computers, however I still have to cap at 15 simultaneous when mixed with other task/projects


I don't know if i am getting new or old tasks to be honest Toby, but will set tasks to 4 cores and see how that goes.

I used to fold atlas tasks too, but since the merge ive no idea! dont even know if my points came over!
24) Message boards : ATLAS application : Console monitoring (Message 29581)
Posted 24 Mar 2017 by BRG
Post:
Is there a way to do this in windows? (w10) got an atlas WU running currently and would love to know if its died after all this time!

It should.

What is your problem ?


Wu died after 13hrs :( was running 22 cores but was only really using 6/7

I was reading other bits on the forums and it showed about changing max cpus to a lower number so selected 8 (right or wrong?!!?) got 22 lhcb tasks now so will sit and watch these for 16 hrs :)
25) Message boards : ATLAS application : Console monitoring (Message 29578)
Posted 24 Mar 2017 by BRG
Post:
Is there a way to do this in windows? (w10) got an atlas WU running currently and would love to know if its died after all this time!
26) Message boards : LHCb Application : Memory usage of LHCb tasks (Message 28717)
Posted 29 Jan 2017 by BRG
Post:
I too have the same issue, 24 cores but 32g of ram. I have to set, say 8 LHCb tasks to run then the remainder of my cores to run six track...


Previous 20


©2024 CERN