Message boards :
ATLAS application :
Console monitoring
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 9 Dec 14 Posts: 202 Credit: 2,533,390 RAC: 0 ![]() ![]() |
not always does the console seem to work: try to klick into the console, hit enter once and maybe you see some output. but not sure if this works. |
![]() ![]() Send message Joined: 2 Sep 04 Posts: 421 Credit: 105,348,871 RAC: 66,381 ![]() ![]() ![]() |
Erich65 wrote: What does this mean? Is the task broken? Is the console broken? Is the VM broken? Go to my Checklist V3 and check Number 16 Scenario E ![]() Supporting BOINC, a great concept ! |
Send message Joined: 18 Dec 15 Posts: 1346 Credit: 25,237,270 RAC: 23,999 ![]() ![]() |
Thanks, Yeti, for your advice. However, when I came back home lateron, I saw that the task had finished properly. So maybe something was wrong only with the console GUI, or whatever. |
![]() ![]() Send message Joined: 2 Sep 04 Posts: 421 Credit: 105,348,871 RAC: 66,381 ![]() ![]() ![]() |
|
Send message Joined: 13 May 14 Posts: 330 Credit: 11,395,681 RAC: 3,487 ![]() ![]() ![]() |
The output is added to the previous output so that's why you see repetition (notice that the timestamps are the same). I will try to flush the screen each time before printing the output. |
![]() Send message Joined: 15 Jun 08 Posts: 1668 Credit: 98,933,305 RAC: 86,472 ![]() ![]() ![]() |
@David Cameron The console output works and is much better than to have nothing. Can you add the total number of WU events? Perhaps like: ... Event nr. 5/100 took ... |
Send message Joined: 20 Mar 15 Posts: 7 Credit: 164,703,496 RAC: 312,809 ![]() ![]() ![]() |
It seems like that the new information output produces also a lot of mails. Every time when a event is processed you are doing some grep on the events, but the location is wrong, so the postmaster is sending everytime a mail. Subject: Cron <root@localhost> grep -h "Event nr" /home/atlas01/RunAtlas/Panda_Pilot_*/PandaJob_*/athenaMP-workers-EVNTtoHITS-sim/worker_*/AthenaMP.log|sort > /dev/tty2 grep: /home/atlas01/RunAtlas/Panda_Pilot_*/PandaJob_*/athenaMP-workers-EVNTtoHITS-sim/worker_*/AthenaMP.log: No such file or directory It would be great if that could be fixed! |
Send message Joined: 13 May 14 Posts: 330 Credit: 11,395,681 RAC: 3,487 ![]() ![]() ![]() |
This is fixed now (the fix will be propagated to new tasks in a few hours). The errors should only happen at the start of the task before the log is started, did you see it during the whole task? |
Send message Joined: 20 Mar 15 Posts: 7 Credit: 164,703,496 RAC: 312,809 ![]() ![]() ![]() |
Thank you David! Unfortunately i saw it during the whole task, on each PC, so there where a huge amount of E-Mails. I will check if it is fine now. |
![]() Send message Joined: 28 Sep 04 Posts: 505 Credit: 26,556,740 RAC: 13,412 ![]() ![]() ![]() |
The TOP (Alt+F3) does not work anymore. It stopped working when I changed to running with a single core. ![]() |
![]() ![]() Send message Joined: 2 Sep 04 Posts: 421 Credit: 105,348,871 RAC: 66,381 ![]() ![]() ![]() |
|
![]() Send message Joined: 28 Sep 04 Posts: 505 Credit: 26,556,740 RAC: 13,412 ![]() ![]() ![]() |
Yes, it was working about a week ago when it was announced but not at the moment. With Alt+F3 I now get the same screen as with Alt+F1 ![]() |
Send message Joined: 13 May 14 Posts: 330 Credit: 11,395,681 RAC: 3,487 ![]() ![]() ![]() |
Well it was kind of half-working a week ago, but my latest attempts to make it work fully stopped it working completely. The problem seems to be running a persistent command with sudo (root permission is needed to write to the console) inside a script run as a normal user. It works for other LHC projects because as I understand they run bootstrap scripts as root. I will keep trying to find a way to make it work. |
Send message Joined: 28 Sep 17 Posts: 4 Credit: 451,660 RAC: 0 ![]() ![]() |
So I'm trying to figure out why my LHC tasks in linux seem to run so slow. Atlas simulation 1.01 (vbox64_mt_mcore_atlas) has been running for almost 13 hours now and is currently at 99.994%. However it is progressing very slowly at this point. alt+f2 does nothing in the vm console, only alt+f1 and alt+f6 switch to some other information. I'm using Ubuntu, I have 16gb RAM and i5-6500 as cpu. My cpu usage is currently set at 99%, however 3 cores are used by LHC and 1 core is used by 3 Einstein WUs at the same time. Also LHC priority is half of normal. Task manager shows 5gb of 15.6gb used and 1 core is running at 100% while other three are usually slacking around 10-25%. |
Send message Joined: 28 Sep 17 Posts: 4 Credit: 451,660 RAC: 0 ![]() ![]() |
It seems I can't edit my own previous post? Anyway I cancelled my previous task and started a new one. Now alt+f2 and alt+f3 works in the VM console and it seems that the cores are being used for only around 0.3%. This new task has been going at it for 14 hours now and CPU time is only 33 minutes. Looks like it happens to every ATLAS WU I get. I will try to look around the forum for a solution, if there is one.. EDIT: I already added app_config.xml for the task, currently I tell it to use 3 cores and ram limit is set to 8 gb. Any other suggestions for the xml? app_config.xml: <?xml version="1.0"?> <app_config> <project_max_concurrent>3</project_max_concurrent> <app> <name>ATLAS</name> <max_concurrent>1</max_concurrent> </app> <app_version> <app_name>ATLAS</app_name> <avg_ncpus>3.000000</avg_ncpus> <plan_class>vbox64_mt_mcore_atlas</plan_class> <cmdline>--memory_size_mb 8000</cmdline> </app_version> </app_config> |
![]() ![]() Send message Joined: 2 Sep 04 Posts: 421 Credit: 105,348,871 RAC: 66,381 ![]() ![]() ![]() |
I will try to look around the forum for a solution, if there is one.. Take a walk through my checklist: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4161&postid=29359#29359 ![]() Supporting BOINC, a great concept ! |
Send message Joined: 28 Sep 17 Posts: 4 Credit: 451,660 RAC: 0 ![]() ![]() |
Yeti, thanks! Resetting lhc@home project seemed to do the trick, alt+f3 showed no much activity in first 10 mins but then 3 cores started crunching hard at 100%. :) Hopefully it will stay that way when I resume other projects as well, for now I will try to run a few ATLAS WUs in a row. A lot of stuff going on in alt+F2 too. |
©2021 CERN