Message boards :
ATLAS application :
Console monitoring
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 9 Dec 14 Posts: 202 Credit: 2,533,875 RAC: 0 |
not always does the console seem to work: try to klick into the console, hit enter once and maybe you see some output. but not sure if this works. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 198,139,853 RAC: 86,453 |
Erich65 wrote: What does this mean? Is the task broken? Is the console broken? Is the VM broken? Go to my Checklist V3 and check Number 16 Scenario E Supporting BOINC, a great concept ! |
Send message Joined: 18 Dec 15 Posts: 1742 Credit: 114,935,562 RAC: 93,825 |
Thanks, Yeti, for your advice. However, when I came back home lateron, I saw that the task had finished properly. So maybe something was wrong only with the console GUI, or whatever. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 198,139,853 RAC: 86,453 |
|
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
The output is added to the previous output so that's why you see repetition (notice that the timestamps are the same). I will try to flush the screen each time before printing the output. |
Send message Joined: 15 Jun 08 Posts: 2491 Credit: 247,732,328 RAC: 123,643 |
@David Cameron The console output works and is much better than to have nothing. Can you add the total number of WU events? Perhaps like: ... Event nr. 5/100 took ... |
Send message Joined: 20 Mar 15 Posts: 7 Credit: 1,137,208,630 RAC: 267,770 |
It seems like that the new information output produces also a lot of mails. Every time when a event is processed you are doing some grep on the events, but the location is wrong, so the postmaster is sending everytime a mail. Subject: Cron <root@localhost> grep -h "Event nr" /home/atlas01/RunAtlas/Panda_Pilot_*/PandaJob_*/athenaMP-workers-EVNTtoHITS-sim/worker_*/AthenaMP.log|sort > /dev/tty2 grep: /home/atlas01/RunAtlas/Panda_Pilot_*/PandaJob_*/athenaMP-workers-EVNTtoHITS-sim/worker_*/AthenaMP.log: No such file or directory It would be great if that could be fixed! |
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
This is fixed now (the fix will be propagated to new tasks in a few hours). The errors should only happen at the start of the task before the log is started, did you see it during the whole task? |
Send message Joined: 20 Mar 15 Posts: 7 Credit: 1,137,208,630 RAC: 267,770 |
Thank you David! Unfortunately i saw it during the whole task, on each PC, so there where a huge amount of E-Mails. I will check if it is fine now. |
Send message Joined: 28 Sep 04 Posts: 707 Credit: 47,271,317 RAC: 28,705 |
The TOP (Alt+F3) does not work anymore. It stopped working when I changed to running with a single core. |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 198,139,853 RAC: 86,453 |
|
Send message Joined: 28 Sep 04 Posts: 707 Credit: 47,271,317 RAC: 28,705 |
Yes, it was working about a week ago when it was announced but not at the moment. With Alt+F3 I now get the same screen as with Alt+F1 |
Send message Joined: 13 May 14 Posts: 387 Credit: 15,314,184 RAC: 0 |
Well it was kind of half-working a week ago, but my latest attempts to make it work fully stopped it working completely. The problem seems to be running a persistent command with sudo (root permission is needed to write to the console) inside a script run as a normal user. It works for other LHC projects because as I understand they run bootstrap scripts as root. I will keep trying to find a way to make it work. |
Send message Joined: 28 Sep 17 Posts: 4 Credit: 451,660 RAC: 0 |
So I'm trying to figure out why my LHC tasks in linux seem to run so slow. Atlas simulation 1.01 (vbox64_mt_mcore_atlas) has been running for almost 13 hours now and is currently at 99.994%. However it is progressing very slowly at this point. alt+f2 does nothing in the vm console, only alt+f1 and alt+f6 switch to some other information. I'm using Ubuntu, I have 16gb RAM and i5-6500 as cpu. My cpu usage is currently set at 99%, however 3 cores are used by LHC and 1 core is used by 3 Einstein WUs at the same time. Also LHC priority is half of normal. Task manager shows 5gb of 15.6gb used and 1 core is running at 100% while other three are usually slacking around 10-25%. |
Send message Joined: 28 Sep 17 Posts: 4 Credit: 451,660 RAC: 0 |
It seems I can't edit my own previous post? Anyway I cancelled my previous task and started a new one. Now alt+f2 and alt+f3 works in the VM console and it seems that the cores are being used for only around 0.3%. This new task has been going at it for 14 hours now and CPU time is only 33 minutes. Looks like it happens to every ATLAS WU I get. I will try to look around the forum for a solution, if there is one.. EDIT: I already added app_config.xml for the task, currently I tell it to use 3 cores and ram limit is set to 8 gb. Any other suggestions for the xml? app_config.xml: <?xml version="1.0"?> <app_config> <project_max_concurrent>3</project_max_concurrent> <app> <name>ATLAS</name> <max_concurrent>1</max_concurrent> </app> <app_version> <app_name>ATLAS</app_name> <avg_ncpus>3.000000</avg_ncpus> <plan_class>vbox64_mt_mcore_atlas</plan_class> <cmdline>--memory_size_mb 8000</cmdline> </app_version> </app_config> |
Send message Joined: 2 Sep 04 Posts: 455 Credit: 198,139,853 RAC: 86,453 |
I will try to look around the forum for a solution, if there is one.. Take a walk through my checklist: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4161&postid=29359#29359 Supporting BOINC, a great concept ! |
Send message Joined: 28 Sep 17 Posts: 4 Credit: 451,660 RAC: 0 |
Yeti, thanks! Resetting lhc@home project seemed to do the trick, alt+f3 showed no much activity in first 10 mins but then 3 cores started crunching hard at 100%. :) Hopefully it will stay that way when I resume other projects as well, for now I will try to run a few ATLAS WUs in a row. A lot of stuff going on in alt+F2 too. |
Send message Joined: 17 Sep 04 Posts: 104 Credit: 32,671,485 RAC: 5,160 |
[Repeated] Regards, Bob P. |
Send message Joined: 17 Sep 04 Posts: 104 Credit: 32,671,485 RAC: 5,160 |
We have added some information on the processed events in ATLAS tasks on consoles inside the VM. I can't get this to work. Does it remain a valid procedure? Thanks. Regards, Bob P. |
Send message Joined: 15 Jun 08 Posts: 2491 Credit: 247,732,328 RAC: 123,643 |
The appearance of console 2 (press ALT + F2) has completely changed. It presents the data from the same log that you see in the screenshot but in a different manner. On console 3 (press ALT + F3) you will see the output of the "top" command of the running VM. Both commands run at low priority to leave the majority of cpu cycles for the scientific app. Hence, it may need a couple of minutes until the VM's basic setup has finished and the required components are available. |
©2024 CERN