Message boards : ATLAS application : ATLAS native app
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

AuthorMessage
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 32561 - Posted: 29 Sep 2017, 11:26:49 UTC - in response to Message 32534.  

How can I avoid getting native tasks? I have SuSE Linux.


You can deselect "Run test applications" in your preferences to avoid getting native tasks.
ID: 32561 · Report as offensive     Reply Quote
tullio

Send message
Joined: 19 Feb 08
Posts: 708
Credit: 4,336,250
RAC: 0
Message 32573 - Posted: 30 Sep 2017, 16:39:40 UTC - in response to Message 32561.  

Yes, but I would lose all sixtracktest. Atlas tasks and SixTrack are the only tasks not failing on my 3 computers, a Windows 10 PC with 22 GB RAM and two Linux boxen with SuSE Linux. I check all Atlas task results for the HITS file, Some tasks on the Windows PC are completed and validated but produce no HITS file On the Linux computers, since they have only 8 GB RAM I run single core tasks, .2 on the Windows 10 PC.
Tullio
ID: 32573 · Report as offensive     Reply Quote
gyllic

Send message
Joined: 9 Dec 14
Posts: 202
Credit: 2,533,875
RAC: 0
Message 32622 - Posted: 4 Oct 2017, 7:06:45 UTC

ID: 32622 · Report as offensive     Reply Quote
Juha

Send message
Joined: 22 Mar 17
Posts: 30
Credit: 360,676
RAC: 0
Message 32630 - Posted: 4 Oct 2017, 14:59:42 UTC - in response to Message 32622.  

The startup script runs these commands to check CVMFS:

cvmfs_config probe
cvmfs_config stat atlas.cern.ch
cvmfs_config stat atlas-condb.cern.ch


The first command is not checked if it succeeds or fails. If the second or third command fails for any reason you get the "CVMFS not found" error. I don't know why exactly the connection to atlas.cern.ch would fail but the connection to atlas-condb.cern.ch succeeds.

There could be some clues in syslog. I get messages like these for all CVMFS repositories the app accesses:

Oct  3 17:08:23 mint cvmfs2: (atlas.cern.ch) failed to resolve IP addresses for ca-proxy.cern.ch (4 - unknown host name)
Oct  3 17:08:23 mint cvmfs2: (atlas.cern.ch) geographic order of servers retrieved from cernvmfs.gridpp.rl.ac.uk
Oct  3 17:08:32 mint cvmfs2: (atlas.cern.ch) CernVM-FS: linking /cvmfs/atlas.cern.ch to repository atlas.cern.ch



For completeness, the other checks the script runs are:

singularity --version
singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/images/singularity/x86_64-slc6.img hostname


These are run on non SL6 hosts. If they fail the error message is "Singularity is not installed" for the first one and "Singularity isnt working..." for the second.
ID: 32630 · Report as offensive     Reply Quote
gyllic

Send message
Joined: 9 Dec 14
Posts: 202
Credit: 2,533,875
RAC: 0
Message 32638 - Posted: 5 Oct 2017, 13:22:15 UTC - in response to Message 32630.  

thanks for the information and tips.
unfortunatly i havent seen any hints in the syslogs for why its failing sometimes. But i will look deeper into them once i have more time.
ID: 32638 · Report as offensive     Reply Quote
gyllic

Send message
Joined: 9 Dec 14
Posts: 202
Credit: 2,533,875
RAC: 0
Message 32934 - Posted: 30 Oct 2017, 11:54:47 UTC

is anyone running successfully native tasks on ubuntu?
according to this post it should work:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4396&postid=32273

but this post tells a different story:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4396&postid=32335

since these posts are rather old, and a new version has been release (2.52), does anyone have experience with running native tasks on ubuntu or other linux distros except then CentOS and SL?
ID: 32934 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 32935 - Posted: 30 Oct 2017, 14:58:10 UTC - in response to Message 32934.  

is anyone running successfully native tasks on ubuntu?

They don't send any to my Ubuntu machine, so I can't test them. It appears that they simply are not ready for Ubuntu yet, but they have not made that clear.
ID: 32935 · Report as offensive     Reply Quote
Juha

Send message
Joined: 22 Mar 17
Posts: 30
Credit: 360,676
RAC: 0
Message 32937 - Posted: 30 Oct 2017, 17:53:10 UTC - in response to Message 32934.  

Back when I was running the native app on Mint I had changed BOINC to report kernel version as 3.13.0-123-generic.not.really.sl6. I changed the kernel version back to what it really is two or three weeks ago and haven't seen any native tasks since then.
ID: 32937 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 33015 - Posted: 7 Nov 2017, 18:52:16 UTC - in response to Message 32937.  

Back when I was running the native app on Mint I had changed BOINC to report kernel version as 3.13.0-123-generic.not.really.sl6. I changed the kernel version back to what it really is two or three weeks ago and haven't seen any native tasks since then.

If it worked OK on Mint, they should be able to get it to work on Ubuntu. Their silence on the subject is deafening. Have they given up on it?
ID: 33015 · Report as offensive     Reply Quote
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 33022 - Posted: 8 Nov 2017, 20:53:16 UTC - in response to Message 33015.  

Back when I was running the native app on Mint I had changed BOINC to report kernel version as 3.13.0-123-generic.not.really.sl6. I changed the kernel version back to what it really is two or three weeks ago and haven't seen any native tasks since then.

If it worked OK on Mint, they should be able to get it to work on Ubuntu. Their silence on the subject is deafening. Have they given up on it?


Definitely not! The silence is more due to the fact that it is working extremely well on the currently supported Linux flavours. One issue we still have is that suspending a WU doesn't seem to work, the WU keeps on running even if BOINC thinks it is suspended. On the server-like hosts with RHEL 6 or 7 we currently run on this isn't really a problem since the tasks run continuously in the background, but people with Ubuntu on their laptops for example probably need the suspend to work. It seems like there is a big appetite among Ubuntu users to try it so we'll put some effort into fixing the issue.
ID: 33022 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 33024 - Posted: 8 Nov 2017, 22:10:33 UTC - in response to Message 33022.  

One issue we still have is that suspending a WU doesn't seem to work, the WU keeps on running even if BOINC thinks it is suspended. On the server-like hosts with RHEL 6 or 7 we currently run on this isn't really a problem since the tasks run continuously in the background, but people with Ubuntu on their laptops for example probably need the suspend to work. It seems like there is a big appetite among Ubuntu users to try it so we'll put some effort into fixing the issue.

OK, thanks for the update. I run my Ubuntu machines 24/7 and they are dedicated to BOINC work, so it should not be an issue for me. I would like the increased efficiency and whatever other benefits (smaller memory?) accrue when you get it working.
ID: 33024 · Report as offensive     Reply Quote
David Cameron
Project administrator
Project developer
Project scientist

Send message
Joined: 13 May 14
Posts: 387
Credit: 15,314,184
RAC: 0
Message 33036 - Posted: 9 Nov 2017, 19:47:35 UTC - in response to Message 33024.  

One of the other main benefits is that the CVMFS cache is shared among all the tasks, so you don't have to download as much data for each WU.

Out of interest I checked results from the last week and I was quite surprised that almost half come from native WU!

App version Total results
1.01 x86_64-apple-darwin [vbox64_mt_mcore_atlas] 1591
2.52 x86_64-pc-linux-gnu [native_mt] 54512
1.01 x86_64-pc-linux-gnu [vbox64_mt_mcore_atlas] 14063
1.01 windows_x86_64 [vbox64_mt_mcore_atlas] 56143
ID: 33036 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 2071
Credit: 156,179,378
RAC: 105,536
Message 33056 - Posted: 13 Nov 2017, 9:26:37 UTC - in response to Message 33036.  


App version Total results
1.01 x86_64-apple-darwin [vbox64_mt_mcore_atlas] 1591
2.52 x86_64-pc-linux-gnu [native_mt] 54512
1.01 x86_64-pc-linux-gnu [vbox64_mt_mcore_atlas] 14063
1.01 windows_x86_64 [vbox64_mt_mcore_atlas] 56143


At the moment we have 120k used from 200k for the last tasks for Atlas!
It smelt down very quickly ;-)
ID: 33056 · Report as offensive     Reply Quote
Gougou

Send message
Joined: 15 Jun 08
Posts: 4
Credit: 2,270,521
RAC: 0
Message 33428 - Posted: 17 Dec 2017, 21:42:11 UTC

Bonsoir !

I received my first ATLAS native app WUs yesterday but it failed with "CVMFS not found".
So, i installed CVMFS from CERN https://cernvm.cern.ch/portal/filesystem/downloads but it failed again.
Example : https://lhcathome.cern.ch/lhcathome/result.php?resultid=169250409

After that, i read this thread (should have start by that !) and saw there's an install script but it's for RH, i'm on Debian.
I modified it and now :
$ cvmfs_config probe
Probing /cvmfs/atlas.cern.ch... OK
Probing /cvmfs/atlas-condb.cern.ch... OK
Probing /cvmfs/grid.cern.ch... OK

Seems good !!! :)
I can share the script if you want/need it.

But now i've no more ATLAS native WU to test... Is there a way to test everything is good now before i receive a new one ? Is it possible to run the ATLAS binaries outsite BOINC client ?

Thanks !
ID: 33428 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 33429 - Posted: 17 Dec 2017, 21:50:41 UTC - in response to Message 33428.  

I received my first ATLAS native app WUs yesterday but it failed with "CVMFS not found".

In addition to CVMFS, you need to install Singularity. This works for me on Ubuntu 16.04, with the latest version being Singularity 2.4.2:

First, check the latest version on GitHub: https://github.com/singularityware/singularity/releases
And substitute that for "$VERSION"
So $VERSION=2.4.2 (as of 6 December 2017)

wget https://github.com/singularityware/singularity/releases/download/$VERSION/singularity-$VERSION.tar.gz
=> wget https://github.com/singularityware/singularity/releases/download/2.4.2/singularity-2.4.2.tar.gz

tar xvf singularity-$VERSION.tar.gz
=> tar xvf singularity-2.4.2.tar.gz

cd singularity-$VERSION
=> cd singularity-2.4.2

./configure --prefix=/usr/local
make
sudo make install

To check the version installed: singularity --version
To check the usage: singularity --help


Then, I would reboot and try again.
ID: 33429 · Report as offensive     Reply Quote
Gougou

Send message
Joined: 15 Jun 08
Posts: 4
Credit: 2,270,521
RAC: 0
Message 33431 - Posted: 17 Dec 2017, 22:11:08 UTC - in response to Message 33429.  

Thanks Jim.
Yes, i installed Singularity too but from Debian repositories. The actual version is 2.3.2 on testing, is that version ok or do i need to backport from unstable (v2.4.1) ?

To test it, do i need to wait a new WU or is there a way to execute the ATLAS binaries as a "standalone" app ?
ID: 33431 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Nov 14
Posts: 602
Credit: 24,371,321
RAC: 0
Message 33432 - Posted: 17 Dec 2017, 22:48:31 UTC - in response to Message 33431.  

The actual version is 2.3.2 on testing, is that version ok or do i need to backport from unstable (v2.4.1) ?

To test it, do i need to wait a new WU or is there a way to execute the ATLAS binaries as a "standalone" app ?

I expect 2.3.2 is OK; there have been several upgrades recently that probably don't affect our use. But I think 2.4.2 is stable if you want to try it; I am having no problems with it. There should not be much difference between Debian and Ubuntu, though I am no expert on the subject. Probably the main issue is the Linux kernel; I am on 4.10.0.42.

I think if both CVMFS and Singularity check out OK on their tests, that should be enough. I don't know how to test it further on ATLAS except to download it.
ID: 33432 · Report as offensive     Reply Quote
Gougou

Send message
Joined: 15 Jun 08
Posts: 4
Credit: 2,270,521
RAC: 0
Message 33444 - Posted: 19 Dec 2017, 21:55:52 UTC - in response to Message 33432.  

Hello

New WU, new failure :/ https://lhcathome.cern.ch/lhcathome/result.php?resultid=169659259
But i made some progress ! CVMFS is ok now but Singularity terminated with an error :
Checking for CVMFS
CVMFS is installed
OS:cat: /etc/redhat-release: No such file or directory

This is not SLC6, need to run with Singularity....
Checking Singularity...
Singularity is installed
copy /var/lib/boinc-client/slots/3/shared/RTE.tar.gz
copy /var/lib/boinc-client/slots/3/shared/start_atlas.sh
copy /var/lib/boinc-client/slots/3/shared/input.tar.gz
copy /var/lib/boinc-client/slots/3/shared/ATLAS.root_0
export ATHENA_PROC_NUMBER=7;start atlas job with PandaID=3748950343
Testing the function of Singularity...

Singularity isnt working...


The script "ATLAS_run_atlas_2.52_x86_64-pc-linux-gnu" seems too failed with that command : "singularity exec -B /cvmfs /cvmfs/atlas.cern.ch/repo/images/singularity/x86_64-slc6.img hostname".
When i execute it on a terminal, i get that result :
WARNING: Not mounting requested bind point (already mounted in container): /cvmfs
Segmentation fault


Any idea ?
ID: 33444 · Report as offensive     Reply Quote
Gougou

Send message
Joined: 15 Jun 08
Posts: 4
Credit: 2,270,521
RAC: 0
Message 33445 - Posted: 19 Dec 2017, 22:52:26 UTC - in response to Message 33444.  

Auto-reply... Segfault could be solved by passing the variable "vsyscall=emulate" to the kernel. http://singularity.lbl.gov/faq#segfault-on-bootstrap-of-centos-image

Waiting for next WU to test...
ID: 33445 · Report as offensive     Reply Quote
AuxRx

Send message
Joined: 16 Sep 17
Posts: 100
Credit: 1,618,469
RAC: 0
Message 33448 - Posted: 20 Dec 2017, 8:33:14 UTC - in response to Message 33445.  

I always wonder why everyone is having such a hard time setting up their systems for the native app. I used Boinc to get accustomed to Linux (Ubuntu in my case) and it worked like a charm (once I figured out why the scripts wouldn't run).

I used singularity-container from NeuroDebian repositories, if that helps.
ID: 33448 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

Message boards : ATLAS application : ATLAS native app


©2024 CERN