Message boards : Sixtrack Application : AVX/AVX2 support
Message board moderation

To post messages, you must log in.

AuthorMessage
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 425
Credit: 118,206,264
RAC: 161,174
Message 28616 - Posted: 22 Jan 2017, 20:48:56 UTC

Hello Eric,

Is it possible, to add AVX support to the app, seems like it could offer a significant improvement in floating point calculations?

Thanks
ID: 28616 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 28628 - Posted: 23 Jan 2017, 8:21:05 UTC - in response to Message 28616.  

Thanks, noted, will do soonest. now I can finally produce Windows executables again. Eric.
ID: 28628 · Report as offensive     Reply Quote
kyrsjo
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 23 Jan 17
Posts: 29
Credit: 315,185
RAC: 1,045
Message 30639 - Posted: 5 Jun 2017, 14:04:26 UTC

The test execs have and AVX version. From our initial testing there gain is rather small :( But let's see what happens when they are ran on more machines...
ID: 30639 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 425
Credit: 118,206,264
RAC: 161,174
Message 30643 - Posted: 5 Jun 2017, 19:24:16 UTC

Sorry then, I assumed that this would be a good canditate, from these slides seems like it would need to vectorize well

https://twiki.cern.ch/twiki/bin/view/Main/VinInn

http://docplayer.net/22964956-Haswell-conundrum-avx-or-not-avx.html
ID: 30643 · Report as offensive     Reply Quote
Profile MAGIC Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 630
Credit: 17,787,336
RAC: 17,755
Message 30645 - Posted: 6 Jun 2017, 4:02:16 UTC - in response to Message 30643.  

Sorry then, I assumed that this would be a good candidate, from these slides seems like it would need to vectorize well

https://twiki.cern.ch/twiki/bin/view/Main/VinInn

http://docplayer.net/22964956-Haswell-conundrum-avx-or-not-avx.html


Yeah I was wondering about that myself since I have a few Sandybridge

https://twiki.cern.ch/twiki/bin/view/LCG/VIAVXBenchMarks
ID: 30645 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 30653 - Posted: 6 Jun 2017, 11:00:36 UTC

I think Kyrre was misunderstood here. SSE2/SSE3 (or pni) gives about a factor of
2 speed up and should always be used when available, which I understand is
not always true. I'll check with him as I think he means AVX doesn't give much more.
Overall vectorisation/pipelining gives a factor of 2 speedup for SixTrack.
(The code was vectorised many years ago for the Cray and IBM VF.) . Eric.
ID: 30653 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 405
Credit: 13,676,254
RAC: 20,641
Message 30657 - Posted: 6 Jun 2017, 11:57:10 UTC - in response to Message 30653.  
Last modified: 6 Jun 2017, 11:57:40 UTC

(The code was vectorised many years ago for the Cray and IBM VF.) . Eric.

We are so sad, to have this Computer not at home ;-).
Or is it not true, because of the Flops of the Computer today?
ID: 30657 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 30660 - Posted: 6 Jun 2017, 12:23:46 UTC - in response to Message 30657.  

Well I think there is a CRAY in Lausanne Museum :-)
Today even Cray is using massive parallelisation, with
a multitude of "cheap" processors as we predicted many
years ago! Mark you I am sure they are doing it very
professionally in the great Cray tradition. Eric.
ID: 30660 · Report as offensive     Reply Quote
Profile MAGIC Quantum Mechanic
Avatar

Send message
Joined: 24 Oct 04
Posts: 630
Credit: 17,787,336
RAC: 17,755
Message 30665 - Posted: 6 Jun 2017, 16:46:13 UTC

These SixTrack tasks remind me of 2004 with these tasks that run about 60 seconds or less......when you watch them they look like a pni Error but when you check they are Valids.....but I did have a few that actually ran over 7 hours.

Not much to see on the Stderr's but imagine running those multi-core
Volunteer Mad Scientist For Life
ID: 30665 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 30668 - Posted: 6 Jun 2017, 18:07:03 UTC - in response to Message 30665.  

Well they may or may not be valid. We are looking very hard at this.
It is just very complicated to rerun every "short" task and check whether it
is valid or not! But we are trying. Eric.
ID: 30668 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 30669 - Posted: 6 Jun 2017, 18:11:41 UTC - in response to Message 30668.  

...and sorry, I forgot to say that the files you downlaoded may no longer be
on the server, we may no be able to access the user's files either. Hence I
am concentrating on tasks/WUs with name wzero_jtbb2cm1.......
If you ever spot a null zero length fort.10 I'd be very interested. Eric.
ID: 30669 · Report as offensive     Reply Quote
maeax

Send message
Joined: 2 May 07
Posts: 405
Credit: 13,676,254
RAC: 20,641
Message 30681 - Posted: 7 Jun 2017, 6:15:34 UTC
Last modified: 7 Jun 2017, 6:16:14 UTC

Those finished are waiting for the second finished task. All with more than 40.000 seconds and wzero_jtbb2cm1...
https://lhcathome.cern.ch/lhcathome/results.php?userid=75468&offset=0&show_names=0&state=2&appid=1
ID: 30681 · Report as offensive     Reply Quote
Erich56

Send message
Joined: 18 Dec 15
Posts: 754
Credit: 5,266,511
RAC: 8,171
Message 30682 - Posted: 7 Jun 2017, 6:34:38 UTC - in response to Message 30681.  

ID: 30682 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 30776 - Posted: 14 Jun 2017, 8:48:56 UTC

Well this is actually about the "null" empty results.
I have identified them and there are several different
causes, platform independent. This will be fixed in our
upcoming release. Eric.
ID: 30776 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 425
Credit: 118,206,264
RAC: 161,174
Message 30781 - Posted: 14 Jun 2017, 16:59:10 UTC

Great news
ID: 30781 · Report as offensive     Reply Quote
Toby Broom
Volunteer moderator

Send message
Joined: 27 Sep 08
Posts: 425
Credit: 118,206,264
RAC: 161,174
Message 31523 - Posted: 20 Jul 2017, 19:57:30 UTC

Since the new Skylake chips have AVX-512, is this something that the app would benift from?
ID: 31523 · Report as offensive     Reply Quote
James Molson
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 25 Jan 17
Posts: 21
Credit: 1,151,096
RAC: 87
Message 31526 - Posted: 20 Jul 2017, 20:41:38 UTC - in response to Message 31523.  

yes
ID: 31526 · Report as offensive     Reply Quote
Eric Mcintosh
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 12 Jul 11
Posts: 843
Credit: 1,578,195
RAC: 665
Message 31618 - Posted: 25 Jul 2017, 4:28:48 UTC - in response to Message 31523.  

Yes, but we can't use FMA because of different rounding and numeric portability. Eric.

(Since the new Skylake chips have AVX-512, is this something that the app would benefit from?)
ID: 31618 · Report as offensive     Reply Quote

Message boards : Sixtrack Application : AVX/AVX2 support


©2018 CERN