Message boards :
Number crunching :
GPU advertised for LHC, but they don't do it?
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 15 Nov 14 Posts: 602 Credit: 24,371,321 RAC: 0 |
For some curious reason, my Radeon RX 570 does well on MilkyWay (Win7 64-bit), even though it is merely OK, not great on DP. https://milkyway.cs.rpi.edu/milkyway/results.php?hostid=803662 So there are various factors involved. We will really need to try out a card and see how it does. |
Send message Joined: 24 Oct 04 Posts: 1182 Credit: 55,533,758 RAC: 48,280 |
The Einstein Project is the place to go if you want to run GPU tasks. Or find out anything about running them every day from the experts. |
Send message Joined: 7 May 08 Posts: 222 Credit: 1,575,053 RAC: 6 |
We are still in the process of developing SixTrackLib, validating the physics and tackle the integration into the main SixTrack. Our time scale is still probably a year or two to expose the functionality into LHC@Home, but we already performing simulations using SixTrackLib in GPU for specific problems. That's great! If you need help to debug gpu app.... (i think you will use lhc-dev project) :-P |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,839,385 RAC: 17,038 |
Then how come there are projects which only use ATI but not Nvidea? There aren't, but there ones that don't list Intel, but do list AMD, which is strange if OpenCL works on both. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,839,385 RAC: 17,038 |
2nd hand cards are great for 64 bit - The R9 280X is the best I've found. £60 2nd hand, and it does 1024 GFLOPS at 64bit (that's 1:4 ratio - older cards tended to have more double precision). More expensive ones are faster, but you're better buying several of these instead for the same price and getting even more computing power. I'm going to experiment with PCI express splitters at some point when I get more cards - it could be possible to get about 20 GPUs connected to one motherboard - I know bitcoin miners have done it. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,839,385 RAC: 17,038 |
Is it true that you can do a 64bit calculation using two or more 32bit floating point units on the card? What's the performance hit if you did? |
Send message Joined: 7 Jan 07 Posts: 41 Credit: 16,102,983 RAC: 1 |
Is it true that you can do a 64bit calculation using two or more 32bit floating point units on the card? What's the performance hit if you did? Typically reported performance is ~40% of FP32 performance, which is an order of madnitude better than the 1/24 (~4%) FP64:FP32 performance available in almost all consumer-grade GPUs. Source HNY 2020 ! |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,839,385 RAC: 17,038 |
Is it true that you can do a 64bit calculation using two or more 32bit floating point units on the card? What's the performance hit if you did? Interesting. I tend to buy up 2nd hand cards from back when they had a better ratio. Still, mine is 1:4, so 25%. So I wonder why Milkyway uses double precision? According to what you said, they'd get more calculations out of even my card. |
Send message Joined: 7 Jan 07 Posts: 41 Credit: 16,102,983 RAC: 1 |
Interesting. I tend to buy up 2nd hand cards from back when they had a better ratio. Still, mine is 1:4, so 25%. So I wonder why Milkyway uses double precision? According to what you said, they'd get more calculations out of even my card. It's all about the needed precision to do the maths, no rounding errors, no overflow, etc ... A reading to have some clues : Floating Point's not Real |
Send message Joined: 11 Apr 17 Posts: 39 Credit: 7,735,161 RAC: 0 |
Well, 64 bit floats still give you rounding errors but they tend to be way out there past the decimal points. :) |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,839,385 RAC: 17,038 |
Well, 64 bit floats still give you rounding errors but they tend to be way out there past the decimal points. :) Can more calculations at FP32 not be done to make the accuracy better? On cards with stupidly low ratios like 1:24, Even doing 10 calculations at FP32 would be faster than 1 at FP64. |
Send message Joined: 15 Jun 08 Posts: 2549 Credit: 255,376,114 RAC: 62,993 |
https://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html This might be a good starting point to understand why it is not a good idea to work with reduced precision. |
Send message Joined: 12 Aug 06 Posts: 429 Credit: 10,839,385 RAC: 17,038 |
https://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html I know reduced precision is bad. I don't have enough maths skills to understand any papers on it. But can't more calculations be done at FP32 to make it more accurate? Even if it requires several, it would still be faster than FP64 on most cards. |
Send message Joined: 7 May 08 Posts: 222 Credit: 1,575,053 RAC: 6 |
As some of you argued, we are focusing on using OpenCL 1.2 because it allows to run on AMD, Intel and Nvidia GPUs, although is less advanced than other options (OpenCL 1.2 dates back to 2011). You can use incoming OpenCl 3.0, that is OpenCl 1.2 plus some extras.... |
Send message Joined: 7 May 08 Posts: 222 Credit: 1,575,053 RAC: 6 |
Seems that the development of sixtrack code is freezed...latest modify 15 Jul, latest version 19 Dec 2019... |
Send message Joined: 4 Apr 19 Posts: 31 Credit: 4,423,972 RAC: 0 |
https://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html One of the features of Sixtrack is that it always try to produce the same result across all platforms they support. In order to do that the scientists even added brackets to expressions to prohibit the compiler from freely choose a mathematically equivalent computation. You can see scientific publications of Sixtrack and LHC@home for stories like that. Now here comes GPU. Not only it cannot reproduce the bit-exact result as CPU, but must operate at reduced precision for best performance. I'd doubt if scientists would ever accept it. |
Send message Joined: 7 May 08 Posts: 222 Credit: 1,575,053 RAC: 6 |
Now here comes GPU. Not only it cannot reproduce the bit-exact result as CPU, but must operate at reduced precision for best performance. I'd doubt if scientists would ever accept it. I don't know. In the past they have often spoken about gpu in sixtrack (i also spoke with Mcintosh) and seems that they are very interested in gpu calculation. But it depends on them... |
Send message Joined: 7 May 08 Posts: 222 Credit: 1,575,053 RAC: 6 |
Interesting video about gpu and LHC |
Send message Joined: 7 May 08 Posts: 222 Credit: 1,575,053 RAC: 6 |
AMD released first gpu with over 10Tflops FP64 (and 23 Tflops FP32). AMD MI100 |
Send message Joined: 1 Sep 04 Posts: 140 Credit: 2,579 RAC: 0 |
Yes, very interesting but nothing to do with Sixtrack or LHC@home. This describes the LHCb online triggering system. Interesting video about gpu and LHC |
©2025 CERN