AMD better than NVIDIA in these tests
The Chips and Cheese portal has conducted a new study comparing the memory latency between the latest graphics cards from AMD and NVIDIA. The results obtained show a certain advantage of the RDNA 2 architecture, which showed a lower memory latency compared to Ampere solutions. However, we would like to point out that this feature is not directly related to the performance achievable in the most common applications.Like processors, modern GPUs exploit a hierarchical system of multilevel cache. Ampere relies on a more traditional configuration with an L1 and L2 cache, while RDNA 2 (also known as Big Navi), on the other hand, features L0, L1, L2 and Infinity Cache, which essentially acts as an L3 cache. The two architectures use different designs, so it is interesting to see which one is capable of offering higher performance in this area.
To measure memory latency, Chips and Cheese used a methodology normally applied to processors. In fact, benchmark pointer chasing in OpenCL was used to evaluate both cache and latency, comparing the Radeon RX 6800 XT and GeForce RTX 3090 models.
In general, RDNA 2 recorded memory latency lower than Ampere on all levels. The memory latency of RDNA 2 was not very different from that of Ampere, even though the data had to traverse multiple levels of cache to reach the destination. Transferring information from L1 to L2 cache over Ampere takes over 100 ns. Instead, only 66ns are needed to go from L0 to L2 in RDNA 2, regardless of whether there is an L1 cache between them.
Chip and Cheese concluded by stating that many clock cycles are required to transfer information along the GA102 GPU due to its large size (628mm²). The portal believes that RDNA 2's low-latency L2 and L3 caches offer an advantage over Ampere in small workloads. This may explain why RDNA 2 graphics cards excel at lower resolution games.
Do you need a Windows 10 Pro license for your brand new gaming PC? On Amazon it is available for a few euros, do not miss it.
AMD Big Navi gets the better of Nvidia Ampere in GPU memory latency test
AMD seems to have the upper hand compared to Nvidia when it comes to memory latency performance with current-gen graphics cards.
This conclusion was drawn by tech site Chips and Cheese which ran comparisons between Big Navi and Ampere, using pointer chasing benchmarks (written in OpenCL) to assess memory latency in nanoseconds (ns) with AMD’s RX 6800 XT pitted against Nvidia’s RTX 3090.
And the results were interesting to say the least, when you consider that the Big Navi card has more cache layers to travel through on the journey to memory – yet it manages to match the Ampere GPU.
Nvidia runs with a straightforward scheme of two levels of cache – L1 and L2 – cache being the tiny amounts of very fast on-board storage (much faster than the actual video RAM) right there on the chip (just as is the case with CPUs).
With Big Navi, however, AMD does things very differently, using multiple layers of cache: L0, L1, L2 and Infinity Cache (which is effectively L3). Meaning there are more levels to pass through, as we mentioned, so you’d think that this could slow things down more in terms of latency – but that’s not the case.
In fact, AMD’s cache is impressively quick indeed, with latency being low through all those multiple layers, whereas Nvidia has a high L2 latency – 100ns, compared to around 66ns for AMD from L0 to L2. The upshot of all this is that the overall result is pretty much a dead heat.
(Image credit: Chips and Cheese)As Chips and Cheese observes: “Amazingly, RDNA 2’s [Big Navi] VRAM latency is about the same as Ampere’s, even though RDNA 2 is checking two more levels of cache on the way to memory.”
The site points out that AMD’s speedy L2 and L3 caches with their very low latency could give Big Navi GPUs the edge in less demanding workloads, and that this might explain why RX 6000 graphics cards do very well at lower resolutions (where the GPU isn’t being pushed nearly as hard).
Certainly, that’s a tentative conclusion, and we need to be careful around reading too much into any one single test. Furthermore, it’s unclear how any such memory latency advantage might translate into real-world performance in games anyway.
Chips and Cheese also notes that memory latency with CPUs is far quicker, of course, but that comparatively “GDDR6 latency itself isn’t so bad” on an overall level.
Today’s best graphics card deals
Via Tom’s Hardware