Author

Topic: Why does AMD cards perform so much better than NVIDIA cards at mining? (Read 6852 times)

hero member
Activity: 602
Merit: 500
This is similar to why F@H and other scientific (floating point operation) endeavours crush on NVidia and suck on AMD.
full member
Activity: 168
Merit: 100
Quote
Firstly, AMD designs GPUs with many simple ALUs/shaders (VLIW design) that run at a relatively low frequency clock (typically 1120-3200 ALUs at 625-900 MHz), whereas Nvidia's microarchitecture consists of fewer more complex ALUs and tries to compensate with a higher shader clock (typically 448-1024 ALUs at 1150-1544 MHz). Because of this VLIW vs. non-VLIW difference, Nvidia uses up more square millimeters of die space per ALU, hence can pack fewer of them per chip, and they hit the frequency wall sooner than AMD which prevents them from increasing the clock high enough to match or surpass AMD's performance. This translates to a raw ALU performance advantage for AMD:

    * AMD Radeon HD 6990: 3072 ALUs x 830 MHz = 2550 billion 32-bit instruction per second
    * Nvidia GTX 590: 1024 ALUs x 1214 MHz = 1243 billion 32-bit instruction per second

This approximate 2x-3x performance difference exists across the entire range of AMD and Nvidia GPUs. It is noticeably visible in all ALU-bound GPGPU workloads such as Bitcoin, password bruteforcers, etc.

Secondly, another difference favoring Bitcoin mining on AMD GPUs instead of Nvidia's is that the mining algorithm is based on SHA-256, which makes heavy use of the 32-bit integer right rotate operation. This operation can be implemented as a single hardware instruction on AMD GPUs, but requires three separate hardware instructions to be emulated on Nvidia GPUs (2 shifts + 1 add). This alone gives AMD another 1.7x performance advantage (~1900 instructions instead of ~3250 to execute the SHA-256 compression function).

Combined together, these 2 factors make AMD GPUs overall 3x-5x faster when mining Bitcoins.

It's the AMD architecture--it has more stream processors than Nvidia's architecture
member
Activity: 112
Merit: 10
amd is using v5 streaming units, nvida v1. thats the point.
full member
Activity: 131
Merit: 100
The opensource sdk allows ati to perform much better.
No, it has to do with the architecture of the card.  Stream processors vs more complex operations like vertex processing and geometric calcs.

Simply put ATI cards have the right architecture to fit the very small and specific thing btc needs to do.
legendary
Activity: 1806
Merit: 1003
The opensource sdk allows ati to perform much better.

How hard it is to create the SDK? I imagine NVIDIA must have noticed that AMD cards are selling out, and is trying to create a similar SDK
hero member
Activity: 711
Merit: 500
The opensource sdk allows ati to perform much better.
legendary
Activity: 1806
Merit: 1003
What is the reason behind this phenomenon? why does the top of the line NVIDIA cards perform almost 10 times worse than the top of the line AMD cards in terms of mining?
Jump to: