Pages:
Author

Topic: Want legit 7970 testing/benchmarking and tuning for cgminer and Diablominer? (Read 19825 times)

legendary
Activity: 1162
Merit: 1000
DiabloMiner author
I find it funny how so many people criticised giving you a 7970 to do diablominer tuning to and now you have delivered a miner that like EVERYONE is using for their own gain.

Yeah, and now cgminer also has the kernel among the ones it supports. I think I delivered on the promise as best as possible, although I'm still looking for new ways of beating the compiler.

To be fair, AMD's GCN compiler (not the same as the 5xxx/68xx/69xx compiler(s), seems to be ground up brand new) is doing a pretty good job, I haven't been able to find any useful optimizations after I did that two week brutal death march of tuning for 2.6.

I think all in all, the community benefited from their investment.
legendary
Activity: 1190
Merit: 1000
www.bitcointrading.com
I find it funny how so many people criticised giving you a 7970 to do diablominer tuning to and now you have delivered a miner that like EVERYONE is using for their own gain.
legendary
Activity: 1162
Merit: 1000
DiabloMiner author
That'd be depressing if all I can get is another 2%.

Yeah, it kinda takes the fun out of optimizing if all you can get is 2%.

Any idea if there's a GCN feature that we can exploit for more performance like fixed function hardware or new instructions/amd-specific extensions? I initially thought that using size 16 vectors would help, thinking that the GCN SIMD cores were akin to x64 SIMD registers, but it seems that the card exploits its SIMD cores by running a scalar instruction on 16 threads at once.

I'm trying to get the damned Kernel Analyzer to work in either Win in VM or in Wine, both is a no go, has to have a real running copy of the drivers, and the native Linux version locks up soon as DM calls an a CL function.

Don't even bother until the release a new version because Kernel Analyzer won't even correctly list the kernel stats for tahiti. Running the profiler via command line does give some useful output however, and I have .il, .cl, .isa  and a comma separated value file of a profile run with -v 1 from my 7970 if you're interested.

I'm more interested in it for generic 2.6 optimization. I'm hoping I can get 2.6 running as fast as 2.5 on my 5850 and _IF_ I can get it running as fast as 2.1, I can abandon 2.1 support (but I don't see how 2.6 could get that fast)
newbie
Activity: 43
Merit: 0
That'd be depressing if all I can get is another 2%.

Yeah, it kinda takes the fun out of optimizing if all you can get is 2%.

Any idea if there's a GCN feature that we can exploit for more performance like fixed function hardware or new instructions/amd-specific extensions? I initially thought that using size 16 vectors would help, thinking that the GCN SIMD cores were akin to x64 SIMD registers, but it seems that the card exploits its SIMD cores by running a scalar instruction on 16 threads at once.

I'm trying to get the damned Kernel Analyzer to work in either Win in VM or in Wine, both is a no go, has to have a real running copy of the drivers, and the native Linux version locks up soon as DM calls an a CL function.

Don't even bother until the release a new version because Kernel Analyzer won't even correctly list the kernel stats for tahiti. Running the profiler via command line does give some useful output however, and I have .il, .cl, .isa  and a comma separated value file of a profile run with -v 1 from my 7970 if you're interested.
newbie
Activity: 43
Merit: 0
rjk
sr. member
Activity: 448
Merit: 250
1ngldh
Curious about how much room for optimization there is, I made some calculations to see how close Diablominer is to the maximum theoretical performance of the 7970 (see here) and unless I'm missing something in my assumptions I found it to already be within ~2% of the maximum theoretical performance.

Your link had a link to https://bitcointalk.org/index?topic=7964.msg550288#msg550288 which server dies on.  Can you find a link to original 3375 ops calculation.


Add .php after index.
donator
Activity: 1218
Merit: 1079
Gerald Davis
Curious about how much room for optimization there is, I made some calculations to see how close Diablominer is to the maximum theoretical performance of the 7970 (see here) and unless I'm missing something in my assumptions I found it to already be within ~2% of the maximum theoretical performance.

Your link had a link to https://bitcointalk.org/index?topic=7964.msg550288#msg550288 which server dies on.  Can you find a link to original 3375 ops calculation.

legendary
Activity: 1162
Merit: 1000
DiabloMiner author
Curious about how much room for optimization there is, I made some calculations to see how close Diablominer is to the maximum theoretical performance of the 7970 (see here) and unless I'm missing something in my assumptions I found it to already be within ~2% of the maximum theoretical performance.

That'd be depressing if all I can get is another 2%. I'm trying to get the damned Kernel Analyzer to work in either Win in VM or in Wine, both is a no go, has to have a real running copy of the drivers, and the native Linux version locks up soon as DM calls an a CL function.

Damnit AMD
donator
Activity: 362
Merit: 250
I finally had the time to install a pair of Gigabyte 7970s last night.

I tried a few of the settings in the other thread and also found the best I could get out of cgminer was around 475Mh/s per GPU.

I downloaded diablo miner, oc'd the cores to 1100mhz on stock voltage and dropped the memory down to 600mhz.  Started up and bam, 1300Mh/s.   It was very late and I had no time to do any tinkering, but 650Mh/s per gpu out of the box on stock voltage with a nice oc is pretty awesome.
newbie
Activity: 43
Merit: 0
Curious about how much room for optimization there is, I made some calculations to see how close Diablominer is to the maximum theoretical performance of the 7970 (see here) and unless I'm missing something in my assumptions I found it to already be within ~2% of the maximum theoretical performance.

Edit: Fixed link.
full member
Activity: 131
Merit: 100
I recommend you guys click the link to my thread and not rely on the quoted info from it as I keep updating the results.

legendary
Activity: 922
Merit: 1003
Efficiency-wise, it's awesome!  But $1/MH sucks Sad

Agreed. There are better mining solutions out there. No one will be swarming to get 7970s for mining unless someone pulls a rabbit out of that GCN architecture.
hero member
Activity: 896
Merit: 1000
Seal Cub Clubbing Club
Efficiency-wise, it's awesome!  But $1/MH sucks Sad
member
Activity: 75
Merit: 10
This(!):

Quote
Three 7970's:

System Idle: 290 watts
Mining: 925/1375mhz, 1.17v, 630 watts
Mining: 925/340mhz, 1.17v, 625 watts
Mining: 925/340mhz, 880mv, 360 watts
Mining: 925/340mhz, 865mv, 345 watts

~1650mh/s

Single 7970:
System Idle: 290 watts
Mining: 925/1375mhz, 1.17v, 210 watts
Mining: 925/340mhz, 1.17v, 208 watts
Mining: 925/340mhz, 880mv, 120 watts
Mining: 925/340mhz, 865mv, 115 watts

550mh/s
legendary
Activity: 1162
Merit: 1000
DiabloMiner author
DiabloD3

You should take a look on this results: https://bitcointalksearch.org/topic/3x7970-mining-results-57410

They are really impressive and there will be an improvement on scripts and everything.

A photo of 3 7970s in their naked and shiny glory? Let me save that to my porn folder....
member
Activity: 75
Merit: 10
DiabloD3

You should take a look on this results: https://bitcointalksearch.org/topic/3x7970-mining-results-57410

They are really impressive and there will be an improvement on scripts and everything.
legendary
Activity: 1162
Merit: 1000
DiabloMiner author
I wish I had stuck with assembly programming.. I totally rocked assembler on my Apple ][e  Smiley
 
But alas, life took me a different direction.. I'm thinking had I stuck with it, I could be squeezing some more performance out of a miner by redoing it in assembly. Smiley

The CPU miners are written heavily in assembly in some areas just to get maximum speed.
hero member
Activity: 609
Merit: 500
I wish I had stuck with assembly programming.. I totally rocked assembler on my Apple ][e  Smiley
 
But alas, life took me a different direction.. I'm thinking had I stuck with it, I could be squeezing some more performance out of a miner by redoing it in assembly. Smiley
legendary
Activity: 1162
Merit: 1000
DiabloMiner author
BREAKING NEWS: I now have all the donations needed to grab that 7970, I should be ordering it within the next week give or take.

The last donation was big enough to help me cover that gap, and I think this wonderful person wishes to remain anonymous due to the size of the donation.

Gee, wish I got free stuff.

Spend hundreds of unpaid hours messing with internals of GPU programming (I guess at this point DiabloD3 knows parts of the ATI hardware better than the AMD engineers). Write one of the most used miners and a mining core that's used in other very popular miners as well. Also be supportive of it - answer questions, test bugs, do fixes and continue improving it until you squeeze another 1% performance, and another 1%, and another 1%. All this makes money for the miners using it and you're not getting paid.

Then you will get "free" stuff.

Not better than the engineers, not even close. The problem is, theres stuff that just isn't documented on how Radeons work, and most of what I know is either ArtForz or I threw shit at the Radeon to see what would stick, and then mixed in some moon dust from there.

The GCN, though? Lemons.
legendary
Activity: 2212
Merit: 1001
BREAKING NEWS: I now have all the donations needed to grab that 7970, I should be ordering it within the next week give or take.

The last donation was big enough to help me cover that gap, and I think this wonderful person wishes to remain anonymous due to the size of the donation.

Gee, wish I got free stuff.

No one pays ME in gum Cry
 Grin Cheesy Grin Cheesy Grin Cheesy Grin Cheesy
Looking forward to see what you come up with Diablo  Wink
Pages:
Jump to: