Pages:
Author

Topic: More MH/s from Your GPU? (Read 1566 times)

legendary
Activity: 1554
Merit: 1222
brb keeping up with the Kardashians
August 13, 2011, 10:17:16 AM
#22
HD 5850
1020MHz / 355MHz
BFI_INT VECTORS FASTLOOP=FALSE AGGRESSION=11 WORKSIZE=256 -k phatk
418 MH/s

HD 5870
1040MHz / 355MHz
BFI_INT VECTORS FASTLOOP=FALSE AGGRESSION=11 WORKSIZE=256 -k phatk
470-474 MH/s

Done with the Phatk 3% kernel mod, YMMV


full member
Activity: 392
Merit: 100
August 13, 2011, 09:19:20 AM
#21
I realized that many people make a mistake overclicking their cards not properly or setting up guiminers wrong. What I'm about to say was checked on 5870, 5850 and 5830. Every time there was a few % more mh/s when I used those settings.

Try this settings:

1. Core Clock - that's up to You.
2. Memory clock - 340MHz ( not less not more )
3. Extra flags: -v -w256
4. Assign all cards to one core.
5. No cross fire

5870,5850, 5830 seem to work as fast as they can on memory down clocked to 340Mhz - much faster. Anything below or abow is slower.

Examples:

XFX Hd 5870.

Core - 950
Memory - 340
Voltage - stock
Extra flags: -v -w256
CPU - Phenom x2 550 down clocked to 800MHz

GUIMiner - 433-438Mh/s each

Regards

XFX HD 5850
865/300 = 350m/h
865/340 = 352.30m/h

Thx  Wink
jr. member
Activity: 54
Merit: 5
August 13, 2011, 05:36:53 AM
#20
I just realised I was lying earlier. I set the clocking back to 915/340 because the 920/240 setting kept freezing my OS.

anyway, tried your suggestion:
921/307 => 422 MHash/s
920/340 => 423 MHash/s

But no stable hashing possible.
newbie
Activity: 42
Merit: 0
August 12, 2011, 06:21:05 AM
#19
Yeah I tested this too (5870) and most remarkably, the power comsumption of my system went down 50W.

I didn't spend too much time testing settings though, but I am running 920/340 now with lower power usage and cooler GPU temps. Did not touch the voltage though.

-k phatk DEVICE=0 VECTORS AGGRESSION=10 BFI_INT WORKSIZE=256

=> 419 MHash/s

Thanx alot for pointing this memory clockrate thing out, dear mister thread poster!

Could you try 921/307 and see what do you get? Cheesy
jr. member
Activity: 54
Merit: 5
August 12, 2011, 06:12:44 AM
#18
Yeah I tested this too (5870) and most remarkably, the power comsumption of my system went down 50W.

I didn't spend too much time testing settings though, but I am running 920/340 now with lower power usage and cooler GPU temps. Did not touch the voltage though.

-k phatk DEVICE=0 VECTORS AGGRESSION=10 BFI_INT WORKSIZE=256

=> 419 MHash/s

Thanx alot for pointing this memory clockrate thing out, dear mister thread poster!
newbie
Activity: 42
Merit: 0
August 12, 2011, 03:14:20 AM
#17
Along the lines of workgroup size, using phoenix 1.5 and phatk 2.2, the difference on either of my 5830's as far as Mh/s change when using either WORKGROUP=256 or WORKGROUP=128 is negligible to about 5 MH/s, with 256 being faster. I hear that 128 is faster with 68XX series cards, or cards with higher memory clock settings, but I don't have the cards to verify.

If higher memory clock is the key, then we could test by changing memory clocks.

Switching to 900/1200 on my 5870
With 256 I get 391 MH/s
With 128 I get 398 MH/s

Dropping down to 900/900
with 256 I get 392 MH/s
with 128 I get 397 MH/s

Dropping back to 900/300
With 256 I get 405 MH/s
With 128 I get 399 MH/s

So it seems correct, higher memory clocks make w128 better than w256. But a really low memory clock gives better performance overall, maybe only for the 58xx cards.
newbie
Activity: 9
Merit: 0
August 12, 2011, 12:37:07 AM
#16
Quote
So if the card can support larger workgroups, more elements can be operated on with the same data, hence faster in cases where this can be leveraged.

Yet, and I am aware you did not explicitly say so, just want to point it out though, larger workgroups do not necessarily mean more Hash/s! Checked this on my card.

That's why I had the "where it can be leveraged..." part Cheesy

It also depends on other parameters. I would suspect on certain combination of memory/core speeds, a larger workgroup may take fractionally longer to load, which may then be amplified by latencies mismatches and processing time. It's like half a cycle delay means a full cycle wait on the next op kind of thing.

However on both my cards (5850,/5870), 256 is faster than 128. Which cards are you using at what clocks?

If your core clock isn't a multiple of your memory clock, try adjusting them to be. e.g. on my 5850 with 927/309 I get 377MH/s but with 927/340, I get 372 instead. Try it out for yourself, don't just trust numbers the rest of us put out because there are always differences in each of our systems.

I really appreciate this suggestion of using a multiple of your memory clock. I've been trying to find a "sweet spot" for my two Sapphire 5830's for a while and didn't even think to do that. I've been running them at 935/300 (stock voltage) (getting around 303MH/s) and just switched it to 936/312. Once my mining pool comes back online, I'll test it out and keep tweaking. Thanks for the suggestion!

Along the lines of workgroup size, using phoenix 1.5 and phatk 2.2, the difference on either of my 5830's as far as Mh/s change when using either WORKGROUP=256 or WORKGROUP=128 is negligible to about 5 MH/s, with 256 being faster. I hear that 128 is faster with 68XX series cards, or cards with higher memory clock settings, but I don't have the cards to verify.
newbie
Activity: 42
Merit: 0
August 11, 2011, 12:47:09 PM
#15
Quote
So if the card can support larger workgroups, more elements can be operated on with the same data, hence faster in cases where this can be leveraged.

Yet, and I am aware you did not explicitly say so, just want to point it out though, larger workgroups do not necessarily mean more Hash/s! Checked this on my card.

That's why I had the "where it can be leveraged..." part Cheesy

It also depends on other parameters. I would suspect on certain combination of memory/core speeds, a larger workgroup may take fractionally longer to load, which may then be amplified by latencies mismatches and processing time. It's like half a cycle delay means a full cycle wait on the next op kind of thing.

However on both my cards (5850,/5870), 256 is faster than 128. Which cards are you using at what clocks?

If your core clock isn't a multiple of your memory clock, try adjusting them to be. e.g. on my 5850 with 927/309 I get 377MH/s but with 927/340, I get 372 instead. Try it out for yourself, don't just trust numbers the rest of us put out because there are always differences in each of our systems.
newbie
Activity: 7
Merit: 0
August 11, 2011, 10:17:14 AM
#14
"larger workgroups do not necessarily mean more Hash/s!"

That can be true but usually is not.

Try -w256 with 340MHz memory, that's the fastest and the most energy efficient for me.

"I am mining with few 5870, using MSI afterburner to overlock them. I tried to underlock memory clock to 340 to check performance, but unfortunately I was not able to set it, because MSI afterburner let me to underclock memory clock just to minimum at 600Mhz. Anyone else expierencing this issue?"

Try ACOLBF 1.75 You can download it here:

https://bitcointalksearch.org/topic/aoclbf-was-phoenix-rising-phoenixpoclbm-gui-with-built-in-overclock-tool-10264

Working for me. Can underclock down to 200MHz memory but still with workgroup 256bit, 340MHz is the fastest.


jr. member
Activity: 54
Merit: 5
August 11, 2011, 09:17:36 AM
#13
Quote
So if the card can support larger workgroups, more elements can be operated on with the same data, hence faster in cases where this can be leveraged.

Yet, and I am aware you did not explicitly say so, just want to point it out though, larger workgroups do not necessarily mean more Hash/s! Checked this on my card.
hero member
Activity: 927
Merit: 1000
฿itcoin ฿itcoin ฿itcoin
August 11, 2011, 09:17:21 AM
#12
I am mining with few 5870, using MSI afterburner to overlock them. I tried to underlock memory clock to 340 to check performance, but unfortunately I was not able to set it, because MSI afterburner let me to underclock memory clock just to minimum at 600Mhz. Anyone else expierencing this issue?
Previously, I have used Sapphire trixx software, I managed to underclock my cards to 300Mhz with no problems, but with that software cards didn't work stable, and sometimes even locked up. MSI Afterburner has no issues with locking up, so I prefer to use it.
See this guide on how to enable unofficial overclocking modes in MSI After Burner & bypass the limitations: https://bitcointalksearch.org/topic/m.229511
newbie
Activity: 42
Merit: 0
August 11, 2011, 09:00:44 AM
#11
The question remains about the power consumption. I am looking for software to monitor it. And soon I'll try to play a bit with memory clocks.

This cannot be accurately monitored by software.


Quote
Also I am curious, how to explain technically difference between extra flag parameter of -w 128 and -w 256, and impact to performance for 58xx and 5970 cards?

Workgroup is an openCL parameter, very simplistically and likely not technically correctly, a workgroup of 128 works on 128 elements at once with all 128 able to share memory. So if the card can support larger workgroups, more elements can be operated on with the same data, hence faster in cases where this can be leveraged. Max workgroups is limited by the hardware, in steps of 32/64 it seems. And 256 seems to be the max in the 58/59 series.

hero member
Activity: 686
Merit: 500
Bitbuy
August 11, 2011, 07:16:05 AM
#10
Try Phoenix 1.50 with the phat switched and the improved kernel by Dia. Should yield even higher hashing rates!
jr. member
Activity: 54
Merit: 5
August 11, 2011, 06:29:11 AM
#9
Quote
Also I am curious, how to explain technically difference between extra flag parameter of -w 128 and -w 256, and impact to performance for 58xx and 5970 cards?

I am, too! I guess it has to do with the way that things are scheduled on the stream processors?

Anyway, can u say something about the powersaving-effects of underclocking the memory?
member
Activity: 72
Merit: 10
August 11, 2011, 06:14:26 AM
#8
I found something on that issue with setting memory clock. The cause of it probably is that I use Catalyst 11.7, and ass opposed here bitcointalk.org/index.php?topic=33611.0 not all multi GPU configurations work properly. So I'll wait for Catalyst 11.8, because even at the moment memory clocks are set at 600Mhz for 5870 and 500Mhz for 5970, I am satisfied with the temperatures and performance.


The question remains about the power consumption. I am looking for software to monitor it. And soon I'll try to play a bit with memory clocks.


Also I am curious, how to explain technically difference between extra flag parameter of -w 128 and -w 256, and impact to performance for 58xx and 5970 cards?
newbie
Activity: 42
Merit: 0
August 11, 2011, 05:20:39 AM
#7
I am mining with few 5870, using MSI afterburner to overlock them. I tried to underlock memory clock to 340 to check performance, but unfortunately I was not able to set it, because MSI afterburner let me to underclock memory clock just to minimum at 600Mhz. Anyone else expierencing this issue?
Previously, I have used Sapphire trixx software, I managed to underclock my cards to 300Mhz with no problems, but with that software cards didn't work stable, and sometimes even locked up. MSI Afterburner has no issues with locking up, so I prefer to use it.

Some cards just don't work well at certain speeds, my 5870 just goes crazy at 400Mhz but is ok at 390 and 418 for example. So you might want to try with Trixx but use say 310MHz instead. The exact clockspeed of the memory doesn't really matter, it's mainly to reduce the power wasted. 3xx was just a figure that came about as a nice 1/3 divider of a common 9xx ~10xx Mhz clock some people were originally clocking at.
newbie
Activity: 42
Merit: 0
August 11, 2011, 05:17:08 AM
#6
Note that this only applies to dedicated mining rigs.

If you're using your multi-GPU rig for other purposes, then the memory clock on the GPU you're using for actual displays must be set higher or else you may experience flickering and other onscreen oddities. This is likely due to a UVD bug.
member
Activity: 72
Merit: 10
August 11, 2011, 05:16:34 AM
#5
I am mining with few 5870, using MSI afterburner to overlock them. I tried to underlock memory clock to 340 to check performance, but unfortunately I was not able to set it, because MSI afterburner let me to underclock memory clock just to minimum at 600Mhz. Anyone else expierencing this issue?
Previously, I have used Sapphire trixx software, I managed to underclock my cards to 300Mhz with no problems, but with that software cards didn't work stable, and sometimes even locked up. MSI Afterburner has no issues with locking up, so I prefer to use it.
newbie
Activity: 12
Merit: 0
August 11, 2011, 05:12:18 AM
#4
Any advice on fan settings?

Do you run the fans temperature controlled or at a steady speed?

Personally I have been running the fans at 70% steady speed the whole summer and kept temperatures decent on my 5850s. But I haven't been in that room... loud! Smiley



newbie
Activity: 7
Merit: 0
August 11, 2011, 12:20:28 AM
#3
Yes I have but without touching voltage.

2 XFX Hd 5870.

Core - 950
Memory - 340
Voltage - stock
Extra flags: -v -w256
CPU - Phenom x2 550 down clocked to 800MHz - 24W power consumption

Total consuption of my PC is 450W

around 870Mh/s

2x Sapphire HD 5850 Extreme

Core - 875
Memory - 340
Voltage - stock
Extra flags: -v -w256
CPU - Phenom x2 550 down clocked to 800MHz - 24W power consumption

Around 690MH/s

Total consuption of my PC is 320W
Pages:
Jump to: