Author

Topic: CGMINER optimization procedure for HD6870 (Read 15162 times)

newbie
Activity: 5
Merit: 0
June 23, 2013, 08:23:40 PM
#8
first of all a big thanks to armodar for the guide. all the settings worked a treat with my 6870.

I don't have a dedicated mining rig, so I played around with the settings that would give me a decent khash rate while watching videos.

The following seems to give me the best hash rate (~190) while not affecting the video playback quality:
Intensity 11 
thread concurrency 6720
worksize 64
memory 900 mHz

I will do some more testing to see if I can push the rate up to ~250.

Does anyone have any success overclocking core clock on 6870 from 950?

I seem have the same problem. mine is doesn't seem to change from 920. I've set auto-gpu and gpu-engine 900-1000 and it still stays at 920.
legendary
Activity: 1498
Merit: 1000
Does anyone have any success overclocking core clock on 6870 from 950?
newbie
Activity: 25
Merit: 0
August 23, 2012, 08:07:38 AM
#6
kh/s?? You should get mh/s! If you are getting kh/s something is wrong.

I run a Sapphire 6870 FleX 1GB DDR5 card and after having some hiccups I went onwards to fully optimize and balance my LTC(please mind i said LTC) generation.
newbie
Activity: 43
Merit: 0
August 23, 2012, 07:08:17 AM
#5
just to say i get very similar results with my 6870, just to add i can take the memory clock down to around 625mhz on my XFX card and 700mhz on my MSI card. there is a small hashrate loss but i think it if worth it for the cooler more energy efficient overall result
legendary
Activity: 1148
Merit: 1008
If you want to walk on water, get out of the boat
August 23, 2012, 06:27:37 AM
#4
kh/s?? You should get mh/s! If you are getting kh/s something is wrong.
newbie
Activity: 25
Merit: 0
August 23, 2012, 06:25:47 AM
#3
Thank you for the observation...I changed it to VLIW5.

For some reason I was fairly convinced it was the other one, anyways, thanks Smiley
newbie
Activity: 25
Merit: 0
August 21, 2012, 08:12:01 PM
#2
Reserved.
newbie
Activity: 25
Merit: 0
August 21, 2012, 08:10:35 PM
#1
Hello everyone,

because our beloved mods don't get that some people just dont want to talk much and only post when needed (yeah, it's a paradox isn't it - not to want to post junk on forums) I will post in here, when actually this should go to the cgminers out there (in the mining software sub-forum)

I run a Sapphire 6870 FleX 1GB DDR5 card and after having some hiccups I went onwards to fully optimize and balance my LTC(please mind i said LTC) generation.

My OS is Win 7 Ultimate 64 Bit with 4 GB of system RAM.

Because this is also my main PC, i cannot run cgminer all the time at the same settings, so I basically need 3:

  • Sleep profile - low noise, close to 70C (I am cautious) but as high hash rate as possible
  • Desktop profile - noise not important - close to 78C - high hash - but running in "I dont even notice something is running" mode
  • Performance mode - noise absolutely not important (but i don't want my fan to die on me so no more than 85%) - temp not higher than 78C and...you guessed it...max hash rates! - when I'm away from home


Okay, now I know that max noise of the fan i can call silent is 53% so for the sleep profile thats max.

I have 1120 shaders and cgminer help says i should use 4x shaders for vliw5 so 4480....

And now my testing (done by letting the values stabilize on the same pool - so same difficulty):

thread concurrency 8192 - worksize 256  

Intensity 17: 287.9 kH/s -- 82% fan speed -- 76.0C (target 78C)
Intensity 16: 282.0 kH/s -- 81% fan speed -- 76.0C (target 78C)
Intensity 15: 271.8 kH/s -- 81% fan speed -- 75.5C (target 78C)

thread concurrency 8192 - worksize 128  

Intensity 17: 299.0 kH/s -- 85% fan speed -- 76.5C (target 78C) -FASTEST config that works so far
Intensity 16: 296.7 kH/s -- 84% fan speed -- 76.5C (target 78C)
Intensity 15: 286.7 kH/s -- 84% fan speed -- 76.0C (target 78C)

thread concurrency 4480 - worksize 256  

Intensity 17: 288.7 kH/s -- 80% fan speed -- 77.0C (target 78C) - FASTER and cooler than 8192 w256 at same Intensity
Intensity 16: 282.7 kH/s -- 79% fan speed -- 76.5C (target 78C) - FASTER and cooler than 8192 w256 at same Intensity
Intensity 15: 272.6 kH/s -- 78% fan speed -- 76.0C (target 78C) - FASTER and cooler than 8192 w256 at same Intensity

thread concurrency 4480 - worksize 128  

Intensity 17: 301.7 kH/s -- 81% fan speed -- 75.0C (target 78C) - HOLY CRAP - fast and cool but look below
Intensity 16: 298.7 kH/s -- 81% fan speed -- 75.0C (target 78C) - HOLY CRAP - fast and cool but look below
Intensity 15: 287.9 kH/s -- 77% fan speed -- 75.5C (target 78C) - HOLY CRAP - fast and cool but look below

you would expect it to be faster and cooler than the 8192 w128 if you look at the worksize-thread concurrency comparison, and IT IS...but

YOU GET HARDWARE ERRORS. Howcome? don't ask me....ckolivas or someone who understands it...please enlighten me...

Howcome at higher worksizes (256) everything works smoothly and at smaller,easier workloads (128) you get hardware errors i dont know.
What to do to get rid of them..considering that at tc 4480 everything is a wee bit faster and LOTS cooler.

UPDATE:

- new settings tested with very good results - sadly temp wise is not comparable because of much hotter environment
- also decided to completely skip worksize 256 because of past results

thread concurrency 5632 - worksize 128

Intensity 17: 301.8 kH/s - Faster than any previous config at same intensity
Intensity 16: 298.8 kH/s - Faster than any previous config at same intensity
Intensity 15: 287.9 kH/s - Faster than any previous config at same intensity

UPDATE 2:

After extensive reading of this thread: https://bitcointalksearch.org/topic/attn-litecoin-gpu-miners-scrypt-support-for-cgminer-92887
And after Tittiez's discoveries: https://bitcointalksearch.org/topic/attn-litecoin-gpu-miners-scrypt-support-for-cgminer-92887.msg1045832#msg1045832
I started to understand a little more the correlation between thread concurrency and worksize so here are my (partial) discoveries:

1. at the same worksize, decreasing thread concurrency increases speed - at some point you will reach the limit of your hardware (in my case that was tc 4480 w128)
2. at smaller worksizes you need a higher level of thread concurrency to keep yourself over the limit of the hardware (DO NOT FORGET 1.)
3. thread concurrency works best if it's a multiple of the shader count of your card (thus for me that means 1120, 2240, 3360, 4480, 5600, 6720, 7840 - 8960 will result in buffer error, need to test some more)
4. it seems that smaller worksizes increase hashing speed (the concept resembles to a funnel: if you try to send too much through a funnel it will overflow and will make it harder for the liquid to get through, yet if you get to the point where you just barely send enough to let it flow unhindered, you will get a far higher rate of fluid through that funnel)

Considering what I found out, here are my newest results

thread concurrency 6720 - worksize 64 - memory 900 mHz (default:1050 mHz)
(tc 5600 gives HW errors see points 2 and 3 and FOR THIS CARD lower memory speed means lower temps and about 1 kH/s extra)

Intensity 17: 302.1 kH/s -- 74% fan speed -- 77.5C (target 78C) - Even faster and cooler than ever before!
Intensity 16: 297.8 kH/s -- 74% fan speed -- 77.0C (target 78C)
Intensity 15: 292.8 kH/s -- 74% fan speed -- 77.0C (target 78C) - Even faster and cooler than ever before!

NOTE: stale shares were never my concern, with any previous config I never got more than 1.5%-2%, YET for some reason WITH THIS CONFIG ABOVE I get a superb speed and temp AND only ~0.5% stales (so about 1.5% faster speed than these speeds, by submitting more valid shares)

The quest for the best optimized 6870 continues....
Jump to: