Pages:
Author

Topic: Custom RAM Timings for GPU's with GDDR5 - DOWNLOAD LINKS - UPDATED - page 40. (Read 155485 times)

sr. member
Activity: 652
Merit: 266
I just figured out ohgodadecode has the wrong structure for MISC_TIMING.  Here's a Hynix 1625 strap that shows it: 999000000000000022559D0010DE7B4480551312B78C450A004C0601750414206A8900A00200312 018112D34A42A3816

Here's how I think it should be:
Code:
typedef struct _SEQ_MISC_TIMING_FORMAT
{
    uint32_t TRP_WRA : 7;
    uint32_t TRP_RDA : 7;
    uint32_t TRP : 6;
    uint32_t TRFC : 9;
    uint32_t Pad0 : 3;
} SEQ_MISC_TIMING_FORMAT;

I think the way it was defined is probably correct for R9 series cards, not Rx.

Here's the correct structure, it was stated earlier in niko2004x's post.
RX
Quote
struct {
        uint32_t trp_wra : 7 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad0  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;

R9
Quote
struct {
        uint32_t trp_wra : 6 ;
        uint32_t m1Pad0  : 2 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;
sr. member
Activity: 588
Merit: 251
I just figured out ohgodadecode has the wrong structure for MISC_TIMING.  Here's a Hynix 1625 strap that shows it: 999000000000000022559D0010DE7B4480551312B78C450A004C0601750414206A8900A00200312 018112D34A42A3816

Here's how I think it should be:
Code:
typedef struct _SEQ_MISC_TIMING_FORMAT
{
    uint32_t TRP_WRA : 7;
    uint32_t TRP_RDA : 7;
    uint32_t TRP : 6;
    uint32_t TRFC : 9;
    uint32_t Pad0 : 3;
} SEQ_MISC_TIMING_FORMAT;

I think the way it was defined is probably correct for R9 series cards, not Rx.
sr. member
Activity: 588
Merit: 251
Tried 1200/2100 on my Samsung 480 - 30.47MH/s; doesn't matter if I use 4.10 or 4.11rc3.

Thanks for the info; saved me the trouble of testing 4.11.

As for the speed, is that with your Polaris-tuned kernel or stock sgminer-gm 5.5.5?  I still can't get over 29.3 at 1212/2100 (same speed as 1167/2100).
The card is capable of >30, since I get ~30.1 from Claymore 8.1 (1167/2100).


Does over 30 on both. Timings are pretty good, though.

I'm using worksize 192, gpu-threads 2, and xi1024.  You using anything different?
sr. member
Activity: 588
Merit: 251

My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.

doesn't work on my power color red dragon rx480 8gm samsung mem.
windows driver crash when start ETH, both 2050 and 2100 mem clock.
works good on 2100 clock with 1750 (1625?) strap from original rom
777000000000000022339D00CECD593980551111AE8A84080048C6006C0014206A8900A00200312 0140F262B88252F15

My strap is for K4G4.  Likely different mode registers than your 8G Samsung.
sr. member
Activity: 588
Merit: 251
Tried 1200/2100 on my Samsung 480 - 30.47MH/s; doesn't matter if I use 4.10 or 4.11rc3.

Thanks for the info; saved me the trouble of testing 4.11.

As for the speed, is that with your Polaris-tuned kernel or stock sgminer-gm 5.5.5?  I still can't get over 29.3 at 1212/2100 (same speed as 1167/2100).
The card is capable of >30, since I get ~30.1 from Claymore 8.1 (1167/2100).
sr. member
Activity: 652
Merit: 266

It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Thank you, that sounds absolutely correct! I didn't know( I confess ) that DPM voltage is determined by ASIC quality(much to learn, obviously). That sure clarifies all.

You are very welcome.  BTW- take a look a AIDA - you can print off the actual voltage at each DPM for each individual GPU - it can give you all kinds of great info in addition to that
It will be a bit difficult Smiley I rarely use windows and if I do it is for 1 GPu only. I will
later see on my hynix the output of aida indeed.
legendary
Activity: 980
Merit: 1001
aka "whocares"

It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Thank you, that sounds absolutely correct! I didn't know( I confess ) that DPM voltage is determined by ASIC quality(much to learn, obviously). That sure clarifies all.

You are very welcome.  BTW- take a look a AIDA - you can print off the actual voltage at each DPM for each individual GPU - it can give you all kinds of great info in addition to that
sr. member
Activity: 652
Merit: 266

It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Thank you, that sounds absolutely correct! I didn't know( I confess ) that DPM voltage is determined by ASIC quality(much to learn, obviously). That sure clarifies all.
legendary
Activity: 1510
Merit: 1003

My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.

doesn't work on my power color red dragon rx480 8gm samsung mem.
windows driver crash when start ETH, both 2050 and 2100 mem clock.
works good on 2100 clock with 1750 (1625?) strap from original rom
777000000000000022339D00CECD593980551111AE8A84080048C6006C0014206A8900A00200312 0140F262B88252F15
legendary
Activity: 980
Merit: 1001
aka "whocares"

It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA

sr. member
Activity: 588
Merit: 251

My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.


What core clock?

That's with core not locked, and Powerplay seems to settle around 1200.  If I lock the core at 1167, I get the same speed.


What do you get with same core and 2150?

The memclock seems to have a hard limit of 2100 on the Sapphire ref-style 470.
https://bitcointalksearch.org/topic/m.18298250


I have one, and it doesn't - what's the VBIOS part number?

Code:
Adapter  0    (BN=01, DN=00, PCIID=67DF1002, SSID=E349174B)
    Asic Family        :  Ellesmere
    Flash Type         :  M25P40/c    (512 KB)
    Product Name       :  D00003 Polaris10 PRO A1 GDDR5 128Mx32 4GB 300e/300m
    Bios Config File   :  349P05SG.U41
    Bios P/N           :  113-349PS-U41
    Bios Version       :  015.050.000.000.000000
    Bios Date          :  09/30/16 02:30
    ROM Image Type     :  Hybrid Images
    ROM Image Details  :
        Image[0]: Size(59392 Bytes), Type(Legacy Image)
        Image[1]: Size(59392 Bytes), Type(EFI Image)
sr. member
Activity: 588
Merit: 251

My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.


What core clock?

That's with core not locked, and Powerplay seems to settle around 1200.  If I lock the core at 1167, I get the same speed.


What do you get with same core and 2150?

The memclock seems to have a hard limit of 2100 on the Sapphire ref-style 470.
https://bitcointalksearch.org/topic/m.18298250
sr. member
Activity: 588
Merit: 251

My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.


What core clock?

That's with core not locked, and Powerplay seems to settle around 1200.  If I lock the core at 1167, I get the same speed.
sr. member
Activity: 588
Merit: 251
Given the low values for 32AW, I had guessed that it was a multiple of FAW.  Just in case it wasn't I had been zeroing both in my first experiments with custom straps.  Today I did a test and found out 32AW does make a difference even when FAW=0.


What strap has given you the best hash rate with the rx470?

My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.
member
Activity: 129
Merit: 10
Given the low values for 32AW, I had guessed that it was a multiple of FAW.  Just in case it wasn't I had been zeroing both in my first experiments with custom straps.  Today I did a test and found out 32AW does make a difference even when FAW=0.


What strap has given you the best hash rate with the rx470?

depends what type of memory it has
member
Activity: 68
Merit: 10
Given the low values for 32AW, I had guessed that it was a multiple of FAW.  Just in case it wasn't I had been zeroing both in my first experiments with custom straps.  Today I did a test and found out 32AW does make a difference even when FAW=0.


What strap has given you the best hash rate with the rx470?
member
Activity: 129
Merit: 10

EDIT: @dallase
I have my hynix 30.4 at 2080, it does 30.5 on 2100.
Quote
GPU #0: Ellesmere, 4096 MB available, 36 compute units
ETH - Total Speed: 30.430 Mh/s, Total Shares: 1106, Rejected: 0, Time: 43:16
ETH: GPU0 30.430 Mh/s
Incorrect ETH shares: none

Thats on windows though...

And what core clock speed?   I see about +0.7Mh increasing core by 100mhz

Core 1145

So I assume you can break 31 @ core 1250 then?  



1220/2150 = 31+ on linux Wink


This is a screenshot of 1165/2150 on an old kernel 4.10 with old drivers, same settings on 4.11 and latest amdgpupro drivers gets me 31+ with these clocks.


Cool.  Still on 4.4.0 and 16.40 here. I may have to upgrade!

sr. member
Activity: 588
Merit: 251
Given the low values for 32AW, I had guessed that it was a multiple of FAW.  Just in case it wasn't I had been zeroing both in my first experiments with custom straps.  Today I did a test and found out 32AW does make a difference even when FAW=0.
sr. member
Activity: 652
Merit: 266

EDIT: @dallase
I have my hynix 30.4 at 2080, it does 30.5 on 2100.
Quote
GPU #0: Ellesmere, 4096 MB available, 36 compute units
ETH - Total Speed: 30.430 Mh/s, Total Shares: 1106, Rejected: 0, Time: 43:16
ETH: GPU0 30.430 Mh/s
Incorrect ETH shares: none

Thats on windows though...

And what core clock speed?   I see about +0.7Mh increasing core by 100mhz

Core 1145

So I assume you can break 31 @ core 1250 then? 


It depends...sometimes raising core might not work as expected, besides I like my gpus cold and powersaving.
When this timing passes 72h barrier I will do additional tests.
legendary
Activity: 1050
Merit: 1293
Huh?

EDIT: @dallase
I have my hynix 30.4 at 2080, it does 30.5 on 2100.
Quote
GPU #0: Ellesmere, 4096 MB available, 36 compute units
ETH - Total Speed: 30.430 Mh/s, Total Shares: 1106, Rejected: 0, Time: 43:16
ETH: GPU0 30.430 Mh/s
Incorrect ETH shares: none

Thats on windows though...

And what core clock speed?   I see about +0.7Mh increasing core by 100mhz

Core 1145

So I assume you can break 31 @ core 1250 then?  



1220/2150 = 31+ on linux Wink


This is a screenshot of 1165/2150 on an old kernel 4.10 with old drivers, same settings on 4.11 and latest amdgpupro drivers gets me 31+ with these clocks.

Pages:
Jump to: