Pages:
Author

Topic: Custom RAM Timings for GPU's with GDDR5 - DOWNLOAD LINKS - UPDATED - page 48. (Read 155485 times)

legendary
Activity: 980
Merit: 1001
aka "whocares"
WOW - thats a bold statement -- I would suggest doing a little more research before making wild statements like that, The Stilt is a well known and respected guy in the AMD world.   Huh

In regards to the other comments, welcome to the club, several of us (myself, wolf0 etc) have put in countless hours

Respect is irrelevant.
I said 'Out of historic context it is hard to tell who copied who.'
It is not bold, it just states what either vendors used Stilt work or other way around.
Who done it is irrelevant (at least for me), changed values are not.

Respect maybe irrelevant to you but not to me.  I respect those that put in the work.

Ultimately what matters on this forum is if the work yields a result.  I am hitting 530+ mining ZEC with a Fury X and a modified BIOS way under stock voltages.  There isn't a GDDR5 based card that can touch the numbers HBM cards can put up.  Obviously it isn't only the HBM but it certainly doesn't hurt
member
Activity: 126
Merit: 10
WOW - thats a bold statement -- I would suggest doing a little more research before making wild statements like that, The Stilt is a well known and respected guy in the AMD world.   Huh

In regards to the other comments, welcome to the club, several of us (myself, wolf0 etc) have put in countless hours

Respect is irrelevant.
I said 'Out of historic context it is hard to tell who copied who.'
It is not bold, it just states that either vendors used Stilt work or other way around.
Who done it is irrelevant (at least for me), changed values are not.
legendary
Activity: 980
Merit: 1001
aka "whocares"
It is even more fun playing with HBM.  Just look at the 100 strap and compare it to the 400,500,600 straps and it looks like it would be easy.  Well it is not but it is fun to play with regardless.

Well, there is only one variant of timings table for HBM compared to >100 variants of timings table for GDDR5.
Not much data to do science here.


1 Variant of what? I think I missed something. Wink

I see your point, but that is exactly why there is MORE science.  I do not use the hundreds of GDDR5 timings to write my straps because I do not want to Reinvent The Wheel.  I am looking to make new straps that accomplish something better than the originals.

And I did use hundreds of GDDR5 timings strings to statistically discover layout of registers and their parts.

Take The Stilts timings, the impact his timings had on mining are unprecedented and they were not based on other straps, they were from his own testing.  I think he said it took him 2+ weeks to write 1 strap

I am not exactly sure about Stilt work.
As I stated before (there are some post of mine in this thread) there are some vendors timings which differs from Stilt ones only by few values.
Out of historic context it is hard to tell who copied who.


WOW - thats a bold statement -- I would suggest doing a little more research before making wild statements like that, The Stilt is a well known and respected guy in the AMD world.   Huh

In regards to the other comments, welcome to the club, several of us (myself, wolf0 etc) have put in countless hours
member
Activity: 126
Merit: 10
It is even more fun playing with HBM.  Just look at the 100 strap and compare it to the 400,500,600 straps and it looks like it would be easy.  Well it is not but it is fun to play with regardless.

Well, there is only one variant of timings table for HBM compared to >100 variants of timings table for GDDR5.
Not much data to do science here.


1 Variant of what? I think I missed something. Wink

I see your point, but that is exactly why there is MORE science.  I do not use the hundreds of GDDR5 timings to write my straps because I do not want to Reinvent The Wheel.  I am looking to make new straps that accomplish something better than the originals.

And I did use hundreds of GDDR5 timings strings to statistically discover layout of registers and their parts.

Take The Stilts timings, the impact his timings had on mining are unprecedented and they were not based on other straps, they were from his own testing.  I think he said it took him 2+ weeks to write 1 strap

I am not exactly sure about Stilt work.
As I stated before (there are some post of mine in this thread) there are some vendors timings which differs from Stilt ones only by few values.
Out of historic context it is hard to tell who copied who.
legendary
Activity: 980
Merit: 1001
aka "whocares"
It is even more fun playing with HBM.  Just look at the 100 strap and compare it to the 400,500,600 straps and it looks like it would be easy.  Well it is not but it is fun to play with regardless.

Well, there is only one variant of timings table for HBM compared to >100 variants of timings table for GDDR5.
Not much data to do science here.


1 Variant of what? I think I missed something. Wink

I see your point, but that is exactly why there is MORE science.  I do not use the hundreds of GDDR5 timings to write my straps because I do not want to Reinvent The Wheel.  I am looking to make new straps that accomplish something better than the originals.

Take The Stilts timings, the impact his timings had on mining are unprecedented and they were not based on other straps, they were from his own testing.  I think he said it took him 2+ weeks to write 1 strap
member
Activity: 126
Merit: 10
It is even more fun playing with HBM.  Just look at the 100 strap and compare it to the 400,500,600 straps and it looks like it would be easy.  Well it is not but it is fun to play with regardless.

Well, there is only one variant of timings table for HBM compared to >100 variants of timings table for GDDR5.
Not much data to do science here.
legendary
Activity: 980
Merit: 1001
aka "whocares"
It is even more fun playing with HBM.  Just look at the 100 strap and compare it to the 400,500,600 straps and it looks like it would be easy.  Well it is not but it is fun to play with regardless.
sr. member
Activity: 588
Merit: 251
tFAW = 0  
Really, do I read this right??



Elpida... you can change some random things and it will still work...

In comparison with Samsung, Elpida runs on 3 cycles, which is interesting Smiley

Anyhow, tFAW 0, i'd suggest to try it, you'll see Smiley works fine.

+ they clock like beasts Smiley

Greetings

I finally got a chance to try tFAW = 0 on my Sapphire Rx470 with Samsung RAM.  Works fine at 2000 with the 1625 strap (normally tFAW = 10).  No change in mining performance though.  Maybe if I try tRRD = 5 (instead of 6)...
full member
Activity: 199
Merit: 108
Look, I'm really not that interesting. Promise.
Aah.
And the profits go to the drain.
Here is my version https://github.com/niko2004x/atom_timing_editor.

Well, to be fair, you're all swinging your dicks around like helicopters and acting like you're gods; it's about time someone levels the playing field.

Thank you for sharing. Your code is very clean! Very nice job.

People will still pay you to do this for them - there are many people out there who won't be able to use these tools. All this does is provide the ones who have the knowledge, the information. There are many people who will pay for the convenience of a tighter timing.
member
Activity: 126
Merit: 10
Aah.
And the profits go to the drain.
Here is my version https://github.com/niko2004x/atom_timing_editor.

EDIT: There are different decoders for preRX(starting from HD7xxx) and RX series.
Not sure if they are right but they give consistent values for Elpida EDW4032BABG and Hynix H5GC4H24AJR in cards of different generations.
sr. member
Activity: 588
Merit: 251
Hello all,

Wolf0 and I have, today, released OhGodATool, OhGodADecode and OhGodACsumFixer.

Currently, they are without barebones documentation - I don't have the time right now with work, but once I do have a spare moment, I will update it.

You can download OhGodATool here: https://github.com/OhGodACompany/OhGodATool/releases/
You can download OhGodADecode here: https://github.com/OhGodACompany/OhGodADecode/releases/
You can download OhGodACsumFixer here: https://github.com/OhGodACompany/OhGodACsumFixer/releases

Enjoy.

Thanks.  Where did you find the updated MC_SEQ_MISC_TIMING?
full member
Activity: 199
Merit: 108
Look, I'm really not that interesting. Promise.
Hello all,

Wolf0 and I have, today, released OhGodATool, OhGodADecode and OhGodACsumFixer.

Currently, they are without barebones documentation - I don't have the time right now with work, but once I do have a spare moment, I will update it.

You can download OhGodATool here: https://github.com/OhGodACompany/OhGodATool/releases/
You can download OhGodADecode here: https://github.com/OhGodACompany/OhGodADecode/releases/
You can download OhGodACsumFixer here: https://github.com/OhGodACompany/OhGodACsumFixer/releases

Enjoy.

Thanks for the links--- my old school spreadsheet works but that will be a lot easier Wink

btw- does the CheckSum work on linux --- I do not have any windoze machines

Yes, it does. Releases has it precompiled for you.
legendary
Activity: 980
Merit: 1001
aka "whocares"
Hello all,

Wolf0 and I have, today, released OhGodATool, OhGodADecode and OhGodACsumFixer.

Currently, they are without barebones documentation - I don't have the time right now with work, but once I do have a spare moment, I will update it.

You can download OhGodATool here: https://github.com/OhGodACompany/OhGodATool/releases/
You can download OhGodADecode here: https://github.com/OhGodACompany/OhGodADecode/releases/
You can download OhGodACsumFixer here: https://github.com/OhGodACompany/OhGodACsumFixer/releases

Enjoy.

Thanks for the links--- my old school spreadsheet works but that will be a lot easier Wink

btw- does the CheckSum work on linux --- I do not have any windoze machines
full member
Activity: 199
Merit: 108
Look, I'm really not that interesting. Promise.
Hello all,

Wolf0 and I have, today, released OhGodATool, OhGodADecode and OhGodACsumFixer.

Currently, they are without barebones documentation - I don't have the time right now with work, but once I do have a spare moment, I will update it.

You can download OhGodATool here: https://github.com/OhGodACompany/OhGodATool/releases/
You can download OhGodADecode here: https://github.com/OhGodACompany/OhGodADecode/releases/
You can download OhGodACsumFixer here: https://github.com/OhGodACompany/OhGodACsumFixer/releases

Enjoy.
legendary
Activity: 980
Merit: 1001
aka "whocares"
So my first try at a custom strap didn't work (GPU crashed almost immediately when mining ETH).
custom 1900: 1500RAS, 1625CAS, MISC2, & ARB
777000000000000022CC1C00AD515A3ED0570F15B98CA50A004AE7001C0714207A8900A00300000 01B11353F922A3217

A straight copy of the 1625 strap to 2000 works fine, while the 1500 strap gave errors even at 1900.  I tried taking the 1900 strap, RAS from the 1500, and CAS, MISC2 & ARB2 from the 1625 strap and using it for the 2000 strap.


Most of the time those type of adjustments will crash or yield no benefit.  The timings that are affected by a change must be changed as well to compensate.

Secondly, tRAS has little affect on anything and is mostly changed to compensate for changes in other timings
sr. member
Activity: 588
Merit: 251
Not quite - they tell you part of the story - but look at MISC1, for example :3

Yes, I'm intentionally not using the mask for tRP_WRA and RDA since there is data in the straps outside the mask.
I suppose I could add unknown fields (i.e. uk1, uk2).
sr. member
Activity: 588
Merit: 251
So my first try at a custom strap didn't work (GPU crashed almost immediately when mining ETH).
custom 1900: 1500RAS, 1625CAS, MISC2, & ARB
777000000000000022CC1C00AD515A3ED0570F15B98CA50A004AE7001C0714207A8900A00300000 01B11353F922A3217

A straight copy of the 1625 strap to 2000 works fine, while the 1500 strap gave errors even at 1900.  I tried taking the 1900 strap, RAS from the 1500, and CAS, MISC2 & ARB2 from the 1625 strap and using it for the 2000 strap.
sr. member
Activity: 588
Merit: 251
Not quite - they tell you part of the story - but look at MISC1, for example :3
I've been understanding of how much you want to brag, because finding this information is hard work, but not everyone has the resources you do, through me.

Hmmm... sounds like someone with contacts inside AMD.
full member
Activity: 199
Merit: 108
Look, I'm really not that interesting. Promise.
I see at least a couple people have written strap decoding programs, but I can't find publicly released.  I was going to write one and release it publicly, but I figured if someone else has already written one...


So it's not as simple as using atombios.h to dump the fields in ATOM_MEMORY_TIMING_FORMAT_V2.
https://raw.githubusercontent.com/torvalds/linux/master/drivers/gpu/drm/radeon/atombios.h

Straps for GCN cards are 52 bytes long (3 bytes for memory clock, 1 byte for memory type, 48 bytes for strap), but sizeof(ATOM_MEMORY_TIMING_FORMAT_V2) = 40 bytes.

So is it just a matter of old-fashioned reverse engineering?  i.e. looking at different straps and reading through GDDR5 data sheets to figure out the strap offsets for different values?


Hah, you don't know the format and you're going to make a public tool? Your threats are like skate park swimming pools - empty Tongue

Looks like you haven't read the rest of the thread.  It took less than an hour to figure it out from the Linux drm code.


Not quite - they tell you part of the story - but look at MISC1, for example :3

Can you stop being a dick? You should be kind and considerate and thankful that people are working hard to document and unlock how all this works - knowledge should be shared, and distributed freely. This kind of optimization isn't just valuable to mining; it's valuable to a lot of operations (including scientific research, which requires a lot of compute power, and benefits quite heavily from this kind of optimization).

I've been understanding of how much you want to brag, because finding this information is hard work, but not everyone has the resources you do, through me.

nerdralph - you're doing really well. Keep it up. I'm really proud of you.

I didn't need to use all those resources - I decoded it a lot through trial and error, and public knowledge. But sure, I'll stop.

Except you did. And yes, you did a lot of the hard work, but so did many before you, including me. There's no need to act like you're a god; and there's no need for this pissing contest. Thankfully nerdralph will probably be able to take all of my work and do something useful with it, because I sure as hell don't have the time right now.

EDIT: nerdralph, I want to make this public. I don't have the time to compile all of this information in a nice, easy to use way, though. Can you please help me, if I provide you with the assistance? I'm really short on time, and this is something I am passionate about.
full member
Activity: 199
Merit: 108
Look, I'm really not that interesting. Promise.
I see at least a couple people have written strap decoding programs, but I can't find publicly released.  I was going to write one and release it publicly, but I figured if someone else has already written one...


So it's not as simple as using atombios.h to dump the fields in ATOM_MEMORY_TIMING_FORMAT_V2.
https://raw.githubusercontent.com/torvalds/linux/master/drivers/gpu/drm/radeon/atombios.h

Straps for GCN cards are 52 bytes long (3 bytes for memory clock, 1 byte for memory type, 48 bytes for strap), but sizeof(ATOM_MEMORY_TIMING_FORMAT_V2) = 40 bytes.

So is it just a matter of old-fashioned reverse engineering?  i.e. looking at different straps and reading through GDDR5 data sheets to figure out the strap offsets for different values?


Hah, you don't know the format and you're going to make a public tool? Your threats are like skate park swimming pools - empty Tongue

Looks like you haven't read the rest of the thread.  It took less than an hour to figure it out from the Linux drm code.


Not quite - they tell you part of the story - but look at MISC1, for example :3

Can you stop being a dick? You should be kind and considerate and thankful that people are working hard to document and unlock how all this works - knowledge should be shared, and distributed freely. This kind of optimization isn't just valuable to mining; it's valuable to a lot of operations (including scientific research, which requires a lot of compute power, and benefits quite heavily from this kind of optimization).

I've been understanding of how much you want to brag, because finding this information is hard work, but not everyone has the resources you do, through me.

nerdralph - you're doing really well. Keep it up. I'm really proud of you. I'm going to give you some help via PM.
Pages:
Jump to: