Pages:
Author

Topic: ATTN Litecoin GPU Miners - Scrypt support for cgminer - page 16. (Read 175855 times)

-ck
legendary
Activity: 4088
Merit: 1631
Ruu \o/
Okay so I've had an extensive discussion with mtrlt about the code and I've done a lot of debugging and I've learnt a lot.

First of all, the values you can safely plug into linux are NOT compatible with what you can plug into windows. There are different restrictions on the allocatable memory dependent on driver/OS combination. Therefore you cannot compare results from the two.

Second, there IS a MEANINGFUL upper limit to aggression or in this case, intensity. It is where the power of 2 is greater than the concurrent threads.

Eg concurrent threads of 8192 has an upper limit of 13 intensity because 2^13 is 8192. You CAN go over this value, but you are absolutely guaranteed to start producing invalid results. How many invalid results you get for the potential rise in hashrate is highly hardware dependent.

The previous release code did no boundary checking or any testing of the device. I have now updated the git tree to test just how much memory it can allocate and it will now AUTOMATICALLY TUNE to the maximum values that are likely to work. I suggest you start it in debug mode with -D to see what it reports as the concurrent threads, and then find the value that is the largest multiple of number of shaders in the device. Eg a 6950 has 134217728 max memory, this works out to concurrency 2048 but it only has 1408 shaders so setting concurrent_threads to 1408 will likely make it faster.

Changing lookup_gap has 2 effects. The larger it is, the higher you can go with thread_concurrency. However, speed also is dependent on architecture design, and virtually all GPUs are fastest at a gap of 2. If you choose a custom gap without choosing a thread concurrency, cgminer will choose the concurrency for you. If you don't choose a gap, it will select 2 for you.

About GPU threads: You should run as many as you can start without cgminer crashing or failing. They do NOT correlate with shaders, compute units, ram or anything else as any meaningful multiple or anything like that.

Now finally, and you can believe me or not on this, but raper sends work to the GPU WITHOUT CHECKING if it was accepted, and gets the return buffer WITHOUT CHECKING if it actually did any work, and then adds the number of hashes it would have expected the GPU to do with that work sent to it. This means that when you start with lots of threads, some of them may not even be doing anything. Or if you've set some borderline invalid values, it will appear to be working fine, report back a big hashrate, but generate less valid shares. So I implore you to check the share rate generation and pretty much ignore the reported hashrate when comparing notes. Remember that cgminer AND raper use virtually identical kernels so should hash at virtually identical rates.

Summary: Start cgminer without setting worksize, vectors, lookup gap or thread concurrency, but in debug mode with -D -T (I made this example up, not sure what it really is)

 [2012-07-23 21:07:18] GPU 0: selecting lookup gap of 2                    
 [2012-07-23 21:07:18] GPU 0: selecting thread concurrency of 2048    

Then if you're on a 5770, you can google it has 800 processing elements, so pick the highest multiple of that while staying under the thread concurrency, so 1600. The nearest power of 2 is 2048 so an intensity of 12.

Give that a go and let's see what happens. I expect different results on windows and linux. Use this table as a guide for what multiples to make concurrent threads.

Code:
GPU  Processing Elements
7750 512
7770 640
7850 1024
7870 1280
7950 1792
7970 2048

6850 960
6870 1120
6950 1408
6970 1536
6990 (6970x2)

6570 480
6670 480
6790 800

6450 160

5670 400
5750 720
5770 800
5830 1120
5850 1440
5870 1600
5970 (5870x2)
hero member
Activity: 686
Merit: 500
To anyone with a 5770: A core overclock up to around 40 is the sweetspot to gain a few kh/s, any further and kh/s decreases. (At least for me)

Example:
875 MHz gets me 202.5kh/s
885 MHz gets me 205.5kh/s
900 MHz gets me 180kh/s

Seems to be the same way with memory too. Actually, exactly the same.

So more tweaking has pumped 452kh/s out of my two cards (6850, 5770).

Edit:

I'll make a new build with the new commits when I wake up. I like staying updated. Tongue
hero member
Activity: 686
Merit: 500
Tittiez also helped me and nearly solved it for me so I appreciate that too Smiley

No problem. Smiley
legendary
Activity: 4592
Merit: 1851
Linux since 1997 RedHat 4
Ya I bumped it because no one had replied to it and was wondering...

That's what you do when a thread gets no replies  Roll Eyes


No.
newbie
Activity: 28
Merit: 0
Azrael_PT yes you did help me thankyou Smiley

It wasn't how I fixed it but I appreciate it none the less Smiley

Tittiez also helped me and nearly solved it for me so I appreciate that too Smiley

Thanks again guys sorry if it came across harsh I didn't mean it like that in honesty the first post wasn't being snarky well at least I wasn't INTENDING it to be..
legendary
Activity: 1736
Merit: 1006
win32, includes commits 94c94d6 and 1711b4e: http://www.mediafire.com/?18wth7mby8evcbb

Includes the .dll files, tested and works fine for me.

Thanks, appreciate it.
newbie
Activity: 28
Merit: 0
Ya I bumped it because no one had replied to it and was wondering...

That's what you do when a thread gets no replies  Roll Eyes

full member
Activity: 190
Merit: 100
Par Pari Refertur
I did nothing of the sort kano..

I asked for an answer none of you could give me one...

I solved it myself..

It's not my fault you couldn't tell me the answer..

Hum...

I think i helped you, but i don't care. Tongue
legendary
Activity: 1736
Merit: 1006
Kano, have some beers or something. Anything that'll chill you out.
legendary
Activity: 4592
Merit: 1851
Linux since 1997 RedHat 4
I did nothing of the sort kano..

I asked for an answer none of you could give me one...

I solved it myself..

It's not my fault you couldn't tell me the answer..
No, your first 2 posts here were: you asked a question then posted 35 minutes later asking why no one had answered it yet ...
coz you hadn't bothered to read anything in this thread.
newbie
Activity: 28
Merit: 0
How is it trolling when I solve it myself it's fact that you act like you are the smartest person here when I solved it myself , so how fuckin smart are you clown shoes?
newbie
Activity: 28
Merit: 0
I did nothing of the sort kano..

I asked for an answer none of you could give me one...

I solved it myself..

It's not my fault you couldn't tell me the answer..
legendary
Activity: 1736
Merit: 1006


We got it the first time - you prefer reaper - good ... go use it and be happy elsewhere.


What's wrong with you? Why are so antagonistic? I'm comparing notes - after all its the reason this thread exists - TO IMPROVE LITECOIN MINING EG REAPER.

Gonna keep getting up in my grill, knuckle-dragger? And who's we? Do you have multiple personality disorder?

Good grief.
legendary
Activity: 4592
Merit: 1851
Linux since 1997 RedHat 4
...

I thought the 6670 was what got around 100 MH/s, geez I'm trying to help. Like I said, try other concurrencies.
Your wasting your time on the troll noob.
He's not listening and taking out his forum frustrations on you.

As I said (and he's also just said) he's expecting you to give him the answers and not bother to fucking look for himself.
Ignore him.
-ck
legendary
Activity: 4088
Merit: 1631
Ruu \o/
...and the signal to noise ratio drops to the usual levels  Undecided
newbie
Activity: 28
Merit: 0
not at all the 6670 is 150 MH/s

the 6570 is 89-98 MH/s (89 Low / 98 Max)

and sorry Tittiez I'm not in a very good fuckin mood the amount of bullshit I've had to put up with the last 2 fucking weeks..

and I ask for help and don't fuckin get it I just fix it myself so stfu for failing so hard kano..
hero member
Activity: 686
Merit: 500
And actually it's not normal..

50 KH/s is half of what a 6570 can get...

I'm not fuckin stupid I know the difference in scrypt/SHA256

If 6570 Bitcoin Hashrate = 100 MH/s then Litecoin Hashrate = +/- 100 KH/s

Is everyone on this forum a fucking troll..

Go back under your bridge scrub nub..

I thought the 6670 was what got around 100 MH/s, geez I'm trying to help. Like I said, try other concurrencies.
full member
Activity: 190
Merit: 100
Par Pari Refertur
And actually it's not normal..

50 KH/s is half of what a 6570 can get...

I'm not fuckin stupid I know the difference in scrypt/SHA256

If 6570 Bitcoin Hashrate = 100 MH/s then Litecoin Hashrate = +/- 100 KH/s

Is everyone on this forum a fucking troll..

Go back under your bridge scrub nub..

Well whit my Nvidia 310M i get ~3,6mh/s mining bitcoins and i get ~5kh/s mining litcoins so i don´t think that what you said is not totally correct but i could be wrong.

And please leave other matters tho their appropriated thread. Please don't go off topic.

PS: Only 100mh/s whit 6570? I have a friend that gets +120mh/s whit BAMT. Whit OC, forgot that. Tongue
newbie
Activity: 28
Merit: 0
And actually it's not normal..

50 KH/s is half of what a 6570 can get...

I'm not fuckin stupid I know the difference in scrypt/SHA256

If 6570 Bitcoin Hashrate = 100 MH/s then Litecoin Hashrate = +/- 100 KH/s

Is everyone on this forum a fucking troll..

Go back under your bridge scrub nub..
newbie
Activity: 28
Merit: 0
Too much truths for you?
Pages:
Jump to: