OFFICIAL CGMINER mining software thread for linux/win/osx/mips/arm/r-pi 4.11.0 - page 648.

DeathAndTaxes

donator

Activity: 1218

Merit: 1080

Gerald Davis

Quote from: dlasher on February 16, 2012, 09:31:16 PM

Quote from: -ck on February 15, 2012, 10:34:19 PM

cgminer sets clocks all back to default on exit... if it exits cleanly, and of course on windows it's a miracle when it does.

Sadly this is not the case.

windows machine, pair of 6950's, set to 850/1300 for normal operation, in cgminer they are set to 700-880/300... when cgminer exits it leaves the cards at 880/300.

even updated to 12.1 drivers, both 2.4 and 2.6 SDK.

Is it exiting or crashing?

dlasher

sr. member

Activity: 467

Merit: 250

Quote from: -ck on February 15, 2012, 10:34:19 PM

cgminer sets clocks all back to default on exit... if it exits cleanly, and of course on windows it's a miracle when it does.

Sadly this is not the case.

windows machine, pair of 6950's, set to 850/1300 for normal operation, in cgminer they are set to 700-880/300... when cgminer exits it leaves the cards at 880/300.

even updated to 12.1 drivers, both 2.4 and 2.6 SDK.

kano

legendary

Activity: 4634

Merit: 1851

Linux since 1997 RedHat 4

Quote from: jjshabadoo on February 16, 2012, 08:54:24 PM

...
fan 100 ...
...

Hmm don't expect that to last forever ...

jjshabadoo

hero member

Activity: 535

Merit: 500

I second a thread just for cgminer settings and what cards are being used.

I use I,9 and then just set memclock, fan and engine in my bash script for linuxcoin.

I'm still struggling to get my 5970's stable.

fan 100 engine 820 and me 410 seem best for me, but of course for wattage, I'd love a lower memclock.

Should I be using I 8, is that my issue? also i am on cgminer 2.0.7 because there is no way in hell i can do anything in linuxcoin without screwing it up and hate the full ubuntu OS, too difficult to work with IMO.

We need a new stripped down version of linuxcoin or call it whatever, just for mining and maybe built around optimal settings/set-up for cgminer. Seems BAMT doesn't supoort or work with cgminer.

My farm is up to 8 rigs 10 5970's, 8 5870's and 1 6990. I plan on building a water-cooled 4x5970 with 1500 watt psu. I have the stuff, but want to make sure I can get it set up as fast and stable as possible as it will be in a semi-remote location.

I know I need to figure out how to use ssh, etc. but I'd also like to have the best sdk, drivers, etc. along with some other folks settings for engine and memclock.

Can we start a cgminer optimization thread with some guides? I'll throw in a bounty of 10btc if it has a set up guide and an optimal settings database.

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Okay so I tested the fastest diablominer has to offer on 7970 and current cgminer is 1.5 MHash faster with defaults, so I'm pleased

. I guess I should keep working on my kernel Wink

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Quote from: Diapolo on February 16, 2012, 10:11:16 AM

Con, maybe we can talk about integrating global offset parameter support into CGMINER?

Take a short look at http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueNDRangeKernel.html and the global_work_offset parameter. All that has to be taken into consideration from the kernel-side is in DiaKGCN. OpenCL 1.1 detection is in your code, too, which is needed, but I can't do the other required changes without a compiler.

In short, the nonce-base is not supplied via the base parameter, if GOFFSET is enabled, but instead via the global_work_offset parameter and used via the global work-item ID in the kernel. This saves a few instructions and can give us a small boost.

Yes I like that idea. But 8 and 16 vectors perform shithouse followed by appalling so it's not worth pursuing those.

kano

legendary

Activity: 4634

Merit: 1851

Linux since 1997 RedHat 4

Quote from: DeathAndTaxes on February 16, 2012, 10:02:33 AM

c) cgminer stops after 5000 shares (and script detects it)

Or

--sched-stop `date -d "+6 hours" +%H:%M`

... since shares are random

And 6 hours for a 1GH/s is approximately 5029 shares

Frizz23

hero member

Activity: 1162

Merit: 500

Suggestion: Would it be possible to maintain on the first page of this tread a list of all the AMD graphics cards with the optimum settings for cgminer?

It's kind of cumbersome to keep reading through all those pages (202) at the moment - so it would be nice to have this information in one central place.

Diapolo

hero member

Activity: 772

Merit: 500

Con, maybe we can talk about integrating global offset parameter support into CGMINER?

Take a short look at http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueNDRangeKernel.html and the global_work_offset parameter. All that has to be taken into consideration from the kernel-side is in DiaKGCN. OpenCL 1.1 detection is in your code, too, which is needed, but I can't do the other required changes without a compiler.

In short, the nonce-base is not supplied via the base parameter, if GOFFSET is enabled, but instead via the global_work_offset parameter and used via the global work-item ID in the kernel. This saves a few instructions and can give us a small boost.

Dia

bulanula

hero member

Activity: 518

Merit: 500

Quote from: DeathAndTaxes

I would love to see an expanded version of this that goes down to 100 (or lower) memclock using latest version of cgminer, latest drivers, and SDK 2.1.

Anyone have any ideas how to automate a loop of cgminer.
I would imagine the easiest would be something like.

setup config file w/ shares: "5000", static memclock, static coreclock, worksize defined, vectors defined.

script
a) modifies config file to next parameter set to be tested (i.e. V2, W256, memclock = 180)
b) starts cgminer
c) cgminer stops after 5000 shares (and script detects it)
d) capture and save final avg hashrate, rejects, errors, etc.
e) goto a

I would be happy to run something like this on one of my rigs. Once we get results for one core clock we could run it at another speed.

Between 5870s & 5970s I figure coreclocks of 725 (5970 stock), 800, 825 (5870 stock), 850, 875, 900, 925, 950, 1000 (watercooled

) would be useful.

Doing it manually would be a nightmare but if someone can make some semi-automated tool I would be happy to leave it running on one of my rigs. Once we get data for SDK 2.1 we could compare it to SDK 2.4, 2.5, 2.6. It is possible that some parameters do better on different version of SDK (as an example. maybe SDK 2.1 is best w/ high work, low vectors, low memclock but SDK 2.5 is better at smaller work, high vectors, "normalish" memclock, etc).

Yeah. You can say that again.

I would have done it but sadly my router died on me and I am still trying to fix it ...

IMHO one needs to test just 2.1 against 2.4 and ranges from below 300 for memory and above 900 for core.

DeathAndTaxes

donator

Activity: 1218

Merit: 1080

Gerald Davis

Quote from: Vbs on February 16, 2012, 04:57:59 AM

Gonna link again the graphic I like so much Grin

(from phatk thread)

I would love to see an expanded version of this that goes down to 100 (or lower) memclock using latest version of cgminer, latest drivers, and SDK 2.1.

Anyone have any ideas how to automate a loop of cgminer.
I would imagine the easiest would be something like.

setup config file w/ shares: "5000", static memclock, static coreclock, worksize defined, vectors defined.

script
a) modifies config file to next parameter set to be tested (i.e. V2, W256, memclock = 180)
b) starts cgminer
c) cgminer stops after 5000 shares (and script detects it)
d) capture and save final avg hashrate, rejects, errors, etc.
e) goto a

I would be happy to run something like this on one of my rigs. Once we get the complete result set for one core clock we could run it at another core speed. Between 5870s & 5970s I figure coreclocks of 725 (5970 stock), 800, 825 (5870 stock), 850, 875, 900, 925, 950, 1000 (watercooled

) would be useful.

Doing it manually would be a nightmare but if someone can make some semi-automated tool I would be happy to leave it running on one of my rigs. Once we get data for SDK 2.1 we could compare it to SDK 2.4, 2.5, 2.6. It is possible that some parameters do better on different versions of SDK (as an example maybe SDK 2.1 is best w/ high work, low vectors, low memclock but SDK 2.5 is better at smaller work, high vectors, "normalish" memclock, etc).

Diapolo

hero member

Activity: 772

Merit: 500

Quote from: -ck on February 16, 2012, 06:29:52 AM

Quote from: Diapolo on February 16, 2012, 06:26:23 AM

Quote from: -ck on February 16, 2012, 06:20:01 AM

You are but 1 Mhash off my current poclbm kernel with that Wink

Diakgcn 716.5 versus ck-poclbm 717.5

Sounds not too bad Cheesy

. I will try -w 128 and compare results on my machine ... will report back, which one is faster for me at default clocks and on Win7 x64. I'm happy now!

There is something unusual about it running diakgcn and the hashrate appears to be more unstable, rising and falling more so it takes a while to get a reasonable grasp for what the hashrate really is. Since you're running 2 vectors, it effectively makes the hashrate update half as often as running my poclbm kernel since I use no vectors. Either way, the hashrates are really close.

I still have some problems with that Git-stuff, but it's such a great tool, wow ... it would be nice to default diakgcn to -v 2 and it seems -w 256 is better for me by ~0,5 MH/s Cheesy

but I can live with the default being 128 ^^.

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Quote from: Diapolo on February 16, 2012, 06:26:23 AM

Quote from: -ck on February 16, 2012, 06:20:01 AM

You are but 1 Mhash off my current poclbm kernel with that Wink

Diakgcn 716.5 versus ck-poclbm 717.5

Sounds not too bad Cheesy

. I will try -w 128 and compare results on my machine ... will report back, which one is faster for me at default clocks and on Win7 x64. I'm happy now!

There is something unusual about it running diakgcn and the hashrate appears to be more unstable, rising and falling more so it takes a while to get a reasonable grasp for what the hashrate really is. Since you're running 2 vectors, it effectively makes the hashrate update half as often as running my poclbm kernel since I use no vectors. Either way, the hashrates are really close.

Diapolo

hero member

Activity: 772

Merit: 500

Quote from: -ck on February 16, 2012, 06:20:01 AM

Quote from: Diapolo on February 16, 2012, 05:52:38 AM

Quote from: -ck on February 16, 2012, 05:28:08 AM

Quote from: Diapolo on February 16, 2012, 05:27:15 AM

Quote from: -ck on February 16, 2012, 01:51:19 AM

With Diapolo's help we finally got the diakgcn kernel working on cgminer. I've just committed code to the git tree which makes it work. Alas at the same engine and clock speeds on the 7970, diakgcn gives me 699 MHash while my customised kernel gives me 717 MHash. But now that it's working, he may be able to tweak it further...

Did you use -v 2 with DiaKGCN for your test? I'm not sure why, but it has always been faster on Phoenix to use it with -v 2.

Dia

Tried it, much slower.

Did you use the code from the latest commits, here on Windows it's definitely faster with -v 2.

-I 9 -k diakgcn -d 0 -v 1 -w 256: ~524 MH/s
-I 9 -k diakgcn -d 0 -v 2 -w 256: ~539 MH/s

Indeed it is faster now, and with -v 2, it is fastest allow cgminer to choose worksize which is ends up being 128 - it queries the "preferred worksize" and divides that by number of vectors.

You are but 1 Mhash off my current poclbm kernel with that Wink

Diakgcn 716.5 versus ck-poclbm 717.5

Sounds not too bad Cheesy

. I will try -w 128 and compare results on my machine ... will report back, which one is faster for me at default clocks and on Win7 x64. I'm happy now!

Dia

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Quote from: Diapolo on February 16, 2012, 05:52:38 AM

Quote from: -ck on February 16, 2012, 05:28:08 AM

Quote from: Diapolo on February 16, 2012, 05:27:15 AM

Quote from: -ck on February 16, 2012, 01:51:19 AM

With Diapolo's help we finally got the diakgcn kernel working on cgminer. I've just committed code to the git tree which makes it work. Alas at the same engine and clock speeds on the 7970, diakgcn gives me 699 MHash while my customised kernel gives me 717 MHash. But now that it's working, he may be able to tweak it further...

Did you use -v 2 with DiaKGCN for your test? I'm not sure why, but it has always been faster on Phoenix to use it with -v 2.

Dia

Tried it, much slower.

Did you use the code from the latest commits, here on Windows it's definitely faster with -v 2.

-I 9 -k diakgcn -d 0 -v 1 -w 256: ~524 MH/s
-I 9 -k diakgcn -d 0 -v 2 -w 256: ~539 MH/s

Indeed it is faster now, and with -v 2, it is fastest allow cgminer to choose worksize which is ends up being 128 - it queries the "preferred worksize" and divides that by number of vectors.

You are but 1 Mhash off my current poclbm kernel with that Wink

Diakgcn 716.5 versus ck-poclbm 717.5

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Quote from: Diapolo on February 16, 2012, 05:55:46 AM

Quote from: -ck on February 16, 2012, 05:28:47 AM

Quote from: Diapolo on February 16, 2012, 05:21:04 AM

Thanks for your help with integrating DiaKGCN Con!

And thank you for the code. Now what advantage is there to reordering the variables passed to the kernel? In commit d86a38d1e75090e1ffb9df9e68aa13b1c8dcf9ec you shuffled arguments which appear to be mostly cosmetic. Would that be right?

That's true for the most part when reordering kernel-arguments to be in line with usage of them in the code, but I think it's part of a clean code, don't you think so?

So you created a new DiaKGCN branch, will this last so I can switch my local repo to that one

?

Dia

Yes, please do, and if things go well, we can have a new default kernel for GCN Wink

Diapolo

hero member

Activity: 772

Merit: 500

Quote from: -ck on February 16, 2012, 05:28:47 AM

Quote from: Diapolo on February 16, 2012, 05:21:04 AM

Thanks for your help with integrating DiaKGCN Con!

And thank you for the code. Now what advantage is there to reordering the variables passed to the kernel? In commit d86a38d1e75090e1ffb9df9e68aa13b1c8dcf9ec you shuffled arguments which appear to be mostly cosmetic. Would that be right?

That's true for the most part when reordering kernel-arguments to be in line with usage of them in the code, but I think it's part of a clean code, don't you think so?

So you created a new DiaKGCN branch, will this last so I can switch my local repo to that one

?

Dia

Diapolo

hero member

Activity: 772

Merit: 500

Quote from: -ck on February 16, 2012, 05:28:08 AM

Quote from: Diapolo on February 16, 2012, 05:27:15 AM

Quote from: -ck on February 16, 2012, 01:51:19 AM

With Diapolo's help we finally got the diakgcn kernel working on cgminer. I've just committed code to the git tree which makes it work. Alas at the same engine and clock speeds on the 7970, diakgcn gives me 699 MHash while my customised kernel gives me 717 MHash. But now that it's working, he may be able to tweak it further...

Did you use -v 2 with DiaKGCN for your test? I'm not sure why, but it has always been faster on Phoenix to use it with -v 2.

Dia

Tried it, much slower.

Did you use the code from the latest commits, here on Windows it's definitely faster with -v 2.

-I 9 -k diakgcn -d 0 -v 1 -w 256: ~524 MH/s
-I 9 -k diakgcn -d 0 -v 2 -w 256: ~539 MH/s

Dia

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Quote from: Diapolo on February 16, 2012, 05:21:04 AM

Thanks for your help with integrating DiaKGCN Con!

And thank you for the code. Now what advantage is there to reordering the variables passed to the kernel? In commit d86a38d1e75090e1ffb9df9e68aa13b1c8dcf9ec you shuffled arguments which appear to be mostly cosmetic. Would that be right?

-ck

legendary

Activity: 4088

Merit: 1631

Ruu \o/

Quote from: Diapolo on February 16, 2012, 05:27:15 AM

Quote from: -ck on February 16, 2012, 01:51:19 AM

With Diapolo's help we finally got the diakgcn kernel working on cgminer. I've just committed code to the git tree which makes it work. Alas at the same engine and clock speeds on the 7970, diakgcn gives me 699 MHash while my customised kernel gives me 717 MHash. But now that it's working, he may be able to tweak it further...

Did you use -v 2 with DiaKGCN for your test? I'm not sure why, but it has always been faster on Phoenix to use it with -v 2.

Dia

Tried it, much slower.

Topic: OFFICIAL CGMINER mining software thread for linux/win/osx/mips/arm/r-pi 4.11.0 - page 648. (Read 5806103 times)