Author

Topic: [ANN] cudaMiner & ccMiner CUDA based mining applications [Windows/Linux/MacOSX] - page 1065. (Read 3426921 times)

newbie
Activity: 28
Merit: 0
I'm quite aware of the normal autotuning, what I want to try is force it to autotune with another kernel and see if there could be any gains like you found, albeit unstable.
Problem is, with capital letters it's just "unknown", with non-capital the program just crashes, even with "f", which is the correct one for my card..
So it does pick a configuration when you use (capital) F, but you don't know which? If so, use the -D flag for debugging mode. It will give you more information about the auto-tuning process.
member
Activity: 98
Merit: 10
Are you starting from a config that autotune is detecting? Try whatever it finds, then work from that. If just the -l F#x# that autotune chooses doesn't work (and you're using a lowercase L, and you have an updated cudaminer version), you'll need help from cbuchner.
newbie
Activity: 43
Merit: 0
I'm quite aware of the normal autotuning, what I want to try is force it to autotune with another kernel and see if there could be any gains like you found, albeit unstable.
Problem is, with capital letters it's just "unknown", with non-capital the program just crashes, even with "f", which is the correct one for my card..
member
Activity: 98
Merit: 10
How do you make it autotune for an specific generation btw? Would like to experiment like blackraven here above. Am I just missing something very obvious here or is it just broken? All kinds of combinations I can think of end up in either crashes, even when using the correct Fermi config (-l f?) which is my card, or just "unknow operator"

Skimming this thread earlier, there's a lot of mentions of
-l F
-l K
for autotune with a selected kernel, leaving out the specifics (not sure what the #x# represents). Maybe you need a capital F. If it doesn't work, just try -l auto and it'll choose a kernel for you. It says which one it chose in the command prompt as a full descriptor like
K98x2
. You might be trying to use a kernel that's not supported on your card, and using autotune will give you a starting point.
newbie
Activity: 43
Merit: 0
How do you make it autotune for an specific generation btw? Would like to experiment like blackraven here above. Am I just missing something very obvious here or is it just broken? All kinds of combinations I can think of end up in either crashes, even when using the correct Fermi config (-l f?) which is my card, or just "unknow operator"
member
Activity: 98
Merit: 10
I ran into a weird issue trying the Titan kernel on my GTX670. It was reporting almost 400khash, but every share was being invalidated by the CPU, besides otherwise acting fairly normally. Maybe there's an improvement to be made somewhere that could let Kepler achieve these kinds of khash rates.

For anyone looking for a config for their GTX670, I'm at 180 khash with these settings. If someone has something better, I'm all ears.
-i 1 -l K98x2 -C 2 -m 1
newbie
Activity: 8
Merit: 0
nice, i was at 1280 mhz core 3005 mem with config k94x2 on 182 k/hs. With the new build core 1280 kh/s and mem 3150 i now get 190 kh/s.
Cant seem to push it more, the driver crashes. Only at 69% TDP though and just 60 degrees temp.
Got a GTX-760. No nightly drops btw.

I usually run 1280 for games but I decided to push it further. 1306 typically crashes on games pretty quickly. I've got the MSI version and it's my first piece of hardware from that company. I'm really happy with this card. I don't run heat in my apartment so with inaudible fan < 50%, I'm getting around 55C. If I run 100% fan I'm seeing mid 40C. Ambient is probably 20C or less.  Grin My last card was water cooled but that turned out to be a huge hassle so I went with an aftermarket-OEM-supplied air cooler instead.

I would really prefer to run on Fedora 20 instead of Windows 7 which is where I do most of my multi media from my PC. Anyone been able to compile cudaminer for use on Fedora 20?
full member
Activity: 182
Merit: 100
Anyone using a GTX 760? I've got flags -i 0 -l k96x2 and I've managed to hit 200Khash/s give or take 2. Much better than 100 when I first started a few days ago. Any other configs with better results?

Also, is it normal for the Khash/s rate to vary? I'm hitting 201 while I am using my computer, then when I leave it at night I see dips down to 155Khash/s and everywhere in between. What gives?

Overclock is 1306Mhz core, 3375Mhz memory although I can hit 3402 but I saw no performance increase from the 3402Mhz on memory. Does cudaminer benefit from maintaining a specific ratio like the AMD cards do?

nice, i was at 1280 mhz core 3005 mem with config k94x2 on 182 k/hs. With the new build core 1280 kh/s and mem 3150 i now get 190 kh/s.
Cant seem to push it more, the driver crashes. Only at 69% TDP though and just 60 degrees temp.
Got a MSI GTX-760 TF. System watt uses under full gpu load is only 250w, so while not an AMD card in quite in the green.

No nightly drops btw.
member
Activity: 106
Merit: 10
CryptoBeggar
anyone tried this on linux ?
source files arent  compatible with unix format and contain ^M

I am running cudaminer on Arch linux from the AUR repository (cudaminer-git) and it works just fine.

There is a dependency issue with the latest update, though. The newest version apparently depends on CUDA 5.0 instead of 5.5 and 5.0 depends on an older version of gcc than the one used in the Arch repositories. I should note that the package is not broken, just outdated. Hopefully it will be fixed soon.


sweet im on Arch too Smiley
newbie
Activity: 28
Merit: 0
anyone tried this on linux ?
source files arent  compatible with unix format and contain ^M

I am running cudaminer on Arch linux from the AUR repository (cudaminer-git) and it works just fine.

There is a dependency issue with the latest update, though. The newest version apparently depends on CUDA 5.0 instead of 5.5 and 5.0 depends on an older version of gcc than the one used in the Arch repositories. I should note that the package is not broken, just outdated. Hopefully it will be fixed soon.
member
Activity: 106
Merit: 10
CryptoBeggar
anyone tried this on linux ?
source files arent  compatible with unix format and contain ^M
newbie
Activity: 8
Merit: 0
Anyone using a GTX 760? I've got flags -i 0 -l k96x2 and I've managed to hit 200Khash/s give or take 2. Much better than 100 when I first started a few days ago. Any other configs with better results?

Also, is it normal for the Khash/s rate to vary? I'm hitting 201 while I am using my computer, then when I leave it at night I see dips down to 155Khash/s and everywhere in between. What gives?

Overclock is 1306Mhz core, 3375Mhz memory although I can hit 3402 but I saw no performance increase from the 3402Mhz on memory. Does cudaminer benefit from maintaining a specific ratio like the AMD cards do?
member
Activity: 85
Merit: 10
Decided to try auto tune with the new build.

T24x20 GTX 780 438 kH/s
newbie
Activity: 19
Merit: 0
you guys could try building the spinlock kernel for sm_20 to check if you'll be getting your previous speeds back when running it on a Fermi device


Still crashed cicc when sm_20, went ok with sm_13. Testing against 2013-12-01, 2013-12-07 spinlock is about 5 - 6 khash/s faster.

With fermi kernel 2013-12-01 with the added offset is around 233, 2013-12-07 is about 228. I tested 2013-12-07 with both your precompiled exe and self built from source, results are identical.

Id like to see feedback from more fermi owners. In particular compute 2.0 fermis and even more particular 448 core gtx560ti.

Well i have a pallit one anyway. Cheapest i could find but that shouldn't cause my results to be totally different to yours. I don't think its a problem with os or drivers because i dual boot 32 bit debian and get the same results there.
Although i cant really see it, maybe my weird ddr3 + fsb + pcie 1.1 abomination is causing it.
hero member
Activity: 756
Merit: 502
you guys could try building the spinlock kernel for sm_20 to check if you'll be getting your previous speeds back when running it on a Fermi device
newbie
Activity: 19
Merit: 0
I posted a 2013-12-07 version. Most notable difference is that the Fermi kernel now targets the Compute 2.0 architecture explicitly. Expect maybe a very small boost in hash rates. For me it went up from 228->235 with the cudaminer x64 version for my Compute 2.0 card.
I'm using GTX480s and I just tried the latest build:
- previous build with K15x16 - 246 KH/s
- current build with F15x16 - ~220 KH/s
- current build with K15x16 - says it requires 3.0 and doesn't validate

other config: -c 0, -i 0, it is second card, so I use the interactive on the other one.

During the last few days I tried using Fxx*xx on both GTX480 and GTX570 and I always get much lower khs compared to using Kxx*xx

I have also noticed a drop in performance since switching to the ulong2 method in fermi. Most noticable in x64. Also, this maybe of no concern of yours as you don't really support cuda 5.5 but, the spinlock kernel crashes the cuda 5.5 cicc compiler when compiling for compute 3.0.
full member
Activity: 196
Merit: 100
GTX 650Ti
Cudaminer x64 1 december - 91kh
Cudaminer x64 7 december - 94kh
-i 0 -l 56x2 -C 2
Thanx!
But... May be exist more effective parameters?
full member
Activity: 196
Merit: 100
I posted a 2013-12-07 version. Most notable difference is that the Fermi kernel now targets the Compute 2.0 architecture explicitly. Expect maybe a very small boost in hash rates. For me it went up from 228->235 with the cudaminer x64 version for my Compute 2.0 card.
I'm using GTX480s and I just tried the latest build:
- previous build with K15x16 - 246 KH/s
- current build with F15x16 - ~220 KH/s
- current build with K15x16 - says it requires 3.0 and doesn't validate

other config: -c 0, -i 0, it is second card, so I use the interactive on the other one.

During the last few days I tried using Fxx*xx on both GTX480 and GTX570 and I always get much lower khs compared to using Kxx*xx
member
Activity: 85
Merit: 10
My script in the bat file is as follows:

Code:
cudaminer --url=stratum+tcp://litecoinpool.org:3333  -i 0 -l auto --userpass=u:p

Try -l K42x6 instead of -l auto.
hero member
Activity: 756
Merit: 502
I posted a 2013-12-07 version. Most notable difference is that the Fermi kernel now targets the Compute 2.0 architecture explicitly. Expect maybe a very small boost in hash rates. For me it went up from 228->235 with the cudaminer x64 version for my Compute 2.0 card.

Next work item on my TODO list are:

-make cudaminer suitable for >1 MHash/s operations, even with weak single or dual core CPUs
 by doing the required SHA256 hashing (for scrypt) also on the GPU

-finish and operate my 1.5 MHash/s build (3x GTX 780Ti in one mainboard)

Jump to: