[ANN] cudaMiner & ccMiner CUDA based mining applications [Windows/Linux/MacOSX] - page 1101.

mhps

hero member

Activity: 516

Merit: 500

CAT.EX Exchange

Quote from: 420 on May 03, 2013, 02:45:18 AM

Quote from: nst6563 on April 20, 2013, 08:50:19 PM

Quote from: 420 on April 20, 2013, 08:27:34 PM

What speed should I get with a GTX 460M

or a GTX 680

check here
https://docs.google.com/spreadsheet/ccc?key=0AjMqJzI7_dCvdG9fZFN1Vjd0WkFOZmtlejltd0JXbmc#gid=1

that should give you an idea

yay 185khash with 680 not overclocked.

now how do I get the miner to connect to COINOTRON?

0.02BTC reward!

Hi this is how I do -- just tried and it works

download http://www.wemineltc.com/files/stratum_proxy.exe
Put it in your cudaminer directory.
Note that there are different versions on the net.

Run this to start proxy

Code:

stratum_proxy.exe -o coinotron.com -p 3334 -pa scrypt -nm

Run this to mine

Code:

cudaminer -o http://127.0.0.1:8332 -O username:passwd -d 1 -l 112x8 -i 0 -C 2 --no-longpoll

You should change the "-d 1 -l 112x8 -i 0 -C 2 " to suit your card. Use -D to find out the best -l parameters.

If it works 1HxvAEC4nj37hyenz9DwsyrMMnkwzpbzno

dell00

newbie

Activity: 28

Merit: 0

cannot download the software. Anyone can give an aternative downloading site?

420

hero member

Activity: 756

Merit: 500

Quote from: tacotime on May 03, 2013, 02:50:24 AM

Quote from: 420 on May 03, 2013, 02:45:18 AM

yay 185khash with 680 not overclocked.

now how do I get the miner to connect to COINOTRON?

0.02BTC reward!

get mining_proxy.exe from ltcmine.ru
mining_proxy.exe -pa scrypt -o coinotron.com -p 3334
connect cudaminer to localhost port 8332, use normal miner password

HTTP request failed: Failed connect to localhost:8332
json_rpc_call failed

my file:
cudaminer.exe -o localhost:8322 -O user:pass

PROXY:
mining_proxy.exe -pa scrypt -o coinotron.com -p 3334

tacotime

legendary

Activity: 1484

Merit: 1005

Quote from: 420 on May 03, 2013, 02:45:18 AM

Quote from: nst6563 on April 20, 2013, 08:50:19 PM

Quote from: 420 on April 20, 2013, 08:27:34 PM

What speed should I get with a GTX 460M

or a GTX 680

check here
https://docs.google.com/spreadsheet/ccc?key=0AjMqJzI7_dCvdG9fZFN1Vjd0WkFOZmtlejltd0JXbmc#gid=1

that should give you an idea

yay 185khash with 680 not overclocked.

now how do I get the miner to connect to COINOTRON?

0.02BTC reward!

get mining_proxy.exe from ltcmine.ru
mining_proxy.exe -pa scrypt -o coinotron.com -p 3334
connect cudaminer to localhost port 8332, use normal miner password

420

hero member

Activity: 756

Merit: 500

Quote from: nst6563 on April 20, 2013, 08:50:19 PM

Quote from: 420 on April 20, 2013, 08:27:34 PM

What speed should I get with a GTX 460M

or a GTX 680

check here
https://docs.google.com/spreadsheet/ccc?key=0AjMqJzI7_dCvdG9fZFN1Vjd0WkFOZmtlejltd0JXbmc#gid=1

that should give you an idea

yay 185khash with 680 not overclocked.

now how do I get the miner to connect to COINOTRON?

0.02BTC reward!

mhps

hero member

Activity: 516

Merit: 500

CAT.EX Exchange

Quote from: mhps on May 01, 2013, 02:13:11 AM

So my question is disabling longpoll a recommended practice where it is an option?

From here it does seem longpolling is an extra protocol that doesn't bring any benefit for using local stratum proxy: https://mining.bitcoin.cz/stratum-mining

TiWu

newbie

Activity: 25

Merit: 0

Quote from: statdude on May 02, 2013, 03:46:12 PM

Help, how can I slow this down on my GTX 570 - it's running over 90 degrees...

My command line:

cudaminer -o http://ltc.crypdough.com:9332/ -u user -p pass -i 0

I use MSI afterburner on my (non-MSI) GTX 570. It allows you to redraw the automatic slope for fan speed.
You can find it here: http://event.msi.com/vga/afterburner/

Trough settings-> fan you can see the graph and adapt it.
I've set 4 point for my card:

30% at 30°
55% at 60°
85% at 80°
100% at 100° (but the Club3D GTX570 I have only goes up to 85%)

I set the fan update speed to 5000ms and a temp hysteresis of 1° to avoid constant 'flapping' of the fan speed.

This is running 24/7 at around 70° at 70% fan, with these options in cudaminer: -i 0 -C 2 -l 30x8, giving me around 226 kh/s.
Hope this helps!

edit: BTW, I run this GPU dedicated for mining, my desktop is using the internal graphics of my i5. If you are using your desktop on the card, I suggest using the -i 1 option.

cbuchner1

hero member

Activity: 756

Merit: 502

Quote from: statdude on May 02, 2013, 03:46:12 PM

Help, how can I slow this down on my GTX 570 - it's running over 90 degrees...

For one, -i 1 helps a little (you lose some 25% of hashing speed on fast cards)

Or use a tool called nvidia-inspector

-Set the fan to Manual and higher Speeds
-clock down the card's core speed and possibly undervolt a bit.

statdude

legendary

Activity: 1498

Merit: 1000

Help, how can I slow this down on my GTX 570 - it's running over 90 degrees...

My command line:

cudaminer -o http://ltc.crypdough.com:9332/ -u user -p pass -i 0

cbuchner1

hero member

Activity: 756

Merit: 502

Quote from: narayan on May 02, 2013, 11:45:20 AM

Anyone had any luck with the GTX 260, I had an old one lying around which I chucked in a test machine but can't get more than 20kh out of it.

I get 44 kHash with Windows 7, cudaminer Version older than 2013-04-30 using

-i0 -l S27x3

It should work a bit faster on Windows XP or Linux

Christian

Stoneysilence

member

Activity: 104

Merit: 10

Quote from: narayan on May 02, 2013, 11:45:20 AM

Anyone had any luck with the GTX 260, I had an old one lying around which I chucked in a test machine but can't get more than 20kh out of it.

Looking at the stats for other cards I am guessing 20kh is probably about right. My 460M in my laptop is about 50kh, in the google doc from page 1 there is a 210 and it gets about 6kh.

narayan

member

Activity: 98

Merit: 10

I do not sell Bitcoins. I sell SHA256(SHA256()).

Anyone had any luck with the GTX 260, I had an old one lying around which I chucked in a test machine but can't get more than 20kh out of it.

cbuchner1

hero member

Activity: 756

Merit: 502

I am currently a bit concerned because large block configurations like 56x8 or 64x4 cause results to not validate on both of my 560Ti cards. Even with texture cache turned off. I wonder what I broke and when that happened. UPDATE: okay, this must be the memory requirement - my 560 cards aren't blessed with 3Gigs of VRAM.

CumpsD

sr. member

Activity: 476

Merit: 250

Quote from: cbuchner1 on May 02, 2013, 06:45:29 AM

When using the autotuned setting from non-interactive mode in interactive mode, you get higher hash rates. The CPU will still pause inbetween, but the computation takes longer. So the ratio of computation vs. pause is higher -> more hashes/sec at increased lag.

Makes sense. What I tried was the other way around actually. Using the autotuned setting from interactive in non-interactive. (Just for testing, now I've moved on to the more correct settings)

cbuchner1

hero member

Activity: 756

Merit: 502

When using the autotuned setting from non-interactive mode in interactive mode, you get higher hash rates. The CPU will still pause inbetween, but the computation takes longer. So the ratio of computation vs. pause is higher -> more hashes/sec at increased lag.

CumpsD

sr. member

Activity: 476

Merit: 250

That explains the difference

Using the interactive auto-tuned setting with not-interactive (so without letting it re-tune) had a nice effect of increased rates, without much lag.

I guess it choose a small kernel which completes fast, but then doesn't pause the CPU.

cbuchner1

hero member

Activity: 756

Merit: 502

Quote from: CumpsD on May 02, 2013, 05:35:58 AM

Big difference in kernel selection between interactive and non-interactive (S-kernel)

interactive picks smaller kernels that complete within 1/30th of a second. And then the CPU pauses for a millisecond (actually twice per hash computation, once inbetween the A and B kernels and once after the B kernel), giving you about 60 frames per second for doing other stuff, like watching movies.

CumpsD

sr. member

Activity: 476

Merit: 250

I've added a lot of results for a EVGA GTX 680 to the Google Docs.

Big difference in kernel selection between interactive and non-interactive (S-kernel)

Thanks for this miner!

Amph

legendary

Activity: 3248

Merit: 1070

Quote from: termhn on May 01, 2013, 05:29:11 PM

Quote from: Amph on May 01, 2013, 04:27:18 PM

it's strange, the program start only with bitcoin, but not with litecoin...
with litecoin it give me an error

This is meant only to work with litecoin not bitcoin

yeah i know, this is why it's strange, maybe can be that i connect to stratum, using p2pool?

Quote from: Lacan82 on May 01, 2013, 04:54:03 PM

Quote from: Amph on May 01, 2013, 04:27:18 PM

it's strange, the program start only with bitcoin, but not with litecoin...
with litecoin it give me an error

What command are you trying to execute? are drivers up to date?

i'm using p2pool,
i run litecoin client, then p2pool with "--net litecoin", and then cudaminer, but it say fail to execute or something, and then restart in 15 sec

KnowBuddy

member

Activity: 69

Merit: 10

Quote from: SubNoize on May 01, 2013, 07:56:15 PM

Quote from: shatter on May 01, 2013, 06:31:07 AM

This is written in the Readme, hope it helps:

Currently there is just one prefix, which is "S". Later releases may
see the introduction of more kernel variants with using other letters.

Examples:

e.g. S27x3 is a launch configuration that works well on GTX 260
   28x4 is a launch configuration that works on Geforce GTX 460
   290x2 is a launch configuration that works on Geforce GTX 660Ti

You should wait through autotune to see what kernel is found best for
your current hardware configuration.

The choice between Non-Titan and Titan CUDA kernels is automatically
made based on your device's compute capability. Titans cost around
a thousand dollars, so you probably don't have one.

Prefix | Non-Titan | Titan
-------------------------------------------------------
| low shared memory | default kernel
   | optimized kernel | with funnel shifter
   | |
   S | spinlock kernel | spinlock kernel
   | for Kepler GPUs | with funnel shifter

Can anyone explain like i'm 5? If my card used to autotune to the default kernal and is now auto tuning to the spinlock kernel will it actually make a difference?

Here's a quote from Christian which goes into more detail about the -l flag format that helped me understand it much better than the README:

Quote from: cbuchner1 on April 14, 2013, 06:30:15 AM

Quote from: cancrena on April 14, 2013, 04:02:52 AM

Quote from: SubNoize on April 14, 2013, 03:03:16 AM

Can someone explain this "64x2" "S27x3,28x4" thing to me or point me in the right direction on reading up on it??

I have a 580gtx and i'm trying to figure out the best set up

well i did not figure out the meaning, but if you run it and let it autotune, it will choose automatically what's the best one (then you can add the flag in the batch file, like -l 112x2 for me)

112x2 means it throws 112 blocks at CUDA, and each consists of 2 warps. A warp is a group of 32 threads.

So in total it computes 112*2*32 = 7168 hashes in parallel in a single CUDA kernel launch.

And because the scrypt scratchpad is 131072 bytes long, this would consume 7168*131072 bytes of memory
on the card. That's about 917 MB.

So, as I understand it...

The kernel selection options (by autotune or -l flag) prior to the 2013-04-30 release were of the format:

"S" OR ""(no value)] - where S is optimized for older devices (compute capability < 2.0?) and no value is for all other devices
#b - # of blocks
x
#w - # of warps (groups of 32 threads)

And the kernel selection options (by autotune or -l flag) for the 2013-04-30 release onward are of the format:

"S" OR ""(no value)] - where S is optimized for Kepler devices (some GTX 6xx GPUs) and no value is for all other devices
#b - # of blocks
x
#w - # of warps (groups of 32 threads)

An important consideration is that #b x #w x 32 x 131072 should be less than the RAM (in bytes) on the card in question.

I'm not really sure how to break it down to explain-like-I'm-five levels without resorting to some MS paint-esque diagramming, which probably would do more harm than good.

And the answer to your question is... maybe. I have a Kepler mobile card but I saw equivalent performance when switching on the S flag at the same blocks and warps, and texture caching settings. I haven't had a chance to run any extensive testing or autotuning yet though.

Topic: [ANN] cudaMiner & ccMiner CUDA based mining applications [Windows/Linux/MacOSX] - page 1101. (Read 3426918 times)