Author

Topic: [ANN] cudaMiner & ccMiner CUDA based mining applications [Windows/Linux/MacOSX] - page 724. (Read 3426922 times)

newbie
Activity: 34
Merit: 0
I can't seem to compile the latest commit for blake and I'm not familiar with how to handle nvcc errors like the following:

Code:
nvcc -g -O2 -Xptxas "-abi=no -v" -arch=compute_10 --maxrregcount=64 --ptxas-options=-v  -o blake.o -c blake.cu

### Assertion failure at line 1917 of ../../be/cg/cgemit.cxx:
### Compiler Error in file /tmp/tmpxft_000010b1_00000000-9_blake.cpp3.i during Assembly phase:
### operand 0 is not a register
nvopencc INTERNAL ERROR: /usr/lib/nvidia-cuda-toolkit/lib//be returned non-zero status 1
hero member
Activity: 812
Merit: 1000
but how much of them can support the cpu itself ?

well using -H 2 the cpu load should be tiny, at 10% per card you could manage 9
I was not really thinking in terms of cpu load but rather of an intrinsic limitation (like the number of sata 3 port)

I'm not sure, depends on the driver and board bios as well as the electrical connecions on each slot.
legendary
Activity: 1400
Merit: 1050
but how much of them can support the cpu itself ?

well using -H 2 the cpu load should be tiny, at 10% per card you could manage 9
I was not really thinking in terms of cpu load but rather of an intrinsic limitation (like the number of sata 3 port)
sr. member
Activity: 350
Merit: 250
but how much of them can support the cpu itself ?

well using -H 2 the cpu load should be tiny, at 10% per card you could manage 9
legendary
Activity: 1400
Merit: 1050
but how much of them can support the cpu itself ?
hero member
Activity: 812
Merit: 1000
You can get a 4x multiplier card here:

http://www.aliexpress.com/item/PCI-e-PCIe-Express1X-to-4port-1X-multiplier-switch-riser-card-adapter-cable-for-diy-bitcoin/1444567810.html

The only manufacturer that does that. Probably the guy from GPU coin is having some kind of deal with this manufacturer to get 8x multiplier.
I think that 4x is enough, it will save you from investing in a lot of computers.

As of now it is the single manufacturer in China that offers such things.

Good luck in your mining adventures !

Yeah, really expensive...if that guy can get em for half the price, it might be worth it.

Nvidia driver supports up to 16 gpus per rig AFAIK.
full member
Activity: 210
Merit: 100
You can get a 4x multiplier card here:

http://www.aliexpress.com/item/PCI-e-PCIe-Express1X-to-4port-1X-multiplier-switch-riser-card-adapter-cable-for-diy-bitcoin/1444567810.html

The only manufacturer that does that. Probably the guy from GPU coin is having some kind of deal with this manufacturer to get 8x multiplier.
I think that 4x is enough, it will save you from investing in a lot of computers.

As of now it is the single manufacturer in China that offers such things.

Good luck in your mining adventures !
full member
Activity: 168
Merit: 100
I discover in my bios, that it was possible the change the latency of the pci-e.
It is set by default at 32 something, but it can be increased.
Would there be an interest in increasing (or reducing) the latency when using risers or several pci cards ?

probably not unless youre running 1x
member
Activity: 70
Merit: 10
Been away from the thread for a while. What's the average hashrate for scrypt on a 750ti atm? I'm getting ~300 on mine.

Same.

300 is pretty good, Some are seeing as high as 340 with heavy OC and as low as 280.
full member
Activity: 140
Merit: 100
Been away from the thread for a while. What's the average hashrate for scrypt on a 750ti atm? I'm getting ~300 on mine.

Same.
newbie
Activity: 27
Merit: 0
Been away from the thread for a while. What's the average hashrate for scrypt on a 750ti atm? I'm getting ~300 on mine.
member
Activity: 70
Merit: 10
My Gainward 750 Ti arrived today. My mining rig has now GPUs from 3 different vendors and it is working under Win 8 x64 Smiley
I have the onboard Intel 4600 (with connected display), a AMD Radeon 6950 1 GB for mining (395 kh/s stock clock, scrypt) and the new NVIDIA 750 Ti (290 kh/s stock clock, scrypt).
Driver installation was straight forward. Only downside to not having a display connected to either card, I can't use CCC or Nvidias control panel. If I would want to overclock I have to first connect a display.
You can do it with either MSI AFterburner, EVGA Precision and Nvidia Inspector (but yes the nvidia control panel does not open).
I am thinking also about adding a 750ti (my psu should still have 50watt to spare even with the two big cards)

Have you tried setting up a dummy plug for this? You just need a DVI-VGA adapter and 3x50-150 ohm resistors to create one. Instructions are literally all over the interwebz.
legendary
Activity: 1400
Merit: 1050
I discover in my bios, that it was possible the change the latency of the pci-e.
It is set by default at 32 something, but it can be increased.
Would there be an interest in increasing (or reducing) the latency when using risers or several pci cards ?
legendary
Activity: 1400
Merit: 1050
My Gainward 750 Ti arrived today. My mining rig has now GPUs from 3 different vendors and it is working under Win 8 x64 Smiley
I have the onboard Intel 4600 (with connected display), a AMD Radeon 6950 1 GB for mining (395 kh/s stock clock, scrypt) and the new NVIDIA 750 Ti (290 kh/s stock clock, scrypt).
Driver installation was straight forward. Only downside to not having a display connected to either card, I can't use CCC or Nvidias control panel. If I would want to overclock I have to first connect a display.
You can do it with either MSI AFterburner, EVGA Precision and Nvidia Inspector (but yes the nvidia control panel does not open).
I am thinking also about adding a 750ti (my psu should still have 50watt to spare even with the two big cards)
member
Activity: 106
Merit: 10
My Gainward 750 Ti arrived today. My mining rig has now GPUs from 3 different vendors and it is working under Win 8 x64 Smiley
I have the onboard Intel 4600 (with connected display), a AMD Radeon 6950 1 GB for mining (395 kh/s stock clock, scrypt) and the new NVIDIA 750 Ti (290 kh/s stock clock, scrypt).
Driver installation was straight forward. Only downside to not having a display connected to either card, I can't use CCC or Nvidias control panel. If I would want to overclock I have to first connect a display.
hero member
Activity: 644
Merit: 500
Will the CUDA 6 release have any impact on mining?

so far it's a negative impact. So CUDA 6 RC is a no-go. There are serious
performance regressions for some kernels, and no additional gain for code
compiled specifically for the 750 Ti (Maxwell).

Christian


Isn't CUDA 6 about unified memory, ie host and device can share literally the same variable in the code? If they do, then there would be constant memcpy going on, which would slow down anything pre-Maxwell as Maxwell is the first to have hardware support for that.

Though I could be way off. Been a while since I've used CUDA.
hero member
Activity: 756
Merit: 502
ok, I have recompiled, this time, I don't get that many share. The only one I got (in 3 runs) was rejected.

I recompile again, just to make sure everything was ok at that level, but it really take a long take to get a share...

ok, I guess I will have to start some pool mining myself and do some more validation.  but maybe not tonight Wink

it's bed time.

Christian
hero member
Activity: 756
Merit: 502

interesting: a CPU miner with all the algorithms the author could find: https://github.com/KaSt/AllMiner

This could serve as a reference for future cudaminer improvements, maybe.

Christian
legendary
Activity: 1400
Merit: 1050

I just tried on one pool Blakecoin EU1 pool, I get only boo's.
the hashrate would be max 130Mhash, I just checked with the big amd electron's sucker (which doesn't seem to use that much on it),
 ouch 3.25GHash/s

this is the code from the blakecoin CPU miner... so the line on the bottom applies to Blake256.
I fixed this in cudaminer now.

Code:
if (opt_algo == ALGO_SCRYPT)
diff_to_target(work->target, sctx->job.diff / 65536.0);
else
diff_to_target(work->target, sctx->job.diff);

About the hashing speed: This is a straight port of the CPU code running mostly unoptimized in CUDA.  Bad speed is kind of
expected, although 50 MHash/s is a start...

If hashes are still getting rejected now, there must be another reason (maybe endianness problems....)

Christian


ok, I have recompiled, this time, I don't get that many share. The only one I got (in 3 runs) was rejected.

I recompile again, just to make sure everything was ok at that level, but it really take a long take to get a share...
hero member
Activity: 756
Merit: 502

I just tried on one pool Blakecoin EU1 pool, I get only boo's.
the hashrate would be max 130Mhash, I just checked with the big amd electron's sucker (which doesn't seem to use that much on it),
 ouch 3.25GHash/s

this is the code from the blakecoin CPU miner... so the line on the bottom applies to Blake256.
I fixed this in cudaminer now.

Code:
if (opt_algo == ALGO_SCRYPT)
diff_to_target(work->target, sctx->job.diff / 65536.0);
else
diff_to_target(work->target, sctx->job.diff);

About the hashing speed: This is a straight port of the CPU code running mostly unoptimized in CUDA.  Bad speed is kind of
expected, although 50 MHash/s is a start...

If hashes are still getting rejected now, there must be another reason (maybe endianness problems....)

Christian

Jump to: