Author

Topic: [ANN] cudaMiner & ccMiner CUDA based mining applications [Windows/Linux/MacOSX] - page 758. (Read 3426922 times)

legendary
Activity: 1400
Merit: 1050
I've received a bit of an optimization from nVidia engineers for the t and k kernels used in Yacoin. It will add an extra kHash/s to 780, Titan and 780Ti devices for N-factor 14 coins.  I am going to try this tonight, and maybe push an update if I can confirm the gains.

Seems these guys are doing my job now, lol Wink

Christian

Could this also work for MRC ? (meaning at lower N factor in particular 10 and 11... assuming it works at N=14).
Something strange I noticed, at N=9 the hashrate of the gtx780ti and the R9 were similar but
at N=10 the R9 does something like a minimum of 100khash/s more than the gtx780ti
(I mean as N increased I was expecting that the gtx780ti would beat the amd card)
hero member
Activity: 812
Merit: 1000
I've received a bit of an optimization from nVidia engineers for the t and k kernels used in Yacoin. It will add an extra kHash/s to 780, Titan and 780Ti devices for N-factor 14 coins.  I am going to try this tonight, and maybe push an update if I can confirm the gains.

Seems these guys are doing my job now, lol Wink

Christian


Amazing news! Hopefully that gets me in the 7kh/s range with my 780 :p (one can dream eh)
Wonder what it'll do for Maxwell.
hero member
Activity: 756
Merit: 502
I've received a bit of an optimization from nVidia engineers for the t and k kernels used in Yacoin. It will add an extra kHash/s to 780, Titan and 780Ti devices for N-factor 14 coins.  I am going to try this tonight, and maybe push an update if I can confirm the gains.

Seems these guys are doing my job now, lol Wink

Christian
hero member
Activity: 676
Merit: 500
Try T15x24 and open a google chrome window with some flashgame.

You don't need to run anything other than open chrome.  Any page will do.

This chrome trick is only for linux, yes?
No it is for windows too.
hero member
Activity: 676
Merit: 500
Try T15x24 and open a google chrome window with some flashgame.

If that was for me, I got the cpu does not validate.

On a side note it took about 15 times of restarting the bat file to get the card back to 99% gpu load. Even trying the flash thing did not work.
As i saw it is a scrypt coin and with 15x24 i get better results , remove anything exept -H2 -i 0 -T15x24 . Can you  post your actual overclock readings? 290 is good , but for scrypt i get 320 when overclocked.
newbie
Activity: 21
Merit: 0
Try T15x24 and open a google chrome window with some flashgame.

You don't need to run anything other than open chrome.  Any page will do.

This chrome trick is only for linux, yes?
newbie
Activity: 21
Merit: 0
Anyone who runs multiple nVidia cards on risers?

I am looking into running 5-6 cards per rig, but i think someone earlier in this thread mentioned that cudaMiner still has some trouble with high traffic so that risers could be a problem. (1x to 16x risers, 5 or 6 per rig)

I notice a severe slowdown from 340 khash to ~280 khash when going to a 1 to 16 riser, I was able to recover some speed at the suggestion of Christian to use the -H 2 switch which got me back up to the 320 khash level, but nothing I can do gets me back to 340.  I will be getting 16 to 16 risers in tomorrow for testing this weekend. Not sure if anyone has done any serious testing of the 1 to 16 risers with NVidia cards and cudaminer, I will try with the cards I have (750 Ti, 680, 780 Ti) but it will have to wait till this weekend.  I will post what I find.  If anyone has any special requests, just post 'em here.  I also have a motherboard with 4 x1 and 3 x16 slots coming in tomorrow.


I'm running into the same issue.

Setup:
Intel Pentium G3220
Asus Z78-Pro
4GB RAM
1kW Seasonic Modular PSU
7x x1-x16 Riser Assemblies
2x Zotac 750 Ti, Reference
5x ASUS 750 Ti, OC

So far I've only messed with the Zotac reference cards.

I have OC'd CPU+135,VRAM+700; Modified VBIOS to lift TDP limit.

On x16 slot, 320-330 kH/s (with -H 2 flag). On x1-x16 riser assembly, ~280kH (with -H 1 flag).

-H2 flag using x1 riser causes unusually low performance. I believe this is due to the fact that for the CPU to offload the GPU, the data bandwidth required for the offload traffic is saturating the x1 bandwidth, thus artificially limiting the card. I suspect that even with -H2, we're seeing some limitations relating to either the way cudaminer works or the x1 bandwidth limitation.

My theory is that the only fix for a riser use-case is to utilize x16-x16 riser assemblies. Otherwise, you're throwing away ~50kH/s per card attached to a x1 riser. As an electrical design engineer, this hurts me.

sr. member
Activity: 791
Merit: 271
This is personal
Try T15x24 and open a google chrome window with some flashgame.

You don't need to run anything other than open chrome.  Any page will do.
legendary
Activity: 1400
Merit: 1000
Try T15x24 and open a google chrome window with some flashgame.

If that was for me, I got the cpu does not validate.

On a side note it took about 15 times of restarting the bat file to get the card back to 99% gpu load. Even trying the flash thing did not work.
hero member
Activity: 676
Merit: 500
Try T15x24 and open a google chrome window with some flashgame.
legendary
Activity: 1400
Merit: 1000
Hey Christian/board

Just wanted to say that I finally got the 750ti mining Black Coin. Only took about 2 days of playing around and googling. But I have it up to 290 kh/s with an overclock of 105 on core and 265 on memory. I still have not unlocked the watts from 38.5 to 60. Running 57° with fan at 65%.
 
Using -d 1 -m 1 -l T5x24 -i 0 -H 2 Again, I did not come up with that, just googled.

Can't wait to see what you can do with yours.
jr. member
Activity: 48
Merit: 10
Any ideas? 

Yes.  The chipset only support 4 PCIe 1x slots off the socket - it delegates any more to the northbridge, which may behave differently/slower possibly?

see the chipset diagram http://cdn.overclock.net/f/f3/f30d12f6_AMD_A85X.jpeg
from
http://www.overclock.net/t/1459052/amd-a10-7850-preview-by-flank3r


Makes sense.  I have a Z77 board here I can test out but I don't have a CPU cooler right now. 

I tried pushing the clock a little bit (was set to auto, and it was listed as 1.2, so I assumed 1.2v is the default, pushed to 1.25..). 

Really curious as to why this works with 6 AMD GPUs and not 6 of these NVIDIA cards... :/

So the board I have is A88X, codenamed Bolton-D4.  You linked to the A85X  Hudson D4 chipset.  I'm unable to find a similar diagram that shows the A88X diagram.

http://en.wikipedia.org/wiki/Comparison_of_AMD_chipsets#Fusion_controller_hubs_.28FCH.29
jr. member
Activity: 48
Merit: 10
Any ideas? 

Yes.  The chipset only support 4 PCIe 1x slots off the socket - it delegates any more to the northbridge, which may behave differently/slower possibly?

see the chipset diagram http://cdn.overclock.net/f/f3/f30d12f6_AMD_A85X.jpeg
from
http://www.overclock.net/t/1459052/amd-a10-7850-preview-by-flank3r


Makes sense.  I have a Z77 board here I can test out but I don't have a CPU cooler right now. 

I tried pushing the clock a little bit (was set to auto, and it was listed as 1.2, so I assumed 1.2v is the default, pushed to 1.25..). 

Really curious as to why this works with 6 AMD GPUs and not 6 of these NVIDIA cards... :/
hero member
Activity: 693
Merit: 500
Any ideas? 

Yes.  The chipset only support 4 PCIe 1x slots off the socket - it delegates any more to the northbridge, which may behave differently/slower possibly?

see the chipset diagram http://cdn.overclock.net/f/f3/f30d12f6_AMD_A85X.jpeg
from
http://www.overclock.net/t/1459052/amd-a10-7850-preview-by-flank3r
sr. member
Activity: 280
Merit: 250
Tried bumping up the northbridge/chipset voltage ?  If the mobo supports it that is..
jr. member
Activity: 48
Merit: 10
Still doesn't explain why the first 4 cards are legit, the 5th is half ass, and the 6th is non existent.  They are all recognized, powered on, and spinning.  I'm writing something to see if I can send some work to the 6th card via cuda.

yeah, the RF engineer in me is clueless about that. :-)

oh wait... idea: the 5th and 6ths PCIe slots are farthest away from the northbridge/CPU though. Now add the USB cable length...

I studied communications engineering, but I mainly focus  on protocols, software stacks, simulation of radio systems. My HF/RF knowledge is pretty rusty :-)

The troubleshooter in me says to take out the middle two cards (middle PCIe slots) and see how many cards you see.  If it's 4, it may just have to do with the number of cards being seen by the system.  Maybe the BIOS runs out of memory trying to address that many cards.  Make sure all the features you don't need are turned off in BIOS... all the fun stuff Smiley


I think you're right with the BIOS running out of memory trying to address all the cards, but I'm still seeing low hash on the last PCI-E lane.  I removed *all* cards and seeing 200khash.  On the first lane it's 265khash at stock speeds which goes with what I'm seeing on the rest of the cards.

  • With 1 card in the first slot we're hashing @ full speed 265khash
  • With 1 card in the last mobo slot we're hashing at 175-200, after 5min it's settled at 175khash.  This is the SAME card going from 1st to last slot
  • So first 4 cards *always* work.
  • The 5th card is lower hash (half, maybe less...)
  • 6th card recognizes but doesn't hash...

I've used this same board as I've previously said with 6 x R9 280X GPUs all hashing at full, overclocked speed.  I'm also using ALL powered risers.

Here's some hardware/software details:

Mobo:  GIGABYTE GA-F2A88X-UP4 FM2+
GPU:  6 x MSI GAMING N750Ti TF 2GD5/OC (http://www.msi.com/product/vga/N750Ti_TF_2GD5OC.html)
PSU:  Corsair AX760i
CPU:  AMD A4-5300
RAM: 2x4GB modules
OS:  Xubuntu 64bit (previous post has my step by step configuration software wise)

With the 6 x 280X I had a 1500W PSU and a 1000W PSU rigged up together.  It hashed for weeks with no hiccups.  I'm really at a loss as to why this setup wouldn't work.  PCI-E was set to auto or 2x in BIOS, I never tried 1x.  

Any ideas?  
full member
Activity: 210
Merit: 100
Has anyone tried out the GTX 750 (Non Ti)? They are selling for as low as $120 and would be viable if they didn't perform much worse than the 750Ti since they're approaching $179 here in Canada
full member
Activity: 182
Merit: 100
Quote from: ManIkWeet link=topic=167229.msg5395953#msg5395953
It'd appease the hardcore miners, because I have no problem opening Google Chrome :p

oops, did not know that speccy is a Windows app. :-)  does speccy show a change in perf levels when opening Chrome?
No, it doesn't show anything more than the screenshot, it seems to be some static value that it reads from the card somehow.
full member
Activity: 173
Merit: 100
Still doesn't explain why the first 4 cards are legit, the 5th is half ass, and the 6th is non existent.  They are all recognized, powered on, and spinning.  I'm writing something to see if I can send some work to the 6th card via cuda.

yeah, the RF engineer in me is clueless about that. :-)

oh wait... idea: the 5th and 6ths PCIe slots are farthest away from the northbridge/CPU though. Now add the USB cable length...

I studied communications engineering, but I mainly focus  on protocols, software stacks, simulation of radio systems. My HF/RF knowledge is pretty rusty :-)

I have no need for a protocol developer! What I really need is a developer that understands the binary language of moisture vaporators.

 Wink Cheesy
hero member
Activity: 756
Merit: 502
Quote from: ManIkWeet link=topic=167229.msg5395953#msg5395953
It'd appease the hardcore miners, because I have no problem opening Google Chrome :p

oops, did not know that speccy is a Windows app. :-)  does speccy show a change in perf levels when opening Chrome?
Jump to: