Pages:
Author

Topic: Nvidia GPU Mining Problems - page 3. (Read 7002 times)

legendary
Activity: 3164
Merit: 1003
August 04, 2016, 01:07:35 PM
#88
Ok after setting the clocks to 1178 the 970's mined at 1417mhz core clock still.  Shocked
Something is setting or alternating  the core clocks.
And the cards are in the gaming mode... going to change them to oc mode.
legendary
Activity: 3164
Merit: 1003
August 04, 2016, 12:43:47 PM
#87
Found out that the 2x  980ti's and the 2x  970's are in gaming mode ... they have dual bios.
I may have to use OC GURU  to get them into over clock mode.
Can someone plz check to see what mode your 970 gtx is in plz.
legendary
Activity: 3164
Merit: 1003
August 04, 2016, 12:12:00 PM
#86
Adding the 2nd 970
Msi afterburner doesn't recognized it on graphics monitor but does on the control panel.

Mining for 2 minutes  then crashes with the 2nd image below.
Now going to set the clocks.






maybe you are pushing just a bit too hard? also whats your cpu/ram combo? I found that running lbry and ethereum side by side (i mixed amd and nvidia cards) can take up a lot of the system resources, including ~3GB of ram.
Thx  but no I'm not pushing the clocks higher ... something else is.
I'm now setting the clocks to normal 1064 for the 980ti's and 1178 for the 970's.
I wonder if ccminer is doing this?
member
Activity: 70
Merit: 10
August 04, 2016, 11:19:46 AM
#85
Adding the 2nd 970
Msi afterburner doesn't recognized it on graphics monitor but does on the control panel.

Mining for 2 minutes  then crashes with the 2nd image below.
Now going to set the clocks.






maybe you are pushing just a bit too hard? also whats your cpu/ram combo? I found that running lbry and ethereum side by side (i mixed amd and nvidia cards) can take up a lot of the system resources, including ~3GB of ram.
legendary
Activity: 3164
Merit: 1003
August 04, 2016, 09:26:38 AM
#84
Adding the 2nd 970
Msi afterburner doesn't recognized it on graphics monitor but does on the control panel.

Mining for 2 minutes  then crashes with the 2nd image below.
Now going to set the clocks.




legendary
Activity: 3164
Merit: 1003
August 04, 2016, 09:16:14 AM
#83
For a reference before I change clocks.



legendary
Activity: 3164
Merit: 1003
August 04, 2016, 09:08:02 AM
#82
Also:   Any special edition cards such as "OC" or "SSC" should never be overclocked.   The manufacturers have already done the legwork for us for the max reliability/stability on their card build.   Keep this well in mind.....   I never overclock a SSC or OC edition card.

This card      http://www.newegg.com/Product/Product.aspx?Item=N82E16814125684
and something is oc'ing it   an app IDK
So this maybe one of the main problems. Clocks are at 1393 right now. p2 state.
legendary
Activity: 3164
Merit: 1003
August 04, 2016, 09:01:36 AM
#81
This is the problem with installing more than one GPU type per system:

You'd have to do a nvidia driver "clean install" for the device's specific driver to fix that issue.....  Which would remove the other versions of the driver for the other cards........

Any chance on getting the 2 970's in their own box by themselves to test this?
No  but the driver I use is made for all cards JK.
I think we got the problem down to 2 issues now.
One windows 8.1 can only handle so many high end cards like someone I know tried to put 6 980ti on his rig...it can't be done. He got 5 but with lunix I think.
So if I had that many cards and did windows 8.1 the max maybe 4 cards...but it maybe msi afterburner seeing only 4 cards.
2nd I'm working on right now is the clocks are most definitely wrong on the 970's.
Going to set clocks to 1178 like their suppose to be with 2 970 in the system and see if that works.
If not going to delete all monitor app and try again.
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
August 04, 2016, 12:26:49 AM
#80
Also:   Any special edition cards such as "OC" or "SSC" should never be overclocked.   The manufacturers have already done the legwork for us for the max reliability/stability on their card build.   Keep this well in mind.....   I never overclock a SSC or OC edition card.

probably why the "non-oc" versions can easily match or beat the OC cards wen actually tuned manually. my strix cards (base model) can exceed the specs of the OC version (~8% faster for a $50 premium)


agreed.  my GTX980 is a strix.

I just added a EVGA 960 SSC 2Gb card to my machine that has the Asus GTX 980.  Still solid on lbry, went from 154Mh to 248Mh.   Happy.  Driver install was painless.  Though on the X58 motherboard for some reason I can't use the PCI port when running two video cards.   I may try stacking the cards up (not on the optimal config) in different slots to see if I can get a PCI card to work with both of these video cards.   I use this machine day in and day out and it mines 24/7 weather I play games or not.


Been pretty solid so far.  Lets see how this mix of video cards treats me in the near future.
member
Activity: 70
Merit: 10
August 03, 2016, 11:59:50 PM
#79
Also:   Any special edition cards such as "OC" or "SSC" should never be overclocked.   The manufacturers have already done the legwork for us for the max reliability/stability on their card build.   Keep this well in mind.....   I never overclock a SSC or OC edition card.

probably why the "non-oc" versions can easily match or beat the OC cards wen actually tuned manually. my strix cards (base model) can exceed the specs of the OC version (~8% faster for a $50 premium)
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
August 03, 2016, 09:46:36 PM
#78
Also:   Any special edition cards such as "OC" or "SSC" should never be overclocked.   The manufacturers have already done the legwork for us for the max reliability/stability on their card build.   Keep this well in mind.....   I never overclock a SSC or OC edition card.
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
August 03, 2016, 09:44:40 PM
#77
This is the problem with installing more than one GPU type per system:

You'd have to do a nvidia driver "clean install" for the device's specific driver to fix that issue.....  Which would remove the other versions of the driver for the other cards........

Any chance on getting the 2 970's in their own box by themselves to test this?
legendary
Activity: 3164
Merit: 1003
August 03, 2016, 10:06:42 AM
#76
Ok On my 970's g1 trying now to add the 2nd one.
msi afterburner display wont show the extra 970 but it is recognized on the control panel.
And the clocks mining in the p2 normal state are showing 1400 core clock...so something is overclocking them causing them to crash? Going to delete all monitoring software to see if the clocks go to normal.
Also going to manually set the clocks to 1178 mhz where they are supposed to be if necessary.
Even though the rig has been mining for days now and no crashes without the extra card.
2 days ago I tried the extra card in and clock default to oc'ing 1413 to high.
With the extra card it will mine for 3 minutes exactly then crashes the drivers.
Reboot and mine and 3 minutes drivers crash.
Gigabyte says to download OC GURU and try the other bios non gaming to see if that helps.
Be back asap with the results.

This is what happens sometimes on a fast reboot, it is the 970 1st card 2nd card not installed.
After rebooting a 2nd time this doesn't happen.
I think I need to take fast boot off on my asrock 81 btc pro to normal boot.

This is the first 970 crash below.



This is with 5 cards installed 5th card not recognized by windows.


 
legendary
Activity: 3164
Merit: 1003
August 02, 2016, 03:48:57 PM
#75
https://www.youtube.com/watch?v=gS1hyzkVk5w

Just saw this.  Still need to watch, but maybe might be something u want to look at given the title....
Yes I can get money I believe ... have 2 970 ... thx
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
July 30, 2016, 03:19:52 PM
#74
https://www.youtube.com/watch?v=gS1hyzkVk5w

Just saw this.  Still need to watch, but maybe might be something u want to look at given the title....
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
July 30, 2016, 03:00:10 PM
#73
Glad to help.

Ive noticed most people reporting issues have been with the middle model cards (970, etc)... so I tend to stay away from them myself.

Its almost as if the 70 model is a batch of high end chips that all the cuda cores didnt work on or something, so they make that die work at a lower rate with less cores and not loose manufactured but failed high end product....

Cant wait to get up to SF and pick up that 960 off the shelf and get it back to work.   We took that duck 960 machine apart to build out a system for a customer and the second video card had been sitting waiting for its new home.   He planned to go 10 series on the next machine, so its not a big deal for him to trade me 2 power supplies for the 1 card.
legendary
Activity: 3164
Merit: 1003
July 30, 2016, 07:40:45 AM
#72
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
July 28, 2016, 08:17:21 PM
#71
When the psu was shutting off I was trying to draw 1085 watts on a continuous 1300 psu which really equals 1200 available.

Yep;  that's what I was thinking.   When you start drawing too much, internal voltage drops and amperage turns up.  This causes more heat in the components and a snowball effect ensues.

My comment about several 50A rails was literal, and is a pretty close and very realistic situation.  When I give these values, i'm not joking.... They are coming from my Coolmax 1600w.    I know the math says there's ~1920W of just 12v power available (E/I*R) cause the sticker says 110A and 50A of 12V power:  But the sticker, shows that its only allowing 1600W total from the PSU; with a max of this, and a max of that on each designated supply line (12V#1 and 12V#2 are stickered to be a max draw of 1560W even though the math of E/I*R says more).  

I have seen a few people burn down S7's and S7 power supplies because the the two sets of PCIE plug rails were only capable of 110A total when they were under the appearance of being both separate circuits;  Yet the power supply said there was another ~40A of overhead. (1560W of 12V between the two is available total if you trust whats on the sticker)... This is not an 1800W power supply;  but it allows for a total of 1600W to be drawn from it across all voltages in a combination which allows a massive; nearly 1600w of +12V usage in theory.  The problem is 12v rail 1 only supplies PCIE (110A), and 12v rail 2 only supplies the motherboard plug, CPU plugs, and accessories such as SATA power and the like. You would never know this is true unless you took it apart.  One of my coolmax PSU's are from a person that burned down an S7 drawing too much from it.   I had to open it up, and hack-job-rewire the modular plugs in the back of the PSU to bypass and cut out the old melted PCIE cable plugs and fix the remaining good ones to have a good usable PSU again.   I will also rig up some PCIE plug adaptors to utilize the FDD/HDD power ports, and the motherboard connectors as well.

To finish:
You not only have to obey the sticker, but you have to use a little extra sense when trying to figure it all out.    Don't ever assume any one thing is correct unless its totally verified Wink
legendary
Activity: 3164
Merit: 1003
July 28, 2016, 11:45:59 AM
#70
 I hope you are figuring your power availability per rig @ 120-160% expected draw?  If I ran a 4 card machine I would be for sure running a 1600w power supply..... 

I wonder if you are drawing too much +12V off the same rail that supplies the processor and are causing this all to happen.

I have seen many strange configurations once opening up power supplies and seeing what is tapping which available rail.  Many PC power supplies have 3-4 independent +12V power supply circuits at roughly 50A each...  give or take....    As far as knowing how they are distributed...... you have to open the power supply often to know the real truth...   


I dont really agree with the above (per-se):

1) I agree with giving about 20% headroom, but a 1600W PSU for 4 cards isnt necesary (unless they are 290/390 cards or some other varient that would be drawing 300w/card). the rx480 or gtx1070 you could run 6 cards with a 1200-1300W PSU just fine.

2) most quality power supplies have a single 12V rail, and the ones wit multiple rails normally are more like 2-3 rails at 30A each (3x30Ax12V=1080W)  Your suggested 3x50Ax12V PSU would be a 1800W+ beast

3) you DONT need to (or want to) open up your PSU and start poking around. youll void the warranty, risk damage, and waste your time. any half-decent power suply will have the power rating and rail ratings marked on it and also on its packaging. If not, use google.

pretty much any PSU that is gold-rated and costs >$100 should be a single 12V rail thats rated at about 95% of the actual PSU specification.

for example, the corsair ax1200 has 1202W on a single 12V rail: http://www.corsair.com/en/professional-series-gold-ax1200-80-plus-gold-certified-fully-modular-power-supply  (click on the technical specs tab)
When the psu was shutting off I was trying to draw 1085 watts on a continuous 1300 psu which really equals 1200 available.
legendary
Activity: 3164
Merit: 1003
July 28, 2016, 11:40:01 AM
#69
AFAIK=as far as I know

If its a temperature problem, theres a system level component that's having an issue I would think.

Run a motherboard monitor and CPU monitor.... but a voltage issue could still be the case.   I hope you are figuring your power availability per rig @ 120-160% expected draw?  If I ran a 4 card machine I would be for sure running a 1600w power supply.....  

I wonder if you are drawing too much +12V off the same rail that supplies the processor and are causing this all to happen.

I have seen many strange configurations once opening up power supplies and seeing what is tapping which available rail.  Many PC power supplies have 3-4 independent +12V power supply circuits at roughly 50A each...  give or take....    As far as knowing how they are distributed...... you have to open the power supply often to know the real truth...    

You dont happen to be CPU mining at the same time are you?
oooo on the overloaded rail... yes I did...now fixed.
That was when the psu was shutting off.
No cpu mining at all I have to use    --cpu-affinity 1 --cpu-priority 0
Pages:
Jump to: