Pages:
Author

Topic: Nvidia GPU Mining Problems - page 6. (Read 7002 times)

hero member
Activity: 724
Merit: 500
July 17, 2016, 12:33:48 PM
#28
i would recommend to install linux.. try with a separate hard drive as you are bound to have problems the first time.. but there are guides on installing the driers, cuda and all that.. my linux rigs run for months without crashing

apart from that you can try booting in safe mode, i had a 6x750ti rig running windows7 that would (surprisingly) mine in safe mode without any issues. otherwise it would crash every few hours for different reasons.  however in other rigs using windows 2012 hashing in safe mode does not work
legendary
Activity: 1470
Merit: 1114
July 17, 2016, 12:17:30 PM
#27
You're juggling too many things at once, start isolating cards. If you have multiple faults you have to seperate them.

You seem to have an issue triggerred by heat, you need to find out which card it is and you need to monitor
the GPU temperatures to confirm it's temp related.


That's what I was doing... now im mining one algo only and so far no crashes. Smiley
Room temp 86 f.
I'm also posting extra info for my own notes.

OK, so your following the algo switching lead. Following up with that, has it failed imediately after a reboot,
ie the first algo starts up but crashes after a few minutes? Or does it run fine for a while algo switching and not
crashing until suddenly it crashes? Monitor the GPU temperatures, not the room remperatures, to see if there
is a correlation.

You will also need to identify which card is crashing. If you don't want to test one card at a time you can test minus 1
card at a time, remove one card, test, resinstall, remove a different card, repeat.
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 11:51:26 AM
#26
You're juggling too many things at once, start isolating cards. If you have multiple faults you have to seperate them.

You seem to have an issue triggerred by heat, you need to find out which card it is and you need to monitor
the GPU temperatures to confirm it's temp related.


That's what I was doing... now im mining one algo only and so far no crashes. Smiley
Room temp 86 f.
I'm also posting extra info for my own notes.
Thx joblo for your help.... much appreciated.  Smiley
legendary
Activity: 1470
Merit: 1114
July 17, 2016, 10:56:53 AM
#25
You're juggling too many things at once, start isolating cards. If you have multiple faults you have to seperate them.

You seem to have an issue triggerred by heat, you need to find out which card it is and you need to monitor
the GPU temperatures to confirm it's temp related.

legendary
Activity: 3164
Merit: 1003
July 17, 2016, 10:35:01 AM
#24
That will happen no matter what ccminer im using.
83 f  28c temp room.
Second cause of crash.
Got it captured finally. 970gtx card 2 device 2.
That cards clocks default mining p2 state 1413 should be 1117.
Now my 2nd 970gtx when I tried it.... clocks the same thing so I did an rma an that ....so when I got the rma 3rd card 970gtx the clocks the same thing 1413.
Their supposed to be about 1178 core but always show 1413 core no oc'ing.
 


legendary
Activity: 3164
Merit: 1003
July 17, 2016, 10:13:40 AM
#23
Now 970 gtx is not recognized and sometimes the 980ti too on fast reboot.. need to reboot twice to be recognized.
This doesn't happen at lower temp if it is related to temp.  room temp  83 f  28c .
On that crash wattage went down to normal 92 f to change algo's.
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 09:57:46 AM
#22
There are 2 maybe 3 things making it crash.....I think I got one of them.... proving by upping the cool down time.
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 09:46:47 AM
#21
why you still have 3 different card in the same rig, do you really need the 750? not saying that that is the problem(well it might be) but i find it wacky
The 750ti is fine amph  need all the hash I can get... thx
If we get this fixed then I'll add the other 970 gtx sitting in its box.
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 09:45:40 AM
#20
Ok mining fine then as it went to switch algo's it went through it's cool down cycle I have set to 12 seconds ...needed to get from p2 state to p8 state ..then crashed.
Now I have at the wall watt meter and during this stage it should go down to 92 watts but didn't.. it went to 252 watts not mining  humm.
Going to up that time to 20 and give that a try.
This is not the crash that was happening.
The display driver never works after that need a reboot.

legendary
Activity: 3248
Merit: 1070
July 17, 2016, 09:29:45 AM
#19
why you still have 3 different card in the same rig, do you really need the 750? not saying that that is the problem(well it might be) but i find it wacky
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 09:22:57 AM
#18
Worked for 3 minutes then crash.... apparently algo's don't matter.
This is before crash....rebooting again turning box fan up to full to help cool.




legendary
Activity: 3164
Merit: 1003
July 17, 2016, 09:10:20 AM
#17
Ok first crash 5 hours later at 78f.

Just to be clear, does it always start mining then crash due to access blocked or does it ever crash on ccminer startup?
Does it crash if you run a single algo without ever swicthing?
Have you been monitoring GPU temps?
It starts mining like normal..then crashes after about 3 to 5 minutes after mining.
Rebooting it now.
legendary
Activity: 1470
Merit: 1114
July 17, 2016, 08:56:35 AM
#16
Ok first crash 5 hours later at 78f.

Just to be clear, does it always start mining then crash due to access blocked or does it ever crash on ccminer startup?
Does it crash if you run a single algo without ever swicthing?
Have you been monitoring GPU temps?
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 08:51:34 AM
#15
Ok first crash 5 hours later at 25c.
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 06:31:08 AM
#14
In 6 hours it should start crashing as the temperatures rise.. summer here and a heat wave.
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 06:28:11 AM
#13
since minercontrol is being used to launch the apps, have you tried running MC under administrator elevation to see if it helps?

It's a strange problem.... I can't think of what would cause HAL to not let your app access the video card...  The only time I had a problem with miner apps not accessing the video card myself was when there was a driver updated by windows automatically and it wasn't the nvidia package with CUDA updates.

Joblo is spot on;  test all hardware one at a time.  I know it seems annoying and it takes a lot of time, but you will find out where your issue resides.  If it is on a specific algo;  try trimming the intensity back a touch, at least .5 if not 1 from what its at currently.  Since Temp is an issue, I would consider this for sure.

I know you definitely need the individual driver packages for each model of card you are running to be installed.    In the past I have heard of people running into bugs and issues when they are mixing and matching types/generations of cards, and they were always able to resolve by having each card type on a separate rig.  This way drivers are unified and the system OS is simpler.
Yes I did try MC under administrative. And it crashed within 3 minutes .. but running at 32c in the room  .. right now it is very cool in the room..room temp 1 degree less and running great.
I think we can start looking at msi afterburner maybe for the problem .. not sure. Maybe the app fan control ? Not sure yet.
Thx
legendary
Activity: 3164
Merit: 1003
July 17, 2016, 06:20:45 AM
#12
Right now it is room temperature. It seems to be so much more stable. Today it will go up to and beyond 32c. At that point it won't mine for more then a few minutes but it should and was 4 days straight without a crash one time about a week ago.
I'm mining muti algo switching with Minercontrol.
And right now everything is fine but as the temperature rises it should start crashing.
I will be doing things as suggested as soon as possible.

Keep notes of what algos fail to start to see if it's algo specific. Also try to characterize the failures in more detail.
Does it ever fail on the first algo or does it switch a few times before it fails? Once you have a failure does it then
fail consistently. How does it recover, do you reboot?


No real specific algo..... I have to reboot.....  it doesn't fail on first algo meaning it will mine that algo for a few minutes then crash..but when temperatures rise. Right now it is room temp and running fine through the algo's as it gets hotter it should start crashing.
legendary
Activity: 1848
Merit: 1166
My AR-15 ID's itself as a toaster. Want breakfast?
July 17, 2016, 06:13:52 AM
#11
since minercontrol is being used to launch the apps, have you tried running MC under administrator elevation to see if it helps?

It's a strange problem.... I can't think of what would cause HAL to not let your app access the video card...  The only time I had a problem with miner apps not accessing the video card myself was when there was a driver updated by windows automatically and it wasn't the nvidia package with CUDA updates.

Joblo is spot on;  test all hardware one at a time.  I know it seems annoying and it takes a lot of time, but you will find out where your issue resides.  If it is on a specific algo;  try trimming the intensity back a touch, at least .5 if not 1 from what its at currently.  Since Temp is an issue, I would consider this for sure.

I know you definitely need the individual driver packages for each model of card you are running to be installed.    In the past I have heard of people running into bugs and issues when they are mixing and matching types/generations of cards, and they were always able to resolve by having each card type on a separate rig.  This way drivers are unified and the system OS is simpler.
legendary
Activity: 1470
Merit: 1114
July 17, 2016, 06:05:13 AM
#10
Right now it is room temperature. It seems to be so much more stable. Today it will go up to and beyond 32c. At that point it won't mine for more then a few minutes but it should and was 4 days straight without a crash one time about a week ago.
I'm mining muti algo switching with Minercontrol.
And right now everything is fine but as the temperature rises it should start crashing.
I will be doing things as suggested as soon as possible.

Keep notes of what algos fail to start to see if it's algo specific. Also try to characterize the failures in more detail.
Does it ever fail on the first algo or does it switch a few times before it fails? Once you have a failure does it then
fail consistently. How does it recover, do you reboot?

legendary
Activity: 3164
Merit: 1003
July 17, 2016, 06:02:01 AM
#9
To add to the above post, make sure all of your system is stock clock. CPU and GPU's.  even a minor instability in either can cause havok in a mining rig.
Yes I turned all oc'ing off.  Thank you
Pages:
Jump to: