Author

Topic: VIDEO_TDR_FAILURE (Read 211 times)

newbie
Activity: 182
Merit: 0
March 15, 2018, 02:39:40 PM
#16
Again, I'm using only Nvidias on that rig.

On a hunch, I moved my PCIe extender from a port to another. It's been working well for an hour now.
full member
Activity: 588
Merit: 101
March 13, 2018, 11:03:36 AM
#15
Can it be caused by a driver issue?

If not, bad riser(s)?

Bad power supply filtering???



EDIT: It occurs on my 12x 1080 Ti rig

This problem is caused by drivers issues, you're absolutely right. I usually reinstall drivers in safe mode or replace atikmdag.sys/atikmpag.sys files (works for ATI for AMD graphics cards).
newbie
Activity: 182
Merit: 0
March 10, 2018, 05:54:46 PM
#14
I had same problem, spent 3 weeks trying to figure out. I solved my problem by installing nvidia drivers instead of unzipping, going to device manager-driver update-browse. Nvidia cards seem to need full install, not just naked driver itself. You can untick all the crap, just leave tick at DRIVER.
That's what I've done every time :-(

Sometimes, the miner stops.
Sometimes, I get a BSOD.
Sometimes, I get this :

https://image.ibb.co/gjquf7/Screenshot_20180310_212518.png


Will check if it's always the same GPU.
newbie
Activity: 45
Merit: 0
March 10, 2018, 04:24:17 PM
#13
Either bad risers or drivers.
newbie
Activity: 210
Merit: 0
March 10, 2018, 04:18:55 PM
#12
Can it be caused by a driver issue?

If not, bad riser(s)?

Bad power supply filtering???



EDIT: It occurs on my 12x 1080 Ti rig

I had same problem, spent 3 weeks trying to figure out. I solved my problem by installing nvidia drivers instead of unzipping, going to device manager-driver update-browse. Nvidia cards seem to need full install, not just naked driver itself. You can untick all the crap, just leave tick at DRIVER.
newbie
Activity: 182
Merit: 0
March 10, 2018, 09:52:54 AM
#11
Probably bad overclocking settings. When this happens you usually have to lower clocks (both core and memory).
Well, I'm not crazy with my O/C:
As for the O/C, I'm currently a safe +140 core, +160 mem at 85 TDP


Are you using the same brand/type of 1080ti's?
Yes, they're all the same Gigabyte
full member
Activity: 280
Merit: 102
March 10, 2018, 06:24:11 AM
#10
Probably bad overclocking settings. When this happens you usually have to lower clocks (both core and memory).
Are you using the same brand/type of 1080ti's?
newbie
Activity: 182
Merit: 0
March 10, 2018, 06:19:07 AM
#9
I've been running 6 of them the whole night, gonna try with 3 more and see.
sr. member
Activity: 847
Merit: 383
March 09, 2018, 05:48:09 PM
#8
pull a card at a time until problem goes away.
newbie
Activity: 182
Merit: 0
March 09, 2018, 05:31:36 PM
#7
I've reinstalled Windows once.
The drivers once more.
Rechecked the mother board bios settings.

Is there a utility that would show me what's happening at the GPU-level? Memory erros, BUS usage, that kind of stuff?
full member
Activity: 259
Merit: 108
March 09, 2018, 03:59:37 PM
#6
In my experience that blue screen can mean ANYTHING. From bios settings, to bad drivers, to some windows settings. I will even switch the cards to different PCI lanes.

Sometimes after days of getting nowhere, I'll just reinstall Windows and make sure I have the bios settings correct before starting.
legendary
Activity: 1510
Merit: 1003
March 09, 2018, 03:34:04 PM
#5
I'm having this issue sometimes with my 5-nvidia mini rig with asus rog strix x370f and ryzen 5 1600 cpu. It seems the mobo doesn't hold all this pci-e lanes full on duty. Setting all pci-e to gen 1 helps a bit but not totally.
newbie
Activity: 182
Merit: 0
March 09, 2018, 03:18:08 PM
#4
Perhaps that's your issue,  you must reduce memclock or either decrease the undervolt, increase power limit.

Bios is the last thing I would check (maybe your timings could be wrong too but just think about it when everything else has been tried).
I modified the opening post, adding the information about the cards: they're nvidias (my AMD rig runs fine).


As for the O/C, I'm currently a safe +140 core, +160 mem at 85 TDP
sr. member
Activity: 847
Merit: 383
March 09, 2018, 02:06:27 PM
#3
Drivers, obviously first.  I normally just reload the machine as I can clone it fast.  If that's not it then normally bad riser if that don't fix it the card just can't handle being OC.
newbie
Activity: 19
Merit: 0
March 09, 2018, 02:02:08 PM
#2
It has always happened when I pushed the cards memclock too far.

Perhaps that's your issue,  you must reduce memclock or either decrease the undervolt, increase power limit.

Bios is the last thing I would check (maybe your timings could be wrong too but just think about it when everything else has been tried).
newbie
Activity: 182
Merit: 0
March 09, 2018, 02:00:28 PM
#1
Can it be caused by a driver issue?

If not, bad riser(s)?

Bad power supply filtering???



EDIT: It occurs on my 12x 1080 Ti rig
Jump to: