This is ridiculous. I have a bunch of identical XFX RX 570s. I am using Ubuntu 16 with Claymore 9.5.
I am using Windows 10 to test for mem errors and check voltages. I have Wattman and Trixx (got rid of Afterburner). I have GPUz to monitor voltages.
Wattman seems to have a mind of it's own and sometimes reads the BIOS voltages I set and other times just flat out ignores them. Using the default auto-incremental voltages sometimes it runs at 1.025v but then on other cards it will run at 900mv (which is great). However, if I swap cards and it decides to run at 1.025v and then I swap back to a card that was just previously running at 900v, suddenly that 900v card will now run at 1.025v. I have reset to default everywhere to try and force the BIOS settings. Even manually setting the card to 925v, the Wattman will show 925, but if I reset the card back to default, Wattman still shows 925 but the power usage is much higher.
I know this makes no sense to read, but how do you all manage undervolting in LINUX? I have a kill-a-watt and most of my cards will run at 120W, but it's seems totally random at whatever the last read on Windows is, affects how it runs on Linux.
I'm using a modifed Anarok Powersave BIOS and slightly lowering the MemClock as it wasn't running stable. He does lower the TDP/TDC settings, which seems to work well, but again, once windows gets involved everything is totally random. I don't get why sometimes it runs at 1.025v and others at 900mv when the settings aren't changing....
I'm just so confused....
Pretty certain these are driver issues. I've got a 290x and a 295x2 in a mining rig. After reboots, I find that sometimes I am unable to get one or the other to the same overclock settings/consistency without modifying the power limit or core voltage further.
EG:
1100/1300 @ -45 vcore/+0 power will work for my 295x2 except for the rare times it boots and I have to do -55 +5
The same thing occurs on my 290x. I haven't been mining for long, this problem did not occur when I was mining only on my 290x. After I added in my 295x2, this started. That was also when I moved the hardware to its own dedicated rig and thusly changed everything... could be something in that hotpot of variables. Still though, I have a pretty high level of confidence it's the drivers.