Probably have them in crossfire. Someone said you had to disable it before you reboot after installing drivers in the registry. Also NEVER auto fan control. Set them to 3000 rpm minium. Even if the GPU is reporting 60c the HBM could be 80c+ so you need the fans to be running pretty high to over compensate.
Thank you for your reply
Don't think they are in Crossfire as that option is not available after the fourth card is installed. And before I purchased the two 56s the registry edit you speak of did not prevent them from restarting in crossfire. I will dig into this more tomorrow and see if I can see the crossfire indicated somewhere other than AMD Settings.
I have ran the cards thru the temps from 50 to 70 and HBM is always 8 to 13 higher than indicated core at full load. This doesn't change as long as load is constant. And since we all run them at full load it is a fairly reliable means to control. The one card that runs at 13 will be getting a teardown so I can check contact of cooling device. I will add a silver shim if I think it will help make better contact. I am comfortable with that cards HBM at 73.
I thought perhaps that that card was the offending card with the compute errors. Thinking maybe it didn't like the 73 and my inability to positively for certain match the GPUx IDs in Cast to a physical GPU I ran them all with HMB temps from 45 to 70 by setting a corresponding core temp in auto and then a two hour run with all fans at 4900. Errors don't appear to get worse or better.
My hash rate drops from 1960 to 16xx after some period of time. I haven't been able to pinpoint a cause yet. I run TeamViewer as well but the hash rate drop seems random.
Are you running any temperature monitors (something that can monitor HBM temp, like HWiNFO64)? I have a family member who had this happen today on his rig, and I'm curious if it's temp related.
Note that Cast only monitors the
GPU core temp, which with my cards runs anywhere from 8 - 11° C cooler than HBM. I'm sure it differs by card, but I've read elsewhere that HBM can start to throttle at 80°.
Also NEVER auto fan control. Set them to 3000 rpm minium.
As above, my experience has been different, especially based on HWiNFO64 monitoring. I have a minimum of 2K just so there's no need to ramp up too much, but I also have external cooling and should be factored in.
Opening or changing GPUs in HWiNFO64 will cause hash to drop to 1.4-1.6k per card requiring one of the methods to restore then detailed in the previous posts. Disabling and Enabling is the fastest I have found.
What can be done is open an instance of HWiNFO64 for each GPU and select correct gnu in each instance. then disable and enable. This is also true for GPUz. OverdriveNTool is the only one I use that doesn't require the D/E procedures.
Thanks again for each of your replies.