Hello gents, I have a feature request to make TRM even better. Similar to "--temp_limit=TEMP", please add config setting for a Navi memory temperature limit. I have a few 5700's that sometimes push upper 90's for mem temp, even with fans at 100%. Would be awesome if TRM did gradual intensity ramp down to stay under the temp limit.
Alternatively, if the existing --temp_limit would accept a list to customize per GPU, could (indirectly) keep Navi mem temps roughly under a target.
Hi! I can add it to the list. It would be better from an efficiency perspective to adjust clocks instead though, but I get your point. There are also more advanced tuning options available for Navi not available in the standard Overdrive API, you need to work with the powerplay table directly. On Windows this is doable with the tools from Igor's lab, Linux is a little more annoying.
For example, lowering MVDD (mem voltage) from 1.35 to 1.3 and VDDCI to what the gpu can handle but still remain stable will probably shave off -4-5C on your mem temp. Silicon lottery involved, need to test yourself to find what your specific gpus can handle. Moreover, lowering SoC voltage while you're at it will also shave off a few watts of power and help keep overall temps under control.