ohh is that what's he's doing, no wonder it seemingly has very hard to find sweet spots profiling measurements and parallel processing through asm optimizations, under loads is very effective, yet less scalable and stable under various conditions than 'clean' implementation. - proper (not fine tuned for specific results asm optimizations that can benefit from compiler targets optimizations as well. but then he uses his own custom environment ...
EDIT: looks like on various timings and clocks [increasing the core clock then should be generally good solution for 24mh drops] it can also produce "interlaced like" fragmented memory load [mem read is done, but the filled in stuff isn't:)] and thus power draw inconsistencies. will check though with adding more RAM, so maybe not. AMD blockchain and new API is not very nice. Is there an update we should expect from him in a few weeks?