joblo, regarding compiling, is there anything up against replacing the windows compiles with "-march=" by compiles with just the extensions added (-msse2/-maes/-mavx/-mavx2) like nicehash does it for their compiles? would boil the resulting bins down to essentially 3 or 4 and the right one can easily be identified
cheers
My intent is to do something similar but continue to compile on architecture boundaries instead of cherry picking features.
If I ignore the manufacturer differences there are 4 levels I want to support: SSE2, +AES, +AVX, +AVX2. Unfortunately
Westmere doesn't have its own compile arch defined but is SSE4.1+AES. I am targetting redundant builds one at a time.
For AMD I'm relying on your advice of what special AMD builds would be useful, meaning there isn't a compatible Intel
build with the same critical features. I would also need to be able to build it which could be an issue.
For now I'm targetting btver1. Any reason to keep it?