https://en.wikipedia.org/wiki/Intel_SHA_extensions
Intel Goldmont chips (sever market atom) and Ice Lake. (I haven't used it on Ice Lake, but it's finally reported there). Intel has been pre-announcing it on arches back to skylake then failing to deliver.
Anything AMD Zen and Zen+/Zen2 (so all the threadripper and epyc), which is what all of Bitcoin's development using SHA-NI has been on.
Instruction latency of sha-ni is such that you're still better interleaving independent processing of several messages... but even without that its much faster than anything else except maybe a super wide many messages AVX512 version.