https://github.com/ghostlander/nsgminer/releases/tag/nsgminer-v0.9.0
Wolf0, your kernel distributed by NiceHash with their miner is also well done, could use an idea or two out of it. One of my primary concerns was to get rid of scratch register usage to run more than one wavefront concurrently, but FastKDF seems to be too complicated to fit VRegs and SRegs alone. Although I've cut the scratch reg usage down by half which also helps.
Thanks for the compilation mate!
But I am getting these speeds with R9 280Xs using NSGminer (Win7, 64-bit, Catalyst 14.6):
Not really an improvement compared to what I'm getting with sgminer5-2-1-general:
Any suggestions?
5.2.1 general uses my code. You'll want different settings for his - also, intensity can't be tuned well on his, it's very coarse-grained.
As you can see partly in the top of the screenshots the settings are very different. For NSGminer I use the same command as stated in the OP: nsgminer --neoscrypt -g 1 -w 128 -I 16 (so not using the engine 1000 and memory 1500 option).
And for sgminer I use: sgminer.exe --algorithm neoscrypt --nfactor 10 --xintensity 2 --thread-concurrency 8192 --gpu-threads 2
When using 2 threads and an intensity of 13 with NSGminer a GPU immediately hangs, which is very unusual for R9 280Xs with these settings.
So for now I'll keep using your code, Wolf0 !
Nfactor is useless with mine - mine doesn't use TC, either. Feel free to omit them.