Thanks for testing.
log from 1.09, --xintensity 64
in 1.15 you should multiply the intensity with 4. Run with --xintensity 256. 1.09 is using a cpu intensive thread scheduling that give 1-2% faster hashrate on low intensiities. We could add a --cpu-mining parameter in the next release.
1.15, without --xintensity
The cuda 11.4 build is faster on the rtx cards. It's not only a different build, it's a different kernel.
Pascal cards is faster in cuda 11.2, rtx card faster on 11.4.
I get around 1MHASH more on the gtx 1070 with the 11.2 kernel.
3. GPUs order in status and configuration for --lock-cclock mismatch. Had to use GPU-Z to see real gpu order.
this is what I replied on github in the same issue:
By default, CUDA orders the GPUs by computing power. GPU:0 will be the fastest GPU on your host
If you set CUDA_DEVICE_ORDER='PCI_BUS_ID' then CUDA orders your GPU depending on how you set up your machine meaning that GPU:0 will be the GPU on your first PCI-E lane.
on windows :
set CUDA_DEVICE_ORDER='PCI_BUS_ID'
to see the order run:
tbminer.exe --list-devices
[moderator's note: consecutive posts merged]