Quote from: Schleicher on October 19, 2014, 05:07:14 PM
I managed to increase the quark and nist5 speed a little bit.
Source code:
https://github.com/KlausT/ccminer
simd512 seems to be 10% faster.
My stock 750TI is approaching 2700 without overclock. and 3000 with oclock.
This is without the faster Keccak (created by nvidia) in cudaminer.
These are the optimized numbers so far.
Blake xxx (not done)
skein 1.5%
BMW 60%
jh512 4.5%
keccac 1%
cubehash: 7.5%
shavite: 3.6%
simd512: 9,2%
fuge: 4,70%
hamsi: 6.97%
shabal: 22%
wirlpool:1.87%
echo: 5.5%
luffa: 0.4%
3 coders have contributed to the new speedup.
The sourcecode will be checked into the blakecoin fork by Epsylon3
https://bitcointalksearch.org/topic/ann-ccminer-23-opensource-gpl-tpruvot-770064
That uint2 keccak was done by mtrlt, if I'm not mistaken.