I can't believed it stayed cpu only for 2 months...
*lol* ...but to late... amd is already there.
edit: and whirlpool seems not that slow on amd - 50mhash per r9 270
100MH/s+ on 270X.
wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...
so this figure you have quoted mate - yours or public? ...
#crysx
Mine - he pointed out it wasn't slow on AMD; he's right.
damn ...
and how to get hold of your one? with the appropriate settings? ...
#crysx
You know the answer to that. But, anyway, I'm working on something more epic.
The CUDA and OpenCL code for Whirlpool consists of lookups into huge tables - which sucks for the GPU; that's CPU code. Even with my current code, I've noticed beyond a certain point, it doesn't matter how high I clock, because it's stalling on memory accesses. Those tables have
so got to go away.
I have gotten the reference implementation down in C - surprisingly hard, seeing as it appears there's no code anywhere for it. This consists of mostly the block cipher W that was created with Whirlpool, which is based on AES - and I know AES backwards and forwards. Small issue - it's got a 2048 byte table for the multiplication, then a 256 byte Sbox.
I took the 2048 byte table used for the multiplies and reduced it to one 8-byte table by doing them manually - then I got rid of that by inlining them as constants. The S-box I split into its parts - three S-boxes containing 16 entries of 4 bits each, and bitsliced them. Does valid hashes so far, but I have a bit further to go before it's really GPU-ready.
wow - so you have been a VERY busy lil wolfie then ... damn ...
so when will you expected final implementation come? ...
btw - pm for the 'you know the answer to that' situation with your idea of how that can be done ...
just trying to make the farm work THAT MUCH better - and that requires optimizations ... sooo - pm me please with what needs to be done on my end to get it organized ...
btw - the completion of the exchange from amd to nvidia is almost complete with the farm - so i can still run / test the optimizations with the gigabyte 280x oc cards left ( 16 of them currently ) ... once those are gone - the farm will be nothing but gigabyte 750ti oc lp cards ...
hence the reason for my interest in what / when / where / how / and how much ...
#crysx