Thanks.
I see that HD6xxx cards require whole kernel optimization, not only sieve For compare, R9 290:
square 320 bits: 40.524ms (3312.055M ops/sec)
multiply 320 bits: 49.931ms (2688.064M ops/sec)
square 352 bits: 48.891ms (2745.244M ops/sec)
multiply 352 bits: 60.490ms (2218.842M ops/sec)
Fermat tests 320 bits: 38.432ms (3.410M ops/sec)
Fermat tests 352 bits: 47.956ms (2.733M ops/sec)
*** hashmod benchmark ***
MHash per second: 550.495
Hash per iteration: 37.938 (0.000452 %)
Average hash multiplier size: 30.712
*** sieve (check) benchmark ***
* [OK] found candidates by CPU: 8077 by GPU: 8082
* [OK] invalid candidates: 0
* [OK] CPU/GPU candidates difference: 0
*** sieve (performance) benchmark ***
* scan speed: 94.933 G
* iteration time: 5.799ms
* candidates per second: 1282165.428
* candidates per iteration: 7435.11 (2711.41 320bit, 4723.70 352bit)
* 320bit/352bit ratio: 0.574/1
I also intrest in benchmarks of old GeForce GTX 6xx/7xx cards, 980Ti and Fury X
No problem. I can make benchmark with gtx 750ti...
750ti
found platform[0] name = 'NVIDIA CUDA'
Found 4 devices
Using device 0 as GPU 0
Using device 1 as GPU 1
Using device 2 as GPU 2
Using device 3 as GPU 3
Compiling ...
Source: 236815 bytes
binsize = 1550311 bytes
GeForce GTX 750 Ti; 5 compute units
square 320 bits: 180.000ms (745.654M ops/sec)
square 320 bits: 180.000ms (745.654M ops/sec)
multiply 320 bits: 255.000ms (526.344M ops/sec)
square 352 bits: 210.000ms (639.132M ops/sec)
multiply 352 bits: 260.000ms (516.222M ops/sec)
Fermat tests 320 bits: 230.000ms (0.570M ops/sec)
Fermat tests 352 bits: 285.001ms (0.460M ops/sec)
*** hashmod benchmark ***
MHash per second: 139.810
Hash per iteration: 36.734 (0.000438 %)
Average hash multiplier size: 30.654
*** sieve (check) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
*** sieve (performance) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
GeForce GTX 750 Ti; 5 compute units
square 320 bits: 180.001ms (745.650M ops/sec)
square 320 bits: 180.000ms (745.654M ops/sec)
multiply 320 bits: 257.002ms (522.244M ops/sec)
square 352 bits: 210.000ms (639.132M ops/sec)
multiply 352 bits: 265.001ms (506.480M ops/sec)
Fermat tests 320 bits: 235.001ms (0.558M ops/sec)
Fermat tests 352 bits: 277.002ms (0.473M ops/sec)
*** hashmod benchmark ***
MHash per second: 139.194
Hash per iteration: 38.797 (0.000462 %)
Average hash multiplier size: 30.667
*** sieve (check) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
*** sieve (performance) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
GeForce GTX 750 Ti; 5 compute units
square 320 bits: 180.000ms (745.654M ops/sec)
square 320 bits: 180.000ms (745.654M ops/sec)
multiply 320 bits: 250.000ms (536.871M ops/sec)
square 352 bits: 212.001ms (633.100M ops/sec)
multiply 352 bits: 262.001ms (512.279M ops/sec)
Fermat tests 320 bits: 230.000ms (0.570M ops/sec)
Fermat tests 352 bits: 280.000ms (0.468M ops/sec)
*** hashmod benchmark ***
MHash per second: 140.985
Hash per iteration: 36.391 (0.000434 %)
Average hash multiplier size: 30.745
*** sieve (check) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
*** sieve (performance) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
GeForce GTX 750 Ti; 5 compute units
square 320 bits: 180.000ms (745.654M ops/sec)
square 320 bits: 180.000ms (745.654M ops/sec)
multiply 320 bits: 255.001ms (526.342M ops/sec)
square 352 bits: 215.000ms (624.269M ops/sec)
multiply 352 bits: 265.001ms (506.480M ops/sec)
Fermat tests 320 bits: 232.004ms (0.565M ops/sec)
Fermat tests 352 bits: 277.002ms (0.473M ops/sec)
*** hashmod benchmark ***
MHash per second: 139.194
Hash per iteration: 37.313 (0.000445 %)
Average hash multiplier size: 30.561
*** sieve (check) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836
*** sieve (performance) benchmark ***
OpenCL error: -54 at /HDD/build/projects/xpmclient/xpmclient/benchmarks.cpp:836