I tested with driver version 332.17, since I've heard there is reduced performance with the latest drivers.
You might want to try -C 2 for scrypt:2048, I found it helped performance quite a bit.
With -C 2 I get "the 'T' kernel ignores the texture cache argument"
+1 C 2 for scrypt and please try higher L values. 3 and 4 and maybe higher and auto tuning would net you more than 4khs (I hope) :p
thanks!
Autotune results with different lookup values
-L 3Scrypt-jane with -L4 [2014-02-20 06:16:09] GPU #1: 2950.30 hash/s with configuration t20x6
[2014-02-20 06:29:35] GPU #0: using launch configuration t5x16
[2014-02-20 06:29:36] GPU #0: GeForce GTX 750 Ti, 2.98 khash/s
[2014-02-20 06:29:36] Total: 2.98 khash/s
[2014-02-20 06:29:41] GPU #0: GeForce GTX 750 Ti, 3.12 khash/s
[2014-02-20 06:29:41] Total: 3.12 khash/s
[2014-02-20 06:29:46] GPU #0: GeForce GTX 750 Ti, 3.12 khash/s
[2014-02-20 06:29:46] Total: 3.12 khash/s
[2014-02-20 06:29:51] GPU #0: GeForce GTX 750 Ti, 3.12 khash/s
-L 4[2014-02-20 06:16:09] GPU #0: 2950.30 hash/s with configuration t20x6
[2014-02-20 06:16:09] GPU #0: using launch configuration t20x6
[2014-02-20 06:16:11] GPU #0: GeForce GTX 750 Ti, 2.88 khash/s
[2014-02-20 06:16:11] Total: 2.88 khash/s
[2014-02-20 06:16:16] GPU #0: GeForce GTX 750 Ti, 2.76 khash/s
[2014-02-20 06:16:16] Total: 2.76 khash/s
-L 5[2014-02-20 06:48:03] GPU #1: using launch configuration t20x7
[2014-02-20 06:48:05] GPU #1: GeForce GTX 750 Ti, 2.26 khash/s
[2014-02-20 06:48:05] Total: 2.26 khash/s
[2014-02-20 06:48:10] GPU #1: GeForce GTX 750 Ti, 2.38 khash/s
[2014-02-20 06:48:58] Total: 2.38 khash/s
[2014-02-20 06:49:03] GPU #1: GeForce GTX 750 Ti, 2.46 khash/s
[2014-02-20 06:49:03] Total: 2.46 khash/s
-L6 ends with cudaminer crashing and this scrolling
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaFree(h_V
[thr_id][MAXWARPS[thr_id]]-h_V_extra[thr_id][MAXWARPS[thr_id]])' (D:/Christian/D
ocuments/Visual Studio 2010/Projects/CudaMiner/salsa_kernel.cu line 875)
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaMalloc((
void **) &tmp, mem_size)' (D:/Christian/Documents/Visual Studio 2010/Projects/Cu
daMiner/salsa_kernel.cu line 251)
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaMalloc((
void **) &tmp, mem_size)' (D:/Christian/Documents/Visual Studio 2010/Projects/Cu
daMiner/salsa_kernel.cu line 252)
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaMalloc((
void **) &tmp, mem_size)' (D:/Christian/Documents/Visual Studio 2010/Projects/Cu
daMiner/salsa_kernel.cu line 253)
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaMalloc((
void **) &tmp, mem_size)' (D:/Christian/Documents/Visual Studio 2010/Projects/Cu
daMiner/salsa_kernel.cu line 254)
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaHostAllo
c((void **) &tmp, state_size, cudaHostAllocDefault)' (D:/Christian/Documents/Vis
ual Studio 2010/Projects/CudaMiner/salsa_kernel.cu line 257)
[2014-02-20 06:58:35] GPU #0: cudaError 30 (unknown error) calling 'cudaHostAllo
c((void **) &tmp, state_size, cudaHostAllocDefault)' (D:/Christian/Documents/Vis
ual Studio 2010/Projects/CudaMiner/salsa_kernel.cu line 258)...snip...
EDIT: -L 5 test results were after I added back in my GTX 780 Ti back in the system so that it could be used for BF4 during the day since I apparently forgot to save the single card results.