Pages:
Author

Topic: What speed are your getting CPU mining TENEBRIX? - page 3. (Read 13645 times)

sr. member
Activity: 330
Merit: 397
what minerd startup options do you use ?

minerd --userpass 1:1 --url http://127.0.0.1:8697
full member
Activity: 168
Merit: 100
what minerd startup options do you use ?

Maybe 4way? ;-)
member
Activity: 112
Merit: 11
Hillariously voracious
What miner are you using, and what box config ?

Miner is the one here: https://github.com/Lolcust/Tenebrix-miner

My box is just a simple macbook with 2 CPUs and no modifications (except the fact that it's running Xubuntu).

what minerd startup options do you use ?
sr. member
Activity: 330
Merit: 397
What miner are you using, and what box config ?

Miner is the one here: https://github.com/Lolcust/Tenebrix-miner

My box is just a simple macbook with 2 CPUs and no modifications (except the fact that it's running Xubuntu).
member
Activity: 112
Merit: 11
Hillariously voracious


Huh

What miner are you using, and what box config ?
sr. member
Activity: 330
Merit: 397
hero member
Activity: 518
Merit: 500
So, on a 2600K with 8 threads I should get what !?

~3.8 kh/s per core

If I get my 2600K to do around 30 khash/s I would feel EPIC !
full member
Activity: 210
Merit: 100
Every little bit help!

Intel users will likely gain more from cranking Uncore than AMD with cpu-nb, just because the AMD doesn't have to use as much of it's L3 cache.
legendary
Activity: 1484
Merit: 1005
AMD users:  Try turning up the CPU-NB (CPU NorthBridge) speed, that's your L3 cache+memory controller.  It may help.

I get about a 2% gain from having my NB at 2600mhz
legendary
Activity: 1484
Merit: 1005
So, on a 2600K with 8 threads I should get what !?

~3.8 kh/s per core
full member
Activity: 210
Merit: 100
More than what you got without compiling AVX and such in.
hero member
Activity: 518
Merit: 500
For SandyBridge CPUs (corei7 with four digit model numbers).

git the package.
./configure
(screw the flags)
gedit Makefile
find "CFLAGS = "
Change that line to: CFLAGS = -march=native -O3 -Wall -msse2 -msse3 -msse4.1 -msse4.2 -msse4 -mavx
Brought my performance from 2.6kh/s per core (running one thread per core) to 3.66-3.7kh/s per core.

Tested with 6 threads, +2 kh/s in total.
HT works now!
Just needed some hardcore flags, that's all.

So, on a 2600K with 8 threads I should get what !?
full member
Activity: 210
Merit: 100
I've managed to get vaguely close, but not really close enough, their (stupid, IMO) decision to gimp L2 is costing us badly.

It really is rather like nvidia vs ATI.
nvidia has focused on floating point performance, in Folding@home and other floating point stuff they stomp ATI.
ATI (AMD, whatever) has focused on integer performance.  Along comes the integer based Bitcoin, and holy crap does ATI compute like mad.


Except in this case AMD spent 6 years doing minor improvements to a core that came out in 2005 (2004?  Long ago) and spent the money earned from stomping the crap out of P4s on CEO paychecks.  Intel meanwhile dumped money into R&D like mad and made the old P3 into the Core2, then with a nice performance lead they kept pouring money into R&D.
AMD meanwhile got left in the dust, axed the dumbshit CEO and started a crash program to get caught up, part of that program was to sacrifice some money in the form of CPU die size to get a little bit of performance, to get a little bit closer to intel.
It hurts their profit margins (which are crap, roughly speaking), but the die size spent on "excessive" L2 cache has paid off big time for Scrypt type CPU mining.
Go figure.
Bulldozer they're cranking the L2 up somewhat as it's on 32nm instead of 45nm and doesn't cost quite as badly, plus more L2 really does help performance, even with a ton of L3 floating around.


Anyway, if you're an intel user enable as many optimization flags as your CPU supports and that'll help somewhat, as well higher core speed obviously (and not obviously, as L3 speed = core speed on sandybridge chips).
If you're on a 1156 or 1366 (three digit core i 7 model numbers), crank up the "uncore" speed, as that's your L3 and L3 speed is the big bottleneck here.  It also has the memory controller in it.
Ram speed seems to help a bit too, though not a ton.

AMD users:  Try turning up the CPU-NB (CPU NorthBridge) speed, that's your L3 cache+memory controller.  It may help.


EDIT:
In theory, -mavx    can replace the entire -msseX mess, but in theory -march=native should do this whole lump automatically, and it doesn't.
full member
Activity: 210
Merit: 100
For SandyBridge CPUs (corei7 with four digit model numbers).

git the package.
./configure
(screw the flags)
gedit Makefile
find "CFLAGS = "
Change that line to: CFLAGS = -march=native -O3 -Wall -msse2 -msse3 -msse4.1 -msse4.2 -msse4 -mavx
Brought my performance from 2.6kh/s per core (running one thread per core) to 3.66-3.7kh/s per core.

Tested with 6 threads, +2 kh/s in total.
HT works now!
Just needed some hardcore flags, that's all.
hero member
Activity: 518
Merit: 500
Guys, why no Intel love !?
legendary
Activity: 1484
Merit: 1005
Not sure what's going on there (aka "can't reproduce issue").
My only guess is it's possibly related to removing the sha256 algos, but... that was even before I started doing the compilers job for scrypt.
Not sure what to do other than general hints along the lines of "start with a clean tree, CFLAGS="-whatever" ./configure; make"
Hrrrm... I guess you could revert the sha256 removal or drop the new scrypt.c into Tenebrix-miner and see if that also causes the same issues.

There is no scrypt.c in your latest released source of 1.0.2...  I'm looking at the tar.gz and it's missing it.
What release? The releases are done by Lolcust, I only keep a git at https://github.com/ArtForz/cpuminer

Oh, I guess it's not a release then... you can get a compressed file of all the contents by clicking on "Downloads" link on the top right of your page ( https://github.com/ArtForz/cpuminer ), but for some strange reason the latest one is missing files.  Might be a github bug.
sr. member
Activity: 406
Merit: 257
That's pretty wild, faster than the 1055T if that's with hyperthreading on.  The Intel people in this thread are going to want to know what your configure options and compiler was for that.

Nope, that's with HT off.  Measurably less heat & power with HT off compared to on, and less than 1% difference in performance.  Latest ArtForz scrypt.c loses about 15% compared to the 2.85 high watermark with the same compiler & options on the previous version, FYI.  That's some very impressive hand optimization for a particular architecture.

Unfortunately even though I'm 2.8% of the tenebrix network I'm only pulling in about 1 BTC/day with the two i7s I have mining this (which still outperforms my 5830s by a factor of 4 in terms of power/$) so it'll be a while till I feel like buying an x6 phenom to try the same compiler/options/minerd version bake-off with.
Sad to hear that, as the current HEAD also improves speeds for K10 even when compiled with older gcc versions to near gcc-4.6.1 levels
Guess this one will have to stay in non-official state for now (looks like it also produces rather crap asm when compiled for 32 bit targets...).
full member
Activity: 154
Merit: 100
That's pretty wild, faster than the 1055T if that's with hyperthreading on.  The Intel people in this thread are going to want to know what your configure options and compiler was for that.

Nope, that's with HT off.  Measurably less heat & power with HT off compared to on, and less than 1% difference in performance.  Latest ArtForz scrypt.c loses about 15% compared to the 2.85 high watermark with the same compiler & options on the previous version, FYI.  That's some very impressive hand optimization for a particular architecture.

Unfortunately even though I'm 2.8% of the tenebrix network I'm only pulling in about 1 BTC/day with the two i7s I have mining this (which still outperforms my 5830s by a factor of 4 in terms of power/$) so it'll be a while till I feel like buying an x6 phenom to try the same compiler/options/minerd version bake-off with.
legendary
Activity: 889
Merit: 1000
Bitcoin calls me an Orphan
Anyone get a cpu miner to work on Solaris?

Yeah I know its a shot in the dark!
sr. member
Activity: 406
Merit: 257
Not sure what's going on there (aka "can't reproduce issue").
My only guess is it's possibly related to removing the sha256 algos, but... that was even before I started doing the compilers job for scrypt.
Not sure what to do other than general hints along the lines of "start with a clean tree, CFLAGS="-whatever" ./configure; make"
Hrrrm... I guess you could revert the sha256 removal or drop the new scrypt.c into Tenebrix-miner and see if that also causes the same issues.

There is no scrypt.c in your latest released source of 1.0.2...  I'm looking at the tar.gz and it's missing it.
What release? The releases are done by Lolcust, I only keep a git at https://github.com/ArtForz/cpuminer
Pages:
Jump to: