[XPM] [ANN] Primecoin High Performance | HP14 released! - page 90.

ReCat

sr. member

Activity: 406

Merit: 250

Mined four blocks in the past 24-ish hours.

Holy. Shit.

(Win 2008, HP4, specs down in siggy)

1l1l11ll1l

legendary

Activity: 1274

Merit: 1000

Quote from: n4ru on July 19, 2013, 10:38:32 AM

Just mined two back to back blocks. Awesome.

Are you running linux? Any particular settings you'd like to share? Wink

n4ru

sr. member

Activity: 350

Merit: 250

Just mined two back to back blocks. Awesome.

ReCat

sr. member

Activity: 406

Merit: 250

Hum. Strange. I've been finding more blocks than any of my friends. O_o maybe i'm having a crazy unreal luck-streak.

mikaelh

sr. member

Activity: 301

Merit: 250

Quote from: ReCat on July 19, 2013, 10:23:50 AM

Quote from: mikaelh on July 19, 2013, 10:10:51 AM

Quote from: ReCat on July 19, 2013, 10:02:21 AM

Quote from: mikaelh on July 19, 2013, 09:51:54 AM

I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

It's 12MB of L2 cache, not L3 cache. Would having this much L2 cache help any significant amount?

Not really. Larger caches are also slower.

ReCat

sr. member

Activity: 406

Merit: 250

Quote from: mikaelh on July 19, 2013, 10:10:51 AM

Quote from: ReCat on July 19, 2013, 10:02:21 AM

Quote from: mikaelh on July 19, 2013, 09:51:54 AM

I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

It's 12MB of L2 cache, not L3 cache. Would having this much L2 cache help any significant amount?

Koooooj

member

Activity: 75

Merit: 10

Quote from: mikaelh on July 19, 2013, 04:29:58 AM

Quote from: Koooooj on July 18, 2013, 06:51:10 PM

Just wanted to say great miner! I feel like this is probably the most optimized miner on the market right now. Looking over the source code of the original miner and yours I think I see a way to make the miner (potentially much) faster from a mathematical standpoint rather than a code-optimization standpoint. My optimization centers around the Primorial and the loop that occurs in main.cpp on line 4622.

Before getting into the code, it is important to realize why a primorial is helpful (if you already understand this, skip this paragraph). With numbers on the order of 2²⁵⁶ there is about a 1 in 177 chance that a random number is prime. If you select 8 random numbers, then, the odds of all of them being prime is about 1 in 1 quintillion--impractically low. If you limit your search to only odd numbers, though, the odds shoot up tremendously. Further limiting your search to numbers not divisible by 3, 5, 7, and so on can cause the odds of finding a prime number to become much, much better. If the hash value is divisible by lots of these small numbers then multiples of that hash ± 1 will not be divisible by any of those numbers. Thus, it is convenient to use a hash that is already a multiple of 2 * 3 * 5 * 7 * ... as it will produce far more primes than another hash.

As I understand the aforementioned loop, the program is searching for a hash that is divisible by a primorial. Each iteration of this loop requires a hash to be generated as it increments the nonce value. In the present form the primorail is of degree 7 (line 4579: static const unsigned int nPrimorialHashFactor = 7). I suspect that this value is carefully chosen and it's probably ideal with the way that it is written. However, I think there's an additional step that can be added to make the process much faster. Increasing the degree of the primorial is incredibly expensive as it gets larger, since adding the 8th prime requires 19 times as many hashes to be checked; the 9th prime requires 23 times as many, and so on. There is another way, though.

Prime origins are required to be in the form O = H * N where O is the origin, H is the hash, and N is any integer; H is selected to be divisible by p#7 (the primorial shown above). If we extend this to O = H * N * P₂ where P₂ is 19 * 23 * 29 * ... * 51--a product of primes starting after the last prime used in the existing search--then the checked origin is still valid (an integer times an integer is still an integer). This grants the benefits of searching with a higher degree primorial while not requiring a longer search for a working hash.

Nothing is free, though, as this method inflates the size of the primes to be checked. If the fast modular exponentiation is implemented through the same method as is used on the Fermat Primality Test Wikipedia page then the algorithmic efficiency is O(log²(n) * log(log(n)) * log(log(log(n))) ). There should be some sweet spot of how large of a primorial to use where the increased frequency of primes more than offsets the extra time required for the fast modular exponentiation. It's possible that the sweet spot is 0 extra primes, but I think it's worth looking into.

Definitely a great. I haven't had the time to study the code from a mathematical perspective, so this is helping me understand the code better.

Unfortunately it seems that Sunny King was ahead of you here. The code is already doing what you proposed. It uses something called a round primorial which is dynamically adjusted. You can see these round primorials in debug.log if you enable the debugging options -debug -printmining. A fixed multiplier is calculated through division by the first primorial used to choose the hash value. This fixed multiplier corresponds to your P₂.

I think some improvements could be made to the dynamic adjustment system. It seems to be picking lots of different values.

Darn. I guess Sunny was on his game! At any rate, I hope this at least opens a new door for additional tuning of the miner in the wild. Perhaps a system that does away with the dynamic tuning in favor of a conf file/command line parameter. I don't have the time to crawl through the source code right now--friend is getting married--but I'll take a look when I get back.

xyzzy099

legendary

Activity: 1066

Merit: 1098

Quote from: mikaelh on July 19, 2013, 10:10:51 AM

Quote from: ReCat on July 19, 2013, 10:02:21 AM

Quote from: mikaelh on July 19, 2013, 09:51:54 AM

I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

Running hp4 on a CPU with a 10M L3 cache, I found that if I set the sievesize any larger than 4M, the program would crash within seconds of startup. What should be the maximum sievesize?

I am using hp5 now, but I have not checked to see if it still crashes.

mikaelh

sr. member

Activity: 301

Merit: 250

Quote from: ReCat on July 19, 2013, 10:02:21 AM

Quote from: mikaelh on July 19, 2013, 09:51:54 AM

I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

ReCat

sr. member

Activity: 406

Merit: 250

Quote from: mikaelh on July 19, 2013, 09:51:54 AM

I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

mikaelh

sr. member

Activity: 301

Merit: 250

I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

ReCat

sr. member

Activity: 406

Merit: 250

I just mined 3 blocks in a row... holy crap.

ig0tik3d

legendary

Activity: 1246

Merit: 1000

Quote from: nmersulypnem on July 19, 2013, 08:41:54 AM

Does anyone have a guide for building on Windows. I can't get the static libraries to bind using MingW.

https://bitcointalksearch.org/topic/building-headless-bitcoin-and-bitcoin-qt-on-windows-149479

nmersulypnem

full member

Activity: 238

Merit: 100

Does anyone have a guide for building on Windows. I can't get the static libraries to bind using MingW.

Joe13

newbie

Activity: 18

Merit: 0

Nah, tryed it once with gen=0 and then another time with gen=1 but i never maintained to get a block...
That is why i am asking if the conf file is important for the mining, if yes how must it be configured then....
thx

Krusher33

newbie

Activity: 37

Merit: 0

Quote from: Joe13 on July 19, 2013, 05:06:32 AM

Sorry i am new to all this ...
But there is one thing i dont get.
I tryed to run it with a primecoin.conf file and changed parameters like rpcuser ... rpcpassword ... gen=0 gen=1 and also with rpcport=9910 rpcallowip=127.0.0.1 with server=1 and daemon=1.
I did not receive any block in earlier stages ... difficulty was past 8 to this stage ...
as i deleted the conf file and run the wallet without it i received two blocks in two hours ...
How can that be ... Luck ? I was using hp3 ...
So my question is is the conf file needed or not ??

Thx

Sounds like coincidence.

But why did you have both gen=0 and gen=1 in there?

bitrich

member

Activity: 109

Merit: 10

running i5 at 2000pps. got about 5% more pps from hp4 to hp 5. Only found 2 blocks since release of first client. got them in the first few days, havent had one since on solo. ypool sucks had a second pc pool mining with ypool way too many server rejected shares.

ivanlabrie

hero member

Activity: 812

Merit: 1000

Day one, 1400pps, 8 core nehalem rig running HP5.
0 blocks. Cheesy

jammertr

member

Activity: 100

Merit: 10

to be honest .... almost 24 hours

Quote from: jammertr on July 19, 2013, 07:22:27 AM

same here 3 pc setup, 4000/3500/1700 pps all on hp5 no luck more than 24 hours

jammertr

member

Activity: 100

Merit: 10

same here 3 pc setup, 4000/3500/1700 pps all on hp5 no luck more than 24 hours

Topic: [XPM] [ANN] Primecoin High Performance | HP14 released! - page 90. (Read 397658 times)