Pages:
Author

Topic: [XPM] [ANN] Primecoin High Performance | HP14 released! - page 90. (Read 397658 times)

sr. member
Activity: 406
Merit: 250


Mined four blocks in the past 24-ish hours.

Holy. Shit.

(Win 2008, HP4, specs down in siggy)
legendary
Activity: 1274
Merit: 1000
Just mined two back to back blocks. Awesome.

Are you running linux? Any particular settings you'd like to share?  Wink
sr. member
Activity: 350
Merit: 250
Just mined two back to back blocks. Awesome.
sr. member
Activity: 406
Merit: 250
Hum. Strange. I've been finding more blocks than any of my friends. O_o maybe i'm having a crazy unreal luck-streak.
sr. member
Activity: 301
Merit: 250
I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

It's 12MB of L2 cache, not L3 cache. Would having this much L2 cache help any significant amount?

Not really. Larger caches are also slower.
sr. member
Activity: 406
Merit: 250
I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

It's 12MB of L2 cache, not L3 cache. Would having this much L2 cache help any significant amount?
member
Activity: 75
Merit: 10
Just wanted to say great miner!  I feel like this is probably the most optimized miner on the market right now.  Looking over the source code of the original miner and yours I think I see a way to make the miner (potentially much) faster from a mathematical standpoint rather than a code-optimization standpoint.  My optimization centers around the Primorial and the loop that occurs in main.cpp on line 4622.

Before getting into the code, it is important to realize why a primorial is helpful (if you already understand this, skip this paragraph).  With numbers on the order of 2256 there is about a 1 in 177 chance that a random number is prime.  If you select 8 random numbers, then, the odds of all of them being prime is about 1 in 1 quintillion--impractically low.  If you limit your search to only odd numbers, though, the odds shoot up tremendously.  Further limiting your search to numbers not divisible by 3, 5, 7, and so on can cause the odds of finding a prime number to become much, much better.  If the hash value is divisible by lots of these small numbers then multiples of that hash ± 1 will not be divisible by any of those numbers.  Thus, it is convenient to use a hash that is already a multiple of 2 * 3 * 5 * 7 * ... as it will produce far more primes than another hash.

As I understand the aforementioned loop, the program is searching for a hash that is divisible by a primorial.  Each iteration of this loop requires a hash to be generated as it increments the nonce value.  In the present form the primorail is of degree 7 (line 4579: static const unsigned int nPrimorialHashFactor = 7).  I suspect that this value is carefully chosen and it's probably ideal with the way that it is written.  However, I think there's an additional step that can be added to make the process much faster.  Increasing the degree of the primorial is incredibly expensive as it gets larger, since adding the 8th prime requires 19 times as many hashes to be checked; the 9th prime requires 23 times as many, and so on.  There is another way, though.

Prime origins are required to be in the form O = H * N where O is the origin, H is the hash, and N is any integer; H is selected to be divisible by p#7 (the primorial shown above).  If we extend this to O = H * N * P2 where P2 is 19 * 23 * 29 * ... * 51--a product of primes starting after the last prime used in the existing search--then the checked origin is still valid (an integer times an integer is still an integer).  This grants the benefits of searching with a higher degree primorial while not requiring a longer search for a working hash.  

Nothing is free, though, as this method inflates the size of the primes to be checked.  If the fast modular exponentiation is implemented through the same method as is used on the Fermat Primality Test Wikipedia page then the algorithmic efficiency is O(log2(n) * log(log(n)) * log(log(log(n))) ).  There should be some sweet spot of how large of a primorial to use where the increased frequency of primes more than offsets the extra time required for the fast modular exponentiation.  It's possible that the sweet spot is 0 extra primes, but I think it's worth looking into.

Definitely a great. I haven't had the time to study the code from a mathematical perspective, so this is helping me understand the code better. Smiley

Unfortunately it seems that Sunny King was ahead of you here. The code is already doing what you proposed. It uses something called a round primorial which is dynamically adjusted. You can see these round primorials in debug.log if you enable the debugging options -debug -printmining. A fixed multiplier is calculated through division by the first primorial used to choose the hash value. This fixed multiplier corresponds to your P2.

I think some improvements could be made to the dynamic adjustment system. It seems to be picking lots of different values.

Darn.  I guess Sunny was on his game!  At any rate, I hope this at least opens a new door for additional tuning of the miner in the wild.  Perhaps a system that does away with the dynamic tuning in favor of a conf file/command line parameter.  I don't have the time to crawl through the source code right now--friend is getting married--but I'll take a look when I get back.
legendary
Activity: 1066
Merit: 1098
I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.

Running hp4 on a CPU with a 10M L3 cache, I found that if I set the sievesize any larger than 4M, the program would crash within seconds of startup.  What should be the maximum sievesize?

I am using hp5 now, but I have not checked to see if it still crashes.


sr. member
Activity: 301
Merit: 250
I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!

Well, it might run a tiny bit faster if you have a large L3 cache. I wrote most of the code to run fast using the L1 and L2 caches, so the L3 cache is actually not that important. But of course it never hurts to have a big L3 cache. Wink

GPU architecture also has a big impact.
sr. member
Activity: 406
Merit: 250
I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.

Mikaelh. I have a question for you.

My CPU's have 12MB of cache each. Is that advantageous for primecoin mining? I have noticed I have been getting significantly higher primespersec than most of my friends when we use larger seivesizes, even when they have newer processors!
sr. member
Activity: 301
Merit: 250
I posted a new compilation guide for Linux:
https://bitcointalksearch.org/topic/xpm-primecoin-high-performance-linux-compilation-guide-259022

It shows you how to compile your own libgmp and everything else.
sr. member
Activity: 406
Merit: 250
I just mined 3 blocks in a row... holy crap.

legendary
Activity: 1246
Merit: 1000
full member
Activity: 238
Merit: 100
Does anyone have a guide for building on Windows.  I can't get the static libraries to bind using MingW.
newbie
Activity: 18
Merit: 0
Nah, tryed it once with gen=0 and then another time with gen=1 but i never maintained to get a block...
That is why i am asking if the conf file is important for the mining, if yes how must it be configured then....
thx
newbie
Activity: 37
Merit: 0
Sorry i am new to all this ...
But there is one thing i dont get.
I tryed to run it with a primecoin.conf file and changed parameters like rpcuser ... rpcpassword ... gen=0 gen=1 and also with rpcport=9910 rpcallowip=127.0.0.1 with server=1 and daemon=1.
I did not receive any block in earlier stages ... difficulty was past 8 to this stage ...
as i deleted the conf file and run the wallet without it i received two blocks in two hours ...
How can that be ... Luck ? I was using hp3 ...
So my question is is the conf file needed or not ??

Thx

Sounds like coincidence.

But why did you have both gen=0 and gen=1 in there?
member
Activity: 109
Merit: 10
running i5 at 2000pps. got about 5% more pps from hp4 to hp 5. Only found 2 blocks since release of first client. got them in the first few days, havent had one since on solo. ypool sucks had a second pc pool mining with ypool way too many server rejected shares.
hero member
Activity: 812
Merit: 1000
Day one, 1400pps, 8 core nehalem rig running HP5.
0 blocks. Cheesy
member
Activity: 100
Merit: 10
to be honest .... almost 24 hours

same here 3 pc setup,  4000/3500/1700 pps  all on hp5  no luck more than 24 hours
member
Activity: 100
Merit: 10
same here 3 pc setup,  4000/3500/1700 pps  all on hp5  no luck more than 24 hours
Pages:
Jump to: