Pages:
Author

Topic: RANDOM-X on XEON... CACHE, FREQ'S OR CORES? - page 2. (Read 528 times)

full member
Activity: 241
Merit: 100
To Hash or not to Hash, that's what the question
February 08, 2020, 04:35:43 PM
#11
Quote
-exploiting 'xeon v3 turbo hack' can boost mining performance easily 20-30%.
- never heard of this, can you point me?

I got my e5 2070 v3 installed. temps are good. I Get 5800hs out of both. Mem runs on two channels, bought more sticks, will get them next week installed, curious if 4 channels will do any good...
Comparing to two E5 2665 V2, the v3's are less powerful. I get 6200-6300Hs out of V2s
legendary
Activity: 1176
Merit: 1015
February 08, 2020, 03:38:51 PM
#10
Excellent info, very informative. I don't have any Xeons but it was still quite interesting.

Thanks for identifying the 256K L2 dependency, I wasn't previously aware of it.

To make matters worse latest xeons have more than enough of L2 but guess what... some L3 is missing.

To be on topic, XEON... CACHE, FREQ'S OR CORES?, perfect answer is FREQ x CORES without HT.

OT: Most cpu-algos are not L2 limited, must Do Your Own Research.


legendary
Activity: 1470
Merit: 1114
February 08, 2020, 12:53:47 PM
#9

Had a chance to test many different v3 xeons and compare randomx performance to specs, some observations:


Excellent info, very informative. I don't have any Xeons but it was still quite interesting.

Thanks for identifying the 256K L2 dependency, I wasn't previously aware of it.

With improvements to turbo boost, boosting more cores, the benefits of mining with
N/2 threads is increased.
legendary
Activity: 1176
Merit: 1015
February 08, 2020, 05:25:02 AM
#8
- this is very reasonable, where did you get numbers from?

i have found locally 2670 v3, will install soon and see what they are capable of

Had a chance to test many different v3 xeons and compare randomx performance to specs, some observations:

-algo needs 256Kb L2 for every thread so hyperthreading is basically useless, disable it to save power or use cpu affinity to bind threads to 'real' cores.
-turbo bins, models with highest all core turbo are the best.
-tdp, on higher core count models after optimal tweaking you start hitting the tdp wall and throttling starts, negative vcore offset helps to a point where cpu crashes due to too low voltage.
-xeons run really cool, with a decent cooler heat is never a problem.
-exploiting 'xeon v3 turbo hack' can boost mining performance easily 20-30%.

What I forgot to try was some REALLY tight memory timings like with ryzens, v3 supports only DDR4-2133 and depending on how many memory sticks you have it runs on 1-4 way mode.

For example, looking at 2670 v3 I see it is a 12c/24t cpu with tdp of 120w, 2.3GHz base clock and turbo 3/3/3/3/3/3/3/4/5/6/8/8 meaning it will run on 2.6-3.1 GHz depending on how many threads you are using. 12 threads @2.6GHz will end up somewhere in 5k range.

Most likely you are limited also on your INTEL S2600CW2SR motherboard and the lack of tuning options and/ or possibility to use cheap ES/QS processors.

Do a search for 'List_of_Intel_Haswell-based_Xeon_microprocessors', really nice listing which clearly shows the most important specs for mining.



full member
Activity: 241
Merit: 100
To Hash or not to Hash, that's what the question
February 07, 2020, 04:45:35 PM
#7
Quote
2x 2696v3 ($800?) 36 threads @2.8GHz gives you 18k consuming 290w
- this is a bit too much of $ in...

Quote
2x 4627v3 ($100?) 20 threads @3GHz gives you 10k consuming 260w
- this is very reasonable, where did you get numbers from?

i have found locally 2670 v3, will install soon and see what they are capable of
legendary
Activity: 1176
Merit: 1015
February 06, 2020, 03:23:50 PM
#6
v3 and v4 Xeons are L2 limited on randomx no matter how much L3 they have.

2x 4627v3 ($100?) 20 threads @3GHz gives you 10k consuming 260w.

2x 2696v3 ($800?) 36 threads @2.8GHz gives you 18k consuming 290w.

To compare, L3 limited ryzen 3600 ($200) 12 threads @4.2GHz gives 7k consuming 100w.
legendary
Activity: 1470
Merit: 1114
February 05, 2020, 02:45:04 PM
#5
Adding some detail to phillipma's reply...

so for example E5 2670 V3 ( 12core 30mb 2,3ghz) vs E7 8893 (4core 45mb 3,2ghz)  
2670 should be better right?

devide core into cache.

so   30/12 = 2.5   this works  since the 2.5 is bigger then 2 by just a bit

joblo> this is a good match.

and

45/4 = 11.25   this works less since 11.25 is way bigger then   2

you can set threads used lower

if  you have say 40mb and 32 cores that is

40/32 = 1.25  this is under 2

so you would set this to

20 of the 32 cores

joblo> you have more cache than the  cores can use, need morecore with this much cache


and get

40/20 = 2

get cheap cpu's with big cache


A ratio of 2MB cache / core(thread) is optimum.
legendary
Activity: 4256
Merit: 8551
'The right to privacy matters'
February 05, 2020, 02:14:28 PM
#4
so for example E5 2670 V3 ( 12core 30mb 2,3ghz) vs E7 8893 (4core 45mb 3,2ghz)  
2670 should be better right?

devide core into cache.

so   30/12 = 2.5   this works  since the 2.5 is bigger then 2 by just a bit


and

45/4 = 11.25   this works less since 11.25 is way bigger then   2


you can set threads used lower

if  you have say 40mb and 32 cores that is

40/32 = 1.25  this is under 2

so you would set this to

20 of the 32 cores


and get

40/20 = 2

get cheap cpu's with big cache
full member
Activity: 241
Merit: 100
To Hash or not to Hash, that's what the question
February 05, 2020, 01:57:37 PM
#3
so for example E5 2670 V3 ( 12core 30mb 2,3ghz) vs E7 8893 (4core 45mb 3,2ghz)  
2670 should be better right?
legendary
Activity: 1470
Merit: 1114
February 04, 2020, 09:47:11 PM
#2
1. cache / cores

I believe RandomX use the same cache as the old Cryptonight, 2 MB/thread.
You only need as many cores as you have cache to support them.

2. Frequency

3. Cores, see 1.
full member
Activity: 241
Merit: 100
To Hash or not to Hash, that's what the question
February 04, 2020, 09:37:26 PM
#1
INTEL S2600CW2SR with dual 2011 sockets that support Xeon V3 and V4 cpus.Trying to figure out what is the best watt/$ processor for this board to run XMRIG? Would CACHE be the first priority then NUMBER OF CORES and then FREQUENCY or how should i prioritize factors involved>?
Board supports up to 140W per socket.
Thanks
Pages:
Jump to: