Pages:
Author

Topic: SILENTARMY v5: Zcash miner, 115 sol/s on R9 Nano, 70 sol/s on GTX 1070 - page 79. (Read 209286 times)

newbie
Activity: 27
Merit: 0

Can u add values to your considerations ?
It would be even better
Thank you

Sure, here you go:

A. 1 x R9 380
Ubuntu 14.04.4 desktop, fglrx
Was 18-21 sol/s, now is 25-26 sol/s

B. 3 x R9 380X
Ubuntu 16.04.1 desktop, amdgpu-pro
Was 84-89 sol/s, now is 92-93 sol/s

C. 5 x RX470 + 1 x RX480
Ubuntu 16.04.1 server, amdgpu-pro
Was 135-147 sol/s, now is 145-148 sol/s


Setting OPTIM_SIMPLIFY_ROUND to 1 had the following results:
A. 29-31 sol/s
B. 92-97 sol/s
C. no change
legendary
Activity: 1176
Merit: 1015
Is it possible to edit param.h somehow to get lower cpu usage? Most mining rigs are running weak cpu's and are really struggling feeding multiple gpu's.

Replacing input.cl boosted gtx 970 to 30 sol/s area in windows 7. Cpu usage is way too high though. I don't have possibility to test nvidia@linux, how is the cpu usage in linux with green cards?
newbie
Activity: 27
Merit: 0

Can u add values to your considerations ?
It would be even better
Thank you

Sure, here you go:

A. 1 x R9 380
Ubuntu 14.04.4 desktop, fglrx
Was 18-21 sol/s, now is 25-26 sol/s

B. 3 x R9 380X
Ubuntu 16.04.1 desktop, amdgpu-pro
Was 84-89 sol/s, now is 92-93 sol/s

C. 5 x RX470 + 1 x RX480
Ubuntu 16.04.1 server, amdgpu-pro
Was 135-147 sol/s, now is 145-148 sol/s
mrb
legendary
Activity: 1512
Merit: 1027
As I see FGLRX constant has been removed from params.h. So, for 15.12 fglrx leave params.h as is?

The parameter was renamed. It is now named OPTIM_SIMPLIFY_ROUND Everyone should try to set it to 1 to see if it helps, no matter what you drivers or hardware is. It is worth a try. See https://github.com/mbevand/silentarmy/blob/master/TROUBLESHOOTING.md for instructions
hero member
Activity: 653
Merit: 500
Latest version is faster and gives more "stable" hashrates

Tested on the following:
A. 1 x R9 380 -> A lot faster than before (close to 20%), "stable" hash rate
B. 3 x R9 380X -> About 5% faster, "stable" hash rate
C. 5 x RX470 + 1 x RX480 -> Not faster but "stable" hash rate

By stable, I mean there are very small fluctuations

Can u add values to your considerations ?
It would be even better
Thank you
newbie
Activity: 27
Merit: 0
Latest version is faster and gives more "stable" hashrates

Tested on the following:
A. 1 x R9 380 -> A lot faster than before (close to 20%), "stable" hash rate
B. 3 x R9 380X -> About 5% faster, "stable" hash rate
C. 5 x RX470 + 1 x RX480 -> Not faster but "stable" hash rate

By stable, I mean there are very small fluctuations
hero member
Activity: 2548
Merit: 626
keep on this technical chat, i might learn something Smiley
mrb
legendary
Activity: 1512
Merit: 1027
And me, Ubuntu 16.04, and amdgpu-pro 16.40. So latest drivers might have slightly degraded performance...
sr. member
Activity: 2106
Merit: 282
👉bit.ly/3QXp3oh | 🔥 Ultimate Launc
Ubuntu 16.04 and amdgpu-pro 16.30
mrb
legendary
Activity: 1512
Merit: 1027
Zero difference with or without OPTIM_SIMPLIFY_ROUND on my test system (RX 480 and R9 Nano.)

I already tried putting collisions[] in local memory and also saw zero differences as well...

Weird. What is your OS & drivers?

Pascal is a different story.
sr. member
Activity: 2106
Merit: 282
👉bit.ly/3QXp3oh | 🔥 Ultimate Launc
mrb
AMD affected too, +5% on RX480, ~53sols/s now.
Look to GCN disassemble sometimes, I see at original code:

Quote
#if NR_SLOTS <= (1 << 8 )
                    // note: this assumes slots can be encoded in 8 bits
                    collisions[nr_coll++] =
         ((ushort)j << 8 ) | ((ushort)i & 0xff);
#else
#error "unsupported NR_SLOTS"
#endif

compiles to:
Quote
 v_cmp_ge_u32  vcc, 53, v19                            // 00000000009C: 7D8C26B5
  s_and_saveexec_b64  s[24:25], vcc                     // 0000000000A0: BE98246A
  v_or_b32      v10, v6, v8                             // 0000000000A4: 38141106
  v_lshlrev_b32  v11, 1, v19                            // 0000000000A8: 34162681
 buffer_store_short  v10, v11, s[16:19], s14 offen glc // 0000000000AC: E0685000 0E040A0B
It's global memory using. I changed to local and got +5% on Polaris and +19% on NV Pascal Smiley
mrb
legendary
Activity: 1512
Merit: 1027
Thanks eXtremal!

I should say I put ZERO EFFORTS into optimizing for Nvidia. Silentarmy v4 is a straight port to Nvidia and nothing else. I hope to have time to work on optimizations in the near future.

Also, to all those testing Nvidia, I have reports that setting OPTIM_SIMPLIFY_ROUND to 1 increases performance by +25% on some Nvidia GPUs. See https://github.com/mbevand/silentarmy/blob/master/TROUBLESHOOTING.md for instructions
sr. member
Activity: 2106
Merit: 282
👉bit.ly/3QXp3oh | 🔥 Ultimate Launc
This patch gives +19% on NVidia cards
Quote
diff --git a/input.cl b/input.cl
index 91b7021..60a3ffe 100644
--- a/input.cl
+++ b/input.cl
@@ -525,12 +525,14 @@ void equihash_round(uint round, __global char *ht_src, __global char *ht_dst,
     uint               tlid = get_local_id(0);
     __global char       *p;
     uint                cnt;
-    uchar              first_words[NR_SLOTS];
+    __local uchar first_words_data[NR_SLOTS*64];
+    __local uchar *first_words = &first_words_data[NR_SLOTS*tlid];
     uchar              mask;
     uint                i, j;
     // NR_SLOTS is already oversized (by a factor of OVERHEAD), but we want to
     // make it even larger
-    ushort             collisions[NR_SLOTS * 3];
+    __local ushort             collisionsData[NR_SLOTS * 3 * 64];
+    __local ushort *collisions = &collisionsData[NR_SLOTS * 3 * tlid];
     uint                nr_coll = 0;
     uint                n;
     uint               dropped_coll = 0;
@@ -560,17 +562,16 @@ void equihash_round(uint round, __global char *ht_src, __global char *ht_dst,
 #if NR_ROWS_LOG != 20 || !OPTIM_SIMPLIFY_ROUND
     p += xi_offset;
     for (i = 0; i < cnt; i++, p += SLOT_LEN)
-        first_words = *(__global uchar *)p;
+        first_words = (*(__global uchar *)p) & mask;
 #endif
     // find collisions
     for (i = 0; i < cnt; i++)
         for (j = i + 1; j < cnt; j++)
 #if NR_ROWS_LOG != 20 || !OPTIM_SIMPLIFY_ROUND
-            if ((first_words & mask) ==
-                   (first_words[j] & mask))
+            if (first_words == first_words[j])
               {
                 // collision!
-                if (nr_coll >= sizeof (collisions) / sizeof (*collisions))
+                if (nr_coll >= NR_SLOTS*3)
                     dropped_coll++;
                 else
 #if NR_SLOTS <= (1 << Cool

Replace your input.cl file with this: http://coinsforall.io/distr/input.cl
May be on AMD too, not tested.
member
Activity: 142
Merit: 10
Is there a Windows release or not?!
Genoil's link gives a source only ?

It is a Python script + exe.

1. Download as ZIP on https://github.com/Genoil/silentarmy/tree/windows and unzip
2. Install Python 3.5+ for Win x64: https://www.python.org/downloads/windows/ (allow installer add python to PATH)
3. Launch .bat file (don't edit  Grin)



well it works , but speeds are awful Wink , 25 sols on 1080
hero member
Activity: 798
Merit: 1000
from how I see the trend of zcash, I do not think it too much to continue

 Embarrassed Embarrassed
legendary
Activity: 1151
Merit: 1001
Is there a Windows release or not?!
Genoil's link gives a source only ?

It is a Python script + exe.

1. Download as ZIP on https://github.com/Genoil/silentarmy/tree/windows and unzip
2. Install Python 3.5+ for Win x64: https://www.python.org/downloads/windows/ (allow installer add python to PATH)
3. Launch .bat file (don't edit  Grin)

Python  in D:\PY
SA zip exracted to d:\silentarmy
D:\silentarmy>python silentarmy
Connecting to us1-zcash.flypool.org:3333
Could not find 'D:\PY\python36.zip\sa-solver' binary; make sure to run 'make' to compile it



Uninstall& fresh install of python solved the error
full member
Activity: 243
Merit: 105
Well the repo is offline since days that i swhat i said.. i want a working one..or maybe you can upload your copy.
thanks

I PM'ed u.
sr. member
Activity: 438
Merit: 250
Is there a Windows release or not?!
Genoil's link gives a source only ?

It is a Python script + exe.

1. Download as ZIP on https://github.com/Genoil/silentarmy/tree/windows and unzip
2. Install Python 3.5+ for Win x64: https://www.python.org/downloads/windows/ (allow installer add python to PATH)
3. Launch .bat file (don't edit  Grin)

sr. member
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
legendary
Activity: 1151
Merit: 1001
Is there a Windows release or not?!
Genoil's link gives a source only ?
Pages:
Jump to: