Pages:
Author

Topic: CCminer(SP-MOD) Modded NVIDIA Maxwell / Pascal kernels. - page 18. (Read 2347632 times)

legendary
Activity: 2716
Merit: 1094
Black Belt Developer
There are HBM equipped FPGAs already.
Problem is, even with restricted bitstreams, their ROI is close to infinity. Just like with ASICs.
legendary
Activity: 1764
Merit: 1024
Depending on the market there isn't demand for new ASICs (due to price of development and emission), FPGAs are the new ASICs and much more difficult to deal with. What further exasperates the issue is that the bitstreams for them are generally relegated to smokey back rooms where most people don't have access. So even if you find and buy the really expensive hardware, you don't have access to the software to run on them.

This is an awful lot like the scrypt days where people were making miners for algos and trading them in back rooms while ASICs were just starting to emerge. This entire last year or so has been dominated by this behavior, matched with market decline and over bloat of hash (along with dark hash) has lead to a relatively stark outlook.

GPUs DO have things that FPGAs don't have, which is a much lower price tag and memory. So then it comes down to hash per/$. If FPGAs give little to no advantage for the price you're paying for them, there is no reason to use them. Memory is only on a couple FPGAs and depending on which one you're talking about, it's not the best in the world, which further limits performance.

A lot of the current predicament, putting aside the market decline and bloating of hashrate caused by the boom in '18, has to do with coin devs being lazy. There are a lot of algos that are much more asic resistant then others, instead they do stupid shit like what RVN is doing by slightly altering their algo to maintain a brandname and appearing progressive, without actually tackling the problem. It's not even about having a silver bullet either, they can just swap for already available algos. MTP and Progpow for instance are very asic/fpga resistant (for now).
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
The problem is x16rv2 isn't any more ASIC resistant than v1. All it really needs is development
of a Tiger kernel which shouldn't be too difficult. The only thing that would prevent it is lack
of market demand. Lyra2v3 has a similar problem.

I expect a decline in the difficulty after the fork. Look what happened in the Beam II fork.

Quote
Permuted instructions can be worked around with a RAM code segment so it just increases RAM requirements.

You need to read the instruction from ram decode and execute. Difficult to make the fpga run at full speed.  You can create a superscalar version that execute more than one instruction per cycle, but still much slower than a static hash function. Or is it a faster way?
legendary
Activity: 1470
Merit: 1114
The problem is x16rv2 isn't any more ASIC resistant than v1. All it really needs is development
of a Tiger kernel which shouldn't be too difficult. The only thing that would prevent it is lack
of market demand. Lyra2v3 has a similar problem.

It's not easy to make an algo GPU friendly and ASIC resistant. It would have to target a resource
that GPUs have in abundance that would be too expensive to implement on an ASIC. I'm not aware of any.

Permuted instructions can be worked around with a RAM code segment so it just increases RAM requirements.
A bigger dataset has the same effect. Ironically Lyra2REv2 used a smaller dataset than Lyra2RE to give GPUs
an advantage over CPUs. Lyra2REv3 did not change the size of the dataset.

In the end coins that fork to new algos periodically as an anti-ASIC strategy do little more than create
a planned obsolenscence environment driving demand for new ASICs.
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
Let's see what the result will be after the fork. X16v2 will remove the ASIC'S and x16v3 can remove the fpga'a
Ravencoin could hardfork again in 2 months to a randomhash variant with permuted instructions in the hash x16v3, and then the FPGA's will have to mine something else. An optmized x16rv2 could do around 35MHASH on 65 watt's on the RTX 2060 SUPER.
legendary
Activity: 2716
Merit: 1094
Black Belt Developer
sorry to say but FPGA hashrate of x16rv2 will be the same as the old x16r
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
What about older cards?

Some RTX optimalizations doesn't work on gtx cards. In the code I have split execution on some of them rtx optimized kernel/gtx optimized, but not all of them.  If I do the splitting x17 will do around 24MHASH on the gtx 1080ti. ccminer 1.0 alexis is around 20MHASH.

F.ex reverting cubehash-shavite to the old version you gain a megahash on 1080ti.

To get the opensource up to date with the latest fee miners is more work. The opensource SIMD is slow and need to be rewritten. I have extracted the latest t-rex ptx code. PM me if you want to help reverse engineer to cuda and opensource.
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
X16rv2 is FPGA shit algo, you better start optimizing the FPGA bitsream to 5Ghz or so for a fee  Grin

I'm just testing the new cards and algo. I've got a rtx 2070 and a rtx 2060 SUPER.
In x16rv2 I managed to remove the new tiger192 completely from the SHA512, and partly on luffa and keccak. By merging the tiger into the other kernels the gpu can do the new multiplications and AES in the tiger192 in parallell.

So the new X16v2 will perform close to  the speed of the old x16r on the gpu, and fpga's will slow down mostly because of the multiplications.
A bether FGGA killer would be to generate PTX kernels runtime for each block by permuting the assembly instructions. The instuctions should include multiplications, logic, scrambling. The gpu miner will need to compile the ptx on the fly for every block before warping. Then the FPGA implementation would need to have an ALU (cpu emulation) and this will slow down alot.
legendary
Activity: 3164
Merit: 1003
Spmod-git #13 released.


changes from #12

-X16rv2 Around +30% faster on RTX 2060 super
-X17 +24% on RTX 2060 super

https://github.com/sp-hash/suprminer/commits/master

https://github.com/sp-hash/suprminer/releases


What about older cards?
legendary
Activity: 1901
Merit: 1024
X16rv2 is FPGA shit algo, you better start optimizing the FPGA bitsream to 5Ghz or so for a fee  Grin
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
Spmod-git #13 released.


changes from #12

-X16rv2 Around +30% faster on RTX 2060 super
-X17 +24% on RTX 2060 super

https://github.com/sp-hash/suprminer/commits/master

https://github.com/sp-hash/suprminer/releases

sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
Ravencoin will hardfork on the 1st of october and change the mining algo.

I have added support for x16rv2 in my opensource fork without any fee. My fork is the fastest opensource free miner for x16r and x16rv2.

https://github.com/sp-hash/suprminer/commits/master


-Added support for X16RV2
-Improved speed on rtx cards and 1660/1660ti (x16s,x16r,x16r2,x17)

cuda 9.2 32bit binary (compute 6.1+)

https://github.com/sp-hash/suprminer/releases
jr. member
Activity: 189
Merit: 2
Nothing is changed here.
Still milking the newbies  Cool
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
Do you have anything serious to sell?

sell? This fork free and opensource.

I think RandomX can become much faster. I will take closer look later
sr. member
Activity: 954
Merit: 250
I will be adding some more opensource speed to the NVIDIA monero miner (cryptonightR).

Here is my fork.

https://github.com/sp-hash/xmrig-nvidia


Just 1 commit with 2-5% more speed so far. Feel free to check it out. Didn't build a binary yet.
Do you have anything serious to sell?
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
I will be adding some more opensource speed to the NVIDIA monero miner (cryptonightR).

Here is my fork.

https://github.com/sp-hash/xmrig-nvidia


Just 1 commit with 2-5% more speed so far. Feel free to check it out. Didn't build a binary yet.
legendary
Activity: 3164
Merit: 1003

Sorry for off topic but I think many here have mined Boolberry BBR. I thought you may like to know BBR to Zano swap. And yes BBR will still be supported after the swap ends July 30th 2019.  Smiley

http://coinswap.zano.org/
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
sr. member
Activity: 954
Merit: 250
What algos exactly? Smiley

Beam and Grin. With private miners ofcourse..

But GRIN cannot be mined or traded by us cizitzens.

https://cointelegraph.com/news/crypto-exchange-bittrex-to-block-us-users-from-trading-in-32-cryptos


Profitable GPU coins are normally not in the top 100 list sorted by market cap..

Beam  (From Israel) is currently in 165th place


Why don't you make a good miner for Grin or Beam for sale?
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
What algos exactly? Smiley

Beam and Grin. With private miners ofcourse..

But GRIN cannot be mined or traded by us cizitzens.

https://cointelegraph.com/news/crypto-exchange-bittrex-to-block-us-users-from-trading-in-32-cryptos


Profitable GPU coins are normally not in the top 100 list sorted by market cap..

Beam  (From Israel) is currently in 165th place

Pages:
Jump to: