Author

Topic: CCminer(SP-MOD) Modded NVIDIA Maxwell / Pascal kernels. - page 936. (Read 2347664 times)

sr. member
Activity: 438
Merit: 250
...

I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )

The GTX 980 ti with +50% more bandwith has similar performance than a GTX 980 : there should be some bottleneck elsewhere...
an amd 280x has better performance ...

It's not like the algo loads large chunks of contiguous blocks of memory like texture maps. Instead it loads 64 times a 128 byte random chunk, where the index of each can only be found based on the previous one. So the available bandwidth cannot be fully utilized. I've already tried running two of those loops in parallel, but haven't been succesful at that yet.
So where does the amd advantage come from ? better memory controler with tighter timings ?

I think wider memory bus and therefore higher overall bandwidth. In games, nVidia makes that up with compression, but here there nothing to compress in random chunks of 128 bytes of hashed hashes. The effciency of AMD vs NVidia is about the same. i.e for a GTX980, that hashes about 19MH, the theoretical max. hashrate is 224GBps/8KBph = 28MH. 19MH/28MH = 67%. For a 280/X.290/X that hash at about 27MH on average, the theoretical max is 320GBps/8KBph=40MH. 27MH/40MH= 67%.


 
legendary
Activity: 2940
Merit: 1091
--- ChainWorks Industries ---
Can anyone compare 750Ti,950,960,970 for power effiency? Hashrate per watt ?!! Smiley)


+1

sp probably can ...

dont you have all those cards sp? ...

Wink ...

#crysx
iom
newbie
Activity: 31
Merit: 0
Can anyone compare 750Ti,950,960,970 for power effiency? Hashrate per watt ?!! Smiley)


+1
legendary
Activity: 2940
Merit: 1091
--- ChainWorks Industries ---
...

I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )

The GTX 980 ti with +50% more bandwith has similar performance than a GTX 980 : there should be some bottleneck elsewhere...
an amd 280x has better performance ...

It's not like the algo loads large chunks of contiguous blocks of memory like texture maps. Instead it loads 64 times a 128 byte random chunk, where the index of each can only be found based on the previous one. So the available bandwidth cannot be fully utilized. I've already tried running two of those loops in parallel, but haven't been succesful at that yet.
So where does the amd advantage come from ? better memory controler with tighter timings ?

better opencl coding possibly? ...

#crysx
sr. member
Activity: 445
Merit: 255
...

I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )

The GTX 980 ti with +50% more bandwith has similar performance than a GTX 980 : there should be some bottleneck elsewhere...
an amd 280x has better performance ...

It's not like the algo loads large chunks of contiguous blocks of memory like texture maps. Instead it loads 64 times a 128 byte random chunk, where the index of each can only be found based on the previous one. So the available bandwidth cannot be fully utilized. I've already tried running two of those loops in parallel, but haven't been succesful at that yet.
So where does the amd advantage come from ? better memory controler with tighter timings ?
sr. member
Activity: 438
Merit: 250
...

I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )

The GTX 980 ti with +50% more bandwith has similar performance than a GTX 980 : there should be some bottleneck elsewhere...
an amd 280x has better performance ...

It's not like the algo loads large chunks of contiguous blocks of memory like texture maps. Instead it loads 64 times a 128 byte random chunk, where the index of each can only be found based on the previous one. So the available bandwidth cannot be fully utilized. I've already tried running two of those loops in parallel, but haven't been succesful at that yet.
sr. member
Activity: 248
Merit: 250
Can anyone compare 750Ti,950,960,970 for power effiency? Hashrate per watt ?!! Smiley)
sr. member
Activity: 445
Merit: 255
...

I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )

The GTX 980 ti with +50% more bandwith has similar performance than a GTX 980 : there should be some bottleneck elsewhere...
an amd 280x has better performance ...
Eth mining is a big win for amd miners... On the other hand the GTX 750 ti has a quite good performance, the only one on th nvidia side competitive for eth.
legendary
Activity: 2716
Merit: 1094
Black Belt Developer
Have you guys considered more compression?
thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )

for sure it was bigger... it's like a random file, you can't compress it!
(lossless compression is based on repeating patterns and uneven byte distribution)
member
Activity: 81
Merit: 10
Whirlpoolx is almost unique in how it was conceived: it really looks like it was made to favourite smart developers.
The large part of the optimisations come from "shortcuts", more than general algorithm speedup. In the end it's just reiterated whirlpool (which in turn is similar to groestl and other aes derived algos).
So I guess that a 970 with all the "shortcuts" in place, should be about as good as a 280x (500 Mh/s).

Thats why x11 is safer. Supported by Daesh..
I wonder why the poor NVIDIA miners are skipping the ETHER algo... Such Profit loss


I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

thinking about it, but it isn't really obvious it would help... need to find a compression algorithm which works on it (I tried zipping the file, just to have an idea of what could be done and ended up with a larger file...  Grin ) so it seems there isn't much to compress (well 1.2Gb of integers...). For info, for a 780ti to work correctly on windows 8.1, only 1.3% less dag is required (but that 1.3% is still too large to store anywhere that amount of data (too large for either register or texture/shared )
legendary
Activity: 1764
Merit: 1024
Whirlpoolx is almost unique in how it was conceived: it really looks like it was made to favourite smart developers.
The large part of the optimisations come from "shortcuts", more than general algorithm speedup. In the end it's just reiterated whirlpool (which in turn is similar to groestl and other aes derived algos).
So I guess that a 970 with all the "shortcuts" in place, should be about as good as a 280x (500 Mh/s).

Thats why x11 is safer. Supported by Daesh..
I wonder why the poor NVIDIA miners are skipping the ETHER algo... Such Profit loss


I was mining Eth instead of Vert for awhile, but right now a 280x mines almost twice as fast as 970, so after all the AMD miners hoped on board it was no longer all that profitable. It's about break even with Vert right now though, still sometimes more profitable, regardless of it being slower.


ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage

Have you guys considered more compression?

i think a Nixie display is better



Nixie tubes are super expensive, but look cool.
sr. member
Activity: 427
Merit: 250
i think a Nixie display is better

member
Activity: 111
Merit: 10
Quote
Nice display HB9K!
I cant' take credit for the added LED displays. Those are @Induktor's rigs.  However, outputting info to syslog (then outputting data thru a serial port) enables those displays to come alive.
If anyone else is interested, his schematics and build info is here:
https://litecointalk.org/index.php?topic=16800.msg277720#msg277720
Props to @Induktor then.  Great rig.  With a fourteen-segment display can we get a Vacuum Tube display next? Smiley
sr. member
Activity: 427
Merit: 250
Quote
Nice display HB9K!

I cant' take credit for the added LED displays. Those are @Induktor's rigs.  However, outputting info to syslog (then outputting data thru a serial port) enables those displays to come alive.

If anyone else is interested, his schematics and build info is here:

https://litecointalk.org/index.php?topic=16800.msg277720#msg277720
alz
full member
Activity: 227
Merit: 100

oops wrong thread  Embarrassed
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
SP_ what 750 ti card model you have? (Link on ebay/amazon please)
I have checked power consumptuon on my gigabyte card and at 1420core/3300 mem it get 44-45watt. But at that clock it can make only 4.6mh on lyra.
Also i checked with -X1 ... -X17 and found that power consumption differs only 1-2 watt.
I have all the cards.. It's like the house of cards...
The lyra algoritm is already using bandwitdh. The -X doesn't make any difference
Sorry i mean lyra2v2 algo. you have asked previously for power consumption with different -X parameters.
About all cards - nice, but i was interesting what card model can perform 5mhs at so low core 1388/2700?

This card:

http://www.amazon.co.uk/Graphics-Gainward-nVidia-GeForce-192-Bit/dp/B00IIMVBPA

windows 7 64bit (sp-mod 68 x86) latest drivers


Yes. thanks for testing the watt. From 44 watt to 42 watt is a 4.5% reduction of the powerbill.

newbie
Activity: 51
Merit: 0
SP_ what 750 ti card model you have? (Link on ebay/amazon please)
I have checked power consumptuon on my gigabyte card and at 1420core/3300 mem it get 44-45watt. But at that clock it can make only 4.6mh on lyra.
Also i checked with -X1 ... -X17 and found that power consumption differs only 1-2 watt.

I have all the cards.. It's like the house of cards...

The lyra algoritm is already using bandwitdh. The -X doesn't make any difference


Sorry i mean lyra2v2 algo. you have asked previously for power consumption with different -X parameters.

About all cards - nice, but i was interesting what card model can perform 5mhs at so low core 1388/2700?
legendary
Activity: 1400
Merit: 1050

ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
duh ?! there is no way to get rid of those... otherwise,as Myaguy was explaining, it may possible to calculate some of the dag element hence reducing memory usage
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer

ETH can probably run much faster on the high end cards if some of the memory access can b replaced with computation.

The ET performance depends on the DAG  file.. Without a random file, every random acces isside the cached and the hashrate explodes
Jump to: