Author

Topic: CCminer(SP-MOD) Modded NVIDIA Maxwell / Pascal kernels. - page 788. (Read 2347664 times)

sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
DJM34's neoscrypt.

My testrig is showing +15.5 % (gtx 970, 2x 960, 1 750ti)
With a few validation errors
But I have compiled with cuda 7.5


sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
[2016-01-24 10:24:24] GPU #3 Found nounce c0031ac7
[2016-01-24 10:24:25] GPU #0 Found nounce 000bbb86
[2016-01-24 10:24:30] GPU #2 Found nounce 8025d939
[2016-01-24 10:24:40] GPU #1 Found nounce 4029bd92
[2016-01-24 10:24:42] GPU #0 Found nounce 00aab9f6
[2016-01-24 10:24:42] GPU #0: GeForce GTX 970, 623
[2016-01-24 10:24:43] GPU #0 Found nounce 00bb9b78
[2016-01-24 10:24:43] GPU #0: GeForce GTX 970, 599
[2016-01-24 10:24:55] GPU #1 Found nounce 40522fa2
[2016-01-24 10:24:55] GPU #1: GeForce GTX 750 Ti, 169
[2016-01-24 10:25:00] GPU #0 Found nounce 015574ec
[2016-01-24 10:25:00] GPU #0: result for nonce $015574EC does not validate on CP
U!
[2016-01-24 10:25:02] GPU #2 Found nounce 80d8d5c1
[2016-01-24 10:25:02] GPU #2: GeForce GTX 960, 366
[2016-01-24 10:25:11] GPU #0 Found nounce 01c362db
[2016-01-24 10:25:11] GPU #0: result for nonce $01C362DB does not validate on CP
U!
[2016-01-24 10:25:29] GPU #0 Found nounce 02667a39
[2016-01-24 10:25:29] GPU #0: GeForce GTX 970, 619
member
Activity: 111
Merit: 10
djm very nice with the 30% increase.
member
Activity: 106
Merit: 10
Winners don`t use drugs. My crypto kernals are profitable. Yours are not. Finish the plegde please...
lol, they are profitable to me  Grin mostly done, just had to add the name of the donators

add names of donors? ... nice touch ...

#crysx

I want my name added as king peanus
legendary
Activity: 2940
Merit: 1091
--- ChainWorks Industries ---
Winners don`t use drugs. My crypto kernals are profitable. Yours are not. Finish the plegde please...
lol, they are profitable to me  Grin mostly done, just had to add the name of the donators

add names of donors? ... nice touch ...

#crysx
member
Activity: 106
Merit: 10
Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer



why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others.       --scryptr

X15 had a flare for a few days, I think the coin was evergreen. But yesterday it went wonky on zpool
it was the higherst paying algo and I was hashing away and submitting shares but nothing was
registering at the pool.

Edit: The problem is when an algo gets hot it's too late to shift the optimization focus to it. By the time
something is delievered the heat is off

i think something wonky is going on with zpool reguardless... i dumped a bunch of x11  on it when it was high and i renteed real low.... payout wasent even close to what it was supoosed to be. I gonna dump 50mhs x15 on zpool for 24 hours and see if there is any fuckery going on.
legendary
Activity: 1470
Merit: 1114
Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer



why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others.       --scryptr

X15 had a flare for a few days, I think the coin was evergreen. But yesterday it went wonky on zpool
it was the higherst paying algo and I was hashing away and submitting shares but nothing was
registering at the pool.

Edit: The problem is when an algo gets hot it's too late to shift the optimization focus to it. By the time
something is delievered the heat is off
member
Activity: 106
Merit: 10
Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer



why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others.       --scryptr

zpool

0.3553*    x15

   EverGreen  coin i guess
legendary
Activity: 1797
Merit: 1028
Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer



why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others.       --scryptr
member
Activity: 106
Merit: 10
Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer



why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.
legendary
Activity: 1470
Merit: 1114
Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?
legendary
Activity: 1400
Merit: 1050
so basically you expect other dev to work on the opensource so to make your private miner faster Huh
pallas says thank you to you  Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC.  Roll Eyes

I'm aware of a couple of  assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.
yeah because it is well known that we just do random stuff  Grin

LOL. Trial and error also works sometimes, the key is to figure out exactly why it works.

In my teaser ity's about code scheduling to maximize throughput by avoiding processor stalls
and maximizing superscalar operation.. It's not something that compilers can do easilly
because it requires so much anlysis and detailed understanding of the operation of
a CPU.

You may think I'm joking or boasting but I'm willing to discuss it openly and be subject to humiliation.

You may also wonder why I would do this. If the techniques work I would ask the developpers to
open some of their private code. Also I want to leverage the cuda expertise available to improve
the product for everyone.

I'll post one technique in a while that requires CPU support, but I have to dig out my old processor manuals
for a refresher. This one is something compilers should do since it is documented.

Edit: I've reviewed my manuals and checked out haswell optimization maual online and didn't see
anything at first glance that indicates they have the support to do the following:

allocate load

A special form of the load instruction will cause a line of cache to be allocated without accessing memory to fill it.
This is usefull when allocating mem and you don't care what data is in it. Memory isn't accessed unless the cache line
gets flushed for oother reasons. An if the bufer is used for a short time it may never need to access memory.

Example:

uint32_t* p = allocate_load( size )
crunch some data
free_and invalidate_cache ( &p )

What do you think?


may-be it would be better to discuss that on irc channel #ccminer
legendary
Activity: 1400
Merit: 1050
Winners don`t use drugs. My crypto kernals are profitable. Yours are not. Finish the plegde please...
lol, they are profitable to me  Grin mostly done, just had to add the name of the donators
member
Activity: 106
Merit: 10
Im selling popcorn for 0.1 BTC with 500% faster pop.
legendary
Activity: 1470
Merit: 1114
so basically you expect other dev to work on the opensource so to make your private miner faster Huh
pallas says thank you to you  Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC.  Roll Eyes

I'm aware of a couple of  assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.
yeah because it is well known that we just do random stuff  Grin

LOL. Trial and error also works sometimes, the key is to figure out exactly why it works.

In my teaser ity's about code scheduling to maximize throughput by avoiding processor stalls
and maximizing superscalar operation.. It's not something that compilers can do easilly
because it requires so much anlysis and detailed understanding of the operation of
a CPU.

You may think I'm joking or boasting but I'm willing to discuss it openly and be subject to humiliation.

You may also wonder why I would do this. If the techniques work I would ask the developpers to
open some of their private code. Also I want to leverage the cuda expertise available to improve
the product for everyone.

I'll post one technique in a while that requires CPU support, but I have to dig out my old processor manuals
for a refresher. This one is something compilers should do since it is documented.

Edit: I've reviewed my manuals and checked out haswell optimization maual online and didn't see
anything at first glance that indicates they have the support to do the following:

allocate load

A special form of the load instruction will cause a line of cache to be allocated without accessing memory to fill it.
This is usefull when allocating mem and you don't care what data is in it. Memory isn't accessed unless the cache line
gets flushed for other reasons. An if the buffer is used for a short time it may never need to access memory.

Example:

uint32_t* p = allocate_load( size )
crunch some data
free_and_invalidate_cache ( &p )

even better lock  the cache line to guarantee it never gets flushed, effectively expanding the
register set.

struct * my_struct  my_regs = alocate_load _and_lock_line( size )
do stuff with my_regs.r1 etc
free_unlock_and_invalidate_cache( &my_regs );

What do you think?
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
Winners don`t use drugs. My crypto kernals are profitable. Yours are not. Finish the plegde please...

legendary
Activity: 1400
Merit: 1050
so basically you expect other dev to work on the opensource so to make your private miner faster Huh
pallas says thank you to you  Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC.  Roll Eyes

I'm aware of a couple of  assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.
yeah because it is well known that we just do random stuff  Grin
legendary
Activity: 1470
Merit: 1114
so basically you expect other dev to work on the opensource so to make your private miner faster Huh
pallas says thank you to you  Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC.  Roll Eyes

I'm aware of a couple of  assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.
legendary
Activity: 1400
Merit: 1050
so basically you expect other dev to work on the opensource so to make your private miner faster Huh
pallas says thank you to you  Grin
I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC.  Roll Eyes
that's your problem, you need to study my code to get something out, while if you knew what you were doing it is the original code would have to read to make it easier for the gpu.
Basically you don't get any real upgrade because you don't look at what the code does, you are just trying random stuff expecting it will work based on what and other devs wrote... well good luck with that Grin
I must be the luckiest man in the universe then. My private kernals are the fastest in the world.
I feel like discussing with a tv commercial right now... where's the fucking remote ? Grin

Saturday night. You should go and DJ. That is your profession isn't it?
lol what make you think that ? The vibe I put on your thread  Grin
is SP your profession ?  Grin

I am a DJ. You are not
never said I was Grin
You are making stuff up then telling me that I am not what you made up Grin good one Grin
(you should stop drugs, it clearly alters you consciousness and your programming skill)
sp_
legendary
Activity: 2954
Merit: 1087
Team Black developer
so basically you expect other dev to work on the opensource so to make your private miner faster Huh
pallas says thank you to you  Grin
I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC.  Roll Eyes
that's your problem, you need to study my code to get something out, while if you knew what you were doing it is the original code would have to read to make it easier for the gpu.
Basically you don't get any real upgrade because you don't look at what the code does, you are just trying random stuff expecting it will work based on what and other devs wrote... well good luck with that Grin
I must be the luckiest man in the universe then. My private kernals are the fastest in the world.
I feel like discussing with a tv commercial right now... where's the fucking remote ? Grin

Saturday night. You should go and DJ. That is your profession isn't it?
lol what make you think that ? The vibe I put on your thread  Grin
is SP your profession ?  Grin

I am a DJ. You are not
Jump to: