Topic: CCminer(SP-MOD) Modded NVIDIA Maxwell / Pascal kernels. - page 788. (Read 2347664 times)

Quote from: s7icky on January 23, 2016, 09:23:23 PM

Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer

why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others. --scryptr

X15 had a flare for a few days, I think the coin was evergreen. But yesterday it went wonky on zpool
it was the higherst paying algo and I was hashing away and submitting shares but nothing was
registering at the pool.

Edit: The problem is when an algo gets hot it's too late to shift the optimization focus to it. By the time
something is delievered the heat is off

i think something wonky is going on with zpool reguardless... i dumped a bunch of x11 on it when it was high and i renteed real low.... payout wasent even close to what it was supoosed to be. I gonna dump 50mhs x15 on zpool for 24 hours and see if there is any fuckery going on.

joblo

legendary

Activity: 1470

Merit: 1114

Quote from: scryptr on January 23, 2016, 10:01:56 PM

Quote from: s7icky on January 23, 2016, 09:50:14 PM

Quote from: s7icky on January 23, 2016, 09:23:23 PM

Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer

why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others. --scryptr

X15 had a flare for a few days, I think the coin was evergreen. But yesterday it went wonky on zpool
it was the higherst paying algo and I was hashing away and submitting shares but nothing was
registering at the pool.

Edit: The problem is when an algo gets hot it's too late to shift the optimization focus to it. By the time
something is delievered the heat is off

s7icky

member

Activity: 106

Merit: 10

Quote from: scryptr on January 23, 2016, 10:01:56 PM

Quote from: s7icky on January 23, 2016, 09:50:14 PM

Quote from: s7icky on January 23, 2016, 09:23:23 PM

Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer

why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others. --scryptr

zpool

0.3553* x15

EverGreen coin i guess

scryptr

legendary

Activity: 1797

Merit: 1028

Quote from: s7icky on January 23, 2016, 09:50:14 PM

Quote from: s7icky on January 23, 2016, 09:23:23 PM

Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer

why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

HASHPOWER JUST DROPPED X13 and X15 ALGOS--

I am suspecting the coins associated with these algos are in the toilet further than others. --scryptr

s7icky

member

Activity: 106

Merit: 10

Quote from: s7icky on January 23, 2016, 09:23:23 PM

Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

My 980ti's run at 66 and i added extra butter

my popcorn brings all the miners to the yard cause my kernals are better then yalls.

i need another beer

why is no one working on x15 or x13..... 980ti speeds are not wut they should be meeee thinks.

joblo

legendary

Activity: 1470

Merit: 1114

Quote from: s7icky on January 23, 2016, 09:23:23 PM

Im selling popcorn for 0.1 BTC with 500% faster pop.

But how are the temps and TDP?

djm34

legendary

Activity: 1400

Merit: 1050

Quote from: joblo on January 23, 2016, 09:01:34 PM

Quote from: djm34 on January 23, 2016, 07:30:32 PM

Quote from: joblo on January 23, 2016, 07:01:39 PM

Quote from: sp_ on January 23, 2016, 07:53:06 PM

so basically you expect other dev to work on the opensource so to make your private miner faster Huh

pallas says thank you to you Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC. Roll Eyes

I'm aware of a couple of assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.

yeah because it is well known that we just do random stuff Grin

LOL. Trial and error also works sometimes, the key is to figure out exactly why it works.

In my teaser ity's about code scheduling to maximize throughput by avoiding processor stalls
and maximizing superscalar operation.. It's not something that compilers can do easilly
because it requires so much anlysis and detailed understanding of the operation of
a CPU.

You may think I'm joking or boasting but I'm willing to discuss it openly and be subject to humiliation.

You may also wonder why I would do this. If the techniques work I would ask the developpers to
open some of their private code. Also I want to leverage the cuda expertise available to improve
the product for everyone.

I'll post one technique in a while that requires CPU support, but I have to dig out my old processor manuals
for a refresher. This one is something compilers should do since it is documented.

Edit: I've reviewed my manuals and checked out haswell optimization maual online and didn't see
anything at first glance that indicates they have the support to do the following:

allocate load

A special form of the load instruction will cause a line of cache to be allocated without accessing memory to fill it.
This is usefull when allocating mem and you don't care what data is in it. Memory isn't accessed unless the cache line
gets flushed for oother reasons. An if the bufer is used for a short time it may never need to access memory.

Example:

uint32_t* p = allocate_load( size )
crunch some data
free_and invalidate_cache ( &p )

What do you think?

may-be it would be better to discuss that on irc channel #ccminer

djm34

legendary

Activity: 1400

Merit: 1050

Winners don`t use drugs. My crypto kernals are profitable. Yours are not. Finish the plegde please...

lol, they are profitable to me Grin

mostly done, just had to add the name of the donators

s7icky

member

Activity: 106

Merit: 10

Im selling popcorn for 0.1 BTC with 500% faster pop.

joblo

legendary

Activity: 1470

Merit: 1114

Quote from: djm34 on January 23, 2016, 07:30:32 PM

Quote from: joblo on January 23, 2016, 07:01:39 PM

Quote from: joblo on January 23, 2016, 07:01:39 PM

so basically you expect other dev to work on the opensource so to make your private miner faster Huh

pallas says thank you to you Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC. Roll Eyes

I'm aware of a couple of assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.

yeah because it is well known that we just do random stuff Grin

LOL. Trial and error also works sometimes, the key is to figure out exactly why it works.

In my teaser ity's about code scheduling to maximize throughput by avoiding processor stalls
and maximizing superscalar operation.. It's not something that compilers can do easilly
because it requires so much anlysis and detailed understanding of the operation of
a CPU.

You may think I'm joking or boasting but I'm willing to discuss it openly and be subject to humiliation.

You may also wonder why I would do this. If the techniques work I would ask the developpers to
open some of their private code. Also I want to leverage the cuda expertise available to improve
the product for everyone.

I'll post one technique in a while that requires CPU support, but I have to dig out my old processor manuals
for a refresher. This one is something compilers should do since it is documented.

Edit: I've reviewed my manuals and checked out haswell optimization maual online and didn't see
anything at first glance that indicates they have the support to do the following:

allocate load

A special form of the load instruction will cause a line of cache to be allocated without accessing memory to fill it.
This is usefull when allocating mem and you don't care what data is in it. Memory isn't accessed unless the cache line
gets flushed for other reasons. An if the buffer is used for a short time it may never need to access memory.

Example:

uint32_t* p = allocate_load( size )
crunch some data
free_and_invalidate_cache ( &p )

even better lock the cache line to guarantee it never gets flushed, effectively expanding the
register set.

struct * my_struct my_regs = alocate_load _and_lock_line( size )
do stuff with my_regs.r1 etc
free_unlock_and_invalidate_cache( &my_regs );

What do you think?

sp_

legendary

Activity: 2954

Merit: 1087

Team Black developer

Winners don`t use drugs. My crypto kernals are profitable. Yours are not. Finish the plegde please...

djm34

legendary

Activity: 1400

Merit: 1050

so basically you expect other dev to work on the opensource so to make your private miner faster Huh

pallas says thank you to you Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC. Roll Eyes

I'm aware of a couple of assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.

yeah because it is well known that we just do random stuff Grin

joblo

legendary

Activity: 1470

Merit: 1114

Quote from: sp_ on January 23, 2016, 06:37:35 PM

so basically you expect other dev to work on the opensource so to make your private miner faster Huh

pallas says thank you to you Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC. Roll Eyes

I'm aware of a couple of assembly code optimizing techniques I've used before but require an initimate knowledge
of the CPU (GPU in this case) architecture including memory interface, cache organization and execution environment.

I intend to give this a try with cpuminer-opt once I get up to speed on Intel architecture. Would you be interested in
doing it for cuda? I could explain the details.

djm34

legendary

Activity: 1400

Merit: 1050

Quote from: djm34 on January 23, 2016, 06:31:50 PM

Quote from: sp_ on January 23, 2016, 06:00:21 PM

Quote from: djm34 on January 23, 2016, 05:57:39 PM

Quote from: sp_ on January 23, 2016, 05:19:58 PM

Quote from: djm34 on January 23, 2016, 05:09:28 PM

Quote from: djm34 on January 23, 2016, 06:31:50 PM

so basically you expect other dev to work on the opensource so to make your private miner faster Huh

pallas says thank you to you Grin

I will study your new optimized Neoscrypt code and learn.
You should study my code as well. We use different ways to reach our goal. I do small compiler/assembly optimalizations while you are reinventing the algorithm. You should buy my private miner and analyze it. I will give you a discount of 0.05 BTC. Roll Eyes

that's your problem, you need to study my code to get something out, while if you knew what you were doing it is the original code would have to read to make it easier for the gpu.
Basically you don't get any real upgrade because you don't look at what the code does, you are just trying random stuff expecting it will work based on what and other devs wrote... well good luck with that Grin

I must be the luckiest man in the universe then. My private kernals are the fastest in the world.

I feel like discussing with a tv commercial right now... where's the fucking remote ? Grin

Saturday night. You should go and DJ. That is your profession isn't it?

lol what make you think that ? The vibe I put on your thread Grin

is SP your profession ? Grin

I am a DJ. You are not

never said I was Grin

You are making stuff up then telling me that I am not what you made up Grin

good one

(you should stop drugs, it clearly alters you consciousness and your programming skill)

sp_

legendary

Activity: 2954

Merit: 1087

Team Black developer

Quote from: sp_ on January 23, 2016, 06:00:21 PM

Quote from: djm34 on January 23, 2016, 05:57:39 PM

Quote from: sp_ on January 23, 2016, 05:19:58 PM

Quote from: djm34 on January 23, 2016, 05:09:28 PM