Author

Topic: Is block-chain compressed? (Read 1128 times)

member
Activity: 75
Merit: 10
April 11, 2013, 03:55:07 AM
#8
Quote
I already indicated bit-coin addresses and part of signed transactions being random, what I was talking about the address occurring multiple times in block chain.

This would require some sort of local index of addresses. This would be bloat. Further, how big would you make the index field size? 4 bytes? This was once considered enough for packet-switched networks.. and looked what happened there!

Satoshi intended for addresses to be only be used once. Future address generation could far exceed addresses used in other protocols (ie IPv6 = 16 bytes). 

Mr Smith is correct. Great minds have already thought of such things for us already. In fact, somewhere along the way someone already figured a way to compress public keys to half their original size. No harm in us mere mortals trying to understand however.

full member
Activity: 151
Merit: 100
April 11, 2013, 02:39:31 AM
#7
Please do a search instead of creating new topics. This has been analyzed in detail by very smart people already. Yes, some % of compression is possible.
Dear Sir, I did search before posting but could not find any relevant page in first few search results, so will you be kind enough to point me to those threads which you were referring to.
hero member
Activity: 812
Merit: 1022
No Maps for These Territories
April 11, 2013, 02:20:11 AM
#6
Please do a search instead of creating new topics. This has been analyzed in detail by very smart people already. Yes, some % of compression is possible.
full member
Activity: 151
Merit: 100
April 11, 2013, 02:15:25 AM
#5
Quote
because all the transaction are between limited address out of a HUGE set, at-least that part can be compressed
Do you know how compression works? If not, try to put some bitcoin addresses into a file (in binary form of course) and compress them with zip or whatever, then check results.
I think you don't understand compression, I already indicated bit-coin addresses and part of signed transactions being random, what I was talking about the address occurring multiple times in block chain.

e.g. a very sketchy compression of a simplest blockchain

14x999VDuEd48ykqYrg8K6a1kto45a3BHf => 1HQ3Go3ggs8pFnXuHVHRytPCq5fGG8Hbhx
1C8ToSDT9dwxy7uXmU5AXpx1oXbV84GwpN => 1HQ3Go3ggs8pFnXuHVHRytPCq5fGG8Hbhx
1MZqmDHxyW4htFaZFsQ5q78i9nL3r99BtW =>1C8ToSDT9dwxy7uXmU5AXpx1oXbV84GwpN

can be compressed to
a=14x999VDuEd48ykqYrg8K6a1kto45a3BHf
b=1HQ3Go3ggs8pFnXuHVHRytPCq5fGG8Hbhx
c=1MZqmDHxyW4htFaZFsQ5q78i9nL3r99BtW
d=1C8ToSDT9dwxy7uXmU5AXpx1oXbV84GwpN
a=>b
d=>b
c=>d
member
Activity: 75
Merit: 10
April 10, 2013, 11:18:38 PM
#4
Humans read base58 bitcoin addresses (20bytes) as ASCII numerical representations. Saving these into notepad is going to result in 34 bytes and will be compressible given a large enough sample. The block chain is a binary beast and is very efficient. Addresses are saved as 20bytes. If you can compress this (regardless of set size)  we are all in trouble.

All other data consists of arbitrary data (ie output values) and indistinguishable (hopefully) random data (cryptographic hashes and signatures).

I suppose if someone embedded structure into the block chain, ie through output values or non-working addresses (no private pair) you could somewhat compress it.
kjj
legendary
Activity: 1302
Merit: 1025
April 10, 2013, 09:57:03 AM
#3
bzip2 actually gets around 25-35% compression on the files, at a huge time and CPU cost.

While parts of the data are pathological, other parts are very regular.  You might be able to get somewhat better results by writing a hybrid compressor that skips over the parts that we already know are random.
full member
Activity: 203
Merit: 100
April 10, 2013, 09:28:01 AM
#2
Quote
because all the transaction are between limited address out of a HUGE set, at-least that part can be compressed
Do you know how compression works? If not, try to put some bitcoin addresses into a file (in binary form of course) and compress them with zip or whatever, then check results.
full member
Activity: 151
Merit: 100
April 10, 2013, 01:44:42 AM
#1
Is block-chain currently compressed in someway while storing, it may not be possible if data is almost random but I think because all the transaction are between limited address out of a HUGE set, at-least that part can be compressed
Jump to: