I was reading through a very interesting topic on
Database Sharding that seems to be designed for handling huge datasets and distributing them to various channels so that single system doesn't have to overload itself.
What I understand from the concept is either you can distribute the data to various nodes in horizontal manner that is increasing the capacity of processing with more number of devices and also it has virtually no limit on how far you go. Or one can simply start sharding vertically but in that you have to make the machine more powerful. Such papers often confuse people. Before diving into this topic, you should understand that the database is not a blockchain network. "Database sharding" has nothing to do with "blockchain sharding". Think about this and don't get confused by plausible reasoning of prominent blockchain gurus.
Will it help in blockchain scaling?
I think the concept is pretty straight. If we are already using...
No, it doesn't work that way.
So if we do apply the horizontal sharding to the blockchain generated nodes then each node can further be divided into more data sets / nodes and thus processing could be accelerated based on more "machines" will be working on same problem.
No, it doesn't work that way. Let's put aside decentralized blockchain networks and consider original "database sharding". Database sharding is not always an improvement. It's a trade-off that is helpful only in certain cases in certain circumstances. When you split data into "shards", split processes into threads and distribute everything between separate hardware devices you might get some benefit from parallelism. However, you often get an overhead created by managing concurrent threads and associated delays. Sometimes this overhead is so huge that gains from parallelisation can't outperform it.
Therefore, database sharding is often a trade-off. It might be acceptable and it might be unacceptable. Database sharding is not a magic stick which solves all problems of databases. Neither it's not a magic stick in the context of decentralised blockchain networks. On top of that, as I said, the concept "database sharding in blockchain context" is not clear and straightforward.
If Data Sharding is already on the blockchain then how do we know that it is already implemented? Is there any example of such implementation to relieve the Blockchain Burden?
What you call "data sharding" has been already implemented in Bitcoin from the day one. In Bitcoin, transactions within one block often could be processed in parallel by a node.