Here is the website that collect the op_return message: http://coinsecrets.org/
Thanks for the information. I checked the website. Most of the op_return messages are gibberish. Perhaps there are more data mining techniques that can be applied to decode the message and figure out the purpose of the transactions. So far I don't intend to go that deep in the direction yet. Would be nice to know if other people have done related study about it.
As for the transaction patterns.
Relay: the transaction that only has one input, one output. Usually used to move Bitcoin from one party to another without leaving any change behind. According to my study about the block chain, this kind of transaction are mostly used to mix with other patterns to hide its money flow.
Sweep: This kind of transaction happen when a transaction controller wants to combine multiple separate unspent transaction outputs into a single transaction output that is easier to process and control. Hence, it is very likely that the ownership of the inputs and output are the same.
Peeling Chain: A peeling transaction consists mostly of peel transactions where the change transaction is used as the input to a subsequent peel transaction. A peel transaction has any number of inputs are combined in a transaction and two outputs are created. However, most of the peeling chains that have been studied start with one input and huge amount. It is possible that the pooled mining use peeling chain to pay miners. Hence, we need to cluster peeling chain carefully to reduce the false positive rate.
Those information are quoted from the document in the previous comment.
I think the first two kinds of transaction are easier to detect. Still trying to figure out how to detect a peeling chain precisely.