The actual interest in this topic came from wanting to model and detect pump and dump schemes which where popular and openly coordinated back in the day. I can probably find and make a list of these coordinated pump and dumps to use as labeled data as we know some of them were shilled. And yeah definitely need to bring in coin volume, sentiment volume, and maybe even some bumping metrics.
Bumping farms "should" just increase volume but in a big data sense preserve the sentiment of the users which may or may not indicate positive or negative shilling. That is the patterns should still be there, right? Unless they bump with bias in which case thats just adding to the shilling pattern even more.
You have no idea how hard your eyes will roll when I tell you what I have up my sleeve next. You see, I recently decided to get into game development so bought a bunch of books on Unity 2021, but now got back into and excited about bitcointalk which leads me in a different direction (an incongruity!). I also have a really weird sense of humor so came up with the most ridiculous thing to blog about on here which is to do the Bitcointalk Visualization Project (um there is a github somewhere with the project manifesto) inside a Unity based VR game. Like it is going to be so ridiculous, prodding the crawler to get it to process a thread, which it spits out page objects I have to stack up before hobbling over to the chute you throw them in that spits out a word cloud - or the NLP pipeline being an actual pipeline of weird objects, the stopwords remover spitting out a pile of removed words I have to shovel away, and so on. Inverted decision trees everywhere! Oh god I get so giddy thinking about throwing that out with a straight look on my face.
Oh and it might be actually pretty awesome to be able to interface with BCT as well as analyze historical BCT data (keeping to strictly benign page access limits!) inside a weird VR world... but yeah not sure if anyone else will get it.