Author

Topic: Larger word clouds. Can we guess who is the bounty hunter and who is the miner? (Read 304 times)

sr. member
Activity: 602
Merit: 295
Hail Eris!
Can you publish your code on github or somewhere? It'd be interesting to see for sure.

Sure.  I attached a script to a previous post but it was like written in an hour.  I am refactoring the code into small testable functions and adding some functionality so I can generate wordclouds based on data ranges and other things.  Once that is done today or tomorrow I will start a github for this and other data visualization projects and start breaking it out into modules.

Sounds amazing. Thanks for your hard work!


ANN thread visualization and analysis?  ICO analysis?
User Post History analysis?  Safely and Benignly Crawling Bitcointalk?
Shitpost modeling?

If any of these things interest you come join me in developing an Open Source project.  https://github.com/pandora23/bitcointalk-visualization/blob/master/README.md

https://trello.com/b/fBefa8LD

Slack: bctminingcollective.slack.com
legendary
Activity: 2674
Merit: 2965
Terminated.
Looks very interesting and I do tend to like visualizations. If you have the time, please do one for me as well.
full member
Activity: 574
Merit: 152
Can you publish your code on github or somewhere? It'd be interesting to see for sure.

Sure.  I attached a script to a previous post but it was like written in an hour.  I am refactoring the code into small testable functions and adding some functionality so I can generate wordclouds based on data ranges and other things.  Once that is done today or tomorrow I will start a github for this and other data visualization projects and start breaking it out into modules.

Sounds amazing. Thanks for your hard work!
sr. member
Activity: 602
Merit: 295
Hail Eris!
Can you publish your code on github or somewhere? It'd be interesting to see for sure.

Sure.  I attached a script to a previous post but it was like written in an hour.  I am refactoring the code into small testable functions and adding some functionality so I can generate wordclouds based on data ranges and other things.  Once that is done today or tomorrow I will start a github for this and other data visualization projects and start breaking it out into modules.
full member
Activity: 574
Merit: 152
Can you publish your code on github or somewhere? It'd be interesting to see for sure.
hero member
Activity: 908
Merit: 657
Do one for me and my alt hilariousetc. I want to see if the word shitposters stands out like a sore cock haha (or anything else funny).


Well here goes one of them.  I do see 'shitposters' in there, and oddly enough you don't actually say 'shitpost' or 'shitposter' but talk about them as a collective.  I put the word 'one' in the stopwords since you use it so much and it was so large, but it will be one way we can see that the two accounts have a similar writing style.  One neat thing is that your other account also says 'shitposters' but not the others as well and also says 'one' a lot.   There is a Kaggle contest going on right now where you train a classifier to guess who the author of snippets are using their writing for training. Wink



Currently refactoring that script which was put together in an hour of inspiration and hypomania.  It does not reflect thought out design. Will be able to show monthly trends after the refactor.  

Then I will compare your two accounts.

I wonder who this "themos" person is  Cheesy (top middle)
sr. member
Activity: 602
Merit: 295
Hail Eris!
Do one for me and my alt hilariousetc. I want to see if the word shitposters stands out like a sore cock haha (or anything else funny).


Well here goes one of them.  I do see 'shitposters' in there, and oddly enough you don't actually say 'shitpost' or 'shitposter' but talk about them as a collective.  I put the word 'one' in the stopwords since you use it so much and it was so large, but it will be one way we can see that the two accounts have a similar writing style.  One neat thing is that your other account also says 'shitposters' but not the others as well and also says 'one' a lot.   There is a Kaggle contest going on right now where you train a classifier to guess who the author of snippets are using their writing for training. Wink



Currently refactoring that script which was put together in an hour of inspiration and hypomania.  It does not reflect thought out design. Will be able to show monthly trends after the refactor.  

Then I will compare your two accounts.
global moderator
Activity: 3990
Merit: 2717
Join the world-leading crypto sportsbook NOW!
Do one for me and my alt hilariousetc. I want to see if the word shitposters stands out like a sore cock haha (or anything else funny).
legendary
Activity: 2772
Merit: 3284
I'm going to say person #1 is the miner and person #2 is the bounty hunter /s
sr. member
Activity: 602
Merit: 295
Hail Eris!
This one creates larger word clouds.  Each consists of the texts for twenty posts.  Can you guess which one is more likely to be bounty hunters and which is the miner? Wink  

Candidate One


Candidate Two
Jump to: