See loyce.club/badposts
All categories / spam / scam / other / advertising / email
Most recent posts are shown first.
Whitelisted users
LoyceMobile
TheBeardedBaby
Lafu
Rizzrack
Timelord2067
marlboroza
morvillz7z
NotATether
actmyname
nutildah
hosseinimr93
Mitchell
Bthd
light_warrior
Post keywords
Whitelisted users can post keywords. Please don't use very common words (such as "scam") that trigger too many false positives. And please keep all keywords in one code tag in one post: edit it to add/remove keywords.
Other users can also post, with 2 possible outcomes:
- I whitelist you and process your keywords
- I remove your post
Tips
Leave out "https" from scam links.
Format
Within code-tags, post either "scam:thiswebsiteisascam" or "spam:Ikeeppostingthesamecrapeverywhere". One phrase per line. See this example.
Remove the line to exclude the keyword in the next update.
Only use a space after "scam:" if you want to include a space in your search.
Keyword "other:" is meant for text that isn't necessarily spam or a scam, but needs highlighting nonetheless.
Keyword "advertising" is meant for sites that are often used to copy an article and create a backlink.
Keyword categories:
- spam:
- scam:
- other:
- advertising:
- email:
Features
I search the last ~200,000 posts for new keywords. This covers approximately 1 month.
I update all keywords every 20 minutes.
I show all matches for all usernames and ranks. No exceptions. Not all of them are bad.
Whitelisted users are shown in green.
Limitations
There's a maximum of 50 keywords per post (if you add more those will be ignored). I can increase this if needed.
There's a maximum of 4000 matches per category. If there are more, the oldest are removed.
The minimum keyword length is 4 (for other) or 5 (for spam/scam) characters.
I also search quotes.
Report posts!
This list is only useful if someone actually reports the bad posts
Post removal
I want to keep this topic compact (so I can quickly scrape it many times). That means I'll delete almost all posts that don't contain a list from a Whitelisted user. I would say "I hope I'm not offending anyone", but really, I'm okay with that
Q&A
Some types of scams involve posting very little content in the OP body but then go on to include important keywords in the title.
I don't keep track of titles with this data.
Can I use it for catching plagiarism, like searching for a whole sentence or this will clogged the server, or maybe only a phrases and not so common words like we did in the SpamBuster club with suchmoon?
Searching all my data without database takes too long to do on a regular basis. You should try TryNinja's database though!
I've added category "other" for things like "www.bitcointalk.org".
Each 15 minute update takes about 1 second to process.
Each new keyword takes a few minutes to process, reading all 200,000 posts is slow. Processing several new keywords at once is more efficient, so feel free to add them
scam:github.com/pillforeth/
other: moron
other: moron
You should probably omit the "https://"-part, a scammer can do the same.
Unfortunately, I can't know which posts have been edited, so this is a loophole to escape my badposts list.
github_com/ProjectEthereumPill
github_com/ProjectEthereumPill/EthereumPill/
The following overlapping keywords have been removed:
github.com/ProjectEthereumPill/EthereumPill
github.com/pillforethereum/ETHpillAN (because of github.com/pillforeth)
I also noticed the "Banned" notification is only useful for new keywords, because my banned list is only updated once a day. I ran a one-time update from scratch, searching the last ~800,000 posts, this updates the banned-status on older posts.
I've manually reset it to re-check all keywords in the last ~200k posts. This restored a longer list again.
However the "scam : ETChash" keyword just helps catch posts like this one that have malware download links
I made a new toy:
[Newbie scrutiny instead of jail] Every new user's first post: loyce.club/patrol:
Please Report (or Merit) the posts when needed
It's updated once a minute.
Sample: