Author

Topic: [Active] Finding spam and scams by keyword (Read 3726 times)

hero member
Activity: 1722
Merit: 801
August 14, 2020, 11:58:41 PM
#24
Would you mind adding keywords of the scam group to detect them if they come back, please.

Keywords: Beta tester, Nitrogensports

theymos, please automatic nuke them: a group of scammers.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
would you consider adding this keyword?
I've whitelisted you. Your keywords should show up in the next update.
legendary
Activity: 2870
Merit: 7490
Crypto Swap Exchange
Excuse me for the bump, but would you consider adding this keyword?

Code:
spam:lowest.ltd
spam:waterfall
spam:$water


Note,
1. I found someone create 15 account to spam website "lowest.ltd" today, https://ninjastic.space/search?title=lowest.ltd
2. "waterfall" or "$water" used to shill certain project/using many new accounts.
hero member
Activity: 1659
Merit: 687
LoyceV on the road. Or couch.
Code:
spam:https://tabi.foundation
copper member
Activity: 588
Merit: 926
Several users are spamming ShibaMemu. I found 24 posts containing ShibaMemu spam. Some posts have already been removed using reports to the moderators. The user wrangler26 is spamming the most.

Code:
spam:ShibaMemu
spam:Shiba Memu

P.S.. Sorry if I wrote something wrong.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
2 month bump
3 month bump

Code:
spam:-fortnite-
It turns out a search string starting with a dash (-) doesn't work. I only noticed it after 11,851 errors in my logs. It's fixed now. I'll report this to test it > it works now Smiley

@NotATether: can you merge your 2 posts in this topic into one?
legendary
Activity: 1568
Merit: 6660
bitcoincleanup.com / bitmixlist.org
September 24, 2021, 03:10:50 AM
#14
Code:
spam:PhoenixMiner
scam:github.com/PhoeixMiner-TeamDev
legendary
Activity: 2352
Merit: 2592
Chancellor on Brink of Second Bailout for Banks
November 19, 2020, 03:59:49 PM
#13
Code:
spam:fairspin
scam:xlmwin.com
scam:bitexper
scam:bedavabitcoin
scam:dogecoinbonus
copper member
Activity: 3948
Merit: 2201
Verified awesomeness ✔
November 15, 2020, 01:35:43 PM
#12
Code:
spam:coca-colascholarsfoundation.org
spam:music.missouri.edu
spam:stagioniclt.com
spam:rodgershealth.org
scam:HashrateUp for ETH
spam:reativelearning.org
spam:vasd.instructure.com
spam:mae.rutgers.edu
spam:ucen.ucsb.edu
scam:ETHlargement
spam:iamladp.org
scam:hack-robux
scam:hack-roblox
spam:cstem.uncc.edu
spam:forms.swmich.edu
spam:hcc-web1.hagerstowncc.edu
spam:www.cwu.edu
spam:cts.cwu.edu.ce
scam:DeadManWalkingT0
scam:goo.su/37nb
scam:up your hashrate
scam:mega.nz/file/88xqkj6i#ci_z5ht-r5cug1owda-rkesyxxlom1n88yycrrugxrc
scam:mega.nz/file/i9halbqa#5yoicx_yigfax2ku14biptgczex6dc56drwvdd3llb8
spam:umich.edu
spam:unc.edu
spam:video-
spam:video_
spam:v-ideos
spam:v_ideos
spam:roblux_
spam:generator-
spam:generator_
spam:-fortnite-
scam:etchash
legendary
Activity: 2380
Merit: 5213
November 14, 2020, 09:28:26 AM
#11
Recently, I've reported more than 30 posts including links below.
Their posts get deleted. They make new accounts and spam again and again.  

Code:
spam:247sports.com
spam:game.tv
spam:deviantart
spam:gitlab
spam:raftingsort
spam:ec.europa.eu
spam:battlefy
spam:residentadvisor
legendary
Activity: 3010
Merit: 8114
September 16, 2020, 04:07:02 PM
#10
Hope I did it right.

Code:
scam:HYIP
scam:high yield
scam:doubler
scam:MLM
scam:cloud mining


I noticed that the news-related tags get used far more often that the others... my suggestion would be to take off coindesk and cointelegraph, but leave on the ones that come across as desperate spammers, like coinidol. There's probably a lot of people posting coindesk links out of genuine interest and aren't advertising for them on purpose.
copper member
Activity: 2562
Merit: 2510
Spear the bees
September 05, 2020, 05:09:11 AM
#9
Code:
spam:>agree,
spam:eth of course
other:sesc=
legendary
Activity: 1568
Merit: 6660
bitcoincleanup.com / bitmixlist.org
September 04, 2020, 07:34:13 PM
#8
I want to test this tool against a known spammer.

Code:
spam:TradingView Social

Bump: should I remove the cointelegraph link? There are so many of those, it fills the list, and I haven't reported them anyway.

I think you should, in order to lower the signal to noise ratio for the spam list. It makes it harder to search for posts with the other keywords.
legendary
Activity: 2240
Merit: 3150
₿uy / $ell ..oeleo ;(
August 26, 2020, 02:55:36 AM
#7
Let's see if we can catch some word spinners with this tool Smiley
Code:
other:conversant
other:impeccable
other:impeccably
other:irreproachable
other:apperceptive
other:cerebral
spam:successismoney.com
spam:Koinly

legendary
Activity: 2212
Merit: 2061
Join the world-leading crypto sportsbook NOW!
August 19, 2020, 12:30:21 PM
#6
Code:
scam:github.com/pillforethereum/ETHpillAN/
scam:github.com/MorpheusPill/MorpheusPillETH/
scam:github.com/EthereumPillProject/EthereumPill/
scam:github.com/ProjectEthereumPill/EthereumPill/
scam:github.com/ProjectEthereum/EthereumPill/
scam:github.com/ProjectPill/

Changelog:
10/3/2020 - https_://github.com/EthereumPill/PillForETH/
10/8/2020 - https_://github.com/EthereumPillProject/EthereumPill/
10/12/2020 - https_://github.com/ProjectEthereumPill/EthereumPill/
10/19/2020 - https_://github.com/ProjectEthereum/EthereumPill/
10/22/2020 - https_://github.com/ProjectPill/
legendary
Activity: 3696
Merit: 2219
💲🏎️💨🚓
August 14, 2020, 08:43:21 PM
#5
I'm just trying to get this clear in my head, so if you will permit me to try out a couple of test samples?

Code:

other:hellow
other:Humbertin
other:Humberton
other:Francisco Carvajal
other:majidkhann
other:ixchanger
other:BTCaccelerator9
other:Mastergerundx
email:franklinma81s
other:Hello I am Humb
other:we glade
email:pibworld

scam:mstk.jal2
scam:bet365supplier
scam:Betaccountsupplier
scam:bet365_accounts_seller
scam:Buying LBC account
scam:fhcp9999
scam:kingnumas
scam:for salle

other:yolonu
other:grunch
other:unforchinately
other:moronbozo
other:Suchmoron
other:robovac

email:anikhasan365
email:aymansadiq365
email:devilperson96
email:Ineedfacs
email:Embroiderymate


Thanks.  (If I'm reading this right, I just edit this one post - yes?)









Timeline:

Other:


20/08/20 Initial list Humbertin + hellow (the latter used by the former)
06/10/20 Added grunch while investigating spike420211/TrevorS/bitcoinst
21/10/2020 Changed category to "email" for a few entries.


legendary
Activity: 3178
Merit: 3295
August 13, 2020, 03:43:46 PM
#4
Code:
spam:unlimited-hash.com
scam:ProjectETH+
scam:anonfiles.com/P9bcXbO6o7/Money_Plus_zip
scam:anonfiles.com/5dx7N8O3o3/IMiner_zip
scam:github.com/ethpillandev/
scam:github.com/pillforethereum/
spam:minepi.com
scam:github.com/pillforeth
scam:bitbucket.org
scam:strongfiles.net/ghUQD
scam:github.com/ProjectEthereumPill/
scam:ethereumpill.info/
scam:howtohashrateup
scam:goo.su/38Yx
scam:hashup-utility.info/HashUpUtility.zip

Edited
copper member
Activity: 784
Merit: 710
Defend Bitcoin and its PoW: bitcoincleanup.com
August 13, 2020, 10:47:41 AM
#3
Code:
spam:kintum.io/
spam:uselectionnews.medium
spam:laylalillian4482.medium
spam:azam24.medium
spam:letek23921.medium
spam:canvas.spu.edu
spam:cimis.water.ca.gov
spam:curio.instructure.com
spam:medium.com/@sakurasujukee
spam:sakurasujukee.medium
spam:medium.com/@roanburrows
spam:resources.instructure.com/eportfolios
spam:loxobudum.medium.com
spam:muqiqoq.medium
spam:bathnes.gov
spam:core.colomboserboli.com
spam:therockc4yd.org
spam:jwzfunniemotw19.medium
spam:medium.com/@articlesforyoureyes
spam:twitter.com/iihwjclive

scam:https://github.com/ethpillan/PillForETH
scam:hashup-utilite.info
scam:file/IlwAHRSD#H82iNa6Pm9dQvsG8pIlOyKWh7gPe9q0JkIkteIbG3YM
scam:https://github.com/Cache-engine
scam:https://github.com/tobxx/teamredminer
scam:https://github.com/PhonixNetwork
scam:https://github.com/devlsoftware
scam:https://github.com/Phoenix-MinerDev
scam:https://github.com/PhoenixDev-Team-Miner/PhoenixMiner
scam:https://github.com/Phoenix-DevMinerTeam/PhoenixMiner
scam:https://github.com/chia-plotter/chia
scam:https://github.com/Awesome-MinerDev
scam:https://github.com/Phoenix-Miner-TeamDev/
scam:https://github.com/PhoenixDev-Miner-Team/
scam:https://github.com/Phoenix-TeamDev/
scam:https://github.com/PhoenixBetaMiner
scam:https://github.com/Nebu-Tech/
scam:https://github.com/LHR-Pill/
scam:https://github.com/Phoenix-mine
scam:https://github.com/Phoenixmine
scam:https://github.com/trexminer-TRex/
scam:https://github.com/Phoenixmin/
scam:https://github.com/Phoenix-mine-core/
scam:https://github.com/PhoenixMiner
scam:https://github.com/PhoenixMinerCore/
scam:https://github.com/PhoenixMine-Team
scam:https://github.com/PhoenixMine-Core

github.com/ethpillan => was malware (I said "was" because the repo is currently deleted, but maybe it can find some deleted posts with that bad link)
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
August 13, 2020, 09:44:12 AM
#2
Code:
scam:trustcoin.exchange
other:www.bitcointalk.org
spam:>good project
scam:bitcointaIk
spam:coinidol.com
scam:Seems to be lower fee
scam:provides a free bonus of
advertising:dailyhodl.com
advertising:europeangaming.eu
advertising:crypto-news-flash.com
advertising:theblockcrypto.com
advertising:newsbtc.com
advertising:cryptoglobe.com
advertising:decrypt.co
advertising:financemagnates.cmo
advertising:thecurrencyanalytics.com
advertising:coinjournal.net
advertising:ambcrypto.com
advertising:ethereumworldnews.com
advertising:nairametrics.com
advertising:ghacks.net
advertising:Download The Decrypt App.
spam:AnonymousMask.Finance
spam:dabbyfinance
spam:mediastudies.as.virginia
spam:scholastic.com
spam:lcluc.umd.edu
scam:gun-bot.com
other:1HZwkjkeaoZfTSaJxDw6aKkxp45agDiEzN
spam:satoshihill
spam:dissertation-service.org
spam:WTS account PAYPAL with huge balance
spam:50k paypal exchange 5k btc
spam:t.me/Paypal_Seller_Account
spam:articles.whalesheaven.com
spam:A new standard in mining resource management! - goodhash.io
spam:https://t.me/CryptoGeneration_official
spam:Features: 1. Auto shilling, more than 1,999 groups
spam:>Thank You<
spam:coingabbar.com
spam:unipayofficial.com
spam:bitcoingold.me
spam:As an AI language model
spam:As of my last knowledge update in
spam:my learning base is limited to information until
spam:neonlink
spam:neon link
scam:Binarium
spam:Magic Square
spam:martelgold.com
spam:Bitget
spam:SOEWFr1JxBDFOeClcPu5Wv
spam:2dfb4fc1d90a9d80
spam:1c6q3Wzp9hHarfqP0I1SBhAVlIjyeF7b2
I'm testing with ">" in front of "good project", so it only matches something like the beginning of a line.

Feel free to explain the reason why you add a certain phrase in your post
Just don't put it inside the code-tag.

Explanation
provides a free bonus of
For advertising, see this user's post history.
For 1HZwkjkeaoZfTSaJxDw6aKkxp45agDiEzN, see this topic.
Binarium, see this post.

Removed from my list:
Quote
advertising:cointelegraph.com/news
advertising:coindesk.com
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
August 13, 2020, 09:42:47 AM
#1
The data
See loyce.club/badposts
All categories / spam / scam / other / advertising / email
Most recent posts are shown first.

Whitelisted users
Code:
LoyceV
LoyceMobile
TheBeardedBaby
Lafu
Rizzrack
Timelord2067
marlboroza
morvillz7z
NotATether
actmyname
nutildah
hosseinimr93
Mitchell
Bthd
light_warrior
ABCbits

Post keywords
Whitelisted users can post keywords. Please don't use very common words (such as "scam") that trigger too many false positives. And please keep all keywords in one code tag in one post: edit it to add/remove keywords.
Other users can also post, with 2 possible outcomes:
  • I whitelist you and process your keywords
  • I remove your post
Please don't quote code-tags.

Tips
Leave out "https" from scam links.

Format
Within code-tags, post either "scam:thiswebsiteisascam" or "spam:Ikeeppostingthesamecrapeverywhere". One phrase per line. See this example.
Remove the line to exclude the keyword in the next update.
Only use a space after "scam:" if you want to include a space in your search.
Keyword "other:" is meant for text that isn't necessarily spam or a scam, but needs highlighting nonetheless.
Keyword "advertising" is meant for sites that are often used to copy an article and create a backlink.
Keyword categories:
  • spam:
  • scam:
  • other:
  • advertising:
  • email:

Features
I search the last ~200,000 posts for new keywords. This covers approximately 1 month.
I update all keywords every 20 minutes.
I show all matches for all usernames and ranks. No exceptions. Not all of them are bad.
Whitelisted users are shown in green.

Limitations
There's a maximum of 60 keywords per post (if you add more those will be ignored). I can increase this if needed.
There's a maximum of 4000 matches per category. If there are more, the oldest are removed.
The minimum keyword length is 4 (for other) or 5 (for spam/scam) characters.
I also search quotes.

Report posts!
This list is only useful if someone actually reports the bad posts Smiley

Post removal
I want to keep this topic compact (so I can quickly scrape it many times). That means I'll delete almost all posts that don't contain a list from a Whitelisted user. I would say "I hope I'm not offending anyone", but really, I'm okay with that Tongue



Q&A
What are you trying to accomplish with this thread?
See around here Smiley

Will this look into titles?
Some types of scams involve posting very little content in the OP body but then go on to include important keywords in the title.
For now: nope Sad
I don't keep track of titles with this data.

This could be cross-checked with your list of banned accounts.
Thanks, I've added it.

Can I use it for searching for alts? Like links to twitter, facebook, telegram usernames, etc?
Can I use it for catching plagiarism, like searching for a whole sentence or this will clogged the server, or maybe only a phrases and not so common words like we did in the SpamBuster club with suchmoon?
My plan was to only look back about 100k posts (currently just over 2 weeks), so it won't really help you here. But it would (near) instantly add new posts, and that's what I'm aiming for here.
Searching all my data without database takes too long to do on a regular basis. You should try TryNinja's database though!

would it catch spoofed urls? www.bitcointalk.org
Without "www", the url turns into 9gag.com. I think theymos should give "www.bitcointalk.org" the same treatment.
I've added category "other" for things like "www.bitcointalk.org".

It shows every word which has keyword "moron" or "moran" inside:
umoran
Is it supposed to work like this?
I search for the exact phrase (case insensitive), so it matches anything. You can add a space in front of it (as you did already), but that might miss some matches too. It's more or less as intended, if I change this, it might overlook other matches.

Can we also include the reasoning behind why something is a scam? Perhaps a link to a thread that explains it?
It can be explained in the post in this topic. I don't want to add repetitive explanations to my badposts page.

"the eth pill stuff" has malware https://bitcointalksearch.org/topic/m.54876299
It would be great if you can add this to your earlier post Smiley



Please don't quote code-tags.

Please don't quote code-tags.
You should write that on the first row of the OP.
It's a tad higher now. Don't worry about overlooking it: I only added it today. I don't think it's much of a problem though: only the first code tag in each post is processed, and I now remove duplicate keywords to reduce search time.

Each 15 minute update takes about 1 second to process.
Each new keyword takes a few minutes to process, reading all 200,000 posts is slow. Processing several new keywords at once is more efficient, so feel free to add them Smiley

spam:minepi.com/
scam:github.com/pillforeth/
I'm trying to improve searching for whole words only. I now remove the trailing slash ("/") from the keyword before searching. I don't think it matters for your keywords, but it can improve other strings.

other: moran
other: moron
other: moron
Try without the spaces now Smiley

I've tried with space in front and the end, it didn't find anything. I have also tried with space in front it also didn't find anything.
You were trying to adjust for my old search, right while I was adjusting it to improve matching complete words only.

scam:https://github.com/pillforethereum/ETHpillAN/
There's many of those altcoin pills nowadays.
You should probably omit the "https://"-part, a scammer can do the same.

How far back does your search go?
See:
I search the last ~200,000 posts for new keywords. This covers approximately 1 month.
This search takes about 2 minutes for new keywords. It's mainly meant to catch new posts, older posts can be found through other means.

Any thoughts why this post and User wasnt catched today
The post was edited, see the unedited post.
Unfortunately, I can't know which posts have been edited, so this is a loophole to escape my badposts list.

I think you should just make another link/html file that contains all cointelegraph spammers.
I did already, see loyce.club/badposts/advertising.html.

@Loyce can you please remove one of these keywords :
github_com/ProjectEthereumPill
github_com/ProjectEthereumPill/EthereumPill/
Thanks, done:
The following overlapping keywords have been removed:
github.com/ProjectEthereumPill/EthereumPill
github.com/pillforethereum/ETHpillAN (because of github.com/pillforeth)

I also noticed the "Banned" notification is only useful for new keywords, because my banned list is only updated once a day. I ran a one-time update from scratch, searching the last ~800,000 posts, this updates the banned-status on older posts.

Hope I did it right.
HYIP and MLM are too short, the minimum word length is 5 for scams.

@LoyceV did you delete the old info ? The scam link displays only 22 archived posts.
I do have logs, but it's too many lines to search now. I think someone must have entered a keyword with many hits, then removed the keyword again. My "badposts" only shows the latest 4000 posts each time I update it, but when a keyword is removed, all those entries are removed too.
I've manually reset it to re-check all keywords in the last ~200k posts. This restored a longer list again.

Is ETC officially a scam? I see it in the blacklist words as a scam.
That is debatable...
However the "scam : ETChash" keyword just helps catch posts like this one that have malware download links

I think the keyword "PhoenixMiner" is too general and gives out a lot of fake positives...



I made a new toy:
[Newbie scrutiny instead of jail] Every new user's first post: loyce.club/patrol:
See loyce.club/patrol/

Please Report (or Merit) the posts when needed Wink

It's updated once a minute.

Sample:
Image loading...
Jump to: