Pages:
Author

Topic: Viewing unedited posts and deleted posts, view per post, per user or per topic - page 10. (Read 8803 times)

legendary
Activity: 2296
Merit: 2262
BTC or BUST
any government agency that is interested in tracking their citizens' involvement in crypto could be doing what LoyceV does and scrape those posts

legendary
Activity: 3654
Merit: 8909
https://bpip.org
In countries like Bangladesh the gov does not have the diplomatic resources to use the same tools that NSA offers to the US or any US allies. They employ low-level white hat / grey hat servicemen to do their online digging work. It is the same with many developing and underdeveloped nations.

Besides, I am sure NSA data is accessed in case of national security - NOT detecting / tracking crypto users for political purposes.

You're missing the point. NSA or not NSA, any government agency that is interested in tracking their citizens' involvement in crypto could be doing what LoyceV does and scrape those posts without making an announcement thread here. They could be doing even more, e.g. scrape posts in Investigations.

Information, shared by anyone on this forum is kind of made with the confidence that they can perhaps edit out any unrequired information in the future - hence the existence of the edit button.

You cannot "edit out" anything from the internet. The edit button is useful to fix spelling errors and such. It's utterly useless for hiding information, which could have been copied by anyone, could have ended in Bitcoinalk backups (and yes, likely to be handed over to law enforcement if theymos gets a subpoena), etc.

legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
But now let's talk about the bad side.

Your tool is freely available for anyone to use. The very same person I am trying to bust for multi-accounting, is also using information scattered about me on this forum to track my identity and send me threatening messages on my telegram, phone etc.
Anything you put on the internet, should be considered public information for the rest of eternity.

Now, I'd like to hear your thoughts about these 2 situations. What do you intend to do to avoid / solve such issues?
It's not LoyceV's responsibility to solve it. If you post something you shouldn't have - it's your problem. The government doesn't need LoyceV's site, they can just grab the info directly from Bitcointalk, Google cache, archive sites, or NSA hard drives.
Agreed!
I don't scrape Investigations, which is the only place on Bitcointalk that allows DOXing. If someone gets DOXed anywhere else, posts can be reported and the user gets banned. If that happens, feel free to contact me to edit one of the archived posts. When I do that, I'll also edit the filename to make it very obvious it was edited by me, and I'll probably create a log here. Until now, I haven't done this.
Another thing I've been thinking about is if someone posts something that's not allowed by my webhost. If that would happen, I'll have to edit it too.
I'm not sure how Archive.org and the likes handle illegal stuff on their servers.

You're glowing..
I'm not sure what that means
hero member
Activity: 1778
Merit: 764
www.V.systems
Now, I'd like to hear your thoughts about these 2 situations. What do you intend to do to avoid / solve such issues?

It's not LoyceV's responsibility to solve it. If you post something you shouldn't have - it's your problem. The government doesn't need LoyceV's site, they can just grab the info directly from Bitcointalk, Google cache, archive sites, or NSA hard drives.

Not all posts are archived.

Google caches have an expiry date.

And there is, as far as I know, no way to grab info from Bitcointalk if the info is never quoted by someone else and deleted / edited out. And with Theymos' response rate I don't think any gov agency would have much luck getting him to dish out user data. He's pro anonymity as far as I can tell.

Btw. you seem to think that by gov. I implied only US. Obviously that's not the case.

In countries like Bangladesh the gov does not have the diplomatic resources to use the same tools that NSA offers to the US or any US allies. They employ low-level white hat / grey hat servicemen to do their online digging work. It is the same with many developing and underdeveloped nations.

Besides, I am sure NSA data is accessed in case of national security - NOT detecting / tracking crypto users for political purposes.

Information, shared by anyone on this forum is kind of made with the confidence that they can perhaps edit out any unrequired information in the future - hence the existence of the edit button.

But Loyce's tool is circumventing that basic forum function that the average user takes for granted.

idk guys - this is a big area of grey for me. This is dwelling into the areas of retaining user data, and user privacy - It is a bit uncomfortable for me to endorse it - even though this is great work and can be used to keep the forum clean. But it has a huge potential of misuse and abuse as well.

Loyce - maybe you should think this through and make your stance clear.
legendary
Activity: 2296
Merit: 2262
BTC or BUST
What I do
  • I scrape posts within seconds, but upload them in batches every minute.
  • The list of posts per user is updated once a day (5:50 AM Amsterdam time) > This is still messy, I'm working on it!
  • Files are stored with their post number as file name. I use the first 4 digits as directory name, then upload 10,000 files per directory. You're going to want to use CTRL-F Tongue


You're glowing..
legendary
Activity: 3654
Merit: 8909
https://bpip.org
Now, I'd like to hear your thoughts about these 2 situations. What do you intend to do to avoid / solve such issues?

It's not LoyceV's responsibility to solve it. If you post something you shouldn't have - it's your problem. The government doesn't need LoyceV's site, they can just grab the info directly from Bitcointalk, Google cache, archive sites, or NSA hard drives.
hero member
Activity: 1778
Merit: 764
www.V.systems
A great tool. But I have some thoughts about the possible use and misuse of this.

Let's talk about the positive first.

I am amidst manually collecting data from a few accounts that is operated by one individual. Now, I have an X number of accounts linked with this person but I am sure this person has 2X the number of accounts in total.

I am just being lucky to collect the ones where he has made some errors.

So if I post this half report of these linked accounts, then the person would be alerted and would, in theory, try to delete all his errors from all of his accounts' post history.

Your tool, could stop this. This basically enables you to get the posts made by anyone at the first instance they hit post. Which is great in this kind of example.


But now let's talk about the bad side.

Your tool is freely available for anyone to use. The very same person I am trying to bust for multi-accounting, is also using information scattered about me on this forum to track my identity and send me threatening messages on my telegram, phone etc.

This kind of tool would essentially be a weapon in the hands of such blackmailers and extortioners.

Furthermore,

In our country, our government is banning crypto, so if I posted my information here and if tomorrow some government agency wanted to track me down - then your tool is going to enable them to do that. And possibly land a person in jail for 10 years - just for being associated with crypto.



Now, I'd like to hear your thoughts about these 2 situations. What do you intend to do to avoid / solve such issues?
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
I just checked loyce.club/archive/posts/5233/: it shows 9994 files. That means my scraper missed 6 posts, or 0.06%.
This might have been caused by burst posting (faster than my scraper can handle), or those 6 posts could be in Investigation or some other hidden board that normal users can't access.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
Last night, "recent" was broken for 8 hours.  It's the first time I've seen that, theymos didn't post yet what caused it.

As a result, I couldn't scrape anything between post 52295210 and post 52297852. That means I'm missing 2641 posts.
legendary
Activity: 3696
Merit: 2219
💲🏎️💨🚓
Is this user's posts able to be updated at all?
The scraping should be okay now, but I didn't fix the upload yet. I've uploaded an update for just this file.

I didn't notice before, but it shows many duplicate lines. The links to "scraped" don't work either, you'll have to manually change "/posts/posts/" in the URL to "/posts/".

Sorry for this, I haven't had the time to fix this yet.

Thanks, I'll check back this time tomorrow (my free time this week is fairly limited)

Regards,
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
Is this user's posts able to be updated at all?
The scraping should be okay now, but I didn't fix the upload yet. I've uploaded an update for just this file.

I didn't notice before, but it shows many duplicate lines. The links to "scraped" don't work either, you'll have to manually change "/posts/posts/" in the URL to "/posts/".

Sorry for this, I haven't had the time to fix this yet.
legendary
Activity: 3696
Merit: 2219
💲🏎️💨🚓
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
There was a bug that overwrote files a few times. Some of the scraped posts were as old as a minute or more.
I fixed the bug, let's see how long it takes to post scrape this.

Update: http://loyce.club/archive/posts/5224/52243599.html was scraped 1 second after posting. Let me know if anything else fails.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
This is currently uploading data from the past days. When done, new posts should be available online in less than a minute.
Update August 23: Upload will take a few more hours to catch up.

Quote
See http://loyce.club/archive/posts/members/ for all posts made per a certain userID
This part is still disabled.
legendary
Activity: 1554
Merit: 2037
Does it have anything to do with the recount theymos did? Not sure how that would affect the data LoyceV uses.

I did a recount of post counts earlier today. There are several bugs which cause the post count to drift from its real value over time. The current count is the accurate one.

I do recounts from time to time.

Edit: I guess not, just behind the scenes
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
Hmm... Website seems to be down, might want to take a look at it...
Great! I was waiting for that Smiley

Now see if I can get it back up Cheesy
legendary
Activity: 3010
Merit: 8114
Hmm... Website seems to be down, might want to take a look at it...
legendary
Activity: 3010
Merit: 8114
I'm sure it could be quite a weapon, lol.
If the truth can be used as a weapon against someone, he totally deserves it Tongue

Well said. Quotable LoyceV.

Of course you are inadvertently insinuating that women never lie.  Cheesy

I'm thinking eventually it will be handy in trying to compare writing styles between users, in addition to the usual "but you originally said this" type situations.

I know you could probably parse all the text from particular users from the forum itself, but in the format on your server its easier for me to attempt such a thing.

I encourage you to keep it up as long as you can.

legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
I'm sure it could be quite a weapon, lol.
If the truth can be used as a weapon against someone, he totally deserves it Tongue

Checked it, and saw only ten pages available. Is it a fixed one (for all)?
I only download the first page, there's not really a need to download other pages, as long as I get the first one often enough.



Just a thought: if I get deleted posts from modlog, I can highlight them too.
To answer my own suggestion: this won't work, modlog doesn't show which post was deleted.
legendary
Activity: 2310
Merit: 4085
Farewell o_e_l_e_o
I think it should be the version of posts within 15 minutes (if I am remembering correctly) after published. It will be more matched with forum data. Only posts edited after 15 minutes will be shown with editing history and last editing time.
Posts can be edited for 10 minutes without showing (or even keeping!) an edit history. But I'm not aiming to match the forum, I'm aiming to show the unedited post. And I can only download posts from recent when they're new, searching for posts that are 10 minutes old will be more work.
Ooops. I did not know that page. Checked it, and saw only ten pages available. Is it a fixed one (for all)? Or it is just a default page, and I can modify total displayed pages if I want?
Pages:
Jump to: