Pages:
Author

Topic: Viewing unedited posts and deleted posts, view per post, per user or per topic - page 11. (Read 8635 times)

legendary
Activity: 2170
Merit: 3858
Farewell o_e_l_e_o
you're saving the first version of every post made by everybody, ever?
Correct.

Update.
I think it should be the version of posts within 15 minutes (if I am remembering correctly) after published. It will be more matched with forum data. Only posts edited after 15 minutes will be shown with editing history and last editing time.
are all my edits within the first 10 minutes also logged?

No, edits in the grace period are not logged.

btw, is this still the same TradeFortress?

Probably.
[New Feature] "Last edit" to be shown as text on mobile. FIXED! 10x Theymos:)
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
you're saving the first version of every post made by everybody, ever?
Correct.

Update.
legendary
Activity: 2940
Merit: 7892
So, just that I understand what's going on here, you're saving the first version of every post made by everybody, ever? I don't quite get what you're doing I guess.
hero member
Activity: 1643
Merit: 683
LoyceV on the road. Or couch.
The member directory got messed up, I can't access my VPS from here so just don't look at it for the coming week.....
hero member
Activity: 1643
Merit: 683
LoyceV on the road. Or couch.
Just a thought: if I get deleted posts from modlog, I can highlight them too.

Another idea: list posts for each topicID, so it's easier to find posts that have been deleted from a certain topic.
legendary
Activity: 3626
Merit: 2209
💲🏎️💨🚓
Instead of the 8k/4k approx file size an actual post count?

Also, (am making work for you now) a sort by file name / last posted etc?

Quote
Quote
Where might we post our findings?
I'm not sure, maybe a separate thread?

Perhaps self moderated and a simple code

Code:
date: (time GMT)
name+uid
post count
post type: scam [] one line [] signature [] bump []

people can see when the posts were last reviewed so they aren't doubling up on work?
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
I thought I'd see how significant the posts step down at the 8k/4k file size:



the middle one had just one post while the first and third have 29 and eight respectively. 
The third one in your list has 8 posts. Those 8k/4k aren't real file sizes, I think the webserver shows the block size it uses on the file system.
That means this isn't the best way to sort files. I'll add it to my TODO: create a index.html with more information.

Quote
Perhaps a known spammers' link be changed to another colour? (red/purple/orange etc for spammer/scammer nuked Flag etc)??
I can strike out banned users (also on my TODO now), but since they're still posting, that won't be many users yet.

Quote
Where might we post our findings?
I'm not sure, maybe a separate thread?
legendary
Activity: 3626
Merit: 2209
💲🏎️💨🚓
Bump!

This morning, I checked http://loyce.club/archive/posts/members/?SD and it instantly revealed a spammer: http://loyce.club/archive/posts/members/1514722.html
It got me thinking: I can create a daily list of users (sorted by the number of posts they made in the past 24 hours). That would instantly highlight users who post a lot, and makes it easy to identify bump spammers. If anyone's interested to check it once in a while, I'll make it Smiley

I thought I'd see how significant the posts step down at the 8k/4k file size:



the middle one had just one post while the first and third have 29 and eight respectively.  I guess the posting list will ebb and surge during holidays and work days/week-ends.  Perhaps a known spammers' link be changed to another colour? (red/purple/orange etc for spammer/scammer nuked Flag etc)??

Where might we post our findings?
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
This morning, I checked http://loyce.club/archive/posts/members/?SD and it instantly revealed a spammer: http://loyce.club/archive/posts/members/1514722.html
It got me thinking: I can create a daily list of users (sorted by the number of posts they made in the past 24 hours). That would instantly highlight users who post a lot, and makes it easy to identify bump spammers. If anyone's interested to check it once in a while, I'll make it Smiley
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
Feel free to call it theymos.css because it's mostly stolen from here: https://bitcointalk.org/Themes/custom1/style.css Smiley

You can steal borrow more stuff from the above file, like the .code class
I didn't know CSS is that easy Cheesy

It works Cheesy See http://loyce.club/archive/posts/5190/51904241.html
legendary
Activity: 3654
Merit: 8909
https://bpip.org
The "div class post" part is there already, I never removed it. I'll make some more adjustments, I was lazy using some headers from the forum HTML, but I'll recreate them on my own.

I've named it suchmoon.css Smiley

Feel free to call it theymos.css because it's mostly stolen from here: https://bitcointalk.org/Themes/custom1/style.css Smiley

You can steal borrow more stuff from the above file, like the .code class
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
If you'd like to fix that: wrap the HTML in
...
and use the following CSS:
Thanks!
The "div class post" part is there already, I never removed it. I'll make some more adjustments, I was lazy using some headers from the forum HTML, but I'll recreate them on my own.

I've named it suchmoon.css Smiley

Code:
I'm creating a new post to test the new version. I'm also adding some code

Update: see http://loyce.club/archive/posts/5190/51903915.html
legendary
Activity: 3654
Merit: 8909
https://bpip.org
quotes are not very clear

If you'd like to fix that: wrap the HTML in
...
and use the following CSS:

Code:
    .post {
        color: #000000;
        background-color: #ECEDF3;
        font-size: 12px;
        font-family: verdana, sans-serif;
        margin-bottom: 5px;
        padding: 5px;
    }

    .post .quoteheader {
        color: #476C8E;
        text-decoration: none;
        font-style: normal;
        font-weight: bold;
        font-size: 10px;
        line-height: 1.2em;
        margin-left: 6px;
    }

    .post .quote {
        color: #000000;
        background-color: #f1f2f4;
        border: 1px solid #d0d0e0;
        padding: 5px;
        margin: 1px 3px 6px 6px;
        font-size: 11px;
        line-height: 1.4em;
    }

It's by no means complete (still has problems with code tags etc) but should help with the quotes and makes it look similar to Bitcointalk styling. You can save it as a .css file and just reference in each html so space usage would be minimal and then you can adjust the CSS as needed.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
Ever wanted to see who's lying when a post has been deleted? This could help!

I've saved (most) unedited posts since September 12 last year, and sometimes get a request to dig up unedited data. Since I now (finally) have a VPS, I'm uploading the files (only for newly created posts).

See http://loyce.club/archive/posts/ for all posts
See http://loyce.club/archive/members/ for all posts made per a certain userID (currently not working)

URL format
I remove the last 4 digits from each userID, postID or topicID to get a directory name. Each directory contains up to 10,000 files.
Example: my userID is 459836. I use 45 as directory name, and 459836.html as file name:
http://loyce.club/archive/posts/45/459836.html (this link doesn't work yet)
If there's less than 4 digits, the directory name is 0.

What I do
  • I scrape posts within seconds, but upload them in batches every minute.
  • The list of posts per user is updated once a day (5:50 AM Amsterdam time) > This is still messy, I'm working on it!
  • Files are stored with their post number as file name. I use the first 4 digits as directory name, then upload 10,000 files per directory. You're going to want to use CTRL-F Tongue
  • I don't scrape hidden boards (such as Investigations).
  • I don't keep post titles
  • I save raw HTML. That means quotes are not very clear, you'll have to read the source code (CTRL-U) to be sure who posted what.
    This is experimental, uploading might fail. By the looks of it, my current webhost will run out of disk space in a couple of weeks. I'll see how it goes. If I run out of space, I might create compressed archives per 10,000 posts.
  • Or I might delete them, so feel free to archive posts.

If anything goes wrong, let me know here.



Example
This is the unedited version of this post: http://loyce.club/archive/posts/5190/51902990.html (the layout looks better in more recent archived posts).

Update
I've added a list of all posts per user (for posts made from now on). This could be used to find deleted posts.



See [overview] LoyceV's useful data on Bitcointalk for more of my forum-related topics
Pages:
Jump to: