Pages:
Author

Topic: [BPIP] Bitcointalk Public Information Project [Back in Action] - page 26. (Read 21771 times)

sr. member
Activity: 840
Merit: 375
Your bot will need a login, possibly deal with CloudFlare
Damn I forgot that to view profiles you need to be logged in


Since you mentioned Excel I created it in a CSV format, which you should be able to import into a spreadsheet. If you prefer a different format, like JSON or XML - let me know. The columns are:

Code:
user_id, positive_score, negative_score, neutral_score, trust_color, dt_status, dt1_strength, dt2_strength, flag_count

If this is good I can set it up to be updated once a day.

That format works for me! It would be great if you also added a "username" column so I can make it cross-compatible wether the user included bitcointalk profiles urls or usernames in the excel file.



Now, there is only ~27k users, is that what you meant by

It's technically possible that some trust ratings (colors) aren't updated quickly if e.g. a rarely active DT member posts a rating for another rarely active user, so keep that in mind.
because if it's the case from my tests 27k is definitely not enough , like half the users I tested are not found in it Cry

Hmm, maybe I should at first use this CSV to check the user's trust, if he's not in it, I will search for him in LoyceV's Custom trust list


I assume you included me because I created it.  thx!   But I'm not part of the official team anymore.  Smiley
Yes that's why I included you,I wasn't aware you left the team sorry!
Vod
legendary
Activity: 3668
Merit: 3010
Licking my boob since 1970
Hey BPIP team @ibminer @Vod @suchmoon

Hey Aveatrex

https://bpip.org/about.aspx

I assume you included me because I created it.  thx!   But I'm not part of the official team anymore.  Smiley
legendary
Activity: 3654
Merit: 8909
https://bpip.org
Now that you say it, I wonder if i shouldn't make the bot do the same? i.e scrape directly bitcointalk profiles instead of using bpip as a middleman? That for sure wouldn't solve the false postives detected by 2 out of 68 anti viruses (using virus total) since it will still be making automated requests  Cry but is a solution to be considered

You could do that but it's quite a bit of hassle. Your bot will need a login, possibly deal with CloudFlare, and of course scrape multiple pages.

Here's a file of all trust ratings that we have on BPIP - let me know if it works for you: https://bpip.org/all_trust.csv

Since you mentioned Excel I created it in a CSV format, which you should be able to import into a spreadsheet. If you prefer a different format, like JSON or XML - let me know. The columns are:

Code:
user_id, positive_score, negative_score, neutral_score, trust_color, dt_status, dt1_strength, dt2_strength, flag_count

If this is good I can set it up to be updated once a day.
sr. member
Activity: 840
Merit: 375
No, the part just under that, if it's Red, Black or Green.
I also have Trust score images, but it depends on how many you need.

Well if I'm going to release the software publicly, I can't predict how many users an user X will include in his excel file so that won't work.

Technically, I can use and scrape this page instead of bpip it would be much easier but the downside is that it updates every week which is a bit too long to my taste.

We continuously scrape every user profile, prioritizing those who are most active.
Now that you say it, I wonder if i shouldn't make the bot do the same? i.e scrape directly bitcointalk profiles instead of using bpip as a middleman? That for sure wouldn't solve the false postives detected by 2 out of 68 anti viruses (using virus total) since it will still be making automated requests  Cry but is a solution to be considered
legendary
Activity: 3654
Merit: 8909
https://bpip.org
Edit: I'm curious, if theymos trust dump updates every week, how bpip updates the trust of users on demand?

We continuously scrape every user profile, prioritizing those who are most active. It's technically possible that some trust ratings (colors) aren't updated quickly if e.g. a rarely active DT member posts a rating for another rarely active user, so keep that in mind.

I'll see how I can make this easier for you.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
No, the part just under that, if it's Red, Black or Green.
I also have Trust score images, but it depends on how many you need.

Quote
Edit: I'm curious, if theymos trust dump updates every week, how blip updates the trust of users on demand?
Theymos only dumps Trust lists (only for users who have at least one post), not Trust (feedback) scores. BPIP has to scrape each profile for each update.
sr. member
Activity: 840
Merit: 375

What do you mean by trust - is it this part
No, the part just under that, if it's Red, Black or Green.

You can also use the tool created by LoyceV to see trust scores of all users in a single page.
Custom Trust list viewer, (Topic on bitcointalk)

Thank you for your suggestion I'll take a look at it

Edit: I'm curious, if theymos trust dump updates every week, how bpip updates the trust of users on demand?
legendary
Activity: 2380
Merit: 5213
...Is it possible to have a compressed file of the trust of each user updated like each 24 hours, or is that too much to ask for?
I don't think you need to scrape BPIP.
You can use the following file provided by theymos to extract all trusts and distrusts. It is updated once per week.
trust.txt.xz

You can also use the tool created by LoyceV to see trust scores of all users in a single page.
Custom Trust list viewer, (Topic on bitcointalk)
legendary
Activity: 3654
Merit: 8909
https://bpip.org
I'm scraping the trust of a set list of users from an excel file that the user selects and saves the results on the same excel file. The problem of this is that it's slow, is heavy on the server and triggers false positives from 1-2 anti-virus that thinks the software is performing a ddos attack or is part of some bot net. Is it possible to have a compressed file of the trust of each user updated like each 24 hours, or is that too much to ask for?

What do you mean by trust - is it this part?

Loading...
Edited 2020-11-30 to fix a broken image
sr. member
Activity: 840
Merit: 375
Hey BPIP team @ibminer @Vod @suchmoon,

I'm currently developing a mini-software for the community, is web scraping your website allowed via automated requests? I prefer to ask before as I don't know if you can afford that with your hosting especially that right now, the website's response seem to be slow

Let me know if you need additional information on the intended usage

Let me know what you want to scrape (via PM if you don't want to make it public) and we might have a better solution for it.

I'm scraping the trust of a set list of users from an excel file that the user selects and saves the results on the same excel file. The problem of this is that it's slow, is heavy on the server and triggers false positives from 1-2 anti-virus that thinks the software is performing a ddos attack or is part of some bot net. Is it possible to have a compressed file of the trust of each user updated like each 24 hours, or is that too much to ask for?
legendary
Activity: 3654
Merit: 8909
https://bpip.org
Hey BPIP team @ibminer @Vod @suchmoon,

I'm currently developing a mini-software for the community, is web scraping your website allowed via automated requests? I prefer to ask before as I don't know if you can afford that with your hosting especially that right now, the website's response seem to be slow

Let me know if you need additional information on the intended usage

Let me know what you want to scrape (via PM if you don't want to make it public) and we might have a better solution for it.
sr. member
Activity: 840
Merit: 375
Hey BPIP team @ibminer @Vod @suchmoon,

I'm currently developing a mini-software for the community, is web scraping your website allowed via automated requests? I prefer to ask before as I don't know if you can afford that with your hosting especially that right now, the website's response seem to be slow


Let me know if you need additional information on the intended usage
legendary
Activity: 2240
Merit: 3150
₿uy / $ell ..oeleo ;(
Just change the :
Code:
https://bpip.org/report.aspx?r=mostmerited
to:
Code:
https://bpip.org/report.aspx?r=earnedmerit
and it will be fine. I think there was an update, now I get also reward for the "Most earned merit" Smiley Well I lost 2 positions due to inactivity the last few months but.. still in top 25 Tongue

Most merited is not the same as earned merit - the link is contained in the first shield of the five shields on each users' page information.

Hopefully it's just a glitch during the fortnightly update. (have gone back from 39th to 37th with my last fortnight's posts, so about the same as before. Wink )




Are your house building works done yet?

Then mostmerited = mostmerit
Code:
https://bpip.org/report.aspx?r=mostmerit

Quote
Are your house building works done yet?
Renovating. 2 floors done, now is the basement this winter. Build some small things in the garden as well, almost finished with shed for wood. I'm doing all alone so it takes time.
legendary
Activity: 3696
Merit: 2219
💲🏎️💨🚓
Just change the :
Code:
https://bpip.org/report.aspx?r=mostmerited
to:
Code:
https://bpip.org/report.aspx?r=earnedmerit
and it will be fine. I think there was an update, now I get also reward for the "Most earned merit" Smiley Well I lost 2 positions due to inactivity the last few months but.. still in top 25 Tongue

Most merited is not the same as earned merit - the link is contained in the first shield of the five shields on each users' page information.

Hopefully it's just a glitch during the fortnightly update. (have gone back from 39th to 37th with my last fortnight's posts, so about the same as before. Wink )




Are your house building works done yet?
legendary
Activity: 2240
Merit: 3150
₿uy / $ell ..oeleo ;(
Hi guys,

just a quick heads up, the link

Code:
https://bpip.org/report.aspx?r=mostmerited

returns a page:

Code:
https://bpip.org/?noreportname

Regards,

Just change the :
Code:
https://bpip.org/report.aspx?r=mostmerited
to:
Code:
https://bpip.org/report.aspx?r=earnedmerit
and it will be fine. I think there was an update, now I get also reward for the "Most earned merit" Smiley Well I lost 2 positions due to inactivity the last few months but.. still in top 25 Tongue

Edit: wasn't mostmerit but earnedmerit, corrected now.
legendary
Activity: 3696
Merit: 2219
💲🏎️💨🚓
Hi guys,

just a quick heads up, the link

Code:
https://bpip.org/report.aspx?r=mostmerited

returns a page:

Code:
https://bpip.org/?noreportname

Regards,
legendary
Activity: 3654
Merit: 8909
https://bpip.org
https://epochtalk.org/reference/api-reference.html#get-recent-messages
That is only per user, but they do have a framework.

I know... it looks like it's just some autogenerated doc for the node.js crap. So basically what you see on the website. I don't think it has "get all new and updated trust lists since " or anything like that. Hopefully there will still be at least the weekly downloadable files.

As long as the forum displays the last update, you can always recursively parse any top level category as needed.

Possibly. But on a busy forum like Bitcointalk you'd expect a lot of boards have activity so multiple pages would need to be reparsed every time. A patrol/recent posts page/API would be better and I don't think EpochTalk has that. Maybe putting all boards on a watchlist would solve it.
Vod
legendary
Activity: 3668
Merit: 3010
Licking my boob since 1970
Ideally the forum should have a basic API, which would make it easier for everyone, but I'm not holding my breath.

https://epochtalk.org/reference/api-reference.html#get-recent-messages
That is only per user, but they do have a framework.

As long as the forum displays the last update, you can always recursively parse any top level category as needed.
member
Activity: 382
Merit: 40
Ditty! £ $ ₹ € ¥ ¢ ≠ ÷ ™
My apologies... But... Not really.... My list was long overdue for a major overhaul.

There were also other changes as well, if that makes anyone feel better Roll Eyes

How does adding multiple hundreds of UID's to your distrust list qualify as a "major overhaul"?
legendary
Activity: 3654
Merit: 8909
https://bpip.org
Is it possible to create a search filter on the website that can help Zero down to specific profiles using some of this parameters like;
- Number of received Merits,
- Number of Activity Points
- Date or Year of Registration
- Number of posts?

I have some thoughts on that. It would likely be an API but technically creating a user-friendly search form on top of it wouldn't be too hard. But it's quite far down the TODO list.

Like If I wanted to see profile that was registered in 2015, with 167 Merits and has 777 activity points. I would just feed in the data and get the results of profiles in that range.

There are 67 users registered in 2015 with activity between 750-800 and most of them have 250+ merits with a few having ~100. Nobody close to 167.

It would be a good tool for bursting account sales and other investigations if it's implementable.

Might be this one: The Management

But even if you find a match, anyone can try to sell an account with 13587 Posts, 1946 Activity, 6823 Merit and green Trust. It doesn't prove anything.

Not finding a match might be better - it's a 100% scam then.
Pages:
Jump to: