Pages:
Author

Topic: Ninjastic.space - BitcoinTalk Post/Address archive + API - page 15. (Read 16671 times)

legendary
Activity: 2758
Merit: 6830
That bad? I've only seen Ninjastic.space offline once, but your webhost was on my shortlist in case my AWS hosting (until now kindly sponsored by suchmoon) expires in about half a year. Your webhost seemed like very good value for money, but AWS just keeps running. Especially for scraping posts uptime is kinda important.
I thought they were fine, but I had 3 unscheduled maintenances on another VPS instance I have with them and I once had downtime because of "temporary issues with the DNS resolver" on their end (they first claimed they had nothing to do with it). I will probably try to find another provider next time I need one.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
My VPS proving to be as unreliable as always. Cheesy
That bad? I've only seen Ninjastic.space offline once, but your webhost was on my shortlist in case my AWS hosting (until now kindly sponsored by suchmoon) expires in about half a year. Your webhost seemed like very good value for money, but AWS just keeps running. Especially for scraping posts uptime is kinda important.
legendary
Activity: 2758
Merit: 6830
@TryNinja something wrong happens with your website, I found error from the page, here the warning.

Code:
An error occurred during a connection to ninjastic.space. PR_CONNECT_RESET_ERROR

I don't know this error only on my side or anyone also have same issue. If you guys, open this sites and its working let me know.
Thanks, it should be back online. My VPS proving to be as unreliable as always. Cheesy
legendary
Activity: 2324
Merit: 1604
hmph..
@TryNinja something wrong happens with your website, I found error from the page, here the warning.

Code:
An error occurred during a connection to ninjastic.space. PR_CONNECT_RESET_ERROR

I don't know this error only on my side or anyone also have same issue. If you guys, open this sites and its working let me know.


Thanks, it should be back online. My VPS proving to be as unreliable as always. Cheesy
Haha. its working now, great works bro Smiley

legendary
Activity: 2380
Merit: 5213
Okay, here's another example, what's wrong with it?
The post in question has been edited. (Note that you don't see the "last edit ..." message, because it has been edited in less than 10 minutes after its creation.)
As ninjastic.space searches in unedited versions of the posts and you are searching for the edited content, it can't find the post you are looking for.
staff
Activity: 2436
Merit: 2347
Something broke when searching for some phrases (not all). For example, when searching for this phrase, it gives "No results...", although there is such a post.
There is no problem with ninjastic.space.
The problem is with the word "don't" and how the poster has typed it. If you remove that word from your search, you will get the correct result. (For testing, you can click here.)

The poster has typed the word "don't" using an incorrect punctuation mark.
-snip-

Okay, here's another example, what's wrong with it?

PDX An interesting project with great prospects! A good and confident start, a large team that is interested in the rapid and powerful development of the project. Price over $43, that's good looking for real. A good and promising project capable of pleasantly surprise supporters in the near future. I believe that PDX is no doubt prosperous as it is headed by a strong team to make a good future of it.

The only word that contains the wrong character I deleted (the word "that's"). And still the search result is 0.
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
that also bring up the question how those posts with homographs are handled
The only way for Ninjastic to show homographs would be if Recent Posts shows them. I'm not sure if theymos' fix applies there too.
legendary
Activity: 3668
Merit: 6382
Looking for campaign manager? Contact icopress!
Back in the day were may homograph attacks. Ppl would just change one or few vocal letter from Latin to Cyrillic or other and then it was impossible to check for plagiarism. After me burbling for months, theymos fixed it.
I think if you still try to search for cyrillic "a"  outside of the local section and archive you still will find some hits.

I didn't know about it, sorry.
From what I see it may not be a problem. At least the example I've looked at seems to have the backup with proper latin characters.
So if one searches with wrong characters.. it should be his own problem... And changing that may not be OK because those characters may be valid on certain regional/national pages.
legendary
Activity: 2240
Merit: 3150
₿uy / $ell ..oeleo ;(
Interesting, that also bring up the question how those posts with homographs are handled, because theymos did some changes, and they are displayed without the homographs in the threads, but you are still able to search based on homographs an they came up in the results as homographs. I don't think ppl still do that as there's no point anymore, I'll check later and come with input.

I'm not sure what you mean, since homographs are just normal words (unless there's other meaning I don't know). And (English) words are treated as words.
On the other hand, basically any proper web crawler engine will ignore (treat as space) all kinds punctuation to avoid problems. (Also usually treats multiple spaces as one, but that doesn't matter much here).
I guessed that this one may do the same, made a test, it worked out, hence that post of mine.

Back in the day were may homograph attacks. Ppl would just change one or few vocal letter from Latin to Cyrillic or other and then it was impossible to check for plagiarism. After me burbling for months, theymos fixed it.
I think if you still try to search for cyrillic "a"  outside of the local section and archive you still will find some hits.
legendary
Activity: 3668
Merit: 6382
Looking for campaign manager? Contact icopress!
Interesting, that also bring up the question how those posts with homographs are handled, because theymos did some changes, and they are displayed without the homographs in the threads, but you are still able to search based on homographs an they came up in the results as homographs. I don't think ppl still do that as there's no point anymore, I'll check later and come with input.

I'm not sure what you mean, since homographs are just normal words (unless there's other meaning I don't know). And (English) words are treated as words.
On the other hand, basically any proper web crawler engine will ignore (treat as space) all kinds punctuation to avoid problems. (Also usually treats multiple spaces as one, but that doesn't matter much here).
I guessed that this one may do the same, made a test, it worked out, hence that post of mine.
legendary
Activity: 2240
Merit: 3150
₿uy / $ell ..oeleo ;(
Something broke when searching for some phrases (not all). For example, when searching for this phrase, it gives "No results...", although there is such a post.

And in the database of the site Ninjastic.space, it is also there.

Or maybe there is no result because the post was edited?

I think that the only thing TryNinja has to can "fix" is to strip out all punctuation marks from the query string, normal and odd ones too.
I'm telling this because a search for Both workers and employers are making money with Bondex so why don t you take a part in this money earning machine by investing in it does return exactly the expected results.
Of course, some more tests could be needed.

Interesting, that also bring up the question how those posts with homographs are handled, because theymos did some changes, and they are displayed without the homographs in the threads, but you are still able to search based on homographs an they came up in the results as homographs. I don't think ppl still do that as there's no point anymore, I'll check later and come with input.
legendary
Activity: 3668
Merit: 6382
Looking for campaign manager? Contact icopress!
Something broke when searching for some phrases (not all). For example, when searching for this phrase, it gives "No results...", although there is such a post.

And in the database of the site Ninjastic.space, it is also there.

Or maybe there is no result because the post was edited?

I think that the only thing TryNinja has to can "fix" is to strip out all punctuation marks from the query string, normal and odd ones too.
I'm telling this because a search for Both workers and employers are making money with Bondex so why don t you take a part in this money earning machine by investing in it does return exactly the expected results.
Of course, some more tests could be needed.
legendary
Activity: 2380
Merit: 5213
Something broke when searching for some phrases (not all). For example, when searching for this phrase, it gives "No results...", although there is such a post.
There is no problem with ninjastic.space.
The problem is with the word "don't" and how the poster has typed it. If you remove that word from your search, you will get the correct result. (For testing, you can click here.)

The poster has typed the word "don't" using an incorrect punctuation mark.

From HTML content of the page:



The word "don't" should have been typed as shown below.


staff
Activity: 2436
Merit: 2347
Something broke when searching for some phrases (not all). For example, when searching for this phrase, it gives "No results...", although there is such a post.

And in the database of the site Ninjastic.space, it is also there.

Or maybe there is no result because the post was edited?
legendary
Activity: 2758
Merit: 6830
I just ordered a laptop which should arrive in ~3 days. Then I’ll try to take some time to work on the huge backlog of requests. Smiley
legendary
Activity: 1456
Merit: 5874
light_warrior ... 🕯️
The problem is my lack of time
No rush [...]
+1 ... But when you have free time it would be great if you add the ability to split all charts [not only Merit] into smaller [and larger] time ranges. When I rushed to look for what I needed, I did not find it because Loyce/ has hundreds of threads and I sometimes get confused, so if you would add this feature it would be great.

Quote
legendary
Activity: 3290
Merit: 16489
Thick-Skinned Gang Leader and Golden Feather 2021
The problem is my lack of time
No rush Cheesy But I have another request for your list: when searching posts, I'd like a tick box to only show Merited posts. That makes it easier to find what I'm looking for. Even better if I can also select by who the post was Merited.

In this case I'm looking for this post:
There was a nice explaination about NFTs in the Wall Observer thread a while ago (but I can't find it back). It showed NFTs only mean something on a certain place and have no use whatsoever anywhere else.
copper member
Activity: 1666
Merit: 1901
Amazon Prime Member #7
It looks like when you scrape posts with quotes if the quoted post was made that day, or the quote will say the date of the quoted post is "today". For example, this post quotes a post that was written on July 8, but the quote says "today".

It looks like you already replace the post date with the current date for the date of the post.

It would be nice if the quoted date reflected the actual date of the quoted post.
legendary
Activity: 2758
Merit: 6830
Would it be too complicated to add some searcg filter/parameters for links and media content? Some sort of "check to see only posts with media content" or "check to exclude posts with links"
I need to reindex all the data to make @LoyceV's suggestion and yours to work. The problem is my lack of time, now that I've been contributing to a few projects outside BTT. Undecided

I'll see if I can take a few hours this weekend to do this.



Is it possible to track the number of unique threads and somehow integrate it with the weekly/monthly polygon graphs?
- In theory [CMIIW], it shouldn't go beyond the request limit.
Not that easily. I would need to figure out how to tell which post is a "topic" (a.k.a the OP). Maybe assuming the older post is the OP, but it will never be as accurate as it should. Also, same issue as I said above.
copper member
Activity: 1652
Merit: 1325
I'm sometimes known as "miniadmin"
Hey!

Would it be too complicated to add some searcg filter/parameters for links and media content? Some sort of "check to see only posts with media content" or "check to exclude posts with links"
Pages:
Jump to: