Author

Topic: Remember to archive pages on the Way Back Machine (Read 803 times)

sr. member
Activity: 322
Merit: 250
September 13, 2014, 06:30:29 PM
#8
Rather take this one https://bitcointa.lk/ all deleted posts are still readable on it.

I searched for some threads, but could not find them on there. Other threads there were way behind and had far less posts than the original bitcointalk threads. On the other hand some threads were right up to date and immediately archived anything I posted.

Why does it rapidly archive some threads, but not others?

Don't know, i think we need to ask the moderators about that.

Bitcointalk is harder for the crawlers to reach so what you see archived here is mostly manually archived by users like you and me. I'm not sure to what extend pages are automatically archived on Bitcointalk.

https://archive.org/about/faqs.php

Quote
Why are some sites harder to archive than others?

If you look at our collection of archived sites, you will find some broken pages, missing graphics, and some sites that aren't archived at all. Here are some things that make it difficult to archive a web site:

Robots.txt -- We respect robot exclusion headers.
Javascript -- Javascript elements are often hard to archive, but especially if they generate links without having the full name in the page. Plus, if javascript needs to contact the originating server in order to work, it will fail when archived.
Server side image maps -- Like any functionality on the web, if it needs to contact the originating server in order to work, it will fail when archived.
Unknown sites -- The archive contains crawls of the Web completed by Alexa Internet. If Alexa doesn't know about your site, it won't be archived. Use the Alexa Toolbar (available at www.alexa.com), and it will know about your page. Or you can visit Alexa's Archive Your Site page at http://pages.alexa.com/help/webmasters/index.html#crawl_site.
Orphan pages -- If there are no links to your pages, the robot won't find it (the robots don't enter queries in search boxes.)

Edit: My bad, you were talking about a different site.
member
Activity: 77
Merit: 10
Rather take this one https://bitcointa.lk/ all deleted posts are still readable on it.

I searched for some threads, but could not find them on there. Other threads there were way behind and had far less posts than the original bitcointalk threads. On the other hand some threads were right up to date and immediately archived anything I posted.

Why does it rapidly archive some threads, but not others?
sr. member
Activity: 322
Merit: 250
Do we have to register there to be able to archive things?

No, you can archive as much as you want for free. You don't need an account or to register. They accept Bitcoin donations.

They treat you as an anonymous user.

What link do we have to click on the home page to start off the archiving process?

Paste the link in the bottom right area then click save page. The page you saved will be like the original website with the exception that clicking on links will prompt you to either archive them, or take you to an older version of the page. If you want to make a new version, get the page link and then paste it in the bottom right area on the Way Back Machine's home.
member
Activity: 77
Merit: 10
Do we have to register there to be able to archive things?

No, you can archive as much as you want for free. You don't need an account or to register. They accept Bitcoin donations.

They treat you as an anonymous user.

What link do we have to click on the home page to start off the archiving process?
sr. member
Activity: 322
Merit: 250
Do we have to register there to be able to archive things?

No, you can archive as much as you want for free. You don't need an account or to register. They accept Bitcoin donations.

They treat you as an anonymous user.
member
Activity: 77
Merit: 10
Do we have to register there to be able to archive things?
sr. member
Activity: 322
Merit: 250
Archive any pages you want on Bitcointalk here or whatever sites you want. Scammers can delete/hide information but if we archive it, the information will exist forever.

For Bitcointalk, archive the posts of anyone you find suspicious or believe to have malicious intent. You could also record the fudders and day to day cause so that it is preserved for future investigations and research.

Link to the Way Back Machine:
http://archive.org/web/

Data storage is cheap enough that any archived items will exist and be accessible for as long as the internet exists.
Jump to: