Warrick back

Board: Home Board index Logistics ompf2

(L) [2011/11/30] [tby jbikker] [Warrick] Wayback!

Apparently, there exists some software that is able to recursively fetch webpages from Google cache, as well as other sources, and combinations of sources if copies are partial. The software is called Warrick:
[LINK http://frankmccown.blogspot.com/2011/08/warricks-status.html]
Unfortunately, Warrick is currently undergoing a drastic update which was required because of changes in Google APIs and Archive.org. An updated version is expected 'in a couple of weeks' (see above url). Apparently, Google may delete a site from the cache when it fails to crawl it, so let's hope a couple of weeks is fast enough. It may provide us with a full copy of ompf.
(L) [2011/12/08] [tby nhm] [Warrick] Wayback!

I tried downloading and running warrick but as stated it doesn't work with the Internet Archive any longer.  I stopped playing with it pretty quickly as I didn't want google to ban my server's IP if I happened to get it working without IA.  Might be worth looking into the technique they use here (ignore the spammy sounding URL):
[LINK http://www.startuploans.org/archive-recovery/]
Not sure I'll have time to look into this before the next version of Warrick is released, but I thought I'd mention it if anyone else has free time.
-nhm
(L) [2011/12/09] [tby davepermen] [Warrick] Wayback!

just in case. does bing have a web archive? if we mess up the google one..

back