Monday, May 29, 2006

Google's cache as a service

I have been working on this blog for over two years, and I know that there are many links that are dead.  This may be because the host is not available, or the file on the the web server was moved/removed.

I was thinking that Google's cache could help solve the problems of the dead links.  This could be useful in one of two ways:

  1. A web server could have the page that is displayed on 404 errors, have a link to the url in Google's cache.

  2. On the host with the outgoing links, setup a process that walks all of the outgoing links, and when a 404 error is returned, replace the url with the url from Google's cache.

