First post, by tkrn
Dear Community -
Longtime listener, first-time caller here. It's wonderful what this community has done and soo many other communities have done. As the internet ages I've taken it upon myself to help index, crawl and preserve internet sites that are disappearing as time goes on. This is especially relevant for those sites that have been locked in time where the maintainer is no longer maintaining it or has gone abandoned. As these sites become abandoned, it's a matter of time before they go offline indefinitely! My goal is to capture it before that happens!
I'm looking for a call to action to help me capture a list of sites that fit the criteria to get indexed in this archive collection. You may ask, why not leverage the Wayback Machine? This is a fair question, in my observations, the Wayback Machine does not always capture the binaries associated with these types of sites for whatever reason or another thus leaving important binaries/data unarchived.
The archiving that takes place leverages the Heritrix engine which is the same crawl engine that is behind the Wayback Machine along with a number of other private archive projects generally funded by universities. This means, the archives in this collection are in archive quality collections (warc format). It's also worthy to note, I have approximately 250TB of current storage and have the potential to scale higher.
For more details, you can find them here: https://blog.tkrn.io/vintage-computer-game-console-archive/
And a list of currently archived sites: https://blog.tkrn.io/tkrns-archive-indexed-sites/
Call to action, drop here or use the submission form on my blog for sites that fit the following mission statement:
tkrn’s archive is a niche web archive specifically focused on the preservation of vintage computer technology and video game console related information. There is an emphasis on preserving sites that have information to reverse engineering hardware/software and or driver/game/mod repositories.