First post, by Ozzuneoj
- Rank
- l33t
Big post warning. This is a brain dump for those who are deeply interested in this topic or in data preservation.
With the recent complete loss of Anandtech (27 years of work now redirects to the forum), I have been thinking about whether there's any way for people to bring sites like this back in a way that maintains their original usability.
Archive.org is an amazing resource, but I find that 80% of the time that I access a page, something is broken, images\charts are missing or links go nowhere.... and holy cow is it slow. Ironically, it reminds me of dial-up, except with more broken links. It's really too bad that so much of their resources are taken up by 500,000 snap shots of the same web page, taken minutes\hours\days apart, with tons of broken links, forced redirects (to dead ads) that render pages unusable. It makes me wish we had at least SOME copies of entire websites without broken links.
I'm not really even sure what I'm asking for here. I guess just a place for people to list websites that were a valuable resource that are now completely gone, or are only available in bits and pieces on archive.org. OR, list websites that should be backed up or archived fully before they are gone.
If anyone wants to brainstorm ways (realistic or pure fantasy...) that these sites could be brought back and maintained, that would be cool too.
In the case of Anandtech, I found what is supposed to be an archive of the website as of September 1st 2024. Looks to be about 73GB compressed. There is some more information about it here, though I don't know what to do with any of that information. I have no experience with web hosting or what those commands are for.
Does such an archive exist for other sites? There were so many that are now completely gone or broken...
Tech Report (still online but totally broken and probably entirely managed by AI or something)
Firingsquad.com
xbitlabs
iXBT Labs (which is still online but almost all of the images and charts are gone)
nvnews.net
Rage3D (just a forum now)
... and there are many more.
How about forums?
In addition to the forums at all the sites above, I think of ones like Quest Studios, which I don't believe I'd ever had a chance to use back in the day. When I got into retro computing about 10 years ago I saw a ton of references to it here due to the huge amount of retro sound card and MIDI related posts. There is an archive of the site but the forums are seemingly gone, along with all of the collective knowledge contained in them.
Solutions?
1. One pipe-dream fantasy I can come up with would be a single organization\server dedicated to hosting these old sites exactly as they were. It would probably need to either by crowdfunded or paid for by some benevolent tech person\people (with deep pockets) who just want to give back and preserve computing history. I feel like bandwidth costs wouldn't be super high since it's a relatively small niche these days. But I do wonder if lawyers\vampires would come hunting for the web host to shut it down, even if it wasn't being used to generate income. Obviously with sites that are still up in some form (like Tech Report) it would be less likely to be able to exist without legal problems... but in the case of Anandtech, I do wonder if the new "owners" even care. The company that bought Anandtech apparently lied to the former Editor in Chief about keeping the website and articles up "indefinitely"... though I guess technically the "indefinite" amount of time could have always been "one year" to them.
2. My other thought is... is there such a thing as peer-to-peer web hosting? Like, torrenting, except with websites. For example, 100 systems have "anandtech" seeding, so when you browse the site it pulls it from them all to distribute bandwidth. There are probably a lot of issues with this that would make it impossible to use, but it's a thought anyway.
3. Last solution would simply be to have offline copies of the sites that can be run entirely as they should within a browser. I would absolutely, without hesitating, set up an old hard drive (or even a new SSD...) dedicated to hosting all of these sites so I can just browse them when I want to, on my own. Obviously, this is far far far less expensive and less complicated than any of the other solutions since it doesn't require the user to distribute\host someone else's website without their consent. Still... we'd need to get in touch with whoever ran these old sites to see if they'd support such an endeavor by digging out any old backups they had and mailing them or uploading them somewhere.
I feel like if people don't make this effort before it's too late, everything other than the "modern web" click bait and AI generated slop made FROM that click bait will be lost to time much the same way as the pre-world wide web has been. I bet people who had a chance to make full back ups of whatever was on compuserve and didn't do it regret the missed opportunity... and those effects are likely still felt in places like VOGONS as retro enthusiasts and historians try to figure things out that others probably figured out 30-35 years ago.
Internet archive is great for what it is... a chance to take small glimpses back in time, but a more focused and much smaller-scale effort to bring individual sites back to full functionality would be such a cool thing for humans to have bothered to do for the sake of preservation.
EDIT: Also, it would be interesting to hear from the site\forum admin of VOGONS with regard to any efforts that may be made to back up or archive it. Almost 25 years of forum posts about all things involving retro computing has made this an immense knowledge base. I know I have personally spent many many hours just posting information to get it out there on the internet for people to use if they ever need it. I've done that on many forums (tens of thousands of posts) in the past and they almost all have been lost to sites shutting down. The value of what I posted on most of those was fairly low most of the time, admittedly... a lot of us were just killing time and talking about games. Still, I hope that the value of what I and others post on VOGONS is high enough to be worth making an off-site backup that more than one person has access to.