After writing on the
history of offline rendering, I realized once again how
ephemeral the web it.
With a bit of nostalgia, and having done something similar already for certain
usenet newsgroups, I decided to resuscitate the historic OMPF forums, that were once an amazing resource for people interested in the state of the art of raytracing and pathtracing.
I made a script to scrape everything the
Wayback Machine archived, and then parse it and translate it to static web pages, including all the images I could get.
Here's the results:
-
Original OMPF forum (2006-2011).
-
OMPF2 forum (2011-2021).
If you want to do something similar, the script is
here, should be a good starting point for PHPBB-based forum scraping from wayback.
Some considerations:
- Even if we have the IA, that does not mean that the archived content is usable. Currently, I'm sure in an effort to stop AI and other bots, the IA throttling limits are painfully low (5 requests per minute), which makes projects like this take much longer than ideal.
-- I wonder if there is a future for the IA to have some of these transforms ran directly on their machines and linked as part of the archive. For example, it could be possible to detect PHPBB boards and, similarly to what I've done, convert their contents into a format that is easier to deal with.
- The modern web will be entirely lost, now most of the content is not on webpages or static archives, but behind proprietary APIs.
- We moved most of the discussions to real-time forums, especially in computer graphics - we have discords/slacks, groups on twitter/mastodon/bluesky - and all these things that build communities and friendships are amazing! We had that in the old web in IRC. But having everything in these real-time platforms, instead of separating the space for chats from the space for long-form discussions (mailing lists, newsgroups, forums), is not imho the right arrangement. Real-time platforms should look and be ephemeral!
-- I'm convinced this is the right setup also at work - e.g. the old jabber (messenger etc)/email separation is better than Slack/Teams for everything...