Train coding

When I went up to London for the State of the Browser conference last month, I shared the train journey with Remy.

I always like getting together with Remy. We usually end up discussing sci-fi books we’re reading, commiserating with one another about conference-organising, discussing the minutiae of browser APIs, or talking about the big-picture vision of the World Wide Web.

On this train ride we ended up talking about the march of time and how death comes for us all …and our websites.

Take The Session, for example. It’s been running for two and a half decades in one form or another. I plan to keep it running for many more decades to come. But I’m the weak link in that plan.

If I get hit by a bus tomorrow, The Session will keep running. The hosting is paid up for a while. The domain name is registered for as long as possible. But inevitably things will need to be updated. Even if no new features get added to the site, someone’s got to install updates to keep the underlying software safe and secure.

Remy and I discussed the long-term prospects for widening out the admin work to more people. But we also discussed smaller steps I could take in the meantime.

Like, there’s the actual content of the website. Now, I currently share exports from the database every week in JSON, CSV, and SQLite. That’s good. But you need to be tech nerd to do anything useful with that data.

The more I talked about it with Remy, the more I realised that HTML would be the most useful format for the most people.

There’s a cute acronym in the world of digital preservation: LOCKSS. Lots Of Copies Keep Stuff Safe. If there were multiple copies of The Session’s content out there in the world, then I’d have a nice little insurance policy against some future catastrophe befalling the live site.

With the seed of the idea planted in my head, I waited until I had some time to dive in and see if this was doable.

Fortunately I had plenty of opportunity to do just that on some other train rides. When I was in Spain and France recently, I spent hours and hours on trains. For some reason, I find train journeys very conducive to coding, especially if you don’t need an internet connection.

By the time I was back home, the code was done. Here’s the result:

The Session archive: a static copy of the content on thesession.org.

If you want to grab a copy for yourself, go ahead and download this .zip file. Be warned that it’s quite large! The .zip file is over two gigabytes in size and the unzipped collection of web pages is almost ten gigabytes. I plan to update the content every week or so.

I’ve put a copy up on Netlify and I’m serving it from the subdomain archive.thesession.org if you want to check out the results without downloading the whole thing.

Because this is a collection of static files, there’s no search. But you can use your browser’s “Find in Page” feature to search within the (very long) index pages of each section of the site.

You don’t need to a web server to click around between the pages: they should all work straight from your file system. Double-clicking any HTML file should give a starting point.

I wanted to reduce the dependencies on each page to as close to zero as I could. All the CSS is embedded in the the page. Likewise with most of the JavaScript (you’ll still need an internet connection to get audio playback and dynamic maps). This keeps the individual pages nice and self-contained. That means they can be shared around (as an email attachment, for example).

I’ve shared this project with the community on The Session and people are into it. If nothing else, it could be handy to have an offline copy of the site’s content on your hard drive for those situations when you can’t access the site itself.

Have you published a response to this? :

Responses

Tom Morris

@adactio I’d suggest periodically uploading copies to the Internet Archive… but *sigh* maybe wait a bit

# Posted by Tom Morris on Monday, October 14th, 2024 at 2:55pm

Simon Willison

@adactio I like using GitHub for this kind of thing because they feel more likely to exist long into perpetuity than most other hosting providers - and I believe they back everything up to three different continents (and occasionally might bury some of it in the arctic seed vault too)

Simon Willison

1 Share

# Shared by Fynn Becker on Tuesday, October 15th, 2024 at 12:04pm

9 Likes

# Liked by Aaron Crowder :vim: :go: on Monday, October 14th, 2024 at 3:10pm

# Liked by Simon Cox :SEO: on Monday, October 14th, 2024 at 3:10pm

# Liked by Michael Gearon on Monday, October 14th, 2024 at 3:29pm

# Liked by rem on Monday, October 14th, 2024 at 4:14pm

# Liked by Ms. Jen on Monday, October 14th, 2024 at 4:29pm

# Liked by Ms. Jen on Monday, October 14th, 2024 at 5:22pm

# Liked by Andy Davies on Monday, October 14th, 2024 at 6:21pm

# Liked by Thomas Vander Wal on Monday, October 14th, 2024 at 7:56pm

# Liked by Adam Perfect on Monday, October 14th, 2024 at 11:23pm

Related posts

Preventing automated sign-ups

Here’s a bit of PHP I’m using on The Session.

My approach to HTML web components

Naming custom elements, naming attributes, the single responsibility principle, and communicating across components.

Pickin’ dates

HTML web components for augmenting date inputs.

Progressive disclosure with HTML

The `details` element is like the TL;DR of markup.

Web Audio API update on iOS

The behaviour is more consistent now.

Related links

To remember, or to forget?

What are your own scribbles, your own ordinary plenty, not worth much to you now but that someone in the future may treasure?

Tagged with

Shining a Light on the Digital Dark Age - Long Now

A false sense of security persists surrounding digitized documents: because an infinite number of identical copies can be made of any original, most of us believe that our electronic files have an indefinite shelf life and unlimited retrieval opportunities. In fact, preserving the world’s online content is an increasing concern, particularly as file formats (and the hardware and software used to run them) become scarce, inaccessible, or antiquated, technologies evolve, and data decays. Without constant maintenance and management, most digital information will be lost in just a few decades. Our modern records are far from permanent.

Tagged with

Worse than LaserDiscs?

Kevin takes my eleven-year old remark literally and points out at least you can emulate LaserDiscs:

So LaserDiscs aren’t the worst things to archive, networks of servers running code that isn’t available or archivable are, and we are building a lot more of those these days, whether on the web or in apps.

Tagged with

A Long Bet on Link Rot is Resolved, but Questions About the Durability of the Web Still Remain - Long Now

The Long Now foundation has a write-up on my recently-lost long bet:

On February 22, 02011, Jeremy Keith made a prediction that he hoped would be proven wrong.

Tagged with

A Long Bet Pays Off - Internet Archive Blogs

The bet was been won (not by me, thankfully) and Jason has some thoughts.

Tagged with

Previously on this day

4 years ago I wrote Saving forms

A defensive enhancement to avoid losing everything you just typed into a textarea.

5 years ago I wrote Something for the weekend

Science Hack Day in San Francisco and Indie Web Camp in Brighton

10 years ago I wrote Celebrating CSS

Here’s to the next twenty years.

11 years ago I wrote Listen to dConstruct 2013

For your huffduffing pleasure.

15 years ago I wrote Optimisation

Optimise for ugly bags of mostly water, not your plastic pal that’s fun to be with.

17 years ago I wrote Semantic brevity

Make microformats work with your writing style.

23 years ago I wrote The roots of conflict

Here’s some more grist for the fundamentalist mill.

23 years ago I wrote Falwell-Robertson-Bin Laden Quiz

This is too perfect.