Train coding

October 14th, 2024

When I went up to London for the State of the Browser conference last month, I shared the train journey with Remy.

I always like getting together with Remy. We usually end up discussing sci-fi books we’re reading, commiserating with one another about conference-organising, discussing the minutiae of browser APIs, or talking about the big-picture vision of the World Wide Web.

On this train ride we ended up talking about the march of time and how death comes for us all …and our websites.

Take The Session, for example. It’s been running for two and a half decades in one form or another. I plan to keep it running for many more decades to come. But I’m the weak link in that plan.

If I get hit by a bus tomorrow, The Session will keep running. The hosting is paid up for a while. The domain name is registered for as long as possible. But inevitably things will need to be updated. Even if no new features get added to the site, someone’s got to install updates to keep the underlying software safe and secure.

Remy and I discussed the long-term prospects for widening out the admin work to more people. But we also discussed smaller steps I could take in the meantime.

Like, there’s the actual content of the website. Now, I currently share exports from the database every week in JSON, CSV, and SQLite. That’s good. But you need to be tech nerd to do anything useful with that data.

The more I talked about it with Remy, the more I realised that HTML would be the most useful format for the most people.

There’s a cute acronym in the world of digital preservation: LOCKSS. Lots Of Copies Keep Stuff Safe. If there were multiple copies of The Session’s content out there in the world, then I’d have a nice little insurance policy against some future catastrophe befalling the live site.

With the seed of the idea planted in my head, I waited until I had some time to dive in and see if this was doable.

Fortunately I had plenty of opportunity to do just that on some other train rides. When I was in Spain and France recently, I spent hours and hours on trains. For some reason, I find train journeys very conducive to coding, especially if you don’t need an internet connection.

By the time I was back home, the code was done. Here’s the result:

The Session archive: a static copy of the content on thesession.org.

If you want to grab a copy for yourself, go ahead and download this .zip file. Be warned that it’s quite large! The .zip file is over two gigabytes in size and the unzipped collection of web pages is almost ten gigabytes. I plan to update the content every week or so.

I’ve put a copy up on Netlify and I’m serving it from the subdomain archive.thesession.org if you want to check out the results without downloading the whole thing.

Because this is a collection of static files, there’s no search. But you can use your browser’s “Find in Page” feature to search within the (very long) index pages of each section of the site.

You don’t need to a web server to click around between the pages: they should all work straight from your file system. Double-clicking any HTML file should give a starting point.

I wanted to reduce the dependencies on each page to as close to zero as I could. All the CSS is embedded in the the page. Likewise with most of the JavaScript (you’ll still need an internet connection to get audio playback and dynamic maps). This keeps the individual pages nice and self-contained. That means they can be shared around (as an email attachment, for example).

I’ve shared this project with the community on The Session and people are into it. If nothing else, it could be handy to have an offline copy of the site’s content on your hard drive for those situations when you can’t access the site itself.

« Newer Older »

Responses

Tom Morris

@adactio I’d suggest periodically uploading copies to the Internet Archive… but *sigh* maybe wait a bit

Simon Willison

@adactio I like using GitHub for this kind of thing because they feel more likely to exist long into perpetuity than most other hosting providers - and I believe they back everything up to three different continents (and occasionally might bury some of it in the arctic seed vault too)

Simon Willison

@adactio plus JSON in GitHub means you can open the files in Datasette Lite - here are the three smaller tables (large ones should work too but take a while to load in the browser) https://lite.datasette.io/?json=https://github.com/adactio/TheSession-data/blob/main/json/aliases.json%20&json=https://github.com/adactio/TheSession-data/blob/main/json/events.json%20&json=https://github.com/adactio/TheSession-data/blob/main/json/sessions.json#/data

Datasette

rem

@adactio superb effort 💪

1 Share

# Shared by Fynn Becker on Tuesday, October 15th, 2024 at 12:04pm

9 Likes

# Liked by Aaron Crowder :vim: :go: on Monday, October 14th, 2024 at 3:10pm

# Liked by Simon Cox :SEO: on Monday, October 14th, 2024 at 3:10pm

# Liked by Michael Gearon on Monday, October 14th, 2024 at 3:29pm

# Liked by rem on Monday, October 14th, 2024 at 4:14pm

# Liked by Ms. Jen on Monday, October 14th, 2024 at 4:29pm

# Liked by Ms. Jen on Monday, October 14th, 2024 at 5:22pm

# Liked by Andy Davies on Monday, October 14th, 2024 at 6:21pm

# Liked by Thomas Vander Wal on Monday, October 14th, 2024 at 7:56pm

# Liked by Adam Perfect on Monday, October 14th, 2024 at 11:23pm

Related links

Century-Scale Storage

This magnificent piece by Maxwell Neely-Cohen—with some tasteful art-direction—is right up my alley!

This piece looks at a single question. If you, right now, had the goal of digitally storing something for 100 years, how should you even begin to think about making that happen? How should the bits in your stewardship be stored with such a target in mind? How do our methods and platforms look when considered under the harsh unknowns of a century? There are plenty of worthy related subjects and discourses that this piece does not touch at all. This is not a piece about the sheer volume of data we are creating each day, and how we might store all of it. Nor is it a piece about the extremely tough curatorial process of deciding what is and isn’t worth preserving and storing. It is about longevity, about the potential methods of preserving what we make for future generations, about how we make bits endure. If you had to store something for 100 years, how would you do it? That’s it.

Sunday, December 15th, 2024 11:27am

Tagged with archives digital preservation longevity century storage formats materials access software hardware digital analogue

To remember, or to forget?

What are your own scribbles, your own ordinary plenty, not worth much to you now but that someone in the future may treasure?

Tuesday, September 17th, 2024 7:42am

Tagged with digital preservation archives longevity memory deletion forgetting remembering value

Shining a Light on the Digital Dark Age - Long Now

A false sense of security persists surrounding digitized documents: because an infinite number of identical copies can be made of any original, most of us believe that our electronic files have an indefinite shelf life and unlimited retrieval opportunities. In fact, preserving the world’s online content is an increasing concern, particularly as file formats (and the hardware and software used to run them) become scarce, inaccessible, or antiquated, technologies evolve, and data decays. Without constant maintenance and management, most digital information will be lost in just a few decades. Our modern records are far from permanent.

Friday, September 1st, 2023 5:28pm

Tagged with digital preservation archives longnow archiving storage formats

Worse than LaserDiscs?

Kevin takes my eleven-year old remark literally and points out at least you can emulate LaserDiscs:

So LaserDiscs aren’t the worst things to archive, networks of servers running code that isn’t available or archivable are, and we are building a lot more of those these days, whether on the web or in apps.

Thursday, September 8th, 2022 7:50am

Tagged with web native digital preservation longevity laserdiscs formats apps emulation archives

A Long Bet on Link Rot is Resolved, but Questions About the Durability of the Web Still Remain - Long Now

The Long Now foundation has a write-up on my recently-lost long bet:

On February 22, 02011, Jeremy Keith made a prediction that he hoped would be proven wrong.

Monday, February 28th, 2022 5:25pm

Tagged with longbets urls longevity bitrot digital preservation future past internet archive longnow

Previously on this day

4 years ago I wrote Saving forms

A defensive enhancement to avoid losing everything you just typed into a textarea.

5 years ago I wrote Something for the weekend

Science Hack Day in San Francisco and Indie Web Camp in Brighton

10 years ago I wrote Celebrating CSS

Here’s to the next twenty years.

11 years ago I wrote Listen to dConstruct 2013

For your huffduffing pleasure.

15 years ago I wrote Optimisation

Optimise for ugly bags of mostly water, not your plastic pal that’s fun to be with.

17 years ago I wrote Semantic brevity

Make microformats work with your writing style.

23 years ago I wrote The roots of conflict

Here’s some more grist for the fundamentalist mill.

23 years ago I wrote Falwell-Robertson-Bin Laden Quiz

This is too perfect.

Train coding

October 14th, 2024

Responses

Tom Morris

Simon Willison

Simon Willison

rem

Related posts

Preventing automated sign-ups

My approach to HTML web components

Pickin’ dates

Progressive disclosure with HTML

Web Audio API update on iOS

Related links

Century-Scale Storage

To remember, or to forget?

Shining a Light on the Digital Dark Age - Long Now

Worse than LaserDiscs?

A Long Bet on Link Rot is Resolved, but Questions About the Durability of the Web Still Remain - Long Now

Previously on this day

4 years ago I wrote Saving forms

5 years ago I wrote Something for the weekend

10 years ago I wrote Celebrating CSS

11 years ago I wrote Listen to dConstruct 2013

15 years ago I wrote Optimisation

17 years ago I wrote Semantic brevity

23 years ago I wrote The roots of conflict

23 years ago I wrote Falwell-Robertson-Bin Laden Quiz