There are flat-file CMSes (content management systems) like Grav: <a href="https://getgrav.org/" rel="nofollow">https://getgrav.org/</a><p>Ironically, old-fashioned desktop office apps (Microsoft's or otherwise) save their outputs in a relatively parseable, self-contained XML format, but that's so rare to see in today's world when everything is like fifty JSONs and YAMLs and build artifacts spread across disparate systems and CDNs and such.<p>There are some interesting file formats that encode temporal elements alongside content, like video subtitles (SRT), chess game recordings (PGN), some simpler video game save files, etc. They're different from binary video/audio blobs in that they're entirely human-readable too.<p>In code, there are some rules engines that attempt to declaratively encode logic into flat JSON instead of conditional operators: <a href="https://github.com/CacheControl/json-rules-engine">https://github.com/CacheControl/json-rules-engine</a><p>I guess, in some vague/broad sense, maybe config-as-code systems also implement a similar philosophy? Maybe even OpenAPI schemas could count to some degree...?<p>In the old days, the "semantic web" movement was an attempt to make more webpages both human- and machine-readable indefinitely by tagging them with proper schema: <a href="https://en.wikipedia.org/wiki/Resource_Description_Framework" rel="nofollow">https://en.wikipedia.org/wiki/Resource_Description_Framework</a>. Even Google was on board for a while, but I guess it never saw much uptake. As far as I can tell it's basically dead now, both because of non-semantic HTML (everything as a React div), general laziness, and LLMs being able to parse things loosely.<p>Maybe a continuation of that effort is the Wikidata movement, which is trying to add schema and semantic data to other MediaWiki projects whenever they can: <a href="https://www.wikidata.org/wiki/Wikidata:Introduction" rel="nofollow">https://www.wikidata.org/wiki/Wikidata:Introduction</a><p>-------------<p>Side thoughts...<p>Philosophically, I don't know that capturing raw data alone as files is really sufficient to capture the nuances of any particular experience, or the overall zeitgeist of an era. You can archive Geocities pages, but that doesn't really capture the novelty and indie-ness of that era. Similarly, you can save TikTok videos, but absent the cultural environment that created them (and a faithful recreation of the recommendation algorithm), they wouldn't really show future archaeologists how teenagers today lived.<p>I worked for a natural history museum for a while, and one of the interesting side questions (well, to me anyway) was whether our web content was in and of itself worth preserving as a cultural artifact -- both so that future generations can see what exhibits were interesting/apropos for the cultures of our times, but also so they could see how our generation found out about those exhibitions to begin with (who knows what the Web will morph into 50 years later). It wasn't enough to simply save the HTML of our web pages, both because they tie into various other APIs and databases (like zoological collections) and because some were interactive experiences, like games designed to be played with a mouse (before phones were popular), or phone chatbots with some of our specimens. To really capture the experience authentically would've required emulating not just files but also our tech stacks and devices, among other things. And that doesn't even begin to address the cultural aspects. (Ultimately, this was important to nobody else, and it all just disappeared forever, lol.)<p>Like for the earlier Geocities example, sure you could just save the old HTML and render it with a modern browser, but that's not the same as something like <a href="https://oldweb.today/?browser=ns3-mac#http://geocities.com/" rel="nofollow">https://oldweb.today/?browser=ns3-mac#http://geocities.com/</a> , which emulates the whole OS and browser too (<a href="https://github.com/oldweb-today/oldweb-today?tab=readme-ov-file#how-it-works">https://github.com/oldweb-today/oldweb-today?tab=readme-ov-f...</a>). And that still isn't the same as having to sit in front of a tiny CRT and wait minutes for the 386 to boot and everything to download over a 14.4k modem, only to be interrupted when mom had to make a call.<p>I guess that's a longwinded way of critiquing "file over app": It only makes sense for things that are originally files/documents to begin with. Much of our lives now are not flat docs but "experiences" that take much more thought and effort to archive. If the goal is truly to preserve those for posterity, it's not enough to just archive their raw data, but to also develop ways to record and later emulate entire experiences, both technological and cultural. It ain't easy!