Out of curiosity, would cutting out the filesystem out of the equation entirely improve things further still? Just put the database on /dev/sda2 or a LV? Also, how much overhead does LVM have?
Sounds like we should all just be doing sqlite3 /dev/sda2<p>Filesystems and databases solve similar problems, so putting one on top of the other is a bit redundant.
Related. Others?<p><i>SQLite: 35% Faster Than the Filesystem</i> - <a href="https://news.ycombinator.com/item?id=41085376">https://news.ycombinator.com/item?id=41085376</a> - July 2024 (193 comments)<p><i>SQLite is 35% Faster Than The Filesystem (2017)</i> - <a href="https://news.ycombinator.com/item?id=27897427">https://news.ycombinator.com/item?id=27897427</a> - July 2021 (127 comments)<p><i>35% Faster Than The Filesystem (2017)</i> - <a href="https://news.ycombinator.com/item?id=20729930">https://news.ycombinator.com/item?id=20729930</a> - Aug 2019 (164 comments)<p><i>SQLite small blob storage: 35% Faster Than the Filesystem</i> - <a href="https://news.ycombinator.com/item?id=14550060">https://news.ycombinator.com/item?id=14550060</a> - June 2017 (202 comments)<p>Since the July 2024 had a lot of attention, the current repost counts as a dupe. Reposts are ok after a year or so—this is in the FAQ: <a href="https://news.ycombinator.com/newsfaq.html">https://news.ycombinator.com/newsfaq.html</a>.
Is anybody depending on this for mission critical data? Can I throw 1TB of images, pdfs, and miscellanea in there, delete the originals, and keep humming along in practical perpetuity? I would presume use lightstream to backup on top of other more general backup solutions, and use a few Attached Databases instead one monolithic for a bit more flexibility / safety?
I feel like some kind of compression is the way to go if you are interested in fast reads/writes. Depending on the algorithm used, you could probably decompress the data faster than it can be read off the disk. In an archive file, you would also have the same benefit of fewer read/write calls.
It's talking about comparing to reading the data from individual files. If you keep them in a single file and seek, the 35% no longer holds.<p>Read the post :)