Recently minted database technologies that I find intriguing

473 pointsby biggestloualmost 5 years ago

30 comments

petercooperalmost 5 years ago

I edit a database newsletter – <a href="https://dbweekly.com/" rel="nofollow">https://dbweekly.com/</a> – so tend to always have my eyes out for new releases, what's coming along, and what not. And I thought I'd share a few more things that have jumped out at me recently in case anyone's in the mood for spelunking.1. QuestDB – <a href="https://questdb.io/" rel="nofollow">https://questdb.io/</a> – is a performance-focused, open-source time-series database that uses SQL. It makes heavy use of SIMD and vectorization for the performance end of things.2. GridDB - <a href="https://griddb.net/en/" rel="nofollow">https://griddb.net/en/</a> - is an in-memory NoSQL time-series database (there's a theme lately with these!) out of Toshiba that was boasting doing 5m writes per second and 60m reads per second on a 20 node cluster recently.3. MeiliSearch - <a href="https://github.com/meilisearch/MeiliSearch" rel="nofollow">https://github.com/meilisearch/MeiliSearch</a> – not exactly a database but basically an Elastic-esque search server written in Rust. Seems to have really taken off.4. Dolt – <a href="https://github.com/liquidata-inc/dolt" rel="nofollow">https://github.com/liquidata-inc/dolt</a> – bills itself as a 'Git for data'. It's relational, speaks SQL, but has version control on everything.TerminusDB, KVRocks, and ImmuDB also get honorable mentions.InfoWorld also had an article recently about 9 'offbeat' databases to check out if you want to go even further: <a href="https://www.infoworld.com/article/3533410/9-offbeat-databases-worth-a-look.html" rel="nofollow">https://www.infoworld.com/article/3533410/9-offbeat-database...</a>Exciting times in the database space!

评论 #23534176 未加载

评论 #23533223 未加载

评论 #23532974 未加载

评论 #23533217 未加载

评论 #23534816 未加载

评论 #23536714 未加载

评论 #23533982 未加载

评论 #23532691 未加载

评论 #23537165 未加载

评论 #23532688 未加载

评论 #23536102 未加载

评论 #23535957 未加载

评论 #23536311 未加载

chickenpotpiealmost 5 years ago

I’ve always felt it strange that in almost every job I’ve had databases have been one of the most important pieces of the architecture but the least debated. I’ve spent hours debating languages and frameworks, but databases always come down to whatever we have a license for/what others at the company are using. Engineering teams will always say they make sure to use the right tool for the job, but no one ever talks about if it’s right to keep using the same database for a new product.

评论 #23533123 未加载

评论 #23534960 未加载

评论 #23536456 未加载

评论 #23538558 未加载

rstalmost 5 years ago

Materialize is neat, but there are other database systems that refresh at least some materialized views on the fly, while being smart about not rebuilding the entire view every time. See for example, Oracle, where FAST REFRESH ON COMMIT does most of what Materialize is advertised as doing, at least for views which that feature can support (restriction list here: <a href="https://stackoverflow.com/questions/49578932/materialized-view-in-oracle-with-fast-refresh-instead-of-complete-dosnt-work" rel="nofollow">https://stackoverflow.com/questions/49578932/materialized-vi...</a> ). Mind you, this comes with Oracle's extremely hefty price tag, so I'm not sure I'd recommend it to anyone who isn't already stuck with Oracle, but it is technical precedent.It would be interesting to compare notes, and see what Materialize does better.

评论 #23533076 未加载

评论 #23533177 未加载

评论 #23532595 未加载

评论 #23533180 未加载

评论 #23532641 未加载

评论 #23532504 未加载

评论 #23532413 未加载

评论 #23532536 未加载

sudhirjalmost 5 years ago

I'm working on Redis adapter for DynamoDB - Dynamo is really a distributed superset of Redis, and most of the data structures that Redis has scale effectively to the distributed hash table + B-Tree-like system that Dynamo offers. Having a well known and understood API like Redis is a boon for Dynamo, whose API is much more low level and esoteric.The Go library is in beta, working on a server that's wire compatible with Redis.<a href="https://dbproject.red" rel="nofollow">https://dbproject.red</a><a href="https://github.com/dbProjectRED/redimo.go" rel="nofollow">https://github.com/dbProjectRED/redimo.go</a>

评论 #23540384 未加载

barkingalmost 5 years ago

I really wish one of the existing db technologies, Firebird, got a shot in the arm. It has both embedded and server modes which makes it unique as far as I know. Also the database is a single file which with firebirds "careful write" methodology remains consistent at all times so while you can make a backup at any time because it has MVCC, even a file copy of the database file with open transactions should not be corrupted. The installer size comes in under 10 MB. It's being actively improved, is open source with a very liberal licence but sadly it only gets a tiny fraction of the attention that SQLite, postgres etc receive

willvarfaralmost 5 years ago

My understanding of TileDB is that it is 100% client-side. There is no server. In a sense it’s like handling orc or paraquet or even SQLite files on S3, (except tiledb are fancy r-trees) with a delta-lake-like manifest file for transactions too.I think in the future there’s going to be a sine-wave of smart-clients consuming S3 cleverly, and then smartness growing in the S3 interface so constraints and indices and things happen in the storage again, and back and forth...

评论 #23533747 未加载

评论 #23532891 未加载

评论 #23532686 未加载

slifinalmost 5 years ago

I do wonder how databases like Datomic, and Crux are perceived (if at all) in the wider database community

评论 #23534681 未加载

matlinalmost 5 years ago

I support FoundationDB's approach to databases which is basically provide a consistent, distributed, and ordered Key-Value store then you can build whatever type of database you need on top of it whether that's RDBMS, Document, Graph, etc.With that said, CouchDB 4.0 (on FDB) is going to be killer. Master-Master replication between clients and server with PouchDB is phenomenal when you remove the complicated eventual consistency on the server side.And as a plug, I'm building a multi-tenant/multi-application database on top of it.

andrewstuartalmost 5 years ago

I've found databases fascinating and tried various DB's as they come out.I always find some issue or caveat or problem and I decide in the end that Postgres gets most of the way there anyway and I return to Postgres.Whenever I get tempted by a shiny new database I remind myself "don't bet against Postgres".

评论 #23533498 未加载

评论 #23535565 未加载

grizzlesalmost 5 years ago

Software is advancing so fast. Interesting to constantly reconsider the things I consider myself ahead of the curve on vs behind the curve on. Prisma looks great so I've updated my I want functional dbs, not ORMs post: <a href="https://github.com/ericbets/erics-designs/blob/master/funcdb.md" rel="nofollow">https://github.com/ericbets/erics-designs/blob/master/funcdb...</a>

评论 #23534798 未加载

pachicoalmost 5 years ago

> What I have yet to see but always secretly wanted, however, is a database that natively supports incremental updates to materialized views. Yep, that’s right: Materialize listens for changes in the data sources that you specify and updates your views as those sources change.This is precisely one of the features that make ClickHouse shine

mywacadayalmost 5 years ago

Does anybody know of a good educational resource on software/best practices that is kept up to date. Ideally something that does not include the latest bleeding edge but things that are battle hardened or getting there. Something that includes open source and commercial software would be ideal.

StavrosKalmost 5 years ago

This is only tangentially related, but I rediscovered an old project of mine from years ago today and am rather excited about it:<a href="https://github.com/skorokithakis/goatfish" rel="nofollow">https://github.com/skorokithakis/goatfish</a>It's basically a 200 line document database in Python that's backed by SQLite. I need to store a bunch of scraping data from a script but don't want a huge database or the hassle of making SQLite tables.Goatfish is perfect because it stores free-form objects (JSON, basically), only it lets you index by arbitrary keys in them and get fast queries that way.It's pretty simple, but a very nice compromise between the simplicity of in-memory dicts and the reliability of SQLite.

评论 #23534022 未加载

statictypealmost 5 years ago

>What I’m really hoping for is the emergence of extremely “hackable,” resolutely non-monolithic DBs that provide a plugin interface for highly use-case-specific data types,Isn't this basically what FoundationDB is?

xchaoticalmost 5 years ago

Interesting notes but I feel like the db itself has been commoditised and the battle is elsewhere now. So anyone building a database engine today, will find out that to make it sustainable they also need an ecosystem on top of it, tooling, community, paid support, active devs, consultants (for which they may have no runway) Finally I find anything that calls itself a database and uses S3 as a backend a bit ridiculous. S3 has eventual consistency so you can’t do the operations that differentiate a database from a file system.

hn_checkalmost 5 years ago

"What I have yet to see but always secretly wanted, however, is a database that natively supports incremental updates to materialized views"SQL Server ala 10+ years ago enters the discussion.

评论 #23536813 未加载

评论 #23536930 未加载

评论 #23533963 未加载

评论 #23535542 未加载

moonchildalmost 5 years ago

> And it’s worth noting that many have tried to do what Prisma does and failed because they.Because they what?

评论 #23533007 未加载

leetroutalmost 5 years ago

I miss rethinkdb. I loved their approach and their tooling.

评论 #23534606 未加载

tehlikealmost 5 years ago

Mandatory mention: RavenDB. Probably the only LINQ native database with lots of performance optimizations squeezed in.

remorsesalmost 5 years ago

I think that the graphql adapters like hasura and goke are also an important innovation, for small mvp projects you can create a graphql api to query your database directly from the frontend, this reduces the development time by a factor of 2 at least.

评论 #23534501 未加载

imglorpalmost 5 years ago

I'll just throw a note for a new product, AWS's QLDB. It's an internal managed product that combines a replicated, immutable, versioned document database with ACID transactions and an immutable, provable history of every modification. There's some streaming and subset SQL on the back end.Something this focused should have a few applications where bit level auditability matters, eg financial, chain of events, etc. Of course it comes with some tradeoffs vs a relational or kv db.I wonder if there would be room for a self-hosted clone?

limaalmost 5 years ago

ClickHouse also supports incremental streaming from Kafka into a materialized view.You can even detach and reattach the view from its backing table.

coolleoalmost 5 years ago

What is the best source to create your own database, just for learning purposes ?Great resources will be appreciated.Thank you.

评论 #23534783 未加载

arauhalaalmost 5 years ago

Hi Luc,What's your perspective on predictive databases like <a href="https://aito.ai" rel="nofollow">https://aito.ai</a>?I'm one of the Aito.ai founders. If you would like to hear more, I'm happy to talk one-to-one.Regards, Antti

monksyalmost 5 years ago

I love these kinds of posts. They're targeted towards what people are finding interesting and they're highly tech related. It's a great way to find new technology.

melvinroestalmost 5 years ago

The website didn't load for me. So here it is: <a href="https://web.archive.org/web/20200615193041/https://lucperkins.dev/blog/new-db-tech-1/" rel="nofollow">https://web.archive.org/web/20200615193041/https://lucperkin...</a>Also, I'd like to add one database to the list (I work there for 3 weeks now): TriplyDB [0]. It is making linked data easier.Linked data is useful for when people of different organizations want a shared schema.In many commercial applications one wouldn't want this, as data is the valuable part of a company. However, scientific communities, certain government agencies and other organizations -- that I don't yet know about -- do want this.I think the coolest application of linked data is how the bio-informatics/biology community utilizes it [1, 2]. The reason I found out at all is because one person at Triply works to see if a similar thing can be achieved with psychology. It might make conducting meta-studies a bit easier.I read the HN discussions on linked data and agree with both the nay sayers (it's awkward and too idealistic [4]) and the yay sayers (it's awesome). The thing is:1. Linked data open, open as in open source, the URI [3] is baked into its design.2. While the 'API'/triple/RDF format can be awkward, anyone can quite easily understand it. The cool thing is: this includes non-programmers.3. It's geared towards collaboration. In fact, when reading between the lines, I'd argue it's really good for collaboration between a big heterogeneous group of people.Disclaimer: this is my own opinion, Triply does not know I'm posting this and I don't care ;-) I simply think it's an interesting way of thinking about data.[0] triply.cc[1] A friend of mine once modeled some biochemistry part of C. Elegans from linked data into petrinets: <a href="https://www.researchgate.net/publication/263520722_Building_Executable_Biological_Pathway_Models_Automatically_from_BioPAX" rel="nofollow">https://www.researchgate.net/publication/263520722_Building_...</a>[2] <a href="https://www.google.com/search?client=safari&rls=en&q=linked+data+and+biology&ie=UTF-8&oe=UTF-8" rel="nofollow">https://www.google.com/search?client=safari&rls=en&q=linked+...</a> -- I quickly vetted this search[3] I still don't know the difference between a URI and URL.[4] I think back in the day, linked data idealists would say that all data should be linked to interconnect all the knowledge. I'm more pragmatic and simply wonder: in which socio-technological context is linked data simply more useful than other formats? My current very tentative answer is those 3 points.

评论 #23532742 未加载

tourist_on_roadalmost 5 years ago

How does tiledb compares to something similar like milvus

评论 #23532135 未加载

gigatexalalmost 5 years ago

whatever became of rethinkDB -- i think it was just too far ahead of its time it; had some really interesting ideas

davedxalmost 5 years ago

Is tiledb useful for storing ML models?

评论 #23532666 未加载

biggestloualmost 5 years ago

REMOVED: I complained about the rewritten title in a way that was excessively harsh and have removed that comment.

评论 #23532864 未加载

评论 #23532877 未加载