It's really cool to see these techniques in the wild, and also feels encouraging to us as we're doing something very similar at Splitgraph [0] to implement our "Data Delivery Network" [1]. Recently we've started calling Splitgraph a "Data Mesh" [2]. As long as we have a plugin [3] for a data source, users can connect external data sources to Splitgraph and make them addressable alongside all the other data on the platform, including versioned snapshots of data called data images. [4] So you can `SELECT FROM namespace/repo:tag` where `tag` can refer to an immutable version of the data, or e.g. `live` to route to route to a live external data source via FDW. So far we have plugins for Snowflake, CSV in S3 buckets, MongoDB, ElasticSearch, Postgres, and a few others, like Socrata data portals (which we use to index 40k open public datasets).<p>Our goal with Splitgraph is to provide a single interface to query and discover data. Our product integrates the discovery layer (a data catalog) with the query layer (a Postgres compatible proxy to data sources, aka a "data mesh" or perhaps "data lake"). This way, we improve both the catalog and the access layer in ways that would be difficult or impossible as separate products. The catalog can index live data without "drift" problems. And since the query layer is a Postgres-compatible proxy, we can apply data governance rules at query time that the user defines in the web catalog (e.g. sharing data, access control, column masking, query whitelisting, rewriting, rate limiting, auditing, firewalling, etc.).<p>We like to use GitLab's strategy as an analogy. GitLab may not have the best CI, the best source control, the best Kubernetes deploy orchestration, but by integrating them all together in one platform, they have a multiplicative effect on the platform itself. We think the same logic can apply to the data stack. In our vision of the world, a "data mesh" integrated with a "data catalog" can augment or eventually replace various complicated ETL and warehousing workflows.<p>P.S. We're hiring immediately for all-remote Senior Software Engineer positions, frontend and backend [5].<p>P.P.S. We also have a private beta program where we can deploy a full Splitgraph stack onto either self-hosted or managed infrastructure. If you want that, get in touch. We'll probably be in beta for 12-18 months.<p>[0] <a href="https://www.splitgraph.com" rel="nofollow">https://www.splitgraph.com</a><p>[1] We talked about all this in depth on a podcast: <a href="https://softwareengineeringdaily.com/2020/11/06/splitgraph-d" rel="nofollow">https://softwareengineeringdaily.com/2020/11/06/splitgraph-d</a>...<p>[2] <a href="https://martinfowler.com/articles/data-monolith-to-mesh.html" rel="nofollow">https://martinfowler.com/articles/data-monolith-to-mesh.html</a><p>[3] <a href="https://www.splitgraph.com/blog/foreign-data-wrappers" rel="nofollow">https://www.splitgraph.com/blog/foreign-data-wrappers</a><p>[4] <a href="https://www.splitgraph.com/docs/concepts/images" rel="nofollow">https://www.splitgraph.com/docs/concepts/images</a><p>[5] Job posting: <a href="https://www.notion.so/splitgraph/Splitgraph-is-Hiring-25b421" rel="nofollow">https://www.notion.so/splitgraph/Splitgraph-is-Hiring-25b421</a>...