Globally Distributed Postgres

329 pointsby woodrowalmost 4 years ago

13 comments

monocasaalmost 4 years ago

I've been kicking around an idea like this for a while. The train of thought that brought me there was the recognition of a distinct memory hierarchy in today's common distributed applications that parallels the one in your computer.So your computer has memory banks that (as a rough first approximation) each get ~10x bigger, but with ~10x greater latency. The neat part though is that global consistency for writes happens with L1 and L2 working together, so pretty damn high up in the hierarchy. In contrast distributed applications typically at best will have a write through cache where global consistency happens all the day down at the actual data store. Exploring the idea of distributed MOESI where one client, because they had the permissions to write something in the first place, can then be the owner for that database row as it's still being flushed out seems like a great basis for a distributed system that might not even need the dedicated datastore at all anymore, but a sea of clients participating in coherency and replication. Albeit this has oodles of consistency and availability problems that very well might kill the whole concept, like how MOESI would absolutely fall over when trying to hotplug CPUs at arbitrary times.

评论 #27695457 未加载

评论 #27697340 未加载

评论 #27695961 未加载

chatmastaalmost 4 years ago

Love this. We’re building a similar architecture at Splitgraph [0] which we refer to as a “Data Delivery Network.” The basic idea is we implement a proxy that forwards queries to backend data (either live data sources via an FDW, or versioned snapshots via our Docker-inspired “layered querying”).Soon anyone will be able to connect data sources by adding their read only credentials in the web (already possible via CLI but undocumented). The idea is to make exposing a database to the DDN as simple as exposing a website to a CDN.We’ve designed all this to be multi-tenant and horizontally scalable, but we’re not actually running it on a distributed network yet. Personally, I’ve followed Fly for a long time and always loved the angle you’re taking. If any of you at Fly read this and want to potentially collaborate on a solution in this space, my email is in my profile.[0] <a href="https://www.splitgraph.com" rel="nofollow">https://www.splitgraph.com</a>

评论 #27693702 未加载

评论 #27702956 未加载

simonwalmost 4 years ago

The way this is implemented, with the ability for an application server attached to a replica to say "error: this needs to perform a write - hey CDN, replay this request against the region with the database leader in it" is SO clever.

评论 #27693156 未加载

评论 #27692751 未加载

评论 #27692660 未加载

评论 #27694898 未加载

评论 #27692880 未加载

holistioalmost 4 years ago

I first read the title as "Globally Distributed Progress" and I was like "if an article with this title made it to the front page of HN it must sure be some interesting read.I am now slightly disappointed.

评论 #27692835 未加载

评论 #27693049 未加载

Thaxllalmost 4 years ago

Running a master in us-west and having read replica in europe, what could go wrong ...My advice is don't do those things in the blog and keep the DB and the app in the same region it will save you so many problems. If you can't just use a DB that was designed for that like Spanner or Cockroach.

评论 #27694306 未加载

评论 #27692730 未加载

mkl95almost 4 years ago

It reminds me of Yugabyte, which is Postgres compatible> YugabyteDB is the open source, distributed SQL database for global, internet- scale applications with low query latency, extreme resilience against failures. *However Fly is nicer from a developer's point of view, because it doesn't require you to learn a new query language, you can write good old Postgres.* <a href="https://www.yugabyte.com/" rel="nofollow">https://www.yugabyte.com/</a>

评论 #27692569 未加载

评论 #27692626 未加载

frafraalmost 4 years ago

> It is much, much faster to ship the whole HTTP request where it needs to be than the move the database away from an app instance.Does that mean that the app server would just fail, and then your network replays the same HTTP request (made by the client to the edge), toward an instance running the app server inside the region where there the Postgres primary instance is running? If so, you should then be able to figure out which client request provoked the SQL error and the app server would report some errors, right? Otherwise, the app server should be able to tell you if it failed because the Postgres server does not support write statements, which would require some minor adjustment in the app server code, isn't it?I am sorry if I missed something from your description, but I am trying to figure out what the flow would look like. Thank you!

评论 #27694214 未加载

评论 #27694142 未加载

zomglingsalmost 4 years ago

This is a nice solution, although I prefer for my application to be aware of read vs. read-write connections.Question about fly.io - how strict is the docker image requirement? How easy would it be to deploy a go application + systemd service?

评论 #27692780 未加载

评论 #27694893 未加载

mixmastamykalmost 4 years ago

I know this is hacker news and all and you’re trying to be fun, but as I’m looking at platforms like fly for a future saas, the first sentence (cool hack) rubbed me the wrong way. Would rather hear “we found an elegant design” and are working hard to make it bulletproof… etc.Databases are like that, the place to get serious.Sure I won’t let it deter me from a proper eval, but I could definitely see it scaring away suits.

评论 #27694059 未加载

评论 #27693669 未加载

picardoalmost 4 years ago

I tried this a few months ago. Many features were missing then. I couldn't set up backups, and connect directly to `psql`. It was such a pain. I don't know if they've addressed those issues yet. It'd be a hard sell for me to store a database system for a production app in a Docker image.

评论 #27692143 未加载

评论 #27692026 未加载

frafraalmost 4 years ago

Also relevant: <a href="https://fly.io/docs/getting-started/multi-region-databases/" rel="nofollow">https://fly.io/docs/getting-started/multi-region-databases/</a>

itsjlohalmost 4 years ago

How are POST requests replayed into different regions? Does your edge proxy hold the whole POST requests (body and all?) just in case it needs to be replayed?

评论 #27694880 未加载

AtlasBarfedalmost 4 years ago

Oh look a distributed database with nary a mention of CAP.And they are allowing multi-master writes? What is the update collision resolution? cell timestamps? vector clocks?

13 comments

monocasaalmost 4 years ago

评论 #27695457 未加载

评论 #27697340 未加载

评论 #27695961 未加载

chatmastaalmost 4 years ago

评论 #27693702 未加载

评论 #27702956 未加载

simonwalmost 4 years ago

评论 #27693156 未加载

评论 #27692751 未加载

评论 #27692660 未加载

评论 #27694898 未加载

评论 #27692880 未加载

holistioalmost 4 years ago

评论 #27692835 未加载

评论 #27693049 未加载

Thaxllalmost 4 years ago

评论 #27694306 未加载

评论 #27692730 未加载

mkl95almost 4 years ago

评论 #27692569 未加载

评论 #27692626 未加载

frafraalmost 4 years ago

评论 #27694214 未加载

评论 #27694142 未加载

zomglingsalmost 4 years ago

评论 #27692780 未加载

评论 #27694893 未加载

mixmastamykalmost 4 years ago

评论 #27694059 未加载

评论 #27693669 未加载

picardoalmost 4 years ago

评论 #27692143 未加载

评论 #27692026 未加载

frafraalmost 4 years ago

Also relevant: <a href="https://fly.io/docs/getting-started/multi-region-databases/" rel="nofollow">https://fly.io/docs/getting-started/multi-region-databases/</a>

itsjlohalmost 4 years ago

How are POST requests replayed into different regions? Does your edge proxy hold the whole POST requests (body and all?) just in case it needs to be replayed?

评论 #27694880 未加载

AtlasBarfedalmost 4 years ago

Oh look a distributed database with nary a mention of CAP.And they are allowing multi-master writes? What is the update collision resolution? cell timestamps? vector clocks?