Bidirectional Replication is coming to PostgreSQL 9.6

284 pointsby iamd3vilover 8 years ago

18 comments

ukjover 8 years ago

Holy crap, I am scared!Please, please, please read the fine print and ensure you understand the design tradeoffs as well as your application's requirements before blindly using this.The moment I heard multi-master I thought Paxos, Raft or maybe virtual synchrony. Hmm, nothing in the documentation. Maybe a new consensus protocol was written from scratch then? That should be interesting!No, none of that either - this implementation completely disregards consistency and makes write conflicts the developer's problem.From <a href="http://bdr-project.org/docs/stable/weak-coupled-multimaster.html" rel="nofollow">http://bdr-project.org/docs/stable/weak-coupled-multimaster....</a>* Applications using BDR are free to write to any node so long as they are careful to prevent or cope with conflicts* There is no complex election of a new master if a node goes down or network problems arise. There is no wait for failover. Each node is always a master and always directly writeable.* Applications can be partition-tolerant: the application can keep keep working even if it loses communication with some or all other nodes, then re-sync automatically when connectivity is restored. Loss of a critical VPN tunnel or WAN won't bring the entire store or satellite office to a halt.Basically:* Transactions are a lie* Consistent reads are a lie* Datasets will diverge during network partitioning* Convergence is not guaranteed without a mechanism for resolving write conflictsI am sure there are use-cases where the risk of this design is acceptable (or necessary), but ensure you have a plan for dealing with data inconsistencies!

评论 #12580482 未加载

评论 #12578468 未加载

评论 #12578188 未加载

评论 #12578334 未加载

评论 #12579229 未加载

评论 #12578953 未加载

评论 #12578249 未加载

评论 #12578240 未加载

评论 #12594446 未加载

aembletonover 8 years ago

Some info from 2nd Quadrant on what BDR is: <a href="https://2ndquadrant.com/en/resources/bdr/" rel="nofollow">https://2ndquadrant.com/en/resources/bdr/</a>Bi-Directional Replication for PostgreSQL (Postgres-BDR, or BDR) is the first open source multi-master replication system for PostgreSQL to reach full production status, developed by 2ndQuadrant and assisted by a keen user community. BDR is specifically designed for use in geographically distributed clusters, using highly efficient asynchronous logical replication, supporting anything from 2 to more than 48 nodes in a distributed database.

评论 #12577231 未加载

craigkerstiensover 8 years ago

While indeed very exciting, it's important to note that this makes the BDR extension from 2ndquadrant compatible with stock Postgres. This does not include BDR shipping with core Postgres.This continued improvement with the core code and extension APIs will make more and more extensions feasible which will mean more are able to plug-in and add value without things having to be committed to core. Though in time this is one that has a good chance of actually being in core much like pg_logical.

评论 #12577491 未加载

_Codemonkeyismover 8 years ago

The title is misleading, the replication is not coming to stock Postgresql 9.6.A replication extension got the patches it needs to run into Postgresql 9.6 so you can use the extension without patching Postgres.

评论 #12588248 未加载

oliwarnerover 8 years ago

As the developer who also manages the servers we deploy on, and not a full time PgDBA, things like multi-master replication scare the hell out of me. They really make me worry about what happens after downtime. And latency.Could anyone here recommend good reading material for scaling out your first database on to multiple servers? How do I know which scheme is the best for me?

评论 #12577447 未加载

Zaheerover 8 years ago

Some more information for anyone trying to understand this better: <a href="http://bdr-project.org/docs/stable/overview.html" rel="nofollow">http://bdr-project.org/docs/stable/overview.html</a>Sourcecode: <a href="https://github.com/2ndQuadrant/bdr" rel="nofollow">https://github.com/2ndQuadrant/bdr</a>

booleanbetrayalover 8 years ago

Would love to see this land in Amazon RDS's list of supported extensions!

anthony_francoover 8 years ago

Looking forward to playing around with this. Native master-master replication is the only thing keeping me on MySQL.

评论 #12577548 未加载

elevensiesover 8 years ago

Here is a rationale for multi-master from James Hamilton's "On Designing and Deploying Internet-Scale Services".Designing for automation, however, involves significant service-model constraints. For example, some of the large services today depend upon database systems with asynchronous replication to a secondary, back-up server. Failing over to the secondary after the primary isn't able to service requests loses some customer data due to replicating asynchronously. However, not failing over to the secondary leads to service downtime for those users whose data is stored on the failed database server. Automating the decision to fail over is hard in this case since its dependent upon human judgment and accurately estimating the amount of data loss compared to the likely length of the down time. A system designed for automation pays the latency and throughput cost of synchronous replication. And, having done that, failover becomes a simple decision: if the primary is down, route requests to the secondary. This approach is much more amenable to automation and is considerably less error prone.

no1youknowzover 8 years ago

I look forward to when this lands on PostgreSQL 9.7 without the need for an extension. But more so when I can also include the Citus DB extension.Running CitusDB with just 1 master made me nervous. They did talk about having multi-master replication as a belt and braces solution, but I don't know how far they got.Thinking about this. Both being used may give you a 100% fully fault tolerant solution?

评论 #12577945 未加载

评论 #12579020 未加载

asdf742over 8 years ago

Please make sure you understand log replication and go through fire drills for the list of things that can go wrong with bi-directional replication. The last thing you'll want to do is deploy this into production and wing operations as you go.

idorosenover 8 years ago

BDR is a nice building block to multi-master PostgreSQL. I'm looking forward to parallel aggregates in 9.6 being added to core. Using the agg[0] extension for something as core as using more than one core per (aggregate function) query felt strange. (I wonder if the time has come to decouple connections from processes/threads in postgres, as well...)[0]: <a href="http://www.cybertec.at/en/products/agg-parallel-aggregations-postgresql/" rel="nofollow">http://www.cybertec.at/en/products/agg-parallel-aggregations...</a>

StreamBrightover 8 years ago

Now that is something very interesting. I would love to use this ASAP! :)

imaginenoreover 8 years ago

Can someone explain to me the point of BDR? Since the writes must happen on all servers anyway, why not just have a master-slave?

评论 #12577612 未加载

评论 #12580516 未加载

rgacoteover 8 years ago

Do all nodes need to be up 100% of the time? If not, how long can a node be down without replicating (perhaps because a server is under maintenance).Does BDR have rules for primary key insertion conflicts? I have a (perhaps odd) situation where identical data is already being written to multiple servers. Currently handling with a custom replication mechanism.

评论 #12578887 未加载

sargunover 8 years ago

Is there some sort of consensus mechanism at work here, or is it closer to circular replication a la MySQL?

jtchangover 8 years ago

I have not used 2ndquadrant's BDR extension. Anyone comment as to how easy it is to setup?

ex3ndrover 8 years ago

Does anyone know good replication for psql in dynamic environments like kubernetes?