How to Scale PostgreSQL on AWS: Learnings from Citus Cloud

177 点作者 twakefield大约 8 年前

14 条评论

simonw大约 8 年前

Citus are doing a fantastic job on content marketing. Every single piece they publish on <a href="https://www.citusdata.com/blog/" rel="nofollow">https://www.citusdata.com/blog/</a> is a case-study in how to write content (and headlines) that appeal to the kinds of developers their product targets."How to Scale PostgreSQL on AWS: Learnings from Citus Cloud" - seriously, how am I as a PostgreSQL-liking developer who cares about scalability NOT going to click through to that article?

评论 #13841546 未加载

kornish大约 8 年前

Citus Cloud is perhaps most exciting me because it has tremendous momentum: as the combined product of deep technical expertise meeting top-flight open source software meeting tons of end user experience, it's quickly outpacing platforms which are locked-in anachronisms. Take Redshift: Postgres 8.4? After you've used some of the features in 9.6, it's hard to go back. It'd be interesting to see some numbers around Citus Cloud's battle-tested deployments.As a side note, these blog posts on high-level techniques and open source tools (e.g. PgBouncer, wal-e) are useful for anyone considering deploying an on-prem version of Citus as part of a product – thanks, Ozgun!Usual disclaimers apply: not an employee, but big fan of the team and technology and it's great to see them gaining well-deserved mindshare.

pjungwir大约 8 年前

I saw the section on EBS, but it didn't offer much advice. Getting good performance on networked storage is the biggest challenge to me. The last time I asked about that here [1], I got this answer:<pre><code> nasalgoat 161 days ago [-] The secret to EBS is to use General SSD, not Provisioned, but use a RAID stripe. The reason this works is because IOPS are provisioned per EBS drive and by the size of the drive. So a RAID0 stripe of, say, ten General SSD drives will outperform the more expensive PIOPS single drive. </code></pre> That sounds like a great approach, although I haven't had time to try it out yet. I'm curious if anyone else has done anything like that.[1] <a href="https://news.ycombinator.com/item?id=12609172" rel="nofollow">https://news.ycombinator.com/item?id=12609172</a>

评论 #13842370 未加载

评论 #13842159 未加载

评论 #13842222 未加载

评论 #13842155 未加载

manigandham大约 8 年前

We use MemSQL and it has the best replication setup process for any relational database with 1 line:<pre><code> REPLICATE DATABASE db_name FROM master_user[:master_password]@master_host[:master_port][/master_db_name] </code></pre> Why is it in 2017 we still don't have any other database that can come close to this? Basic replication is very well understood and used everywhere but it seems like database creators just don't understand what should be prioritized.

评论 #13841819 未加载

评论 #13842592 未加载

cromulent大约 8 年前

I was looking for the "I want my database to be performant under high random load" question. PIOPS can hurt.Anyone have any experience running PostgreSQL on the new I3 instances?

评论 #13842235 未加载

agentgt大约 8 年前

I have mentioned this on some previous posted articles but we are really happy users of both citus and pipelinedb.Check out pipelinedb if you are a Postgres fan (obviously it is for a different use case than Citus).The only thing I don't like about pipeline is that it currently is a fork and not an extension but that is supposed to change.Consequently we syndicate to citus and pipeline through rabbitmq and Kafka.We use google cloud as well. I'm contemplating on writing a post on what we have learned (and not :)) but I don't think I could ever match the quality of this article.And yes invariably some one will mention memsql does both but it is proprietary and not Postgres. I probably should have spent more time investigating it though (and eventually will).

jacobscott大约 8 年前

Does Citus (Cloud?) have features that offer better high availability and failover functionality than what RDS provides? Managed Patroni and packaged workflows for zero-downtime failover would be quite interesting, but I don't see anything like that mentioned on <a href="https://www.citusdata.com/product/cloud" rel="nofollow">https://www.citusdata.com/product/cloud</a>.

评论 #13841972 未加载

forgotpwtomain大约 8 年前

Why is it seemingly impossible to read a technical blog-post on a company-blog, without some seven-year-old-humor type meme mixed-in?

hayd大约 8 年前

I wonder how Postgres Aurora will fair against Citus... that's what we're considering migrating to in the next year or so.

评论 #13841917 未加载

jordanthoms大约 8 年前

Any plans to take Citus in more of a data-warehousey, complex queries direction over time? We are starting to hit test limits of Postgres 9.6 and would like to move to a columnar store, but Redshift is hosted-only, Teradata looks expensive, Greenplum looks old.

评论 #13845769 未加载

jaequery大约 8 年前

i feel DB hosting is such an underrated field right now. in terms of scaling everything is pretty easy to scale except databases. i would love to see more services like this.

BIackSwan大约 8 年前

Aren't most of these use cases already offered/handled by Amazon RDS? Maybe not transparent sharding - but otherwise everything else?

评论 #13842517 未加载

LogicX大约 8 年前

Why does the community link on your pricing page lead to a 404?

评论 #13841794 未加载

marknadal大约 8 年前

1. I’d like my PostgreSQL database to be Highly AvailableHighlight: "The first is the complexity associated with it: it takes twelve steps to setup streaming replication ... open source solutions such as Governor and Patroni aim to do just that. That said, this integration again comes with a complexity cost."I cannot believe it is 2017 and streaming replication is still considered complex. I have spent the last half decade+ of my life to try and make this simple, here is a demo: <a href="https://youtu.be/-i-11T5ZI9o" rel="nofollow">https://youtu.be/-i-11T5ZI9o</a>2. I’d like my application to not worry about failoversHighlight: "most PostgreSQL clients don’t have a mechanism to automatically retry different endpoints in case of a failure."Master-Slave systems are not conducive to failover (determining a new Master involves its own locking/election mechanisms). If we have streaming Master-Master replication by default, you can have some easy automatic failover - <a href="https://youtu.be/-FN_J3etdvY" rel="nofollow">https://youtu.be/-FN_J3etdvY</a> .4. I’d like my database to scale horizontallyHighlight: "Deploying a distributed RDBMS into production requires a good understanding of both relational databases and distributed systems."We can do a lot of work to improve understanding out there, Kyle Kingsbury (Aphyr of Jepsen Tests) has done a lot to spread awareness. A couple years ago I did a tech talk that explains the ideas with stick figures so that way even laypersons could understand what is going on: <a href="http://gun.js.org/distributed/matters.html" rel="nofollow">http://gun.js.org/distributed/matters.html</a> .5. I’d like automatic backups for disaster recoveryHighlight: "Distributed database backups are even harder."See the (1) demo, this doesn't have to be hard, it can be easy enough for frontend web developers IF the system is a streaming Master-Master database to begin with. Ontop of that, check out our "backup to S3" prototype where we scaled to doing 100M+ messages for $10/day (all costs, CPU, disk, S3) here: <a href="https://www.youtube.com/watch?v=x_WqBuEA7s8" rel="nofollow">https://www.youtube.com/watch?v=x_WqBuEA7s8</a>My goal and argument here is that database vendors keep propagating the message of "this is hard, so trust us and pay for systems" that Aphyr has repeatedly proven to be broken (although, actually, Postgres did really well, Kyle was recommending it as the best general purpose database) - as Craig notes himself: "In fact, I’ve been on calls where we quoted $300K for the services work, and never heard from that user again."We need to break these cycles, and I do believe Craig is trying to do that with these blog posts, which is great. But, we have a long ways to go (all of us).