Citus Unforks from PostgreSQL, Goes Open Source

763 pointsby jamesherokuabout 9 years ago

28 comments

no1youknowzabout 9 years ago

This is awesome. I have experience with running a CitusDB cluster and it pretty much solved a lot of the scaling problems I was having at the time. For it to go open source now, is of huge benefit to the future projects I have.> With the release of newly open sourced Citus v5.0, pg_shard's codebase has been merged into Citus...This is fantastic, sounds like the setup process is much simpler.I wonder if they have introduced the Active/Active Master solution they were working on? I know before, there is 1 Master and multiple Worker nodes. The solution before was to have a passive backup of the Master.If say, they released the Active/Active Master later on this year. That's huge. I can pretty much think of my DB solution as done at this point.

评论 #11354030 未加载

评论 #11360200 未加载

devitabout 9 years ago

I've been unable to find any clear description of the capabilities of Citus and competing solutions (postgres-x2 seems the other leader).Which of these are supported:1. Full PostgreSQL SQL language2. All isolation levels including Serializable (in the sense that they actually provide the same guarantees as normal PostgreSQL)3. Never losing any committed data on sub-majority failures (i.e. synchronous replication)4. Ability to automatically distribute the data (i.e. sharding)5. Ability to replicate the data instead or in addition to sharding6. Transactionally-correct read scalability7. Transactionally-correct write scalability where possible (i.e. multi-master replication)8. Automatic configuration only requiring to specify some sort of "cluster identifier" the node belongs to

评论 #11355511 未加载

评论 #11355255 未加载

exhilarationabout 9 years ago

AGPL license if anyone's curious: <a href="https://github.com/citusdata/citus/blob/master/LICENSE" rel="nofollow">https://github.com/citusdata/citus/blob/master/LICENSE</a>

评论 #11353976 未加载

评论 #11353878 未加载

评论 #11353847 未加载

gtrubetskoyabout 9 years ago

If anyone from Citus is reading this: how does this affect your business model? I remember when I asked at Strata conf a couple of years ago why isn't your stuff Open Source, the answer then was "because revenue". So what changed since then?

评论 #11354110 未加载

评论 #11354045 未加载

评论 #11353917 未加载

评论 #11353947 未加载

TYabout 9 years ago

This is awesome! Tebrikler (congrats) on the release of 5.0 and going OS, definitely great news.Can you publish competitive positioning of Citus vs Actian Matrix (nee ParAccel) and Vertica? I'd love to compare them side by side - even if it's just from your point of view :-)

评论 #11353932 未加载

评论 #11354598 未加载

erikbabout 9 years ago

Unforking is a very smart decision. Postgres also has gained a lot of favour since MySQL was bought by Oracle. Altogether Citus has earned a lot of kudos for that move, at least with me, for all that may count!

faizshahabout 9 years ago

So this sounds similar to Pivotal's Greenplum which is also open source, can anyone compare the two?

评论 #11354603 未加载

评论 #11360884 未加载

voctorabout 9 years ago

Citus can parallelize SQL queries across a cluster and across multiple CPU cores. How does it compare with the upcoming 9.6 version of PostgreSQL which will support parallel-able sequential scans, parallel joins and parallel aggregate ?

评论 #11355310 未加载

azinman2about 9 years ago

I want it to be called citrus, which is what I always read it as....

评论 #11354760 未加载

评论 #11354474 未加载

评论 #11362216 未加载

rkrzrabout 9 years ago

This is fantastic news! Postgres does not have a terribly strong High Availability story so far and of course it also does not scale out vertically. I have looked at CitusDB in the past, but was always put off by its closed-source nature. Opening it up seems like a great move for them and for all Postgres users. I can imagine that a very active open-source community will develop around it.

评论 #11353839 未加载

评论 #11354169 未加载

ccleveabout 9 years ago

I'd very much like to see what algorithm these systems are using to enable transactions in a distributed environment. Are they just using straight two-phase commit, and letting the whole transaction fail if a single server goes down? Or are are they getting fancy and doing some kind of replication with consensus?

评论 #11357424 未加载

lobster_johnsonabout 9 years ago

This is great!One thing I'm having trouble with is finding information about transactional semantics. If I make several updates (to differently sharded keys) in a single transaction, will the transaction boundaries be preserved (committed "locally" first, then replicated atomically to shards)? Or will they fan out to different shards with separate begin/commit statements? Or without transactional boundaries at all?In fact, I can't really find any information on how CitusDB achieves its transparent sharding for queries and writes. Does it add triggers to distributed tables to rewrite inserts, updates and deletes? Or are tables renamed and replaced with foreign tables? I wish the documentation was a bit more extensive.

signalnineabout 9 years ago

Congrats from Agari! We've been looking forward to this and continue to get a lot of value from both the product and the top-notch support.

jjawssdabout 9 years ago

My guess is that Citus is making enough money from consulting that they don't need to keep this code closed source when they can profit from free community-driven growth while they are expanding their sales pipeline through consulting.

评论 #11354025 未加载

评论 #11353834 未加载

ahacheteabout 9 years ago

Congratulations, Citus.Since I heard last year at PgConfSV that you will be releasing CitusDB 5.0 as open source, I've been waiting for this moment to come.It makes 9.5's awesome capabilities to be augmented with sharding and distributed queries. While this targets real-time analytics and OLAP scenarios, being an open source extension to 9.5 means that a whole lot of users will benefit from this, even under more OLTP-like scenarios.Now that Citus is open source, ToroDB will add a new CitusDB backend soon, to scale-out the Citus way, rather than in a Mongo way :)Keep up with the good work!

BinaryIdiotabout 9 years ago

I don't have a ton of experience scaling out and using different flavors of PostgreSQL but I had run across Postgres-XL not long ago; does anyone know how this compares to that?

ismailabout 9 years ago

Any thoughts on using something like postgres+citrus vs hadoop+hbase+ecosystem vs druid for olap/analytics with very large volumes of data

X86BSDabout 9 years ago

AGPL? This is dead in the water :( It will never be integrated into PG. What a shame. It should have been a 2 clause BSDL. Sigh.

评论 #11354708 未加载

评论 #11354439 未加载

satygeekabout 9 years ago

Does CitusDb fit in olap analytical workloads to do aggregations on hundreds millions of records using varying order and size of dimensions (eg druid) in max of 3 seconds response time using as few boxes as possible - Or there are other techniques have to be used along with Citusdb? Can you shed a light on your experience with CloudFlare in terms of cluster size and queries perf?

评论 #11358460 未加载

评论 #11358178 未加载

评论 #11356501 未加载

uberneoabout 9 years ago

Great product - If would be nice to have a Admin interface like RethinkDB where you can clearly define your replication and Sharding settings. Any documentation around how to do this from command line ?

albashaabout 9 years ago

I recently switched back to MariaDB because I didn't see a clear/easy path for Postgres scalability in case the project i am working on takes off. I am under the assumption there are at least two fairly simple approaches to scale MySQL; master-master replication using Galera and Aurora from AWS. What do you guys think? Am I right in thinking MySQL is easier to scale given I want to spend the least amount of time maintaining it.

Dowwieabout 9 years ago

would a natural evolutionary path for start ups be to emerge with postgresql and grow to requiring citusdb?

onRoadAgain23about 9 years ago

Being burned before,I will never use an OS infrastructure project that has enterprise features you need to pay for. They always try to move you to paid and make the OSS version unpleasant to use over time as soon as the bean counters take over to milk you"For customers with large production deployments, we also offer an enterprise edition that comes with additional functionality"

评论 #11354051 未加载

评论 #11354063 未加载

评论 #11356189 未加载

评论 #11355223 未加载

评论 #11354564 未加载

ioltasabout 9 years ago

Congrats to all for the release. That's a lot of work accomplished.

ksecabout 9 years ago

Does anyone know How does Citus compared to Postgre XL ?

Someoneabout 9 years ago

One must thank them for open sourcing this, and cannot blame them for using a different license, but using a different license makes me think calling this "unfork" is bending the truth a little bit.

评论 #11356536 未加载

评论 #11356828 未加载

lambdafuncabout 9 years ago

Any benchmarks comparing CitusDB against Presto?

Dowwieabout 9 years ago

is it correct to compare citusdb with pipelinedb?

28 comments

no1youknowzabout 9 years ago

评论 #11354030 未加载

评论 #11360200 未加载

devitabout 9 years ago

评论 #11355511 未加载

评论 #11355255 未加载

exhilarationabout 9 years ago

AGPL license if anyone's curious: <a href="https://github.com/citusdata/citus/blob/master/LICENSE" rel="nofollow">https://github.com/citusdata/citus/blob/master/LICENSE</a>

评论 #11353976 未加载

评论 #11353878 未加载

评论 #11353847 未加载

gtrubetskoyabout 9 years ago

评论 #11354110 未加载

评论 #11354045 未加载

评论 #11353917 未加载

评论 #11353947 未加载

TYabout 9 years ago

评论 #11353932 未加载

评论 #11354598 未加载

erikbabout 9 years ago

faizshahabout 9 years ago

So this sounds similar to Pivotal's Greenplum which is also open source, can anyone compare the two?

评论 #11354603 未加载

评论 #11360884 未加载

voctorabout 9 years ago

评论 #11355310 未加载

azinman2about 9 years ago

I want it to be called citrus, which is what I always read it as....

评论 #11354760 未加载

评论 #11354474 未加载

评论 #11362216 未加载

rkrzrabout 9 years ago

评论 #11353839 未加载

评论 #11354169 未加载

ccleveabout 9 years ago

评论 #11357424 未加载

lobster_johnsonabout 9 years ago

signalnineabout 9 years ago

Congrats from Agari! We've been looking forward to this and continue to get a lot of value from both the product and the top-notch support.

jjawssdabout 9 years ago

评论 #11354025 未加载

评论 #11353834 未加载

ahacheteabout 9 years ago

BinaryIdiotabout 9 years ago

I don't have a ton of experience scaling out and using different flavors of PostgreSQL but I had run across Postgres-XL not long ago; does anyone know how this compares to that?

ismailabout 9 years ago

Any thoughts on using something like postgres+citrus vs hadoop+hbase+ecosystem vs druid for olap/analytics with very large volumes of data

X86BSDabout 9 years ago

AGPL? This is dead in the water :( It will never be integrated into PG. What a shame. It should have been a 2 clause BSDL. Sigh.

评论 #11354708 未加载

评论 #11354439 未加载

satygeekabout 9 years ago

评论 #11358460 未加载

评论 #11358178 未加载

评论 #11356501 未加载

uberneoabout 9 years ago

albashaabout 9 years ago

Dowwieabout 9 years ago

would a natural evolutionary path for start ups be to emerge with postgresql and grow to requiring citusdb?

onRoadAgain23about 9 years ago

评论 #11354051 未加载

评论 #11354063 未加载

评论 #11356189 未加载

评论 #11355223 未加载

评论 #11354564 未加载

ioltasabout 9 years ago

Congrats to all for the release. That's a lot of work accomplished.

ksecabout 9 years ago

Does anyone know How does Citus compared to Postgre XL ?

Someoneabout 9 years ago

One must thank them for open sourcing this, and cannot blame them for using a different license, but using a different license makes me think calling this "unfork" is bending the truth a little bit.

评论 #11356536 未加载

评论 #11356828 未加载

lambdafuncabout 9 years ago

Any benchmarks comparing CitusDB against Presto?

Dowwieabout 9 years ago

is it correct to compare citusdb with pipelinedb?