TechEcho

16 comments

PipelineDB = Insert data with time component to be aggregated on the fly into always up-to-date summary tables using a variety of aggregation functions. Raw data is not persisted.TimescaleDB = Store data with time component into "hypertable" that is automatically partitioned by time, for faster queries when limited by time range. Single node and has helper methods to make time based bucketing and aggregation easier.Citus = Store data in distributed tables automatically partitioned and spread across multiple nodes, by any single column. Join across nodes with non-distributed tables.Can definitely use PipelineDB for real-time summaries and TimescaleDB or Citus for raw long-term storage in the same database.Side note: It would be nice if Postgres had package manager for extensions.

评论 #18297894 未加载

评论 #18298673 未加载

评论 #18298552 未加载

allan_sover 6 years ago

Has anyone tried to mix pipelinedb with timescale[1] , I think both are working on different side of playing with timeseries data ?[1]<a href="https://www.timescale.com/how-it-works" rel="nofollow">https://www.timescale.com/how-it-works</a>

评论 #18296078 未加载

chucky_zover 6 years ago

I've been following Pipeline since the beginning and it's so fricking cool. Please, if you can't think of a good use of Pipeline, use it instead of a count(*)! :D

评论 #18295737 未加载

skunkworkerover 6 years ago

Interesting, this seems to be the other side of the postgres time series extension coin.TimescaleDB for writes, PipelineDB for reads.

评论 #18295988 未加载

评论 #18297488 未加载

评论 #18298004 未加载

评论 #18298537 未加载

dkulchenkoover 6 years ago

How does this compare to TimescaleDB?Are they solving the same problem in different ways or are they complementary projects? If it's the latter, what would that look like?

评论 #18298010 未加载

tracker1over 6 years ago

Hoping this gains some traction as a defacto extension for cloud hosted postgresql. I think this is probably as useful as plv8 for a lot of use cases.

the-alchemistover 6 years ago

And it supports Postgres 10.x!<a href="http://docs.pipelinedb.com/installation.html#install-postgresql" rel="nofollow">http://docs.pipelinedb.com/installation.html#install-postgre...</a>Can't wait for Postgres 11 support.

评论 #18295873 未加载

crescentfreshover 6 years ago

Looking over this cursorily, looks super cool.<pre><code> INSERT INTO events_stream (ts, value) VALUES (now(), '0ef346ac'); </code></pre> > As soon as the continuous view reads new incoming events and the distinct count is updated the reflect new information, the raw events will be discarded.So you create a table, insert into it, and it's always empty. Is that right?Does this work for any table in pg? How does pg know that the insert should NOT actually insert a row?

评论 #18296032 未加载

评论 #18296070 未加载

usgroupover 6 years ago

Fantastic guys, thank you! I’ve been looking forward to it becoming an extension for half a year. This is great news.This basically means Postgres now has continuous views and a toolkbox of functions for running calculations. Combined with PG11 partitioning features and better parallel gusty execution, PG is an even more formidable choice for medium sized data.

Arquover 6 years ago

I work closely in the space of providing time series databases as managed solutions. I can say that I am very happy to see this recent development of new tsd's and this with timescale is a huge bump to the industry/segment. Everybody currently measures some analytics and mostly user data and there is so much abuse with it, yet there is so much more you can measure and do and it is still very early stage. Farms, industrial applications, IoT and so much more. I'd love to just measure temperature and wind speed at unprecedented resolution.

ishikawaover 6 years ago

Very interesting. Does it aggregate per day? If so, I wonder how it handles time-zone, I mean when to create a new day when you have agents on different time-zones.

评论 #18313629 未加载

Rapzidover 6 years ago

Running functions on top of the transaction log(in transaction order) is a really powerful thing.

alakinover 6 years ago

Is most of the intermediate processing done in memory, or is it limited by hd write speed?

评论 #18297336 未加载

jadboxover 6 years ago

How does this compare to Citus?

评论 #18298366 未加载

评论 #18296132 未加载

temuzeover 6 years ago

Congrats!Also, how's stride.io doing?

评论 #18295926 未加载

tnoletover 6 years ago

Big question for me: does it work on Heroku postgres?

评论 #18305168 未加载

16 comments

manigandhamover 6 years ago

评论 #18297894 未加载

评论 #18298673 未加载

评论 #18298552 未加载

allan_sover 6 years ago

评论 #18296078 未加载

chucky_zover 6 years ago

I've been following Pipeline since the beginning and it's so fricking cool. Please, if you can't think of a good use of Pipeline, use it instead of a count(*)! :D

评论 #18295737 未加载

skunkworkerover 6 years ago

Interesting, this seems to be the other side of the postgres time series extension coin.TimescaleDB for writes, PipelineDB for reads.

评论 #18295988 未加载

评论 #18297488 未加载

评论 #18298004 未加载

评论 #18298537 未加载

dkulchenkoover 6 years ago

How does this compare to TimescaleDB?Are they solving the same problem in different ways or are they complementary projects? If it's the latter, what would that look like?

评论 #18298010 未加载

tracker1over 6 years ago

Hoping this gains some traction as a defacto extension for cloud hosted postgresql. I think this is probably as useful as plv8 for a lot of use cases.

the-alchemistover 6 years ago

评论 #18295873 未加载

crescentfreshover 6 years ago

评论 #18296032 未加载

评论 #18296070 未加载

usgroupover 6 years ago

Arquover 6 years ago

ishikawaover 6 years ago

Very interesting. Does it aggregate per day? If so, I wonder how it handles time-zone, I mean when to create a new day when you have agents on different time-zones.

评论 #18313629 未加载

Rapzidover 6 years ago

Running functions on top of the transaction log(in transaction order) is a really powerful thing.

alakinover 6 years ago

Is most of the intermediate processing done in memory, or is it limited by hd write speed?

PipelineDB 1.0 – High-Performance Time-Series Aggregation for PostgreSQL

16 comments

PipelineDB 1.0 – High-Performance Time-Series Aggregation for PostgreSQL

16 comments