TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

PipelineDB 1.0 – High-Performance Time-Series Aggregation for PostgreSQL

276 pointsby Fergiover 6 years ago

16 comments

manigandhamover 6 years ago
PipelineDB = Insert data with time component to be aggregated on the fly into always up-to-date summary tables using a variety of aggregation functions. Raw data is not persisted.<p>TimescaleDB = Store data with time component into &quot;hypertable&quot; that is automatically partitioned by time, for faster queries when limited by time range. Single node and has helper methods to make time based bucketing and aggregation easier.<p>Citus = Store data in distributed tables automatically partitioned and spread across multiple nodes, by any single column. Join across nodes with non-distributed tables.<p>Can definitely use PipelineDB for real-time summaries and TimescaleDB or Citus for raw long-term storage in the same database.<p>Side note: It would be nice if Postgres had package manager for extensions.
评论 #18297894 未加载
评论 #18298673 未加载
评论 #18298552 未加载
allan_sover 6 years ago
Has anyone tried to mix pipelinedb with timescale[1] , I think both are working on different side of playing with timeseries data ?<p>[1]<a href="https:&#x2F;&#x2F;www.timescale.com&#x2F;how-it-works" rel="nofollow">https:&#x2F;&#x2F;www.timescale.com&#x2F;how-it-works</a>
评论 #18296078 未加载
chucky_zover 6 years ago
I&#x27;ve been following Pipeline since the beginning and it&#x27;s so fricking cool. Please, if you can&#x27;t think of a good use of Pipeline, use it instead of a count(*)! :D
评论 #18295737 未加载
skunkworkerover 6 years ago
Interesting, this seems to be the other side of the postgres time series extension coin.<p>TimescaleDB for writes, PipelineDB for reads.
评论 #18295988 未加载
评论 #18297488 未加载
评论 #18298004 未加载
评论 #18298537 未加载
dkulchenkoover 6 years ago
How does this compare to TimescaleDB?<p>Are they solving the same problem in different ways or are they complementary projects? If it&#x27;s the latter, what would that look like?
评论 #18298010 未加载
tracker1over 6 years ago
Hoping this gains some traction as a defacto extension for cloud hosted postgresql. I think this is probably as useful as plv8 for a lot of use cases.
the-alchemistover 6 years ago
And it supports Postgres 10.x!<p><a href="http:&#x2F;&#x2F;docs.pipelinedb.com&#x2F;installation.html#install-postgresql" rel="nofollow">http:&#x2F;&#x2F;docs.pipelinedb.com&#x2F;installation.html#install-postgre...</a><p>Can&#x27;t wait for Postgres 11 support.
评论 #18295873 未加载
crescentfreshover 6 years ago
Looking over this cursorily, looks super cool.<p><pre><code> INSERT INTO events_stream (ts, value) VALUES (now(), &#x27;0ef346ac&#x27;); </code></pre> &gt; As soon as the continuous view reads new incoming events and the distinct count is updated the reflect new information, the raw events will be discarded.<p>So you create a table, insert into it, and it&#x27;s always empty. Is that right?<p>Does this work for any table in pg? How does pg know that the insert should NOT actually insert a row?
评论 #18296032 未加载
评论 #18296070 未加载
usgroupover 6 years ago
Fantastic guys, thank you! I’ve been looking forward to it becoming an extension for half a year. This is great news.<p>This basically means Postgres now has continuous views and a toolkbox of functions for running calculations. Combined with PG11 partitioning features and better parallel gusty execution, PG is an even more formidable choice for medium sized data.
Arquover 6 years ago
I work closely in the space of providing time series databases as managed solutions. I can say that I am very happy to see this recent development of new tsd&#x27;s and this with timescale is a huge bump to the industry&#x2F;segment. Everybody currently measures some analytics and mostly user data and there is so much abuse with it, yet there is so much more you can measure and do and it is still very early stage. Farms, industrial applications, IoT and so much more. I&#x27;d love to just measure temperature and wind speed at unprecedented resolution.
ishikawaover 6 years ago
Very interesting. Does it aggregate per day? If so, I wonder how it handles time-zone, I mean when to create a new day when you have agents on different time-zones.
评论 #18313629 未加载
Rapzidover 6 years ago
Running functions on top of the transaction log(in transaction order) is a really powerful thing.
alakinover 6 years ago
Is most of the intermediate processing done in memory, or is it limited by hd write speed?
评论 #18297336 未加载
jadboxover 6 years ago
How does this compare to Citus?
评论 #18298366 未加载
评论 #18296132 未加载
temuzeover 6 years ago
Congrats!<p>Also, how&#x27;s stride.io doing?
评论 #18295926 未加载
tnoletover 6 years ago
Big question for me: does it work on Heroku postgres?
评论 #18305168 未加载