TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Writing a Time Series Database from Scratch

180 pointsby spektomabout 8 years ago

6 comments

jnordwickabout 8 years ago
I still can&#x27;t figure out why people can&#x27;t even come close to KDB+. It is a real conundrum. I&#x27;ve been waiting patiently for something to show up, but the gap seems to keep getting bigger instead of smaller.<p>Is it that people want to make the problem more complex that it needs to be? Is it that those who know most about these issues don&#x27;t share their secrets so implemented from the outside often don&#x27;t have a good understanding of how to do things properly? If you were to asked the guy behind Prometheus if he&#x27;s looked at the commercial offerings and what he&#x27;s learned from them, would even be able to speak about them intelligently?<p>There seems to be a huge skills gap on these things that I can&#x27;t put my finger on. I&#x27;d love to be able to use a real TSDB, even at only half the speed and usefulness. It would be great for these smaller firms that cant or wont pay the license fees for a commercial offering until they get larger.
评论 #14181239 未加载
评论 #14182241 未加载
评论 #14183429 未加载
评论 #14180887 未加载
评论 #14182097 未加载
评论 #14180654 未加载
评论 #14180734 未加载
评论 #14181437 未加载
iksaifabout 8 years ago
You may also want to check <a href="https:&#x2F;&#x2F;github.com&#x2F;criteo&#x2F;biggraphite&#x2F;wiki&#x2F;BigGraphite-Announcement" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;criteo&#x2F;biggraphite&#x2F;wiki&#x2F;BigGraphite-Annou...</a> which is also about how to write a TSDB from Scatch but with different goals.
ah-about 8 years ago
Exciting times in database land! It certainly seems like the good systems are converging on very similar storage architectures. This design is so similar to how Kafka and Kudu work internally.<p>As the raw storage seems pretty optimal now, I suspect next we&#x27;ll see a comeback of indices for more precise queries to get another jump in performance.
nicolaslemabout 8 years ago
The description of this new storage engine does not explain how it manages the durability of the data.<p>When you compare with the extreme efforts traditional databases take to ensure that unplugging a server will never ever result in data loss[0], silencing this problem makes me wonder.<p>Is it that at this ingest rate even trying to ensure durability is a vain effort?<p>[0] <a href="https:&#x2F;&#x2F;www.sqlite.org&#x2F;atomiccommit.html" rel="nofollow">https:&#x2F;&#x2F;www.sqlite.org&#x2F;atomiccommit.html</a>
评论 #14182943 未加载
评论 #14182914 未加载
bogomipzabout 8 years ago
I had a question about the following statement from the post:<p>&gt;&quot;Prometheus&#x27;s storage layer has historically shown outstanding performance, where a single server is able to ingest up to one million samples per second as several million time series&quot;<p>How are there one million samples per second equating to several million time series? Is a single sample not equivalent to a single data point in a time series db for a particular metric in Prometheus?
评论 #14179949 未加载
bongonewhereabout 8 years ago
Is like everyone creating a time series database from scratch?
评论 #14180639 未加载
评论 #14180788 未加载