TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Writing a Time Series Database from Scratch

180 点作者 spektom大约 8 年前

6 条评论

jnordwick大约 8 年前
I still can&#x27;t figure out why people can&#x27;t even come close to KDB+. It is a real conundrum. I&#x27;ve been waiting patiently for something to show up, but the gap seems to keep getting bigger instead of smaller.<p>Is it that people want to make the problem more complex that it needs to be? Is it that those who know most about these issues don&#x27;t share their secrets so implemented from the outside often don&#x27;t have a good understanding of how to do things properly? If you were to asked the guy behind Prometheus if he&#x27;s looked at the commercial offerings and what he&#x27;s learned from them, would even be able to speak about them intelligently?<p>There seems to be a huge skills gap on these things that I can&#x27;t put my finger on. I&#x27;d love to be able to use a real TSDB, even at only half the speed and usefulness. It would be great for these smaller firms that cant or wont pay the license fees for a commercial offering until they get larger.
评论 #14181239 未加载
评论 #14182241 未加载
评论 #14183429 未加载
评论 #14180887 未加载
评论 #14182097 未加载
评论 #14180654 未加载
评论 #14180734 未加载
评论 #14181437 未加载
iksaif大约 8 年前
You may also want to check <a href="https:&#x2F;&#x2F;github.com&#x2F;criteo&#x2F;biggraphite&#x2F;wiki&#x2F;BigGraphite-Announcement" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;criteo&#x2F;biggraphite&#x2F;wiki&#x2F;BigGraphite-Annou...</a> which is also about how to write a TSDB from Scatch but with different goals.
ah-大约 8 年前
Exciting times in database land! It certainly seems like the good systems are converging on very similar storage architectures. This design is so similar to how Kafka and Kudu work internally.<p>As the raw storage seems pretty optimal now, I suspect next we&#x27;ll see a comeback of indices for more precise queries to get another jump in performance.
nicolaslem大约 8 年前
The description of this new storage engine does not explain how it manages the durability of the data.<p>When you compare with the extreme efforts traditional databases take to ensure that unplugging a server will never ever result in data loss[0], silencing this problem makes me wonder.<p>Is it that at this ingest rate even trying to ensure durability is a vain effort?<p>[0] <a href="https:&#x2F;&#x2F;www.sqlite.org&#x2F;atomiccommit.html" rel="nofollow">https:&#x2F;&#x2F;www.sqlite.org&#x2F;atomiccommit.html</a>
评论 #14182943 未加载
评论 #14182914 未加载
bogomipz大约 8 年前
I had a question about the following statement from the post:<p>&gt;&quot;Prometheus&#x27;s storage layer has historically shown outstanding performance, where a single server is able to ingest up to one million samples per second as several million time series&quot;<p>How are there one million samples per second equating to several million time series? Is a single sample not equivalent to a single data point in a time series db for a particular metric in Prometheus?
评论 #14179949 未加载
bongonewhere大约 8 年前
Is like everyone creating a time series database from scratch?
评论 #14180639 未加载
评论 #14180788 未加载