TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

CERN swaps out databases to feed its petabyte-a-day habit

164 点作者 valyala超过 1 年前

19 条评论

keep_reading超过 1 年前
I also dropped InfluxDB at work due to its terrible performance. VictoriaMetrics is great<p>I was using Promscale (TimescaleDB) but they EOL&#x27;d Promscale which forced us to Victoria. But either way both of these are much faster than Influx<p>Don&#x27;t get fooled into the latest InfluxDB rewrite. I think the latest is cloud hosted only too? So stupid
评论 #37608014 未加载
评论 #37603514 未加载
评论 #37605384 未加载
bouvin超过 1 年前
One of my fondest memories as a summer student at CERN in 1993 (in the Electronics and Computing for Physics department) was the visit to the basement beneath the main computing facility, where a colossal tape robot was in operation. Even at that time, CERN was grappling with exceedingly vast amounts of data.
评论 #37610274 未加载
评论 #37607292 未加载
ilyt超过 1 年前
I really like VictoriaMetrics&#x27;s architecture<p>vmagent takes care of all the pesky edge things like emulating prometheus config parsing and various scraping bits. It also does buffering in case you lose network connection for a while, and accept vast spread of different protocols<p>vminsert&#x2F;vmselect scale separately from eachother and your queries don&#x27;t bother your ingest all that much.<p>vmstorage does just that, storage. Only thing that bothers me (compared to say, Elasticsearch), is that data can&#x27;t migrate between nodes so you can&#x27;t &quot;just&quot; start a new one and drain an old one, but a tiny bit ops work in rare cases is IMO price worth paying for straightforwardness of the stack..<p>PromQL compatibility is also great, tools like Grafana &quot;just work&quot; without anyone having to write support for it.<p>We started migrating from InfluxDB at work, and on my private stuff I already did. Soo much less memory usage too.
评论 #37603666 未加载
评论 #37610061 未加载
ComputerGuru超过 1 年前
Missing from the title: leaving InfluxDB and Prometheus for VictoriaMetrics.
评论 #37603421 未加载
Havoc超过 1 年前
That’s one hell of an endorsement. Marketing team won the jackpot.
esafak超过 1 年前
tl,dr:<p>Speaking to The Register, Roman Khavronenko, co-founder of VictoriaMetrics, said the previous system had experienced problems with high cardinality, which refers to the level of repeated values – and high churn data – where applications can be redeployed multiple times over new instances.<p>Implementing VictoriaMetrics as backend storage for Prometheus, the CMS monitoring team progressed to using the solution as front-end storage to replace InfluxDB and Prometheus, helping remove cardinality issues, the company said in a statement.
eclark超过 1 年前
We&#x27;ve been using and building with VictoriaMetrics for a while at Batteries Included. I have probably created and torn down 100+ clusters now. It&#x27;s a remarkably easy to use piece of software for something with the capabilities.
qwertox超过 1 年前
At the end of the article it says<p>&quot;<i>InfluxDB said in March this year it had solved the cardinality issue with a new IOx storage engine.</i>&quot;<p>Does this mean that in the end it wasn&#x27;t really necessary to switch to VictoriaMetrics&#x27; offering?
评论 #37606820 未加载
评论 #37606750 未加载
m3kw9超过 1 年前
Over 24hr period its more then 11 Gigabytes&#x2F;second or rounding to 100 gbps. Those shards must be pretty crazy
评论 #37601785 未加载
foota超过 1 年前
Weird that the title talks about the petabyte a day, while the article is actually about their monitoring tooling, not the thing ingesting the data from experiments, iiuc.
jmakov超过 1 年前
How come they don&#x27;t support wire protocols for analytical workloads like arrow streaming or others like Clickhouse. Looks like they don&#x27;t want to compete with CK.
faeriechangling超过 1 年前
I’m amazed years after I heard about it how the tiny VictoriaMetrics team is thumping what far bigger organizations manage to do. The biggest reason I’ve heard people don’t want to adopt it is if the lead maintainer gets bussed the project is liable to fall apart.<p>I looked at some of the alternatives to victoriametrics for Prometheus and they all seem… much much worse…
__turbobrew__超过 1 年前
I’m surprised nobody has mentioned grafana mimir yet. You get all the niceties of the prometheus ecosystem with a backend which can scale into the billions of metric streams.
评论 #37649977 未加载
amelius超过 1 年前
This is nothing compared to what dragnet surveillance has to deal with.
评论 #37602212 未加载
fijiaarone超过 1 年前
Amazing how much data you can generate with a small cluster piping out &#x2F;dev&#x2F;urandom continuously over every possible socket.
inv2004超过 1 年前
Do not have very positive experience with influxdb, but the strange for me that clickhouse was not even mentioned in the article
zaps超过 1 年前
Just use sqlite amirite
评论 #37606697 未加载
iFire超过 1 年前
OPENSOURCE, APACHE2 LICENSE<p><a href="https:&#x2F;&#x2F;github.com&#x2F;VictoriaMetrics&#x2F;VictoriaMetrics&#x2F;blob&#x2F;master&#x2F;LICENSE">https:&#x2F;&#x2F;github.com&#x2F;VictoriaMetrics&#x2F;VictoriaMetrics&#x2F;blob&#x2F;mast...</a>
评论 #37603503 未加载
sgt101超过 1 年前
I can do this on my laptop<p>&#x2F;tumbleweed...