TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Can you even trust benchmarks these days? ClickHouse vs. Druid vs. Rockset

4 点作者 goenning将近 2 年前

5 条评论

ryadh将近 2 年前
One way to add some trust is to make benchmarks open-source and reproducible: <a href="https:&#x2F;&#x2F;github.com&#x2F;ClickHouse&#x2F;ClickBench&#x2F;">https:&#x2F;&#x2F;github.com&#x2F;ClickHouse&#x2F;ClickBench&#x2F;</a><p>(I work at ClickHouse)
评论 #36006366 未加载
PeterZaitsev将近 2 年前
Can you trust benchmarks Ever ? - No!<p>Or rather while some benchmarks which are relevant for your use case, are done correctly and unbiased may exist it is not easy to find and identify
bell-cot将近 2 年前
Trust for what purpose? Are you a Marketing Droid for a product which looked good&#x2F;bad in some particular benchmark? A fanboy yearning to proclaim yourself Right on the internet? A paid-per-click author cranking out web content? A PHB looking for validation of his half-assed decisions?<p>For the great majority of use cases, any looks-good benchmark will do the job.<p>(Vs. if you actually need to know about real-world performance - you&#x27;ll have to do some serious work for that information.)
PeterCorless将近 2 年前
Full disclosure: I work at StarTree, which is powered by Apache Pinot.<p>ClickHouse&#x27;s ClickBench is a good general tool. However, it&#x27;s not the end-all, be-all of performance benchmarking and testing. Its results may or may not be applicable for guidance on the performance of your specific use case when you get to production.<p>It is definitely a stab at getting an objective suite of tools for the real-time analytics space. But just like you had YCSB as a good general performance test, eventually a subset of users wanted something specific for Cassandra and Cassandra-like databases (DSE, ScyllaDB, etc.), so you eventually saw cassandra-stress. We have to consider cases where certain databases may need to have testing suites that really capture their capabilities.<p>ClickHouse themselves publishes a list of Limitations that everyone should keep in mind as they run ClickBench:<p><a href="https:&#x2F;&#x2F;github.com&#x2F;ClickHouse&#x2F;ClickBench&#x2F;#limitations">https:&#x2F;&#x2F;github.com&#x2F;ClickHouse&#x2F;ClickBench&#x2F;#limitations</a><p>CelerData (based on StarRocks) also wrote up this:<p><a href="https:&#x2F;&#x2F;celerdata.com&#x2F;blog&#x2F;what-you-should-know-before-using-clickbench" rel="nofollow">https:&#x2F;&#x2F;celerdata.com&#x2F;blog&#x2F;what-you-should-know-before-using...</a><p>Plus, I want to direct people to the discussion generated when ClickBench was first posted to HN:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=32084571" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=32084571</a><p>As user AdamProut commented back at the time:<p>&gt; It looks like the queries are all single table queries with group-bys and aggregates over a reasonably small data set (10s of GB)?<p>&gt;I&#x27;m sure some real workloads look like this, but I don&#x27;t think it&#x27;s a very good test case to show the strengths&#x2F;weaknesses of an analytical databases query processor or query optimizer (no joins, unions, window functions, complex query shapes ?).<p>&gt; For example, if there were any queries with some complex joins Clickhouse would likely not do very well right now given its immature query optimizer (Clickhouse blogs always recommend denormalizing data into tables with many columns to avoid joins).<p>So, again, ClickBench is a good (great) beginning. As an industry we should not let it be seen as the end. I&#x27;d be interested in the community&#x27;s opinions on what and how we should be doing better.
goenning将近 2 年前
Are there any unbiased benchmark for databases like techempower.com does for web frameworks?
评论 #36000828 未加载
评论 #36001422 未加载