TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Data Warehouse vs. Operational Database What? How? Which One?

4 pointsby alexmarquardtover 2 years ago

3 comments

swyxover 2 years ago
&gt; For example, loading data from an SSD is more than 3 orders of magnitude (more than 1000x) slower than referencing main memory, and a disk seek on a spinning disk is 5 orders of magnitude (100,000x) slower than referencing data that is in memory. The above latencies should make it clear that there is a huge performance advantage to minimizing disk access.<p>this is the first explanation i&#x27;ve seen that directly links disk latency and data architecture decisions. always felt it intuitively but never did the math.
zX41ZdbWover 2 years ago
The advantage of OLAP vs. OLTP on analytical queries is clearly visible on the benchmark: <a href="https:&#x2F;&#x2F;benchmark.clickhouse.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;benchmark.clickhouse.com&#x2F;</a><p>If we take the extreme - ClickHouse as the most thoroughly optimized column-oriented DBMS and compare it with Postgres, the difference will be more than 100 times on average.
sspaetiover 2 years ago
&gt; The column-oriented storage format used by data warehouses allows them to efficiently leverage modern SIMD computer architectures for columnar-vectorized processing.<p>I find it interesting how these vectorized processing engines with DuckDB and Photon Engine of Databricks try to combine row and columnar-oriented strengths.
评论 #34058281 未加载