TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Delta Lake vs. Data Lakes – what's the difference?

1 点作者 MrPowers11 个月前

1 comment

RoyTyrell11 个月前
This is just a marketing brochure...<p>Not to be &quot;old man yells at sky&quot; but a lot of these new cloud-based&#x2F;cloud-focused architectures seem to be geared toward highly specialized needs that 99.9% of businesses aren&#x27;t going to need. However they do one important thing - they over-use resources that line the pockets of MS, Amazon, Google, Data Bricks, etc. A Data Lakehouse is fine but what benefit does it give you over a much more simple solution of ETL&#x2F;ELTing the data in batches (weekly, daily, hourly, etc) and letting it sit in some kind of DB.<p>They say the Data Lakehouse needs all this metadata storage, API access layers, etc. Seems like an overly complex system for anything but large real-time systems that need to replicate a DB but due to data volume and throughput, are unable to. Perhaps you also aren&#x27;t just driving traditional reporting (dashboards, etc).<p>I&#x27;m happy to use this new technology to make more money for myself as a specialist, and effectively be in on the scam, but from an optimal solution pov they suck.
评论 #40719011 未加载