TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

The Data Engineering Ecosystem: An Interactive Map

47 pointsby jecs321about 10 years ago

4 comments

jkestelynabout 10 years ago
Very nice effort. A couple things missing though to make it truly up to date:<p>1. In &quot;Ingest&quot;, where&#x27;s Flume? 2. Where&#x27;s &quot;Interactive SQL&quot; (eg Impala, and for Presto)? 3. Where&#x27;s &quot;Search&quot; (Solr, ElasticSearch)?
评论 #9159876 未加载
bzz01about 10 years ago
While all maps like this tend to make little practical sense since they inevitably over-generalize and over-simplify things, I&#x27;d still like to point out that they got &quot;columnar&quot; category quite wrong: neither HBase nor Cassandra are columnar stores in a way this term is commonly understood.<p>HBase and Cassandra still store data in rows, however rows can be partitioned into column families which may be stored separately. Columnar databases are usually also relational (Vertica and Redshift) and support SQL or SQL-like query language.<p>Anyway, I think regardless of how you define columnar, HBase and Redshift shouldn&#x27;t end up in the same category as they are quite different in a way they work, throughput&#x2F;latency and read&#x2F;write balance and use cases.
评论 #9160231 未加载
iblaineabout 10 years ago
No real time MPP databases like Redshift, Netezza, Aster...otherwise good graph.
评论 #9159889 未加载
sampathwebabout 10 years ago
Love the text overlay. It would be nice to also have project links in the text.
评论 #9159880 未加载