TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Apache Arrow – Powering Columnar In-Memory Analytics

48 pointsby bertzzieover 8 years ago

5 comments

PDoyleover 8 years ago
Oops... The first sentence in the "Fast" section says "SIMD (Single input multiple data)".
filereaperover 8 years ago
Asking the stupid question here, but why create a new Apache project for this?<p>Apache Arrow seems to be targeting the use of SIMD which is a very JVM&#x2F;Runtime dependent feature. If the runtime can&#x27;t detect this out-of-the-box then create recognized method or some sort of intrinsic to coax the runtime to SIMD-ize the operation.<p>I understand the performance gains of this but why not add this functionality to existing projects like Parquet or HTable etc...<p>This just comes to mind: <a href="https:&#x2F;&#x2F;xkcd.com&#x2F;927&#x2F;" rel="nofollow">https:&#x2F;&#x2F;xkcd.com&#x2F;927&#x2F;</a>
评论 #12437789 未加载
评论 #12432531 未加载
ljoshuaover 8 years ago
Is this similar to how QlikView&#x27;s in-memory engine works?
评论 #12427567 未加载
threeseedover 8 years ago
It really is a confusing title for the project. It&#x27;s more of a high speed interchange format e.g. send data to Cassandra from Spark or Storm.<p>Nothing that end users will ever really have to know anything about.
axman6over 8 years ago
I&#x27;m confused, is this just Structure of Arrays as a service for columnar data? It&#x27;s not clear to me what this actually does.
评论 #12431417 未加载