TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Apache Arrow – Powering Columnar In-Memory Analytics

48 点作者 bertzzie超过 8 年前

5 条评论

PDoyle超过 8 年前
Oops... The first sentence in the "Fast" section says "SIMD (Single input multiple data)".
filereaper超过 8 年前
Asking the stupid question here, but why create a new Apache project for this?<p>Apache Arrow seems to be targeting the use of SIMD which is a very JVM&#x2F;Runtime dependent feature. If the runtime can&#x27;t detect this out-of-the-box then create recognized method or some sort of intrinsic to coax the runtime to SIMD-ize the operation.<p>I understand the performance gains of this but why not add this functionality to existing projects like Parquet or HTable etc...<p>This just comes to mind: <a href="https:&#x2F;&#x2F;xkcd.com&#x2F;927&#x2F;" rel="nofollow">https:&#x2F;&#x2F;xkcd.com&#x2F;927&#x2F;</a>
评论 #12437789 未加载
评论 #12432531 未加载
ljoshua超过 8 年前
Is this similar to how QlikView&#x27;s in-memory engine works?
评论 #12427567 未加载
threeseed超过 8 年前
It really is a confusing title for the project. It&#x27;s more of a high speed interchange format e.g. send data to Cassandra from Spark or Storm.<p>Nothing that end users will ever really have to know anything about.
axman6超过 8 年前
I&#x27;m confused, is this just Structure of Arrays as a service for columnar data? It&#x27;s not clear to me what this actually does.
评论 #12431417 未加载