TechEcho

10 comments

skrebbelabout 4 years ago

I don't understand much about this apache/java data streaming ecosystem (ETL, Kafka, Cassandra, they're all buzzword bingo to me and i don't know what it all means), but maybe someone here can translate this to simpler application programmer terms?<p>I read the overview, and I'm not sure yet, but is this like an in-memory database that runs inside your process? Like, sqlite without disk persistence, or Erlang ETS, but then columnar?<p>I can't completely tell from the overview whether it's about the data format or the querying capability. A columnar ETS alternative would be splendid indeed!

评论 #27053677 未加载

评论 #27054046 未加载

评论 #27054054 未加载

评论 #27053712 未加载

评论 #27053178 未加载

评论 #27053920 未加载

评论 #27053280 未加载

评论 #27054588 未加载

评论 #27053174 未加载

评论 #27059400 未加载

评论 #27053161 未加载

phissenschaftabout 4 years ago

Great to see Ballista in arrow <a href="https://github.com/apache/arrow/pull/9723" rel="nofollow">https://github.com/apache/arrow/pull/9723</a>

评论 #27053478 未加载

评论 #27053443 未加载

xbarabout 4 years ago

Here's a neat story about how an Apple M1 Macbook enjoyed 3x the performance compared to an Apple Intel Macbook using a (hassle to compile) Apache Arrow test instantiation.<p><a href="https://uwekorn.com/2021/01/11/apache-arrow-on-the-apple-m1.html" rel="nofollow">https://uwekorn.com/2021/01/11/apache-arrow-on-the-apple-m1....</a>

liminalabout 4 years ago

Is Arrow good for text data or does the columnar format lose its benefits when dealing with lots of arbitrary length strings?

dmitrykovalabout 4 years ago

Good progress overall, especially on the Rust side, I wonder if C++ and Rust would at some point follow the same roadmap when it comes higher-level compute features or rather deviate and develop at their own pace.<p>Special kudos to the Rust team for Parquet predicates pushdown feature.

yazaddaruvalaabout 4 years ago

Is anyone familiar enough to know if Arrow is also targeting usage by libraries like Lucene?

评论 #27054163 未加载

rubatugaabout 4 years ago

Finally has ARM builds for pyarrow!

poormanabout 4 years ago

Love the progress being made!

mkoubaaabout 4 years ago

Really great to see how fast this thing took flight!

oregontechninjaabout 4 years ago

It's a binary data format, supporting trees, tables, lists, and even blobs. Never used it, I already have sqlite.

评论 #27054405 未加载

10 comments

skrebbelabout 4 years ago

评论 #27053677 未加载

评论 #27054046 未加载

评论 #27054054 未加载

评论 #27053712 未加载

评论 #27053178 未加载

评论 #27053920 未加载

评论 #27053280 未加载

评论 #27054588 未加载

评论 #27053174 未加载

评论 #27059400 未加载

评论 #27053161 未加载

phissenschaftabout 4 years ago

Great to see Ballista in arrow <a href="https://github.com/apache/arrow/pull/9723" rel="nofollow">https://github.com/apache/arrow/pull/9723</a>

评论 #27053478 未加载

评论 #27053443 未加载

xbarabout 4 years ago

liminalabout 4 years ago

Is Arrow good for text data or does the columnar format lose its benefits when dealing with lots of arbitrary length strings?

dmitrykovalabout 4 years ago

yazaddaruvalaabout 4 years ago

Is anyone familiar enough to know if Arrow is also targeting usage by libraries like Lucene?

评论 #27054163 未加载

rubatugaabout 4 years ago

Finally has ARM builds for pyarrow!

poormanabout 4 years ago

Love the progress being made!

mkoubaaabout 4 years ago

Really great to see how fast this thing took flight!

oregontechninjaabout 4 years ago

It's a binary data format, supporting trees, tables, lists, and even blobs. Never used it, I already have sqlite.

评论 #27054405 未加载

Apache Arrow 4.0

10 comments

Apache Arrow 4.0

10 comments