I don't understand much about this apache/java data streaming ecosystem (ETL, Kafka, Cassandra, they're all buzzword bingo to me and i don't know what it all means), but maybe someone here can translate this to simpler application programmer terms?<p>I read the overview, and I'm not sure yet, but is this like an in-memory database that runs inside your process? Like, sqlite without disk persistence, or Erlang ETS, but then columnar?<p>I can't completely tell from the overview whether it's about the data format or the querying capability. A columnar ETS alternative would be splendid indeed!
Great to see Ballista in arrow <a href="https://github.com/apache/arrow/pull/9723" rel="nofollow">https://github.com/apache/arrow/pull/9723</a>
Here's a neat story about how an Apple M1 Macbook enjoyed 3x the performance compared to an Apple Intel Macbook using a (hassle to compile) Apache Arrow test instantiation.<p><a href="https://uwekorn.com/2021/01/11/apache-arrow-on-the-apple-m1.html" rel="nofollow">https://uwekorn.com/2021/01/11/apache-arrow-on-the-apple-m1....</a>
Good progress overall, especially on the Rust side, I wonder if C++ and Rust would at some point follow the same roadmap when it comes higher-level compute features or rather deviate and develop at their own pace.<p>Special kudos to the Rust team for Parquet predicates pushdown feature.