107 pointsby andygrove12 months ago

3 comments

But why. Unless you need to use low-level map/reduce, just ditch Spark and use <a href="https://github.com/apache/datafusion-ballista">https://github.com/apache/datafusion-ballista</a> directly. It supports Python too.

评论 #40539356 未加载

评论 #40539029 未加载

评论 #40539256 未加载

评论 #40538909 未加载

评论 #40540821 未加载

pitah112 months ago

I've been keeping an eye on these kinds of Spark accelerator libraries for a while now.How does it compare to Blaze[1] and Gluten[2]?I'm interested in running some benchmarks soon against all three for my project to see how they all go.[1] <a href="https://github.com/kwai/blaze">https://github.com/kwai/blaze</a>[2] <a href="https://github.com/apache/incubator-gluten">https://github.com/apache/incubator-gluten</a>

评论 #40543701 未加载

scirob12 months ago

Imagine if data bricks switched and just started to contribute to this.I live in a dream world :)

评论 #40540139 未加载

DataFusion Comet: Apache Spark Accelerator

3 comments

DataFusion Comet: Apache Spark Accelerator

3 comments