TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DataFusion Comet: Apache Spark Accelerator

107 pointsby andygrove12 months ago

3 comments

OutOfHere12 months ago
But why. Unless you need to use low-level map&#x2F;reduce, just ditch Spark and use <a href="https:&#x2F;&#x2F;github.com&#x2F;apache&#x2F;datafusion-ballista">https:&#x2F;&#x2F;github.com&#x2F;apache&#x2F;datafusion-ballista</a> directly. It supports Python too.
评论 #40539356 未加载
评论 #40539029 未加载
评论 #40539256 未加载
评论 #40538909 未加载
评论 #40540821 未加载
pitah112 months ago
I&#x27;ve been keeping an eye on these kinds of Spark accelerator libraries for a while now.<p>How does it compare to Blaze[1] and Gluten[2]?<p>I&#x27;m interested in running some benchmarks soon against all three for my project to see how they all go.<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;kwai&#x2F;blaze">https:&#x2F;&#x2F;github.com&#x2F;kwai&#x2F;blaze</a><p>[2] <a href="https:&#x2F;&#x2F;github.com&#x2F;apache&#x2F;incubator-gluten">https:&#x2F;&#x2F;github.com&#x2F;apache&#x2F;incubator-gluten</a>
评论 #40543701 未加载
scirob12 months ago
Imagine if data bricks switched and just started to contribute to this.<p>I live in a dream world :)
评论 #40540139 未加载