TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

2016 Spark Summit East Keynote

52 点作者 mydpy大约 9 年前

8 条评论

eranation大约 9 年前
Very excited to hear the plans for GraphFrames - finally GraphX getting some attention!<p><a href="https:&#x2F;&#x2F;spark-summit.org&#x2F;east-2016&#x2F;events&#x2F;graphframes-graph-queries-in-spark-sql&#x2F;" rel="nofollow">https:&#x2F;&#x2F;spark-summit.org&#x2F;east-2016&#x2F;events&#x2F;graphframes-graph-...</a>
评论 #11126571 未加载
krcz大约 9 年前
How advanced is the Structured Streaming functionality? Looking at the JIRA [1] I cannot find even design prototype there, which is kind of strange if they want to have it ready by end of April. But as there was a presentation on the topic at the summit [2], I hope it&#x27;s just developing it without discussion on JIRA.<p>[1] <a href="https:&#x2F;&#x2F;issues.apache.org&#x2F;jira&#x2F;browse&#x2F;SPARK-8360" rel="nofollow">https:&#x2F;&#x2F;issues.apache.org&#x2F;jira&#x2F;browse&#x2F;SPARK-8360</a> [2] <a href="https:&#x2F;&#x2F;spark-summit.org&#x2F;east-2016&#x2F;events&#x2F;keynote-day-3&#x2F;" rel="nofollow">https:&#x2F;&#x2F;spark-summit.org&#x2F;east-2016&#x2F;events&#x2F;keynote-day-3&#x2F;</a>
评论 #11129445 未加载
mydpy大约 9 年前
I think the more exciting announcement was Databricks community edition, which allows you to use 2.0:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=11126179" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=11126179</a>
azth大约 9 年前
Slide 10:<p>&gt; CPU speeds have not kept up with I&#x2F;O in the past 5 years.<p>I presume he means the other way around?<p>Also, what does he mean by native memory management? Does he mean off-heap allocation?<p>And what&#x27;s he referring to regarding code generation?
评论 #11126516 未加载
TheGuyWhoCodes大约 9 年前
Has it become easier to run ad hoc queries with spark? I remember a year ago that the only available solution was the job server by ooyala. Which seems to be a missing feature of core Spark, and isn&#x27;t something I was willing to bet my product on.<p>Datastax evangelized people to use Spark to run queries over Cassandra but it looks so awkward and time consuming to copy jars around to the master, basically you need a dev ops team to this and even more scriptology for production.
评论 #11127054 未加载
评论 #11127994 未加载
DannoHung大约 9 年前
Are Spark streams ever going to reach a point where you can just have a table sitting in memory aggregating data and then you run queries on the <i>whole</i> thing without having to worry about windowing or anything?
mziel大约 9 年前
Last Spark Summit the videos were up on Youtube 1-2h after each talk. Anybody knows where to find the ones from this summit?
josep2大约 9 年前
Started using Spark 1.6 a few months ago. Excited for the Kafka Connector feature.
评论 #11127238 未加载
评论 #11126381 未加载