TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Apache Flink: Juggling with Bits and Bytes

20 点作者 mxmxm大约 10 年前

2 条评论

gopalv大约 10 年前
Flink is pretty neat and it didn&#x27;t feel like an &quot;all or nothing&quot; monolithic chunk when I played with it.<p>The page based inner loop makes it very predictable when it does data operations - they were doing this early last year and this mechanism made it very cpu cache friendly &amp; didn&#x27;t trigger the massive GC pauses.<p>I was playing with Flink a bit earlier, because Flink can be integrated into Tez, so that Tez could do DAG scheduling while Flink ran with its inner loops on turbo.<p>That inner loop can edge out even hand-written java code I wrote for page-rank (delta iterations are nice).
jtagx大约 10 年前
Great post detailing how Flink is managing data within the JVM and implementing internal operators (hashing, sorting, ..) working with that serialized data. I also like the performance analysis in the post.