TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Apache Flink: Juggling with Bits and Bytes

20 pointsby mxmxmabout 10 years ago

2 comments

gopalvabout 10 years ago
Flink is pretty neat and it didn&#x27;t feel like an &quot;all or nothing&quot; monolithic chunk when I played with it.<p>The page based inner loop makes it very predictable when it does data operations - they were doing this early last year and this mechanism made it very cpu cache friendly &amp; didn&#x27;t trigger the massive GC pauses.<p>I was playing with Flink a bit earlier, because Flink can be integrated into Tez, so that Tez could do DAG scheduling while Flink ran with its inner loops on turbo.<p>That inner loop can edge out even hand-written java code I wrote for page-rank (delta iterations are nice).
jtagxabout 10 years ago
Great post detailing how Flink is managing data within the JVM and implementing internal operators (hashing, sorting, ..) working with that serialized data. I also like the performance analysis in the post.