41 点作者 YAFZ超过 9 年前

2 条评论

kod超过 9 年前

If I'm reading this correctly, the Kafka topic only had 5 partitions, but they had 10 workers.<p>With the Spark direct stream, kafka partitions are 1:1 with spark partitions, which means at most half of the workers would be doing work without a shuffle.<p>Seems like a pretty basic oversight that should be addressed.

estefan超过 9 年前

This is the first mention I've seen of flink on HN.

评论 #10762738 未加载

Benchmarking Streaming Computation Engines at Yahoo

2 条评论

Benchmarking Streaming Computation Engines at Yahoo

2 条评论