TechEcho

9 comments

fhoffaover 9 years ago

Note that this proposal is being back not only by Google, but also Cloudera, Data Artisans, Talend, Cask, PayPal, ...Some other posts on the announcement:<a href="http://googlecloudplatform.blogspot.com/2016/01/Dataflow-and-open-source-proposal-to-join-the-Apache-Incubator.html" rel="nofollow">http://googlecloudplatform.blogspot.com/2016/01/Dataflow-and...</a><a href="http://blog.cloudera.com/blog/2016/01/spark-dataflow-joins-googles-dataflow-sdk/" rel="nofollow">http://blog.cloudera.com/blog/2016/01/spark-dataflow-joins-g...</a><a href="http://data-artisans.com/dataflow-proposed-as-apache-incubator-project/" rel="nofollow">http://data-artisans.com/dataflow-proposed-as-apache-incubat...</a><a href="http://blog.cask.co/2016/01/cask-anticipates-googles-dataflow-to-flourish-in-apache/" rel="nofollow">http://blog.cask.co/2016/01/cask-anticipates-googles-dataflo...</a>

评论 #10942032 未加载

mindprinceover 9 years ago

> While Google has previously published papers describing some of its technologies, Google decided to take a different approach with Dataflow. Google open-sourced the SDK and model alongside commercialization of the idea and ahead of publishing papers on the topic.A large number of ASF projects in the Big Data space are inspired by Google's publications. Good to see Google finally taking the lead and coming out with code.

meltedover 9 years ago

Seems like this would duplicate a rather large chunk of Apache Crunch, which implements Google Flume nearly exactly as far as public API is concerned. As far as I can tell, Google Dataflow is also a variation on top of Google Flume. It would be helpful if they could elucidate why this project would not be redundant under the Apache umbrella.

评论 #10942235 未加载

评论 #10942280 未加载

评论 #10941902 未加载

评论 #10942528 未加载

评论 #10941943 未加载

syskover 9 years ago

Can anyone ELI5 what it means for an open source project to become an Apache project? Why doesn't Google just push the code on Github?

评论 #10948595 未加载

评论 #10944396 未加载

评论 #10942914 未加载

评论 #10942998 未加载

Wonnk13over 9 years ago

what are the best resources to learn about streaming, dataflow, etc? Not necessarily the Google implementations, but the core concepts backing them.

评论 #10943014 未加载

评论 #10942999 未加载

xcelqover 9 years ago

Can we hope to see a google like search engine open source? I'm just waiting for this day to happen.

ericandover 9 years ago

O'Reilly post also released today references the Apache Dataflow submission: <a href="https://www.oreilly.com/ideas/the-world-beyond-batch-streaming-102" rel="nofollow">https://www.oreilly.com/ideas/the-world-beyond-batch-streami...</a>

评论 #10942615 未加载

obulpathiover 9 years ago

It would be awesome to have the code portable across various big data engines.

BenoitPover 9 years ago

Where does Dataflow stands? Is it only a wrapper, trying to define a standard API for combining stream producers, datastores, and stream engines?

评论 #10953401 未加载

9 comments

fhoffaover 9 years ago

评论 #10942032 未加载

mindprinceover 9 years ago

meltedover 9 years ago

评论 #10942235 未加载

评论 #10942280 未加载

评论 #10941902 未加载

评论 #10942528 未加载

评论 #10941943 未加载

syskover 9 years ago

Can anyone ELI5 what it means for an open source project to become an Apache project? Why doesn't Google just push the code on Github?

评论 #10948595 未加载

评论 #10944396 未加载

评论 #10942914 未加载

评论 #10942998 未加载

Wonnk13over 9 years ago

what are the best resources to learn about streaming, dataflow, etc? Not necessarily the Google implementations, but the core concepts backing them.

评论 #10943014 未加载

评论 #10942999 未加载

xcelqover 9 years ago

Can we hope to see a google like search engine open source? I'm just waiting for this day to happen.

ericandover 9 years ago

评论 #10942615 未加载

obulpathiover 9 years ago

It would be awesome to have the code portable across various big data engines.

BenoitPover 9 years ago

Where does Dataflow stands? Is it only a wrapper, trying to define a standard API for combining stream producers, datastores, and stream engines?

评论 #10953401 未加载

Google proposes its Dataflow batch/stream tech to the Apache Incubator

9 comments

Google proposes its Dataflow batch/stream tech to the Apache Incubator

9 comments