Standardizing OpenAI’s deep learning framework on PyTorch

242 点作者 pesenti超过 5 年前

14 条评论

cs702超过 5 年前

At work, we switched over from TensorFlow to PyTorch when 1.0 was released, both for R&D and production... and our productivity and happiness with PyTorch noticeably, significantly improved.Back when we were using TensorFlow, whenever we wanted to try something new that wasn't already provided out-of-the-box by existing APIs, sooner or later we would find ourselves wrestling with its machinery, especially for models with more complex control flow.TensorFlow feels like it was built from the ground up to scale up to billions of users and all kinds of devices, with developer productivity and happiness a secondary priority. PyTorch feels like it was built the other way around, prioritizing developer productivity and happiness; other considerations were secondary.That said, we are keeping an eye on Swift + MLIR + TensorFlow. We think it could unseat PyTorch for R&D and eventually, production, due to (a) the promise of automatic creation of high-performance GPU/TPU kernels without hassle, (b) Swift's easy learning curve, and (c) Swift's fast performance and type safety. Jeremy Howard has a good post about this: <a href="https://www.fast.ai/2019/03/06/fastai-swift/" rel="nofollow">https://www.fast.ai/2019/03/06/fastai-swift/</a>

评论 #22194304 未加载

评论 #22194818 未加载

评论 #22195335 未加载

评论 #22197986 未加载

评论 #22196670 未加载

评论 #22197167 未加载

评论 #22201966 未加载

评论 #22199856 未加载

评论 #22194976 未加载

stabbles超过 5 年前

I've started working with Flux [1] in Julia, and it's so elegant and such a great experience :). Just look at this definition of a U-net model for image segmentation: <a href="https://gist.github.com/haampie/bceb1d59fd9a44f092f913062e58d482" rel="nofollow">https://gist.github.com/haampie/bceb1d59fd9a44f092f913062e58...</a>. Apart from that, you can write your own custom loss functions in pure Julia that run efficiently on the GPU, language level automatic differentiation, proper integration with other packages. If people are moving away from Tensorflow, then Flux could be a solid alternative as well.[1] <a href="https://github.com/FluxML/Flux.jl" rel="nofollow">https://github.com/FluxML/Flux.jl</a>

评论 #22198384 未加载

评论 #22197682 未加载

antome超过 5 年前

As someone who has used both PyTorch and TensorFlow for a couple years now, I can can attest to the faster research iteration times for PyTorch. TensorFlow has always felt like it was designed for some mythical researcher that could come up with a complete architecture ahead of time, based on off-the-shelf parts.

评论 #22194334 未加载

评论 #22198563 未加载

theferalrobot超过 5 年前

Happy to see PyTorch get some love. The company I am at made the same switch and everyone has loved PyTorch. It has more expressive power than Tensorflow 1.x (there are models that cannot be done with static graphs) and is simultaneously much easier to use.

sbrother超过 5 年前

Is there any equivalent of TF Serving for PyTorch? We have been thrilled with how robust and easy it is to deploy our models to production on the TF stack, and it worries me that the inertia in the deep learning community seems to be toward PyTorch.

评论 #22195033 未加载

评论 #22195070 未加载

评论 #22195362 未加载

sandGorgon超过 5 年前

This is second large framework making the switch to Pytorch.<a href="https://medium.com/syncedreview/japanese-unicorn-preferred-networks-migrates-its-dl-platform-to-pytorch-a509ac8f4ba0" rel="nofollow">https://medium.com/syncedreview/japanese-unicorn-preferred-n...</a>

m0zg超过 5 年前

If PyTorch had a viable way to convert models to run on a mobile GPU or DSP, that's all I'd ever use. Currently I have to do my research in PyTorch and then laboriously port to TF to convert to TFLite, which kinda sucks because TF is full of bugs, and there are gotchas due to differences in how ops are implemented.

minimaxir超过 5 年前

It's somewhat disappointing that research is the primary motivator for the switch. PyTorch still has a ways to go in tooling for toy usage of models and deployment of models to production compared to TensorFlow (incidentally, GPT-2, the most public of OpenAI's released models, uses TensorFlow 1.X as a base). For AI newbies, I've seen people recommend PyTorch over TensorFlow just because "all the big players are using it," without listing the caveats.The future of AI research will likely be interoperability between multiple frameworks to support both needs (e.g. HuggingFace Transformers which started as PyTorch-only but now also supports TF 2.X with relative feature parity).

评论 #22194423 未加载

评论 #22193950 未加载

评论 #22193921 未加载

sillysaurusx超过 5 年前

This is a surprisingly unintelligent move from OpenAI. It adds corporate inertia to something as mundane as choice of DL framework.Imagine you worked at OpenAI. Imagine you wanted to experiment with Jax, and that it turned out to be the best solution for the problem. Now you can't ship without a solid technical justification.Except, it's not really a technical justification that you need. You need corporate clout. You can't just be a junior engineer and make a decision that goes against corporate policy. That's the point of having a corporate policy.I can hear a thousand people about to type "C'mon, OpenAI isn't a normal corporation." But it is. Every corporation is a normal corporation. And having policies against specific tech should make productive programmers pause.People get jobs at companies based on whether they use React or Vue, for example. And in DL, a programming library is basically a programming language, so it's one step more powerful than that.Here's an example. Pytorch, as far as I can tell, doesn't support running code on a TPU's CPU. (I could be wrong about this!) When you enumerate the list of accelerators available after connecting to a TPU, you get a list of 8 entries. That means they only support executing code on the cores of a TPU, not the TPU's CPU. This is a huge difference. It means you're restricted to 8GB on TPUv2-8's (which you get on Colab) instead of 300GB.Does that count as a solid technical justification to use Tensorflow for a research project instead of Pytorch? Who knows. But who wants to be the odd one out on corporate politics? Especially if a project doesn't generate any tangible results, which is often the case for research.

评论 #22193771 未加载

评论 #22194437 未加载

评论 #22194016 未加载

评论 #22197590 未加载

评论 #22194092 未加载

评论 #22194167 未加载

评论 #22194172 未加载

zackmorris超过 5 年前

Just FYI I looked at PyTorch for the first time now, and unfortunately they require Mac OS users to build it from source in order to get CUDA support:<a href="https://pytorch.org/get-started/locally/" rel="nofollow">https://pytorch.org/get-started/locally/</a>Please if someone at PyTorch is reading this, put in a request to make CUDA support the default on Mac OS.Also, it looks like PyTorch doesn't currently support OpenCL:<a href="https://github.com/pytorch/pytorch/issues/488" rel="nofollow">https://github.com/pytorch/pytorch/issues/488</a>I can't tell by the issue comments if it's been added yet or if they plan to use Intel's oneAPI or similar.To me, these are prerequisites for switching to PyTorch. Hopefully someone can clarify the state of these thanks!

评论 #22195228 未加载

评论 #22195202 未加载

评论 #22195159 未加载

tastyminerals超过 5 年前

I think it was just a matter of time till TF would get superseded by PyTorch. The only reason we kept TF on prod is the java api which allowed us to quickly load and serve TF models. I spent so many sleepless nights trying to port Torch model to TF back in the days and make it work the same as Lua based prototype. Whole TF "experience" made us switch to plain Python services model throwing away all the boilerplate Scala/Java code for TF. It doesn't happen often in tech that a better engineered product gets more traction and recognition eventually and I am glad that PyTorch did.

评论 #22196790 未加载

bitL超过 5 年前

I believe these days one has to know both, TensorFlow (Keras) and PyTorch; most new research is in PyTorch and most deployments are in TensorFlow. Academia can afford to run on PyTorch only, stable businesses on TensorFlow only, but for individual developers they need to know both.

klowrey超过 5 年前

For folks interested in Julia and RL, I've been involved in <a href="https://www.lyceum.ml/" rel="nofollow">https://www.lyceum.ml/</a> a set of tools for continuous control problems like robotics.It's pretty quick.

评论 #22196415 未加载

syntaxing超过 5 年前

Has anyone taken the course mentioned "Spinning Up in Deep RL"? I've been meaning to learning some Deep RL and I was wondering if this is the best first step.

评论 #22193586 未加载

评论 #22193525 未加载