科技回声

7 条评论

>>> Softmax can be implemented as a composition of primitive TensorFlow ops (exponent, reduction, elementwise division, etc.): softmax = exp(logits) / reduce_sum(exp(logits), dim)No, it can not be implemented this way, it is numerically unstable, and will produce NaNs if any input is greater than ~88.7. Luckily, it is also not how its implemented in Tensorflow: <a href="https://github.com/tensorflow/tensorflow/blob/2c8d0dca978a246f54c506aae4587dbce5d3bcf0/tensorflow/core/kernels/softmax_op_functor.h#L43" rel="nofollow">https://github.com/tensorflow/tensorflow/blob/2c8d0dca978a24...</a>For a clean (and more efficient) C version of this algorithm, take a look at NNPACK reference implementation: <a href="https://github.com/Maratyszcza/NNPACK/blob/master/src/ref/softmax-output.c#L30" rel="nofollow">https://github.com/Maratyszcza/NNPACK/blob/master/src/ref/so...</a>

评论 #13818028 未加载

评论 #13818046 未加载

theCricketer大约 8 年前

Chris Leary, a compiler engineer at Google gave a talk about XLA at the recent Tensorflow Dev Summit:<a href="https://www.youtube.com/watch?v=kAOanJczHA0" rel="nofollow">https://www.youtube.com/watch?v=kAOanJczHA0</a>

jakekovoor大约 8 年前

Thank you OP, this is really helpful. :)If you need to install TensorFlow on Windows 10 you can follow this<a href="http://saintlad.com/install-tensorflow-on-windows/" rel="nofollow">http://saintlad.com/install-tensorflow-on-windows/</a>

评论 #13825640 未加载

评论 #13820145 未加载

visarga大约 8 年前

It would seem Torch/PyTorch are faster than TF. TF uses static optimizations on the computation graph while Torch has a dynamic computation graph. Logically, static optimizations should be faster because they know the data size beforehand.So, why is TF slower?

评论 #13818017 未加载

评论 #13821146 未加载

评论 #13820110 未加载

shoshin23大约 8 年前

I've been looking around in a few places but I can't find a way to use XLA to compile tensorflow models for mobile devices. Is there a tutorial/blogpost by google(or anyone for that matter) talking about it? Thanks!

评论 #13825276 未加载

ndesaulniers大约 8 年前

Even if you're not interested in machine learning or ai, XLA and particularly it's Python bindings are a great and easy way to do GPGPU programming.

probdist大约 8 年前

Why does this support JIT but not AOT for NVIDIA GPUs?

评论 #13818045 未加载

7 条评论

Marat_Dukhan大约 8 年前

评论 #13818028 未加载

评论 #13818046 未加载

theCricketer大约 8 年前

jakekovoor大约 8 年前

评论 #13825640 未加载

评论 #13820145 未加载

visarga大约 8 年前

评论 #13818017 未加载

评论 #13821146 未加载

评论 #13820110 未加载

shoshin23大约 8 年前

评论 #13825276 未加载

ndesaulniers大约 8 年前

Even if you're not interested in machine learning or ai, XLA and particularly it's Python bindings are a great and easy way to do GPGPU programming.

probdist大约 8 年前

Why does this support JIT but not AOT for NVIDIA GPUs?

评论 #13818045 未加载

XLA: linear algebra library for TensorFlow

7 条评论

XLA: linear algebra library for TensorFlow

7 条评论