科技回声

carlmcqueen大约 6 年前

Not to spoil the article for anyone but..<p>Pretty in depth article and well laid out explanation of natural gradient descent with a small pre-fit dataset for a conclusion of 'too computationally expensive for machine learning/big data world'.<p>This is what I struggled with in school. You'd spend a class week learning some tough stuff only to be told 'this is no longer done, better methods are now used.'<p>Sometimes the work is needed to allow you to understand why/how the new method is used, but in many cases I didn't find that to be true.

评论 #19430821 未加载

评论 #19433828 未加载

评论 #19432294 未加载

评论 #19435389 未加载

dfan大约 6 年前

<a href="https://towardsdatascience.com/its-only-natural-an-excessively-deep-dive-into-natural-gradient-optimization-75d464b89dbb" rel="nofollow">https://towardsdatascience.com/its-only-natural-an-excessive...</a> is another nice overview complementary to this one.

xtacy大约 6 年前

This series is well written. The speed up is expected due because you're incorporating second-order information during the optimisation. For more insight into second order optimisation methods, take a look at Newton's method: <a href="https://en.wikipedia.org/wiki/Newton%27s_method" rel="nofollow">https://en.wikipedia.org/wiki/Newton%27s_method</a>. The intuition, derivation, and proof of correctness and convergence speed are quite illuminating.

评论 #19432890 未加载

Natural Gradient Descent (2018)

3 条评论

Natural Gradient Descent (2018)

3 条评论