TechEcho

7 comments

crapflarealmost 7 years ago

<a href="https://www.alexirpan.com/2018/02/14/rl-hard.html" rel="nofollow">https://www.alexirpan.com/2018/02/14/rl-hard.html</a> Reinforcment learning for the average person is a big waste of time. Probably for anyone atm

评论 #17258283 未加载

评论 #17261164 未加载

curiousgalalmost 7 years ago

This is mostly just a preview to a codecamp.

评论 #17258067 未加载

yonkshialmost 7 years ago

I think "learn" is a bit misleading here but I do have to say it's a nice and intuitive overview of RL. RL is quite hard and math heavy, I don't know if one can take a short cut in learning RL without solid graduate level math foundation.

评论 #17258365 未加载

评论 #17258557 未加载

评论 #17258216 未加载

master_yoda_1almost 7 years ago

Is this a click bait article? I wish I have AD BLOCKER plus plus to block this kind of &*## $#!^ :(

评论 #17258623 未加载

ninjamayoalmost 7 years ago

Just get Sutton’s and Barto’s book.

评论 #17258064 未加载

setzer22almost 7 years ago

A small tangential criticism, but using "deep" every other sentence and especially expressions like "classical deep learning" made me take this article less seriously.<p>This is not unique to this author, sadly. I'm tired of seeing the d word thrown in research papers just for the sake of adding more buzzwords per buzzword.<p>Once you've made clear you are using neural networks with a lot of layers you can start using some variation in the discourse. Maybe just call them neural networks...

ogennadialmost 7 years ago

There were so many technical terms, I'm surprised you could get through even an overview, and then practicals, in just 4 hours.<p>Do you know of any resources which list most of the common alternatives? e.g. what are the alternatives to a3c for parallelizing; or the alternatives to a2c for getting policy and value estimates?

7 comments

crapflarealmost 7 years ago

评论 #17258283 未加载

评论 #17261164 未加载

curiousgalalmost 7 years ago

This is mostly just a preview to a codecamp.

评论 #17258067 未加载

yonkshialmost 7 years ago

评论 #17258365 未加载

评论 #17258557 未加载

评论 #17258216 未加载

master_yoda_1almost 7 years ago

Is this a click bait article? I wish I have AD BLOCKER plus plus to block this kind of &*## $#!^ :(

评论 #17258623 未加载

ninjamayoalmost 7 years ago

Just get Sutton’s and Barto’s book.

评论 #17258064 未加载

setzer22almost 7 years ago

ogennadialmost 7 years ago

Reinforcement Learning from scratch

7 comments

Reinforcement Learning from scratch

7 comments