科技回声

7 条评论

crapflare将近 7 年前

<a href="https://www.alexirpan.com/2018/02/14/rl-hard.html" rel="nofollow">https://www.alexirpan.com/2018/02/14/rl-hard.html</a> Reinforcment learning for the average person is a big waste of time. Probably for anyone atm

评论 #17258283 未加载

评论 #17261164 未加载

curiousgal将近 7 年前

This is mostly just a preview to a codecamp.

评论 #17258067 未加载

yonkshi将近 7 年前

I think "learn" is a bit misleading here but I do have to say it's a nice and intuitive overview of RL. RL is quite hard and math heavy, I don't know if one can take a short cut in learning RL without solid graduate level math foundation.

评论 #17258365 未加载

评论 #17258557 未加载

评论 #17258216 未加载

master_yoda_1将近 7 年前

Is this a click bait article? I wish I have AD BLOCKER plus plus to block this kind of &*## $#!^ :(

评论 #17258623 未加载

ninjamayo将近 7 年前

Just get Sutton’s and Barto’s book.

评论 #17258064 未加载

setzer22将近 7 年前

A small tangential criticism, but using "deep" every other sentence and especially expressions like "classical deep learning" made me take this article less seriously.<p>This is not unique to this author, sadly. I'm tired of seeing the d word thrown in research papers just for the sake of adding more buzzwords per buzzword.<p>Once you've made clear you are using neural networks with a lot of layers you can start using some variation in the discourse. Maybe just call them neural networks...

ogennadi将近 7 年前

There were so many technical terms, I'm surprised you could get through even an overview, and then practicals, in just 4 hours.<p>Do you know of any resources which list most of the common alternatives? e.g. what are the alternatives to a3c for parallelizing; or the alternatives to a2c for getting policy and value estimates?

7 条评论

crapflare将近 7 年前

评论 #17258283 未加载

评论 #17261164 未加载

curiousgal将近 7 年前

This is mostly just a preview to a codecamp.

评论 #17258067 未加载

yonkshi将近 7 年前

评论 #17258365 未加载

评论 #17258557 未加载

评论 #17258216 未加载

master_yoda_1将近 7 年前

Is this a click bait article? I wish I have AD BLOCKER plus plus to block this kind of &*## $#!^ :(

评论 #17258623 未加载

ninjamayo将近 7 年前

Just get Sutton’s and Barto’s book.

评论 #17258064 未加载

setzer22将近 7 年前

ogennadi将近 7 年前

Reinforcement Learning from scratch

7 条评论

Reinforcement Learning from scratch

7 条评论