39 点作者 superfx超过 9 年前

3 条评论

smhx超过 9 年前

This is simply a transpile / reproduction of the original Torch version from Deepmind, but in TensorFlow. It doesn't really do anything new or different compared to the paper by [Mnih et. al.](<a href="http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html" rel="nofollow">http://www.nature.com/nature/journal/v518/n7540/full/nature1...</a>) .

评论 #10790508 未加载

nullc超过 9 年前

Would be interesting to see if it could be regularized to make it a bit less twitchy. (E.g. by giving a fitness bonus to no action / and-or penalizing many changes of direction within some time window.)

minimaxir超过 9 年前

Github page: <a href="https://github.com/asrivat1/DeepLearningVideoGames" rel="nofollow">https://github.com/asrivat1/DeepLearningVideoGames</a>

I Trained a Deep Q Network Built in TensorFlow to Play Atari Pong

3 条评论

I Trained a Deep Q Network Built in TensorFlow to Play Atari Pong

3 条评论