TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

I Trained a Deep Q Network Built in TensorFlow to Play Atari Pong

39 点作者 superfx超过 9 年前

3 条评论

smhx超过 9 年前
This is simply a transpile &#x2F; reproduction of the original Torch version from Deepmind, but in TensorFlow. It doesn&#x27;t really do anything new or different compared to the paper by [Mnih et. al.](<a href="http:&#x2F;&#x2F;www.nature.com&#x2F;nature&#x2F;journal&#x2F;v518&#x2F;n7540&#x2F;full&#x2F;nature14236.html" rel="nofollow">http:&#x2F;&#x2F;www.nature.com&#x2F;nature&#x2F;journal&#x2F;v518&#x2F;n7540&#x2F;full&#x2F;nature1...</a>) .
评论 #10790508 未加载
nullc超过 9 年前
Would be interesting to see if it could be regularized to make it a bit less twitchy. (E.g. by giving a fitness bonus to no action &#x2F; and-or penalizing many changes of direction within some time window.)
minimaxir超过 9 年前
Github page: <a href="https:&#x2F;&#x2F;github.com&#x2F;asrivat1&#x2F;DeepLearningVideoGames" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;asrivat1&#x2F;DeepLearningVideoGames</a>