TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Spinning Up in Deep RL

169 点作者 stablemap超过 6 年前

8 条评论

minimaxir超过 6 年前
This developed-and-maintained package is a good approach towards furthering RL development; as the writeups state, the biggest problem in RL is subtle bugs from an implementation which don&#x27;t cause an error but tank learning performance. (+ loggers&#x2F;utils to help debug things)<p>Granted, a lot of RL thought pieces&#x2F;examples on places like Medium.com take an existing RL implementation without many tweaks, run it on a new task, and see what happens. A better RL library might make this workflow more prevalent; hence why it&#x27;s very important for researchers to make their pipelines transparent.
评论 #18409007 未加载
评论 #18409001 未加载
dimitry12超过 6 年前
This is awesome and I hope will allow more people to <i>experiment</i> with algorithms, instead of only re-applying OpenAI&#x27;s baselines. Baselines are great, but are very hard (for me, at least) to tinker with.<p>It helps me to understand something new if I can controllably break it. In other words, I progress by predicting the edge-conditions when something shouldn&#x27;t work - and then testing if algorithm indeed experienced expected type of failure. Transparent algorithm implementation is key for this.<p>One thing, which I immediately checked in the spinningup-repo is if it uses TF Eager. And it doesn&#x27;t. @OpenAI what&#x27;s your reasoning for that?
评论 #18410249 未加载
browsercoin超过 6 年前
whenever I see high quality submissions I bookmark it and promise myself to come back and spend time learning it.<p>this time...I promise myself its different
pretty_dumm_guy超过 6 年前
Hi! I really appreciate you sharing this with the community. The documents and code look really clear and concise. I do have one question. Is it possible to change the dependency on Mujoco engine to something else (to for e.g. Roboschool)?<p>I don&#x27;t have access to a computer with GPU and I am currently using google colab to do my DL projects. I tried installing Mujoco on colab but unfortunately, the computer id generated seems invalid. Any help is highly appreciated.<p>Thank you!
评论 #18411788 未加载
nshr超过 6 年前
This looks like an awesome initiative! I think it will be very valuable for people trying to enter the field. I particularly like the clear advice on how to get started doing RL research. Have you considered setting up a forum for the community to share their experiences?
rcshubhadeep超过 6 年前
Discovered two small issues in the doc. Where can I send feedback?
评论 #18410714 未加载
wnevets超过 6 年前
is there a Dockerfile with everything set up already?
评论 #18410722 未加载
enygmata超过 6 年前
I thought this was something about roguelikes. :(