科技回声

8 条评论

minimaxir超过 6 年前

This developed-and-maintained package is a good approach towards furthering RL development; as the writeups state, the biggest problem in RL is subtle bugs from an implementation which don't cause an error but tank learning performance. (+ loggers/utils to help debug things)Granted, a lot of RL thought pieces/examples on places like Medium.com take an existing RL implementation without many tweaks, run it on a new task, and see what happens. A better RL library might make this workflow more prevalent; hence why it's very important for researchers to make their pipelines transparent.

评论 #18409007 未加载

评论 #18409001 未加载

dimitry12超过 6 年前

This is awesome and I hope will allow more people to experiment with algorithms, instead of only re-applying OpenAI's baselines. Baselines are great, but are very hard (for me, at least) to tinker with.It helps me to understand something new if I can controllably break it. In other words, I progress by predicting the edge-conditions when something shouldn't work - and then testing if algorithm indeed experienced expected type of failure. Transparent algorithm implementation is key for this.One thing, which I immediately checked in the spinningup-repo is if it uses TF Eager. And it doesn't. @OpenAI what's your reasoning for that?

评论 #18410249 未加载

browsercoin超过 6 年前

whenever I see high quality submissions I bookmark it and promise myself to come back and spend time learning it.this time...I promise myself its different

pretty_dumm_guy超过 6 年前

Hi! I really appreciate you sharing this with the community. The documents and code look really clear and concise. I do have one question. Is it possible to change the dependency on Mujoco engine to something else (to for e.g. Roboschool)?I don't have access to a computer with GPU and I am currently using google colab to do my DL projects. I tried installing Mujoco on colab but unfortunately, the computer id generated seems invalid. Any help is highly appreciated.Thank you!

评论 #18411788 未加载

nshr超过 6 年前

This looks like an awesome initiative! I think it will be very valuable for people trying to enter the field. I particularly like the clear advice on how to get started doing RL research. Have you considered setting up a forum for the community to share their experiences?

rcshubhadeep超过 6 年前

Discovered two small issues in the doc. Where can I send feedback?

评论 #18410714 未加载

wnevets超过 6 年前

is there a Dockerfile with everything set up already?

评论 #18410722 未加载

enygmata超过 6 年前

I thought this was something about roguelikes. :(

8 条评论

minimaxir超过 6 年前

评论 #18409007 未加载

评论 #18409001 未加载

dimitry12超过 6 年前

评论 #18410249 未加载

browsercoin超过 6 年前

whenever I see high quality submissions I bookmark it and promise myself to come back and spend time learning it.this time...I promise myself its different

pretty_dumm_guy超过 6 年前

评论 #18411788 未加载

nshr超过 6 年前

rcshubhadeep超过 6 年前

Discovered two small issues in the doc. Where can I send feedback?

评论 #18410714 未加载

wnevets超过 6 年前

is there a Dockerfile with everything set up already?

评论 #18410722 未加载

enygmata超过 6 年前

I thought this was something about roguelikes. :(

Spinning Up in Deep RL

8 条评论

Spinning Up in Deep RL

8 条评论