TechEcho

8 comments

hardmaruabout 7 years ago

Hi, I'm one of the authors of this paper (<a href="https://arxiv.org/abs/1803.10122" rel="nofollow">https://arxiv.org/abs/1803.10122</a>, <a href="https://worldmodels.github.io" rel="nofollow">https://worldmodels.github.io</a>).Happy to answer any questions you may have.

评论 #16866493 未加载

评论 #16862900 未加载

评论 #16864816 未加载

评论 #16862932 未加载

评论 #16863634 未加载

评论 #16863620 未加载

评论 #16864622 未加载

评论 #16866242 未加载

npr11about 7 years ago

This is a neat paper - it's an interesting empirical result combining known techniques - but machine learning academics should really know better than to contribute to the over-hyping of results. For example, talking about "dreams" and "hallucinations" is not helpful - it doesn't make the work more accessible and only adds unnecessary hype.

评论 #16862655 未加载

评论 #16863385 未加载

评论 #16862714 未加载

评论 #16863161 未加载

bassman9000about 7 years ago

Our agent consists of three components that work closely together: Vision (V), Memory (M), and Controller (C)Next web frameworks are going to be smart!

make3about 7 years ago

The original interactive blog post is also really awesome <a href="https://worldmodels.github.io/" rel="nofollow">https://worldmodels.github.io/</a>

minimaxirabout 7 years ago

The post talks about running "video" on a remote server for the RL training, but not how to take that image and visualize it locally (which would be helpful for debugging failing models).Let's say I wanted to run a Twitch stream of RL training on a remote server (and stream directly from the server to Twitch). What is the intended way to render the video in real time remotely?

BrandonSmithJabout 7 years ago

Is this similar to Dyna-Q learning, but with modeling/simulation being handled by the RNN?It looks like the VAE is just used to create a feature vector, so the main difference seems to be in the MDN-RNN - which is taking the place of the usual state/action simulation in Dyna-Q.

评论 #16863604 未加载

hmate9about 7 years ago

This posts' author is fantastic. Breaks things down and explains everything very nicely.

flyingcircus3about 7 years ago

Who decides what is the correct information to learn? What will prevent a bad actor from providing subject material that teaches people to bring harm to themselves or others. Post Traumatic Stress Disorder sounds, at least to the layman, as this very design pattern, but obviously reinforces undesirable subjects.

评论 #16865703 未加载

评论 #16861578 未加载

评论 #16868887 未加载

8 comments

hardmaruabout 7 years ago

评论 #16866493 未加载

评论 #16862900 未加载

评论 #16864816 未加载

评论 #16862932 未加载

评论 #16863634 未加载

评论 #16863620 未加载

评论 #16864622 未加载

评论 #16866242 未加载

npr11about 7 years ago

评论 #16862655 未加载

评论 #16863385 未加载

评论 #16862714 未加载

评论 #16863161 未加载

bassman9000about 7 years ago

Our agent consists of three components that work closely together: Vision (V), Memory (M), and Controller (C)Next web frameworks are going to be smart!

make3about 7 years ago

The original interactive blog post is also really awesome <a href="https://worldmodels.github.io/" rel="nofollow">https://worldmodels.github.io/</a>

minimaxirabout 7 years ago

BrandonSmithJabout 7 years ago

评论 #16863604 未加载

hmate9about 7 years ago

This posts' author is fantastic. Breaks things down and explains everything very nicely.

flyingcircus3about 7 years ago

评论 #16865703 未加载

评论 #16861578 未加载

评论 #16868887 未加载

A step-by-step guide to the “World Models” AI paper

8 comments

A step-by-step guide to the “World Models” AI paper

8 comments