TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A step-by-step guide to the “World Models” AI paper

261 pointsby datashrimpabout 7 years ago

8 comments

hardmaruabout 7 years ago
Hi, I&#x27;m one of the authors of this paper (<a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;1803.10122" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;1803.10122</a>, <a href="https:&#x2F;&#x2F;worldmodels.github.io" rel="nofollow">https:&#x2F;&#x2F;worldmodels.github.io</a>).<p>Happy to answer any questions you may have.
评论 #16866493 未加载
评论 #16862900 未加载
评论 #16864816 未加载
评论 #16862932 未加载
评论 #16863634 未加载
评论 #16863620 未加载
评论 #16864622 未加载
评论 #16866242 未加载
npr11about 7 years ago
This is a neat paper - it&#x27;s an interesting empirical result combining known techniques - but machine learning academics should really know better than to contribute to the over-hyping of results. For example, talking about &quot;dreams&quot; and &quot;hallucinations&quot; is not helpful - it doesn&#x27;t make the work more accessible and only adds unnecessary hype.
评论 #16862655 未加载
评论 #16863385 未加载
评论 #16862714 未加载
评论 #16863161 未加载
bassman9000about 7 years ago
<i>Our agent consists of three components that work closely together: Vision (V), Memory (M), and Controller (C)</i><p>Next web frameworks are going to be smart!
make3about 7 years ago
The original interactive blog post is also really awesome <a href="https:&#x2F;&#x2F;worldmodels.github.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;worldmodels.github.io&#x2F;</a>
minimaxirabout 7 years ago
The post talks about running &quot;video&quot; on a remote server for the RL training, but not how to take that image and visualize it locally (which would be helpful for debugging failing models).<p>Let&#x27;s say I wanted to run a Twitch stream of RL training on a remote server (and stream directly from the server to Twitch). What is the intended way to render the video in real time remotely?
BrandonSmithJabout 7 years ago
Is this similar to Dyna-Q learning, but with modeling&#x2F;simulation being handled by the RNN?<p>It looks like the VAE is just used to create a feature vector, so the main difference seems to be in the MDN-RNN - which is taking the place of the usual state&#x2F;action simulation in Dyna-Q.
评论 #16863604 未加载
hmate9about 7 years ago
This posts&#x27; author is fantastic. Breaks things down and explains everything very nicely.
flyingcircus3about 7 years ago
Who decides what is the correct information to learn? What will prevent a bad actor from providing subject material that teaches people to bring harm to themselves or others. Post Traumatic Stress Disorder sounds, at least to the layman, as this very design pattern, but obviously reinforces undesirable subjects.
评论 #16865703 未加载
评论 #16861578 未加载
评论 #16868887 未加载