Note that it isn't being created from whole cloth, it is trained on videos of the places and then it is generating the frames:<p>"To improve autoregressive stability for this research preview, what we’re sharing today can be considered a narrow distribution model: it's pre-trained on video of the world, and post-trained on video from a smaller set of places with dense coverage. The tradeoff of this post-training is that we lose some generality, but gain more stable, long-running autoregressive generation."<p><a href="https://odyssey.world/introducing-interactive-video" rel="nofollow">https://odyssey.world/introducing-interactive-video</a>