科技回声

6 条评论

Jack000大约 3 年前

2022: Deepmind releases paper on bootstrapped meta-learning and scaling RL agents<p>2023: RL agent trained for multi-task learning solves majority of perfect information games. It's a scaled up decision transformer. Scaling laws for RL agents are discovered, similar to language models.<p>2024: Large scale RL agents are combined with frozen vision and language models via cross-attention, can be prompted one-shot with language/vision tokens to solve novel tasks.<p>2025: RL agents enter the real world - first pre-trained in diverse synthetic environments, then via imitation learning from youtube videos, and finally in an online fashion via realtime human interaction.<p>timeline might be optimistic, but one can hope!

评论 #31199341 未加载

评论 #31195681 未加载

评论 #31196319 未加载

maxwells-daemon大约 3 年前

Wow! The ability to ingest the "cross product" of data on the internet and in the real world is huge; I bet a lot of what LMs don't know yet lives in that space. This seems a lot more general-purpose than CLIP, so I'm hopeful for even more impressive downstream applications, eg robotics.

goldenkey大约 3 年前

"I am not affected by this difference" - What The Fuck?!

bobbylarrybobby大约 3 年前

The conversations are scary. They almost don't seem believable -- did I miss the part where they say they're just an example of what a conversation might look like?

评论 #31194804 未加载

jcims大约 3 年前

I would love to hear some of the spine tingling moments these researchers experience when developing and interacting with large models.

razodactyl大约 3 年前

AI. Just casually evolving alongside and using us as their conduit. Lol

Tackling multiple tasks with a single visual language model

6 条评论

Tackling multiple tasks with a single visual language model

6 条评论