TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Why aren't LLMs trained on action / cause+effect data vs. just analytical stuff?

3 pointsby purplerabbit8 months ago
Stupid question, but if we want models that are capable of <i>doing</i> things (agents) vs just spitting out interesting content, why isn&#x27;t anyone training them on data that represents actions?<p>Models are incredible at generating analytical &#x2F; blog-ish &#x2F; stack overflowish content, but suck at doing things that are complex enough that they require iteration.<p>For instance: If we want models that can handle complex projects, why don&#x27;t we record actions taken in the execution of complex projects, and train models on that? Or if we want models that can use a browser competently, why don&#x27;t we train models on screenshots + action descriptions? (Or is this what was done with o1, which is why it seems to have unprecedented capabilities?)<p>Is the problem just getting high-quality data? I know we&#x27;ve got internet dumps full of blog-ish content, but no big, easy-to-gather dumps of high-quality information about actions or chains of actions and their effects over time<p>(I&#x27;m sure there are tons of framing problems in this question -- sorry)

2 comments

dtagames8 months ago
What you&#x27;re describing isn&#x27;t how GPT training works. Mostly, they work on <i>next token prediction</i> without having any understanding of what those tokens actually mean. It works well for text and images but it can&#x27;t lead to a reproducible set of steps.<p>I wrote an article[0] about it recently that you might enjoy.<p>[0] Something From Nothing | A Painless Approach to Understanding AI<p><a href="https:&#x2F;&#x2F;medium.com&#x2F;gitconnected&#x2F;something-from-nothing-d755f49d6636" rel="nofollow">https:&#x2F;&#x2F;medium.com&#x2F;gitconnected&#x2F;something-from-nothing-d755f...</a>
评论 #41712542 未加载
wmf8 months ago
This is starting to happen; they&#x27;re calling them Large Action Models.