All the AI founders (e.g., Dario Amodei) seem to believe that we're nowhere near the end of seeing performance improvements in LLMs as they are trained on more data (i.e., LLM scaling laws) - at least that's what they say publicly, but they obviously have skin in the game. Curious what knowledgeable people think who are not incentivized to make optimistic public statements?<p>What I really want to know is, assuming capital / compute is not a constraint, will be continue to see order of magnitude improvements in LLMs, or is there some kind of "technological" limit you think exists?
As far as I (ex-ML researcher) know, the main technological case that LLM performance will hit a limit is due to the amount of text data available to train on is limited. The ways these scaling laws work is they require 10x or 100x quantity of data to see major improvements.<p>This isn't necessarily going to limit it though. It's possible there are clever approaches to leverage much more data. This could either be through AI-generated data, other modalities (e.g. video) or another approach altogether.<p>This is quite a good accessible post on both sides of this discussion: <a href="https://www.dwarkeshpatel.com/p/will-scaling-work" rel="nofollow">https://www.dwarkeshpatel.com/p/will-scaling-work</a>
Research seems to suggest we need exponential training data volume increases to see meaningful performance gains: <a href="https://arxiv.org/abs/2404.04125" rel="nofollow">https://arxiv.org/abs/2404.04125</a><p>Personally I think we've already hit a ceiling.