Yann LeCun, Pioneer of AI, Thinks Today's LLM's Are Nearly Obsolete

124 pointsby alphadelphi2 months ago

10 comments

antirez2 months ago

As LLMs do things thought to be impossible before, LeCun adjusts his statements about LLMs, but at the same time his credibility goes lower and lower. He started saying that LLMs were just predicting words using a probabilistic model, like a better Markov Chain, basically. It was already pretty clear that this was not the case as even GPT3 could do summarization well enough, and there is no probabilistic link between the words of a text and the gist of the content, still he was saying that at the time of GPT3.5 I believe. Then he adjusted this vision when talking with Hinton publicly, saying "I don't deny there is more than just probabilistic thing...". He started saying: not longer just simply probabilistic but they can only regurgitate things they saw in the training set, often explicitly telling people that novel questions could NEVER solved by LLMs, with examples of prompts failing at the time he was saying that and so forth. Now reasoning models can solve problems they never saw, and o3 did huge progresses on ARC, so he adjusted again: for AGI we will need more. And so forth.So at this point it does not matter what you believe about LLMs: in general, to trust LeCun words is not a good idea. Add to this that LeCun is directing an AI lab that as the same point has the following huge issues:1. Weakest ever LLM among the big labs with similar resources (and smaller resources: DeepSeek).2. They say they are focusing on open source models, but the license is among the less open than the available open weight models.3. LLMs and in general all the new AI wave puts CNNs, a field where LeCun worked (but that didn't started himself) a lot more in perspective, and now it's just a chapter in a book that is composed mostly of other techniques.Btw, other researchers that were in the LeCun side, changed side recently, saying that now "is different" because of CoT, that is the symbolic reasoning they were blabling before. But CoT is stil regressive next token without any architectural change, so, no, they were wrong, too.

评论 #43594733 未加载

评论 #43595519 未加载

评论 #43594669 未加载

评论 #43595668 未加载

评论 #43594747 未加载

评论 #43614487 未加载

评论 #43594852 未加载

评论 #43595501 未加载

评论 #43595562 未加载

评论 #43596309 未加载

评论 #43595292 未加载

评论 #43597435 未加载

评论 #43597354 未加载

评论 #43594812 未加载

评论 #43596291 未加载

gsf_emergency_22 months ago

Recent talk: <a href="https://www.youtube.com/watch?v=ETZfkkv6V7Y" rel="nofollow">https://www.youtube.com/watch?v=ETZfkkv6V7Y</a>LeCun, "Mathematical Obstacles on the Way to Human-Level AI"Slide (Why autoregressive models suck)<a href="https://xcancel.com/ravi_mohan/status/1906612309880930641" rel="nofollow">https://xcancel.com/ravi_mohan/status/1906612309880930641</a>

评论 #43594491 未加载

评论 #43594527 未加载

评论 #43594385 未加载

djoldman2 months ago

The idolatry and drama surrounding LeCun, Hinton, Schmidhuber, etc. is likely a distraction. This includes their various predictions.More interesting is their research work. JEPA is what LeCun is betting on:<a href="https://ai.meta.com/blog/v-jepa-yann-lecun-ai-model-video-joint-embedding-predictive-architecture/" rel="nofollow">https://ai.meta.com/blog/v-jepa-yann-lecun-ai-model-video-jo...</a>

redox992 months ago

LeCun has been very salty of LLMs ever since ChatGPT came out.

csdvrx2 months ago

> Returning to the topic of the limitations of LLMs, LeCun explains, "An LLM produces one token after another. It goes through a fixed amount of computation to produce a token, and that's clearly System 1—it's reactive, right? There's no reasoning," a reference to Daniel Kahneman's influential framework that distinguishes between the human brain's fast, intuitive method of thinking (System 1) and the method of slower, more deliberative reasoning (System 2).Many people believe that "wants" come first, and are then followed by rationalizations. It's also a theory that's supported by medical imaging.Maybe the LLM are a good emulation of system-2 (their perfomance sugggest it is), and what's missing is system-1, the "reptilian" brain, based on emotions like love, fear, aggression, (etc.).For all we know, the system-1 could use the same embeddings, and just work in parallel and produce tokens that are used to guide the system-2.Personally, I trust my "emotions" and "gut feelings": I believe they are things "not yet rationalized" by my system-2, coming straight from my system-1.I know it's very unpopular among nerds, but it has worked well enough for me!

评论 #43594544 未加载

评论 #43594452 未加载

评论 #43594494 未加载

评论 #43594520 未加载

bitethecutebait2 months ago

there's a bunch of stuff imperative to his thriving that has become obsolete to others 15 years ago ... maybe it's time for a few 'sabbatical' years ...

ejang02 months ago

"[Yann LeCun] believes [current] LLMs will be largely obsolete within five years."

评论 #43594374 未加载

评论 #43594495 未加载

GMoromisato2 months ago

I remember reading Douglas Hofstadter's Fluid Concepts and Creative Analogies [<a href="https://en.wikipedia.org/wiki/Fluid_Concepts_and_Creative_Analogies" rel="nofollow">https://en.wikipedia.org/wiki/Fluid_Concepts_and_Creative_An...</a>]He wrote about Copycat, a program for understanding analogies ("abc is to 123 as cba is to ???"). The program worked at the symbolic level, in the sense that it hard-coded a network of relationships between words and characters. I wonder how close he was to "inventing" an LLM? The insight he needed was that instead of hard-coding patterns, he should have just trained on a vast set of patterns.Hofstadter focused on Copycat because he saw pattern-matching as the core ability of intelligence. Unlocking that, in his view, would unlock AI. And, of course, pattern-matching is exactly what LLMs are good for.I think he's right. Intelligence isn't about logic. In the early days of AI, people thought that a chess-playing computer would necessarily be intelligent, but that was clearly a dead-end. Logic is not the hard part. The hard part is pattern-matching.In fact, pattern-matching is all there is: That's a bear, run away; I'm in a restaurant, I need to order; this is like a binary tree, I can solve it recursively.I honestly can't come up with a situation that calls for intelligence that can't be solved by pattern-matching.In my opinion, LeCun is moving the goal-posts. He's saying LLMs make mistakes and therefore they aren't intelligent and aren't useful. Obviously that's wrong: humans make mistakes and are usually considered both intelligent and useful.I wonder if there is a necessary relationship between intelligence and mistakes. If you can solve a problem algorithmically (e.g., long-division) then there won't be mistakes, but you don't need intelligence (you just follow the algorithm). But if you need intelligence (because no algorithm exists) then there will always be mistakes.

评论 #43594799 未加载

评论 #43594869 未加载

评论 #43594738 未加载

评论 #43594643 未加载

grandempire2 months ago

Is this the guy who tweets all day and gets in online fights?

评论 #43595314 未加载

asdev2 months ago

outside of text generation and search, LLMs have not delivered any significant value

评论 #43594774 未加载

评论 #43594700 未加载

评论 #43594705 未加载

评论 #43594921 未加载

评论 #43594703 未加载