Stochastic Parrot

125 pointsby miobrienalmost 2 years ago

22 comments

Workaccount2almost 2 years ago

I worry that the "stochastic parrot" was premature, an idea sown early in development that will now carry along through any advances made.Basically there is this innate idea that if the basic building blocks are simple systems with deterministic behavior, then the greater system can never be more than that. I've seen this is spades within the AI community, "It's just matrix multiplication! It's not capable of thinking or feeling!"Which to me always felt more like a hopeful statement rather than a factual one. These guys have no idea what consciousness is (nobody does) nor have any reference point for what exactly is "thinking" or "feeling". They can't prove I'm not a stochastic parrot anymore than they can prove whatever cutting edge LLM isn't.So while yes, present LLMs likely are just stochastic parrots, the same technology scaled might bring us a model that actually is "something that is something to be like", and we'll have everyone treating it with reckless carelessness because "its just a stochastic parrot".

评论 #36311405 未加载

评论 #36313184 未加载

评论 #36311620 未加载

评论 #36311122 未加载

评论 #36311248 未加载

评论 #36312127 未加载

评论 #36312189 未加载

评论 #36313394 未加载

评论 #36312753 未加载

评论 #36312267 未加载

评论 #36312471 未加载

评论 #36311475 未加载

评论 #36323689 未加载

评论 #36317708 未加载

评论 #36311767 未加载

评论 #36312313 未加载

评论 #36311656 未加载

ispalmost 2 years ago

Topical tweet from 2018:> Optimist: AI has achieved human-level performance!> Realist: “AI” is a collection of brittle hacks that, under very specific circumstances, mimic the surface appearance of intelligence.> Pessimist: AI has achieved human-level performance.<a href="https://twitter.com/dmimno/status/949302857651671040" rel="nofollow noreferrer">https://twitter.com/dmimno/status/949302857651671040</a>

mach1nealmost 2 years ago

>"stochastic parrot" is a term coined by Emily M. Bender in the 2021 artificial intelligence research paper "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?"This might be the first time the term was seen in an ’official’ context, but is it really the origin? It feels like the term has been hovering around for longer, and even Google Trends shows significant search trends way before 2021

评论 #36311194 未加载

评论 #36311249 未加载

评论 #36310938 未加载

评论 #36310913 未加载

samgilbalmost 2 years ago

Fun fact: philosopher Regina Rini referred to GPT-3 as a "statistical parrot" six months before the Bender et al paper came out: <a href="https://dailynous.com/2020/07/30/philosophers-gpt-3/#rini" rel="nofollow noreferrer">https://dailynous.com/2020/07/30/philosophers-gpt-3/#rini</a>

rsynnottalmost 2 years ago

> They go on to note that because of these limitations, a learning machine might produce results which are "dangerously wrong"I was initially thinking "well, yes, Nobel Prize for Stating the Obvious there", but looks like the paper was written in the far distant past of 2021, when LLMs were largely still in their babbling obvious nonsense stage, rather than the current state of the art, where they babble dangerously convincing nonsense, so, well, fair enough I suppose.Amazing how fast progress has been there, though it's progress in an arguably rather worrying direction, of course.

评论 #36310657 未加载

seydoralmost 2 years ago

LLMs are not stochastic though, they are deterministic and dont even require random numbers, right?The term in general seems to be unfortunate because the models seem to do more than parroting. LLMs are more like central pattern generators of the nervous systems, able to flexibly create well coordinated patterns when guided appropriately

评论 #36310879 未加载

评论 #36312298 未加载

评论 #36311641 未加载

评论 #36312492 未加载

评论 #36312555 未加载

dekhnalmost 2 years ago

The real question to me is: in the next decade, as ML researchers roll out progressively more sophisticated systems, we can expect that generative systems- which may actually be "only stochastic parrots"- are going to create works that would fool any reasonable human being.At what point does a stochastic parrot fake it till it makes it? Does it even matter? We can imagine that, within 10 years, we'll have a fully synthetic virtual human simulator- a generative AI combined with knowledge base, language parsing, audio and video recognition, basically a talking head that could join your next technical meeting and look like full contributor. If that happens, will the Timnits and the Benders of the world admit that, perhaps, systems which are indistinguishable from a human may not just be parrots, or perhaps, we are just sufficiently advanced parrotS?Seen from that perspective, the promoters of stochastic parrots would seem to be luddites and close-minded, as well as discouraging legitimate, important, and valuable scientific research.

评论 #36313046 未加载

renewiltordalmost 2 years ago

In the end, it turned out the actual innovation was doing the opposite of what this paper recommended: scaling up the LLM, improving quality by throwing lots of data at it rather than curating, and limiting bias by RLHF rather than picking the right datasets.The organizations that listened to these people for even some amount of time got hosed in this situation. Google managed to oust this flock from within but not before their AIs were so lobotomized that they are wildly renowned for being the village idiot.Ultimately, this paper is a triumph of branding over science. Read it if you'd like. But if you let these kinds of people into your organization, they'll cripple it. It costs a lot to get them out. Instead, simply never let them in.

评论 #36312472 未加载

评论 #36311919 未加载

the8472almost 2 years ago

The first step to defeating a tiger is to realize that it cannot hurt you, for it is only made of simple atoms.

评论 #36312287 未加载

rchaudalmost 2 years ago

I've got another word for it: recipe-fication.Everything we revile about online recipe websites that spend 1000 words about the history of cooking before getting to the point, will be part and parcel of AI-written anything. It won't be properly proofread or edited by a human, because that would defeat the purpose.

adamsmith143almost 2 years ago

Yoshua Bengio, Andrew Ng, Anrej Karpathy, and many other of the top researchers in the field do not believe these models are stochastic parrots, they believe they have internal world models and prompts are methods to probe those world models. Stochastic parrots is one of the dumbest takes in AI/ML.

评论 #36312275 未加载

dehrmannalmost 2 years ago

Something good that came out of crypto was a lot of people thought about what money actually is. LLMs are doing the same with intelligence.

评论 #36312787 未加载

评论 #36311533 未加载

评论 #36313831 未加载

apialmost 2 years ago

I’d argue that all these models are stochastic parrots because they’re not embodied in any way. There is no way they can actually understand what they are talking about in any way that is tied back to the physical world.What these LLMs and diffusion models and such actually are is a lossy compression method that permits structural queries. The fact that they can learn structure as well as content allows them to reason as well, but only to the extent that the rules they’re following existed somewhere in the training data and its structure.If one were given access to senses and memory and feedback mechanisms and learned language that way, it might be considered actually intelligent or even sentient if it exhibited autonomy and value judgments.

评论 #36310825 未加载

评论 #36312700 未加载

评论 #36311522 未加载

评论 #36310685 未加载

Invictus0almost 2 years ago

Feels like this wikipedia page is overly (self-?)promotional of the paper and its authors

评论 #36312381 未加载

评论 #36310926 未加载

hackandthinkalmost 2 years ago

A nice paper:"Meaning without reference in large language models""we argue that LLM likely capture important aspects of meaning, and moreover work in a way that approximates a compelling account of human cognition in which meaning arises from con- ceptual role"<a href="https://arxiv.org/pdf/2208.02957.pdf" rel="nofollow noreferrer">https://arxiv.org/pdf/2208.02957.pdf</a>I remember Quine's meaning holism it seems to be related.<a href="https://en.wikipedia.org/wiki/Semantic_holism" rel="nofollow noreferrer">https://en.wikipedia.org/wiki/Semantic_holism</a>

RHSman2almost 2 years ago

What do you think parrots think about this? Insulted.

cubefoxalmost 2 years ago

GPT-3 was released less than a year before that, even though this now seems to be long ago. Time is moving fast with AI.

评论 #36311823 未加载

aaroninsfalmost 2 years ago

TL;DR: the focus on the implementation details, and descriptions like this, are detrimental, even perilous,because such accounts are both accurate, and deeply misleading.This is description, but it is neither predictive, nor explanatory.It implies a false model, rather than providing one.Evergreen:Ximm's Law: every critique of AI assumes to some degree that contemporary implementations will not, or cannot, be improved upon. Lemma: any statement about AI which uses the word "never" to preclude some feature from future realization is false.

koalalaalmost 2 years ago

From the article: A "stochastic parrot", according to Bender, is an entity "for haphazardly stitching together sequences of linguistic forms … according to probabilistic information about how they combine, but without any reference to meaning."It seems to me that the great success transformers are now enjoying is precisely due to the fact that 'probabilistic information about how they combine' _is_ meaning.

评论 #36313114 未加载

评论 #36313882 未加载

nologic01almost 2 years ago

Rehashed language imitating sequences is a term that does not denigrate parrots.

browningstreetalmost 2 years ago

“stochastic” is to the tech forum as “sapiosexual” is to the online dating profile

constantcryingalmost 2 years ago

This also relates to vision models. The existence of adversarial attacks (e.g. imperceptable changes in the image drastically changing the output) essentially demonstrate that the model has not reached the point at which the network "understands" the generalized concept it wants to disinguish.

评论 #36313424 未加载