Teaching ChatGPT to speak my son’s invented language

428 pointsby szopaabout 2 years ago

26 comments

og_kaluabout 2 years ago

In context learning is hands down the biggest breakthrough of LLMs. The flexibility the model displays without updating weights is genuinely mind blowing, bordering on absurd especially if you've trained other kinds of models before.See here - <a href="https://imgur.com/a/w3DAYOi" rel="nofollow">https://imgur.com/a/w3DAYOi</a> from the paper - <a href="https://arxiv.org/abs/2211.09066" rel="nofollow">https://arxiv.org/abs/2211.09066</a>GPT 3.5's (4 is much much better) addition accuracy tanks after 2 digits. However, by approaching arithmetic as an algorithm to be performed and taught similarly to how it's done with people, you can supercharge accuracy to basically 100% for up to 13 digit addition and >90% after.

评论 #35520502 未加载

评论 #35522514 未加载

评论 #35517790 未加载

评论 #35520363 未加载

评论 #35518212 未加载

评论 #35516822 未加载

评论 #35520395 未加载

评论 #35516743 未加载

crdrostabout 2 years ago

Wow, they asked the model to self-evaluate and it just outright cheated:<pre><code> He has three cats. Proposed: h’io’ngkiltrikumrikumrikumri’nguuy Correct: h’io’ngkiltri’ngkumrikumri’nguuy Points: 1 Hypothesis: N/A </code></pre> (Other comments observe that it accidentally compensated for this by getting the sum wrong, haha, d'oh)I have had similar problems with trying to get ChatGPT to do nontrivial things, "here are the rules for this game, do you understand this game, great, let's play it." And then it's like herding cats. "No that's wrong, the game pieces cannot leave the game board," "Oh my apologies you are entirely correct, here is the revised board (proceeds to dump the exact same state of the game board that I told it was wrong)." Eventually it will lie about its own capacities, "As an AI language model I am incapable of selecting a move to play next"... But you have done several already!!! This is literally the ONLY thing you have been doing right and now you refuse?Some other prompts are more successful but it does seem to have a sing-song high school book review style that inclines it to be boring... Very uncanny valley.

评论 #35518722 未加载

评论 #35519012 未加载

评论 #35523078 未加载

评论 #35518750 未加载

评论 #35518880 未加载

评论 #35520367 未加载

graypeggabout 2 years ago

Unrelated to the actual work here (which is interesting!), I just wanted to mention how much of an awesome impact you must be having on your son. You two seem like such a fun curious pair! I spent a lot of time tinkering away alone in my room as a kid and that sucked. The opportunities you’re giving him now to explore are going to inspire a lifetime of creativity.I know you don’t need someone on the internet to tell you that, but I thought I’d give kudos where it’s due :)

评论 #35521137 未加载

anon84873628about 2 years ago

>All of these differences can make it surprising and challenging for someone with an Indo-European language background to learn and use Kłeti.Ironically, Proto-Indo-European is believed to be far more complex than its modern descendants, as described by Wikipedia:>PIE is believed to have had an elaborate system of morphology that included inflectional suffixes (analogous to English child, child's, children, children's) as well as ablaut (vowel alterations, as preserved in English sing, sang, sung, song) and accent. PIE nominals and pronouns had a complex system of declension, and verbs similarly had a complex system of conjugation.So maybe a PIE speaker would have an easier time with Kłeti than we :-)

评论 #35517031 未加载

评论 #35518941 未加载

评论 #35523187 未加载

JCharanteabout 2 years ago

I would like to see this expanded, I think it's a bit unfair to assess its abilities with so few examples. My hypothesis is that a rosetta stone with a thousand examples with a vector database hooked up to it so you don't hit the 32k token context limit would lead to much better performance.

评论 #35516162 未加载

评论 #35516264 未加载

评论 #35517131 未加载

snickererabout 2 years ago

What a wonderful story and a great family. Congratulations!You inspired me to ask GPT to do it the other way around. I asked it to invent a successor to Esperanto, with influences from Slavic, Germanic, and Latin languages. It called its language Euroglossa and wrote a short story in it. Who can understand it?Un tag, en froliko vilaž, un jun chico namen Tomas trovat un misterioz mapo v star bibliotek. Na mapo, skribet: "Skarb de Tri Montes." Tomas decidet da sledit la mapo in aventuro, sperante da otkriti grand skarb.Tomas paket svoi rukzak s neobkhodim stvari, inkluziv kompas, binaukli, i nutrimento. Nachet svoi putovanje, iz vilaž, do la Tri Montes. Po nekoliko dni, on prishel k bazen na pervoj monte. Tam, on otkril zagadka, ki je klyuch za dalsi koraki: "Kogda solntse küsst la luna, dvigat kamen i vstretit un oko."Tomas wartet geduldig bis la sonne küsst la luna in noktchielo. Kvando moment venit, er raskt dvigat un gros stein u otkryvajet secret passaž. Ingressante, on sledit un dunkel tunel, portant un torča por iluminar svoi put. La tunel führt tief in monte, bis Tomas entdeckt un hider kammer s ančient skulpturen i un glänzend tresor.V la hider kammer, Tomas najde un skarbnica poln von stran objekte, inkluziv un animat statuo von un banane, ki parlat und tanzt. Zaskočen, ale smejšeći, Tomas realisat ke la glänzend tresor je ver la amikeco i rado ke li povas dividi kon svoi vilaž. La legenda "Skarb de Tri Montes" je veritabile un symbol za unite de diversa kulturi i lingvoj en un mondo plen da froliko i humor.

评论 #35523329 未加载

replwoacauseabout 2 years ago

I'm more impressed by your son than I am ChatGPT....TBH.

robgaabout 2 years ago

I am curious if the advent of GPT and LLMs allows linguistic theorists to adjudicate where we are with understanding the language instinct and settling the Chomsky vs Pinker vs Others debate.Perhaps it is entirely irrelevant as GLT has learned through billions of examples a child never could. Or perhaps it is totally relevant as it can synthesise billions of examples better than any linguist.

rhn_mk1about 2 years ago

Not trusting the models's self-assessment is the right call, considering that the actual score summed up to 7.5 compared to the self-reported 6.5 :)

评论 #35516307 未加载

droobyabout 2 years ago

Yeah I was think yesterday maybe we can start translating dolphin language.Someone get on that

评论 #35515950 未加载

dgritskoabout 2 years ago

The idea of asking it to produce an "ouroboros prompt" that can be fed back into itself summarizing everything already learned is very clever; definitely going to use that in future ChatGPT sessions of my own.

评论 #35518567 未加载

评论 #35519041 未加载

评论 #35517334 未加载

fcatalanabout 2 years ago

I've been trying a few things, some are very interesting.For example it understands Europanto* perfectly, but when I asked it to produce some it was germanic-only Europanto: English, German, Danish, Swedish... I told it to use more romance words and he came up with pure French. After some more prodding he achieved a decent mix.I also tried to get it to behave like an ersatz Duolingo for Basque and it sorta worked, but it would need some clever working on the prompts to really be usable.(*) Europanto is a joke language that uses random European language vocabulary on top of a generally English grammar.

vintermannabout 2 years ago

Once again illustrating that the powerful thing about ChatGPT is that no matter what you do, it does its best to play along. Its eyes do not glaze over.

评论 #35517366 未加载

评论 #35517379 未加载

评论 #35516631 未加载

DonHopkinsabout 2 years ago

Prompt: Write a poem in Simlish.ChatGPT-4:While it's important to remember that Simlish is a nonsensical language without a proper grammar or vocabulary, I can still attempt to create a playful poem using some known Simlish phrases and sounds. Keep in mind that this poem will not have any real meaning:<pre><code> Sul sul! Vadeesh, plurna zib, Badeesh! Dag dag, cribna drib, Froobi flazzle, noogit fraz, Glarbni weeb, frizzle maz. Lerpna gort, jibbly zor, Slooble wib, kwebna dor, Skibbi plurt, pleeba narn, Grubble veeb, shazzle darn. Wabble gret, minzi glarn, Flartle doo, zibba yarn, Yibbs frazzle, dribna glorp, Vorpti zound, greeble snorp </code></pre> Enjoy the playful sounds and rhythm of the poem, even though it lacks any real meaning!

评论 #35522229 未加载

6510about 2 years ago

Mine is a silent language. I look at people and wait until they guess what I'm thinking or understand what I mean. Sometimes I show them an imaginary sphere that I hold with both hands.edit: I had one more OT thought: This article convinced me even further that children should learn machine code. Adults (we) should stop pretending we are anywhere near as sophisticated at learning languages as they are. I've seen 5 year olds play together and create a hybrid dutch/russian language in 5 minutes. They just kept talking. Within an hour they had a pretty good understanding, within a day it was good enough for what they needed.

i-use-nixos-btwabout 2 years ago

This sounds like a lot of fun! It’s also great that you’re encouraging this - having a passion is great, and following it through to this level is fantastic for development.I have a question. When you ask an LLM to summarise the rules of the language in a concise manner, and it does so, is it… disheartening?I mean, if you put a lot of work into creating something, and it gets described in terms that basically make it feel like a configuration - given a glossary and a set of toggles and fill in the blank settings (e.g. word boundaries), out pops a language.Does seeing that make it feel somewhat less exciting?

DonHopkinsabout 2 years ago

The Future of GPT4 1337 Programming: CyberCode Tutorial with <=\> cYber-C0mb1n4t0r !!! </=>I asked GPT-4 to implement QuickSort and the Y-Combinator in my own personal programming language that I designed — who can guess the influences (besides cannabis and shrooms)?<a href="https://donhopkins.medium.com/the-future-of-gpt4-programming-cybercode-tutorial-with-cyber-c0mb1n4t0r-ddfdf82a84d" rel="nofollow">https://donhopkins.medium.com/the-future-of-gpt4-programming...</a>

lloekiabout 2 years ago

> For example, Kłeti has a sentence structure that is SOV (Subject-Object-Verb), while all the languages I know have a Subject-Verb-Object (SVO) sentence structure."all the languages I know" caveat alright, still Japanese has the verb at the end, and my German is far far away but I seem to recall it has something like that as well (at least the meaningful "verb-like" semantic part of the sentence often being at the end)

评论 #35523166 未加载

dfxm12about 2 years ago

Did it actually speak the language or did it just translate text?I'm not trying to be pedantic; these are two very different tasks.

评论 #35517994 未加载

fernlyabout 2 years ago

Oh I wish I had time to train it on one of my old hobbies, Lojban!<a href="https://lojban.io/" rel="nofollow">https://lojban.io/</a><a href="https://mw.lojban.org/papri/Lojban" rel="nofollow">https://mw.lojban.org/papri/Lojban</a>

评论 #35518498 未加载

评论 #35518402 未加载

syntaxingabout 2 years ago

Super curious, would fine tuning with LoRa on a LLaMa/Alpaca model work better?

评论 #35524411 未加载

arps18about 2 years ago

This is a super amazing stuff! Just blown away with the power of AI in tbe future! I wonder if that still exist to make some of the languages spoken by humans which are on the verge of extinction?

m3kw9about 2 years ago

Not sure if ChatGPT is correct but it does sound good

sage11about 2 years ago

You might enjoy creating personalised bedtime stories for your kid: www.mybabystory.co

jakkosabout 2 years ago

bookmark

评论 #35524294 未加载

marcodiegoabout 2 years ago

I don't have access to ChatGPT4, but in my tests I could observe that it can't do some very simple tasks:<pre><code> - It can't play tic-tac-toe, - It can't play hangman, - It insists that winning on stone-paper-scissor using the chat (playing before me) is a matter of probability. </code></pre> It was also demonstrated that it can't reverse strings.Actually a transformer doesn't accesses 'strings', all it processes are tokens which are then mapped to vectors by whatever embedding is applied. I think it will be extremely difficult for a transformer to do any of these tasks correctly until a successor model is adopted.I don't have much hope of any reasonably complex symbolic processing of anything that it was not trained on. Some of these tasks are easy for a human to perform with paper and pencil and a set of rules; of course a human may get confused, but for that you write programs. Write code is one of GPT's skills but It is not "that" good with code for problems that are not mere small modification of problems it was trained on.EDIT: Could have expressed myself better: I don't have access to chatGPT4; I tested using the "available" chatGPT, I think it is 3.5.A transcript of me trying to play tic-tac-toe with it: <a href="https://pastebin.com/V1CW5hpt" rel="nofollow">https://pastebin.com/V1CW5hpt</a>

评论 #35516686 未加载

评论 #35516456 未加载

评论 #35516393 未加载

评论 #35516391 未加载

评论 #35516859 未加载

评论 #35517233 未加载

评论 #35516748 未加载

评论 #35516492 未加载

评论 #35516378 未加载

26 comments

og_kaluabout 2 years ago

评论 #35520502 未加载

评论 #35522514 未加载

评论 #35517790 未加载

评论 #35520363 未加载

评论 #35518212 未加载

评论 #35516822 未加载

评论 #35520395 未加载

评论 #35516743 未加载

crdrostabout 2 years ago

评论 #35518722 未加载

评论 #35519012 未加载

评论 #35523078 未加载

评论 #35518750 未加载

评论 #35518880 未加载

评论 #35520367 未加载

graypeggabout 2 years ago

评论 #35521137 未加载

anon84873628about 2 years ago

评论 #35517031 未加载

评论 #35518941 未加载

评论 #35523187 未加载

JCharanteabout 2 years ago

评论 #35516162 未加载

评论 #35516264 未加载

评论 #35517131 未加载

snickererabout 2 years ago

评论 #35523329 未加载

replwoacauseabout 2 years ago

I'm more impressed by your son than I am ChatGPT....TBH.

robgaabout 2 years ago

rhn_mk1about 2 years ago

Not trusting the models's self-assessment is the right call, considering that the actual score summed up to 7.5 compared to the self-reported 6.5 :)

评论 #35516307 未加载

droobyabout 2 years ago

Yeah I was think yesterday maybe we can start translating dolphin language.Someone get on that

评论 #35515950 未加载

dgritskoabout 2 years ago

评论 #35518567 未加载

评论 #35519041 未加载

评论 #35517334 未加载

fcatalanabout 2 years ago

vintermannabout 2 years ago

Once again illustrating that the powerful thing about ChatGPT is that no matter what you do, it does its best to play along. Its eyes do not glaze over.

评论 #35517366 未加载

评论 #35517379 未加载

评论 #35516631 未加载

DonHopkinsabout 2 years ago

评论 #35522229 未加载

6510about 2 years ago

i-use-nixos-btwabout 2 years ago

DonHopkinsabout 2 years ago

lloekiabout 2 years ago

评论 #35523166 未加载

dfxm12about 2 years ago

Did it actually speak the language or did it just translate text?I'm not trying to be pedantic; these are two very different tasks.

评论 #35517994 未加载

fernlyabout 2 years ago

评论 #35518498 未加载

评论 #35518402 未加载

syntaxingabout 2 years ago

Super curious, would fine tuning with LoRa on a LLaMa/Alpaca model work better?

评论 #35524411 未加载

arps18about 2 years ago

This is a super amazing stuff! Just blown away with the power of AI in tbe future! I wonder if that still exist to make some of the languages spoken by humans which are on the verge of extinction?

m3kw9about 2 years ago

Not sure if ChatGPT is correct but it does sound good

sage11about 2 years ago

You might enjoy creating personalised bedtime stories for your kid: www.mybabystory.co