Ask HN: Why is GPT-4 or Claude-2 so bad at tic-tac-toe?

2 点作者 zurfer超过 1 年前

I was surprised to learn that GPT-4 can't play tic-tac-toe, but thought people who tried just didn't prompt it correctly.But after trying for 2h to get it work (even with GPT-4V) it seems like a fundamental limitation.I've found a HN submission [1] where someone used a brute-force prompt to get it to play correctly, but as the top/only comment points out, it's a limited action space and enumerating most of it seems moot.I was hoping for a more reasonable prompt. After all humans are able learn tic-tac-toe rapidly.current hypotheses:1) tic-tac-toe requires "spatial reasoning" and LLMs train on sequences (somehow GPT-4V didn't elevate that constraint)2) tic-tac-toe requires "search" of future scenariosWould love to hear what you think/know!---Previous discussion about T3 and GPT-4: https://news.ycombinator.com/item?id=35216614 (7 months ago)[1] https://news.ycombinator.com/item?id=37626918

5 条评论

outdig超过 1 年前

I agree with jqpabc123 except for one point: that it cannot 'reason' and it is not smart. I disagree with this.The reason it cannot play well is because it has very little 'experience' (training data) with it. It's been trained on 'what the game is', it has not been trained how to win.You can think of it a bit like driving. You can know what driving is, but it doesn't make you a driver if you've never driven before.You can ask a genius who's never played before to play tic tac toe with you, tell it the rules, they will likely not win on the first attempt or play optimally. This doesn't mean that person isn't a genius.You said humans are able to 'learn' to play it rapidly. So is GPT, in training mode, it can process a million games in seconds, where a human can't.The problem here, is it simply has no experience.If I told you every time you played tic tac toe against me, you would forget all your experience the next time we played, would you play optimally?

评论 #38134534 未加载

sharemywin超过 1 年前

I think this paper has an interesting way of training problem solving:<a href="https://venturebeat.com/ai/microsoft-unveils-lema-a-revolutionary-ai-learning-method-mirroring-human-problem-solving/" rel="nofollow noreferrer">https://venturebeat.com/ai/microsoft-unveils-lema-a-revoluti...</a>I submitted to HN but nobody seemed to care:<a href="https://news.ycombinator.com/item?id=38128012">https://news.ycombinator.com/item?id=38128012</a>I looks like it basically uses GPT-4 to train a smaller model on problem solving.

jqpabc123超过 1 年前

But after trying for 2h to get it work (even with GPT-4V) it seems like a fundamental limitation.It obviously hasn't been "trained" for tic-tac-toe. The way to train it is using statistics --- present every possible position and the correct response so it can build a database.There is no logic or reasoning involved --- it's all statistics. It's not what we call "smart". Any ability to "reason" is just a statistical illusion.

bell-cot超过 1 年前

The Turing Test sounds cool, and "is he a clever conversationalist?" is a fairly good test - of social intelligence and class, for casual use in human society.But current "AI's" are intelligent kinda like pocket calculators are intelligent.

david927超过 1 年前

The New Yorker wrote a nice article on LLMs and how they work, "What Kind of Mind Does ChatGPT Have?"