Positions chess engines don't understand

407 点作者 diplodocusaur大约 4 年前

17 条评论

When OpenAI trained their Dota model to beat people 1v1 mid SF, they had a contest at The International (the big yearly dota tournament, RIP) like "can you beat OpenAI's new bot?" and had a big bucket of prizes for anyone that could.By the end of the night, the bucket was empty. People learned to cheese the bot by running "out of bounds", so to speak -- normally in a 1v1, you're supposed to stay close to your opponent, since they'll be getting stronger if you leave. But the bot didn't know how to deal with it when you snuck behind it (normally an insane maneuver, almost guaranteed to cost you the game in normal play) and prevented his army (called "creeps") from running to the middle lane. (All you have to do is go wave your hand at them, and they mindlessly chase you.)People did that over and over, and your own army would eventually overwhelm the bot and win. :)

评论 #27190342 未加载

评论 #27191695 未加载

评论 #27190422 未加载

评论 #27193641 未加载

评论 #27190812 未加载

评论 #27190539 未加载

airza大约 4 年前

An astonishing insert:>A good example of human exploitation of the engine's failings is GM Hikaru Nakamura's defeat of Rybka in the following three-minute blitz game from the Internet Chess Club. Nakamura cleverly locks the position so that progress is impossible for Rybka, then he offers two exchange sacrifices to the engine. With the position locked, the engine's rooks have no value, but the engine thinks it has a winning material advantage. With a draw by the 50-move rule approaching, the engine sacrifices a pawn to avoid a draw, but this proves a huge mistake as Nakamura is then able to win the game, an incredible achievement!

评论 #27189770 未加载

评论 #27193669 未加载

评论 #27190515 未加载

评论 #27189852 未加载

matsemann大约 4 年前

When watching the WC games, I've seen it happen that a move wasn't considered as a top move by the engine, but once played the engine realizes it's actually crushing. Something about the heuristics used to prune the vast search space can make it miss sacrifices or seemingly sub-optimal moves that temporarily weakens the perceived position but has a huge payoff in the end. But humans find them. Of course, given enough time and depth the engine will eventually circle back and try the move. But it has no intuition.Also, an engine without an endgame tablebase can be pretty stupid. There are certain rules one can deduct when there are few pieces left, but a min/max engine will search forever, not knowing the patterns.

评论 #27189271 未加载

评论 #27189708 未加载

评论 #27189692 未加载

评论 #27189775 未加载

评论 #27189534 未加载

评论 #27191201 未加载

评论 #27193077 未加载

评论 #27193699 未加载

评论 #27192009 未加载

评论 #27189404 未加载

perihelions大约 4 年前

Here's a particularly extreme example:<a href="https://old.reddit.com/r/chess/comments/ndz2lj/simple_mate_in_93/" rel="nofollow">https://old.reddit.com/r/chess/comments/ndz2lj/simple_mate_i...</a>It's a mate-in-93 puzzle that's fairly accessible to humans, using abstract reasoning. But not chess engines. Comparing against the OP article, the main "technique"/"trick" is zugzwang (#7), but on a dramatic scale.I think this is the kind of position you could use to stress-test a candidate puzzle solver, just because of the shear size of the solution.

dwohnitmok大约 4 年前

Does anybody know if advanced chess/centaur chess (chess play where a human uses a computer for assistance) is still a thing/whether a human+computer combo is a meaningful improvement these days (i.e. last couple of years) over just a computer.I can't find any recent advanced chess tournaments and though I see quotes of people saying that the combo is stronger than a computer alone, I haven't found any recent examples of a top tier engine by itself losing to a human + engine (e.g. Stockfish + human vs Stockfish).

评论 #27190486 未加载

评论 #27189299 未加载

rudi-c大约 4 年前

There's a similar situation with Go where some positions utterly confuse bots trained on playing mostly normal games. There was this interesting research blog post on training a bot specifically to become good at solving one of these weird problems (Igo Hatsuyoron 120, the "hardest go problem ever")<a href="https://blog.janestreet.com/deep-learning-the-hardest-go-problem-in-the-world/" rel="nofollow">https://blog.janestreet.com/deep-learning-the-hardest-go-pro...</a>

评论 #27191426 未加载

mrslave大约 4 年前

Agadmator covered Kramnik v Leko 2002 (in 2018) titled "Invisible to Engines | One Of The Greatest Moves Ever Played" [0] which is worth a watch.[0] <a href="https://www.youtube.com/watch?v=yGnpewUKP88" rel="nofollow">https://www.youtube.com/watch?v=yGnpewUKP88</a>

senkora大约 4 年前

It seems dubious to show that engines are sometimes bad at evaluating positions by giving a position with three black bishops on black squares.

评论 #27189209 未加载

评论 #27189256 未加载

评论 #27189451 未加载

评论 #27189285 未加载

评论 #27189378 未加载

microtherion大约 4 年前

The Hasek study in #2 is bugging me: Why can't Black simply play 1. ... Rh8 ? It looks to me like this would gain the one crucial tempo to win the game. None of the discussion of the study I've found seems to consider this move (And the online analysis engines are useless, as they don't properly understand that the original line is a draw, as the article notes).ETA: Corrected typo, I originally wrote 1. ... Re8, which does not accomplish anything.

评论 #27190692 未加载

评论 #27193321 未加载

zone411大约 4 年前

So I actually checked the problems listed.In the "IQ Test #52" position (FEN: 8/1p1q1k2/1Pp5/p1Pp4/P2Pp1p1/4PpPp/1N3P1P/3B2K1 w - - 0 1) listed in #1 both LC0 and Stockfish play the correct line on my computer in seconds.Both of them also play the right move Ba4+ in the second position of #1 "William Rudolph vs." (FEN: 8/1p1q1k2/1Pp5/p1Pp4/P2Pp1p1/4PpPp/1N3P1P/3B2K1 w - - 0 1) but they take quite a bit longer. Stockfish variants get it quicker.Stockfish solves "Hasek vs." (FEN: r7/7k/5R2/p3p3/Pp1pPp2/1PpP1Pp1/K1P3P1/8 w - - 0 1) listed in #2 quickly. Both Stockfish and LC0 solve "Lazard=F vs." (FEN: q7/8/2p5/B2p2pp/5pp1/2N3k1/6P1/7K w - - 0 1) quickly.Stockfish gets Bh3 from #3 "Veselin Topalov (?) vs. Alexey Shirov" (FEN: 8/8/4kpp1/3p1b2/p6P/2B5/6P1/6K1 b - - 2 47) in seconds (7-piece end game tablebases installed).The next position from #3 "Spassky, Boris V vs. Byrne, R." (FEN: 3B4/1r2p3/r2p1p2/bkp1P1p1/1p1P1PPp/p1P4P/PPB1K3/8 w - - 0 1) is easy for both Stockfish and LC0. They both get 50. c5!! right away.Stockfish also gets the last position from #3 "Stefan Brzozka vs. David Bronstein" in seconds (FEN: 1r6/4k3/r2p2p1/2pR1p1p/2P1pP1P/pPK1P1P1/P7/1B6 b - - 0 48) Rxb3+.Stockfish and LC0 see Kd1 in "Lamford=P vs." from #4 (FEN: 8/8/8/1k3p2/p1p1pPp1/PpPpP1Pp/1P1P3P/QNK2NRR w - - 0 1) but believe Rg2 is also winning.Stockfish gets c8N from "Randviir=J vs." (FEN: 5nr1/2Pp2pk/3Pp1p1/4P1P1/6P1/5K2/8/7n w - - 0 1) in #4 in about 2 minutes on my computer.Stockfish gets Bf5 from "Simkhovich=F vs." (FEN: 8/8/2pk4/8/p1p3B1/PpP5/1P6/r1NK4 w - - 2 2) in #5 in seconds. LC0 also gets it.The next two positions are mentioned as easier for programs and they are:Qe3 from "Deep Blue vs. Garry Kasparov" in #6 (FEN: 1r6/5kp1/RqQb1p1p/1p1PpP2/1Pp1B3/2P4P/6P1/5K2 b - - 14 45) is very easy for both Stockfish and LC0.Both also get "Vladimir Kramnik vs. Peter Leko" (FEN: 6k1/5p1p/P1pb1nq1/6p1/3P4/1BP2PP1/1P1Nb2P/R1B3K1 b - - 0 25) in #6 quickly."Matous=M vs." (FEN: n2Bqk2/5p1p/Q4KP1/p7/8/8/8/8 w - - 0 1) is indeed harder for Stockfish and LC0 than expected. I've confirmed mate in 13 in my mate solver.In "Nigel Short vs. Vladimir Kramnik" (FEN: r3r1k1/1bp1Bppp/pb1p4/1p6/1P6/1BP2P2/P1P2PKP/R3R3 b - - 6 19) from #9, the engines like ...a5 more than ...c6. Hard to say that this move doesn't also win without more analysis.Stockfish wants to play c6! and b4! from "Marwitz=J vs." from #10 (FEN: 2K3k1/1p6/R3p1p1/1rB1P1P1/8/8/1Pb5/8 w - - 0 1) right from the start. LC0 takes longer but gets it as well.In "Anish Giri vs. Maxim Rodshtein" (FEN: 8/5pkp/3p1np1/Rpr5/8/6P1/PB3PKP/8 w - - 6 34) from #10 both Stockfish and LC0 like 34. h4 over 34. a4. More analysis would be needed to see if h4 is not also winning.Last position "IQ Test #16" (FEN: 5k2/4bp2/2B3p1/1P4p1/3R4/3P2PP/2r2PK1/8 b - - 0 1) takes around 10 seconds for Stockfish.In summary, chess engine might not really "understand" these positions but they solve them pretty well.

评论 #27191600 未加载

Andrex大约 4 年前

<a href="https://www.youtube.com/watch?v=Z4F7mUUjt_c" rel="nofollow">https://www.youtube.com/watch?v=Z4F7mUUjt_c</a>Maybe a more realistic scene than the fanbase gives it credit for. :)

评论 #27190899 未加载

TchoBeer大约 4 年前

I think there are some engines (Crystal is the one I'm thinking of) which do well in fortresses; these come at the cost of play strength.

评论 #27189287 未加载

gweinberg大约 4 年前

Has anyone actually checked that a modern chess engine believes black is winning in the Penrose position? I find it very hard to believe.

评论 #27189885 未加载

评论 #27189883 未加载

评论 #27190060 未加载

ngcc_hk大约 4 年前

Politics are about making up rules that others have to follow. May be AI still not good at playing this level of politics.

vmception大约 4 年前

Yeah I would have thought these engines to be alot smarter then this, especially the “AI” ones

johnklos大约 4 年前

Nice try, chess.com. I'm not going to let you use all those cycles stolen by your site to win chess games :P

unnouinceput大约 4 年前

Quote: "Since IBM's Deep Blue defeated World Chess Champion Garry Kasparov in their 1997 match..."Deep Blue lost in 1996. Its upgrade, called Deeper Blue is the one that won the famous match in 1997. Please SamCopland, do your homework.

评论 #27190701 未加载