TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Menace: A Machine-Educable Noughts and Crosses Engine (2016)

21 点作者 lrsjng超过 5 年前

4 条评论

flixic超过 5 年前
This is a great video demonstration by Matt Parker of Parker Square fame: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=R9c-_neaxeU" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=R9c-_neaxeU</a>
INFP超过 5 年前
The rules seem weird - Martin Gardner had the same matchbox self-learning robot for playing hexapawn <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Hexapawn" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Hexapawn</a> It had less states so it could fit on 20+ matchboxes that were filled with candy, but the rules are: 1. If it wins, nothing is changed 2. If it loses, you take the last move that has been made that resulted in the loss, and eat the candy, thus cutting this move from the possible move graph<p>This way every game lost improves the engine 100%, while in this Menace example the draw introduces unnecessary noice by bringing back moves, and the punishment for a loss seems unnecessarily harsh - removing EVERY MOVE played, which may cut out the best strategy
rimliu超过 5 年前
I remember trying to build something similar in my teen years after reading about it in one of the Martin Gardner&#x27;s book. I don&#x27;t remember the details, but the number of boxes needed was definitely nowhere near 304, 30+ IIRC.
MauranKilom超过 5 年前
I&#x27;m not sure what to take away from this. If you already enumerated all possible board positions (modulo flips&#x2F;rotations) <i>and</i> had a long look at each one to determine unique moves (i.e. which moves result in distinct board states), you might as well assign drawn&#x2F;losing&#x2F;winning to each position. It&#x27;s conceptually different but not really more work...<p>Obligatory xkcd: <a href="https:&#x2F;&#x2F;xkcd.com&#x2F;832&#x2F;" rel="nofollow">https:&#x2F;&#x2F;xkcd.com&#x2F;832&#x2F;</a><p>I would also consider it interesting that, in the original version, you could guarantee to win 75% of the time (given knowledge of the algorithm).