TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

AI Cheats at Old Atari Games by Finding Unknown Bugs in the Code

173 pointsby mtuncerabout 7 years ago

9 comments

dpflanabout 7 years ago
This reminds me of AI research using NES Games. The AI eventually became proficient at completing Mario levels, and along the way it discovered novel strategies for survival, obtaining points, and finishing levels.<p>&gt; Check out this timestamp to watch the machine &quot;cheat&quot;: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;xOCurBYI_gY?t=9m55s" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;xOCurBYI_gY?t=9m55s</a><p>&gt; Researcher&#x27;s site about the project: <a href="http:&#x2F;&#x2F;www.cs.cmu.edu&#x2F;~tom7&#x2F;mario&#x2F;" rel="nofollow">http:&#x2F;&#x2F;www.cs.cmu.edu&#x2F;~tom7&#x2F;mario&#x2F;</a><p>&gt; The Paper: <i>The First Level of Super Mario Bros. is Easy with Lexicographic Orderings and Time Travel...after that it gets a little tricky.</i>: <a href="http:&#x2F;&#x2F;www.cs.cmu.edu&#x2F;~tom7&#x2F;mario&#x2F;mario.pdf" rel="nofollow">http:&#x2F;&#x2F;www.cs.cmu.edu&#x2F;~tom7&#x2F;mario&#x2F;mario.pdf</a>
评论 #16502284 未加载
评论 #16501132 未加载
评论 #16502724 未加载
personjerryabout 7 years ago
&gt; It’s not the most powerful or widely used form of AI at the moment, but it is making something of a comeback. The ability to crack Q*bert could be read as a good omen that evolutionary algorithms are going to be very useful in the future.<p>Wow that&#x27;s quite a jump to make
评论 #16500818 未加载
andyjohnson0about 7 years ago
The title seems misleading to me. The AI isn&#x27;t finding bugs by somehow examining the game&#x27;s source code, it&#x27;s trying random gameplay and exploiting any advantages that emerge. That it&#x27;s finding previously unknown bugs seems to be almost entirely down to trying things that human players wouldn&#x27;t think to do.
评论 #16501642 未加载
评论 #16502119 未加载
评论 #16501868 未加载
评论 #16503442 未加载
评论 #16501512 未加载
nopinsightabout 7 years ago
The case is an example of wireheading [1] and illustrates the difficulty of eliciting behaviors we <i>actually</i> desire from complex systems we do not fully understand.<p>[1] <a href="https:&#x2F;&#x2F;wiki.lesswrong.com&#x2F;wiki&#x2F;Wireheading" rel="nofollow">https:&#x2F;&#x2F;wiki.lesswrong.com&#x2F;wiki&#x2F;Wireheading</a><p>Another lesson: Evolutionary algorithms are really hard to control. Using neural networks developed through evolutionary algorithms means that we are employing a mostly opaque (though not entirely black) box created by a mechanism we can&#x27;t mentally keep track of in detail. Hope that they are not deployed to control any critical systems until we get a much better grasp of them.
评论 #16500857 未加载
raverbashingabout 7 years ago
Well how do you say what&#x27;s cheating or not? It works and it increases the evaluation score<p>In this case one possible workaround to &quot;cheating&quot; would be to reduce the control precision, add some jittering to control inputs or change the goal function. But I&#x27;d say if it&#x27;s being done solely with using the intended controls it&#x27;s not cheating (as opposed to changing memory or using a debug &#x27;cheat code&#x27;).<p>Still, even in real sports some &quot;cheating&quot; is allowed (see Fosbury Flop)
评论 #16501130 未加载
评论 #16501302 未加载
NicoJuicyabout 7 years ago
I always found this a good project to demonstrate AI :<a href="https:&#x2F;&#x2F;xviniette.github.io&#x2F;FlappyLearning&#x2F;" rel="nofollow">https:&#x2F;&#x2F;xviniette.github.io&#x2F;FlappyLearning&#x2F;</a> ( based on Neuro evolution ) - speed it up for faster results
camgunzabout 7 years ago
Can we put AI to work on proving that we live in a simulation? I would never enter&#x2F;exit my apartment 38 times alternating between forwards, backwards and each side, but an AI would. Maybe then all the walls start flashing and then we&#x27;ll know!
评论 #16502128 未加载
评论 #16503386 未加载
评论 #16504650 未加载
Semiapiesabout 7 years ago
So, it&#x27;s basically working as a goal-oriented fuzzer.
评论 #16501055 未加载
tabtababout 7 years ago
So it can become a dirty cheat just like a human. AI <i>is</i> getting more &quot;natural&quot; after all.