TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Beating the World’s Best at Super Smash Bros. with Deep Reinforcement Learning

202 pointsby willwhitneyabout 8 years ago

9 comments

gwernabout 8 years ago
Note: it doesn&#x27;t learn from pixels but features directly from RAM; and superhuman reaction time, with performance badly degrading when human-like delays added.<p>Good discussions on Reddit: <a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;MachineLearning&#x2F;comments&#x2F;5vh4ae&#x2F;r_a_new_foe_has_appeared_170206230_beating_the&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;MachineLearning&#x2F;comments&#x2F;5vh4ae&#x2F;r_a...</a> <a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;smashbros&#x2F;comments&#x2F;5vin8x&#x2F;beating_the_worlds_best_at_super_smash_bros_melee&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;smashbros&#x2F;comments&#x2F;5vin8x&#x2F;beating_t...</a>
评论 #13710688 未加载
评论 #13710422 未加载
评论 #13710380 未加载
评论 #13710666 未加载
评论 #13710363 未加载
brileeabout 8 years ago
Video of the AI here, playing as the black captain falcon: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=dXJUlqBsZtE" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=dXJUlqBsZtE</a>
评论 #13711167 未加载
swansonabout 8 years ago
We all know that Mew2King is first reinforcement learning AI capable of beating Super Smash Bros pro players.<p><a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=z-1YfhUFtbY&amp;feature=youtu.be&amp;t=285" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=z-1YfhUFtbY&amp;feature=youtu.be...</a>
评论 #13710822 未加载
jwtadviceabout 8 years ago
While the AI might be cheating by taking salient features from RAM rather than from pixel values, this is still an incredible feat. Just a few years ago we did not have generic algorithms that could take even salient features and self-learn policies to near this level this quickly.
评论 #13710597 未加载
smailiabout 8 years ago
As someone who&#x27;s played for quite a while I can tell you SSBM is one of the most complex games I&#x27;ve ever come across.
评论 #13714749 未加载
评论 #13710608 未加载
laniusabout 8 years ago
I&#x27;m impressed it beat the likes of S2J and Zhu. I wonder how it&#x27;d fare against the Five Gods?
WhitneyLandabout 8 years ago
What&#x27;s the key insight here compared to previous systems?. As far as I can tell, still no one can beat simple non-deterministic games that require some planning.<p>My favorite example is Ms. Pac Man because it seems so old and simplistic. Been tried by a dozen teams and no one can beat a decent human.
cervedabout 8 years ago
Civ AI has denounced this research
fiatjafabout 8 years ago
I was expecting a video.