TechEcho

9 comments

It's clear that there's still a long way to go. Too much manual feature engineering and handtuning is needed right now due to the extremely weak reward signalling of current reinforcement learning algorithms, crippling OpenAI in the later stages.The cool thing is that bruteforcing computational power seems to get us decently close. I'm optimistic that with renewed interest in the reinforcement learning field, breakthroughs will be made on the algorithmic side within a matter of time.

评论 #17838676 未加载

chibg10over 6 years ago

While there are certainly broader insights to be made, one tidbit I think I noticed that interested me had to do with Axe.Between the two games, we saw Axe played by both the human team and the AI team. When played by the humans, blink-calls were completely shutdown by the AI's superhuman counterinitiations. That made enough sense. When the Axe was played by the AI though, I don't recall Axe ever even attempting any blink-calls. I'm curious if this might be the result of the AI overfitting to itself -- at AI reaction speeds, blink-calls are not a very useful maneuver, and thus the AI learns not to perform them.Against a group of humans though, Axe's blink-call initiations are arguably the hero's biggest selling point.We didn't get to see most of the hero pool, but I wonder how much the AI overfitting to AI playstyles will hinder the bots against humans in the future.Of course, the bots have many other issues which loom larger atm imo but I felt interested in enough in this tidbit to point it out.

评论 #17840174 未加载

评论 #17839885 未加载

评论 #17839884 未加载

fermienricoover 6 years ago

I wish they’d use a better name for their engine than “Five”. Half the time I get confused if they’re talking about of Five players or Five engine or 5 something else.Use a unique name like “Galaxy” which doesn’t represent anything remotely close in the game - Spell names, skills, etc. There is a huge amount of stuff going on in the game and it’s such a heavy cognitive load for an outsider who doesn’t play Dota, it was annoying to keep checking if they meant the name of the engine or number five in a game of Five vs Five. Or Five vs 5!!!? I’m so damn confused.Same thing here: <a href="https://openai.com/five/" rel="nofollow">https://openai.com/five/</a>Number #2 bullet point says “Defeat five of the world’s top professionalsFive will attempt this live at The International in Vancouver’s Rogers Arena this week!”It is such a poor choice.

评论 #17839238 未加载

PhearTheCealover 6 years ago

Here are some insights into how OpenAI fine-tuned the rewards and short-term actions of the bots <a href="https://gist.github.com/dfarhi/66ec9d760ae0c49a5c492c9fae93984a" rel="nofollow">https://gist.github.com/dfarhi/66ec9d760ae0c49a5c492c9fae939...</a>The numbers seem pretty arbitrary to me, that's probably what this blog post is talking about when it mentions why it lost.

评论 #17838879 未加载

Learyover 6 years ago

One of the clear weakness of the current OpenAI Five is its warding, with wards oftentimes being placed in bases. Perhaps the amount of vision that the team has can be a short term reward. Likewise, it currently does not engage in too much counter warding with sentries.The game vs. Pain clearly demonstrated how humans can use wards to gain an information advantage over the bots that otherwise had a great chance of winning the game.

评论 #17839259 未加载

ufoover 6 years ago

I hope they continue the project over the next year. I am really curious if they will be able to teach the AI to be better at late game strategies and long term planning. (Item builds might be an interesting challenge in this area as well)

shawnover 6 years ago

Someone asked me, "How does OpenAI know the MMR of its bots?"I don't know. I assume it's similar to how AlphaGo measures its ELO ranking. But the strange thing is, this is hundreds of years of self-play, not a public pool of humans playing against each other. How does MMR in simulation translate to MMR in real matches?Before pointing out that it's possible for an ELO rating, consider that Dota MMR is a bit different – every game you win, you get +25. Every game you lose, you get -25. This changes at the very high / very low levels, or if the matchups are wildly imbalanced, but that's the general setup. Or it was, a few years ago.Does anyone have a guess?

评论 #17838694 未加载

评论 #17841372 未加载

评论 #17839260 未加载

arenaninjaover 6 years ago

Progress is much farther along than I expected. At the same time I wonder how well the bots adjust to the meta. Dota 2 changes every time a new patch rolls along. I also wonder if bots could make some of the more unconventional picks work. As an example, I love Medusa, and she's one of the least picked heroes in pro dota

评论 #17839819 未加载

评论 #17840873 未加载

simonebrunozziover 6 years ago

Question for the OpenAI team: have you ever thought about applying Five to other games, as it is?I'd suggest Total War: Warhammer II (it has an interesting competitive scene, and has both tactical combat, and strategy gameplay), which is very different than DotA 2, but it would be super intriguing to see how it performed, and how fast it could learn.If you built Five in the right way, it should be able to learn with mostly any other game.I can also imagine you offering this to gaming companies in the near future, so that they could provide a decent computer AI instead of the crap they usually offer now :)(source: I used to be a videogamer in my teens, and occasionally still play some strategy games, not as often as I would like to :D)

The International 2018: Results

9 comments

The International 2018: Results

9 comments