Show HN: AlphaZero Science paper

200 点作者 Inufu超过 6 年前

14 条评论

gamegoblin超过 6 年前

It's a shame they still played against 2016 Stockfish (Stockfish 8), when Stockfish 9 or Stockfish Dev were available (Stockfish 10 is out now, but only very recently, so I can understand why they didn't use it).Their results show that they are only just barely stronger than Stockfish 8, but Stockfish 9 and 10 are stronger than 8 as well.EDIT: Also meant to include a shout-out to <a href="http://www.lczero.org/" rel="nofollow">http://www.lczero.org/</a> which is an open source implementation of AlphaZero chess. Here is their forum post for this paper: <a href="https://groups.google.com/forum/#!topic/lczero/TfmaNHI99gk" rel="nofollow">https://groups.google.com/forum/#!topic/lczero/TfmaNHI99gk</a>SECOND EDIT: I was wrong! They did play against a newer SF than 8, specifically, SF at this commit: <a href="https://github.com/official-stockfish/Stockfish/commit/b508f9561cc2302c129efe8d60f201ff03ee72c8" rel="nofollow">https://github.com/official-stockfish/Stockfish/commit/b508f...</a> , which was about 2 weeks before SF 9 was released, so maybe it is close in strength to SF 9.

评论 #18624744 未加载

评论 #18623591 未加载

评论 #18622599 未加载

评论 #18622730 未加载

评论 #18623004 未加载

mindgam3超过 6 年前

To OP or anyone else at DeepMind: can you comment on why you decided not to release all of the games?IMHO as a competitive scholastic chess player (former national U16 champion and top 3 world U10) and software engineer, it would significantly increase credibility of results. Not to mention would be fascinating to see the “ugly” games in addition to the ones handpicked by your team.

评论 #18624932 未加载

评论 #18625501 未加载

mcphage超过 6 年前

> AlphaZero and AlphaGo Zero used a single machine with 4 first-generation TPUs and 44 CPU cores. A first generation TPU is roughly similar in inference speed to commodity hardware such as an NVIDIA Titan V GPU, although the architectures are not directly comparable.> The amount of training the network needs depends on the style and complexity of the game, taking approximately 9 hours for chess, 12 hours for shogi, and 13 days for Go.How much would that much computing power would cost on something like AWS? That's a lot of hardware, but if you're only renting it for 9 hours... the beefiest EC2+GPU instance Amazon has currently is p3.16xlarge, which has 8 Tesla V100 GPUs, and 64 (virtual) CPUs, for $25/hour on-demand. My understanding is that a V100 is slightly more powerful than a Titan V, so does that mean you could run the Chess training (at least the AlphaZero side) for $225? That seems impossible?EDIT: pacala below pointed out that the hardware listed was just for running AlphaZero against Stockfish, not for training it. Digging through the preprint itself, they say that for training they used:> During training only, 5,000 first-generation tensor processing units (TPUs) (19) were used to generate self-play games, and 16 second-generation TPUs were used to train the neural networks.So that would be... a lot more.

评论 #18623053 未加载

评论 #18626868 未加载

madisfun超过 6 年前

It would be nice if A0 participated in at least one public computer chess championship, Chess.com's CCC or TCEC. That's a level playing field and all games published.AlphaZero was a great concept and execution, but if we have to judge its relative strength, it should compete fairly. 4 TPUs (~ 4 Titan V) + 44 cores for AlphaZero vs only 44 cores for Stockfish pre-9 may or may not have put Stockfish at a disadvantage.BTW, current, presumably balanced, TCEC 14 configurations are:Non-GPU Server: CPUs: 2 x Intel Xeon E5 2699 v4 @ 2.8 GHz, Cores: 44 physical, RAM: 64 GB DDR4 ECCGPU Server: GPUs: 1 x 2080 ti + 1 x 2080, CPU: Quad Core i5 2600k, RAM: 16GB DDR3-2133TCEC GPU server looks more modest than what A0 authors used to "beat" SF.

评论 #18630860 未加载

评论 #18629651 未加载

RivieraKid超过 6 年前

Finally! I've been waiting for this for a year. I love that it learned from scratch without human bias and how it plays in a much more captivating style than alpha-beta engines.Will you be answering questions?

antirez超过 6 年前

How it is possible that a so high profile article says a so doubtful statement?"Traditional chess engines – including the world computer chess champion Stockfish and IBM’s ground-breaking Deep Blue – rely on thousands of rules and heuristics handcrafted by strong human players that try to account for every eventuality in a game."

评论 #18626420 未加载

评论 #18626383 未加载

评论 #18626381 未加载

stabbles超过 6 年前

So they finally released more games?! Really looking forward to Kingscrusher or Chessnetwork covering more of these 210 games on YouTube: <a href="https://deepmind.com/research/alphago/alphazero-resources/" rel="nofollow">https://deepmind.com/research/alphago/alphazero-resources/</a>

评论 #18623473 未加载

评论 #18621913 未加载

schaefer超过 6 年前

Inufu,As a Go player, is there any way I could download and review the game records described in this paper?

评论 #18623371 未加载

Rampoina超过 6 年前

What is Deepmind's interest in not releasing the source code and weights for the neural networks?I'm excited about their work but it seems that it would be much better for everyone if they just released their work openly.

kevinwang超过 6 年前

Dunno if I just missed it in the paper, but is there an explanation for why alphazero is better at go than alphago zero?

detaro超过 6 年前

I don't see anything I can try out? <a href="https://news.ycombinator.com/showhn.html" rel="nofollow">https://news.ycombinator.com/showhn.html</a>

评论 #18621171 未加载

YetAnotherNick超过 6 年前

Why Show HN? It generally implies a single person or just a few people behind it.

评论 #18622820 未加载

评论 #18622803 未加载

thom超过 6 年前

Another set of games against an outdated Stockfish which appears to make moves that a recent Stockfish at any reasonably depth disagrees with. I've no doubt at all that AlphaZero has a much stronger evaluation algorithm than Stockfish, but I do wish they'd be a bit more transparent about its actual strength (although presumably they're selling access to it right now if you connect all the dots).

评论 #18623032 未加载

评论 #18626835 未加载

评论 #18624686 未加载

nuguy超过 6 年前

It is my moral obligation to express to you the fact that AI, even this kind of AI, is a death sentence for humanity. The progress of automation will eventually meet and surpass the human mind. But even before it does, perhaps long before it does, it will cause massive economic disruption and unemployment. The more complete automation becomes, the less power humans will have, the less influence humans will have over the powerful entities that hold the keys to critical resources such as jobs. The economics of automation leave little doubt that the outcome will be bad for humans. I’m sorry I can’t explain it more effectively here. But I think it’s clear to anyone who thinks it through carefully.Please stop applying your intillenge to AI.Edit: substantive counter-arguments would be highly appreciated

评论 #18635388 未加载

评论 #18626259 未加载

评论 #18626928 未加载

评论 #18625696 未加载

评论 #18625885 未加载