Inside the (deep) mind of AlphaZero

Full AlphaZero paper is published

When AlphaZero was first announced late last year, it is not an understatement to say it caused feelings of shock and awe. After all, a new paradigm had been ushered into the somewhat stodgy world of computer chess, challenging decades of accepted truths and promising wondrous things for players all around the world.

Here was a program that eschewed conventional wisdom on how one should be built, challenging even that most basic premise: faster is better. Not only did it not run remotely as fast as Stockfish, the standard it was tested against, but it was a good 900 times slower, yet still stronger by some margin.

Accompanying this eye-opening news was a tantalising pre-paper that shared many of its intimate details to those who could understand it, and were willing to work to implement it. Still, there were many who cried foul, screaming that not only had the test match been grossly unfair as AlphaZero ran on a ‘supercomputer’ while Stockfish did not, but that Stockfish had been nothing short of crippled.

AlphaZero: Shedding new light on the grand games of chess, shogi and Go

Match conditions

The final paper, published in Science magazine, a serious journal that will demand the utmost scrutiny and peer reviews before accepting a paper, has brought in a number of rectifications regarding the match conditions as well as clarifications on the hardware. In the pre-paper, the hardware ascribed to Stockfish had been 64 threads generating 70 million positions per second, and 32MB (megabytes) for hash tables. That last detail caused no shortage of cries of outrage, since such a minuscule amount could barely benefit it. Then there was the matter of the 100-game match at one minute per move, and finally, last but not least, there were the mysterious four TPUs that AlphaZero was running on. While many today might appreciate what a strong GPU brings to the table, a TPU is hard to quantify.

The final paper brings a number of changes, which make it unclear whether this was as stated, or whether it was misreported. Whatever the case, the games shared at the Deep Mind website are different from those in the pre-paper, and while there is no shortage of brilliancies (that is unchanged), they are different brilliancies.

In this final paper, the match was not only rerun, with roughly the same result (+104 Elo performance), but had much better conditions for Stockfish to put the complaints to rest of it being crippled to rest. This time Stockfish was running on 44 threads on 44 cores (two 2.2GHz Intel Xeon Broadwell CPUs with 22 cores), a hash size of 32GB, Syzygy endgame tablebases, at 3-hour time controls with 15 additional seconds per move. Furthermore, Stockfish 8 was not the only version tested, Stockfish 9 was given its chance as well. The relative difference in nodes per second was maintained, for roughly 900-1, so that much was not changed. The authors also measured the overall average nodes per second for each player, instead of just the start position, which had been the case in the pre-paper. All in all, they report on the total results of 1000 games, though only 210 are actually published at the website.

As to AlphaZero and its first generation TPUs, the authors help narrow down its strength by explaining that while not the same, the inference performance is equivalent to a Titan V. The Titan V is without question a superb professional grade GPU, but its performance is nearly identical to that of the newly released Nvidia RTX 2080 Ti, a $1200 GPU. Powerful? Without question, but hardly a supercomputer unless comparing to machines from years back.

Furthermore, the authors tested a variety of conditions, and not just without books. They tried allowing Stockfish to use a book while AlphaZero did not, and even a TCEC-style match using the exact same openings TCEC used in a superfinal a couple of years back, as well as time handicap matches with AlphaZero getting one third the time Stockfish got or even one-tenth. Have you wanted to know how AlphaZero would have fared in the TCEC superfinal against Stockfish? Here is the result.

More importantly, all the games for these matches have been released — over 200 games, including a fine selection by Sadler who took the liberty of choosing those he felt were not to be missed.

The article brought much more detailed explanations as well as graphs to help understand

Shogi fans were not overlooked either. Not only were the 100 games between the Shogi version of AlphaZero published, but ten were chosen by Yoshiharu Habu, who is the 'Kasparov' of Shogi.

Master Class Vol.7: Garry Kasparov

On this DVD a team of experts gets to the bottom of Kasparov's play. In over 8 hours of video running time the authors Rogozenko, Marin, Reeh and Müller cast light on four important aspects of Kasparov's play: opening, strategy, tactics and endgame.

One knowledgeable aficionado who went over them was flabbergasted. As he explained, “I've been looking at some of the shogi games...and they are utterly impenetrable. All known joseki (openings) and king-safety principles are thrown out the window! In some of these games, the king doesn't just sit undeveloped in the center but does the chess equivalent of heading out to the middle of the board in the middle game before coming back to the corner for safety and then winning. Astounding!”

In the Science publication where the AlphaZero paper appears, additional commentary was provided by luminaries such as Murray Campbell, a leader in AI research and one of the key names behind Deep Blue, as well as an editorial by Garry Kasparov, who gave his own perspective on it, noting:

(...) I admit that I was pleased to see that AlphaZero had a dynamic, open style like my own. The conventional wisdom was that machines would approach perfection with endless dry maneuvering, usually leading to drawn games. But in my observation, AlphaZero prioritizes piece activity over material, preferring positions that to my eye looked risky and aggressive. Programs usually reflect priorities and prejudices of programmers, but because AlphaZero programs itself, I would say that its style reflects the truth. This superior understanding allowed it to outclass the world's top traditional program despite calculating far fewer positions per second. It's the embodiment of the cliché, 'work smarter, not harder'.

AlphaZero shows us that machines can be the experts, not merely expert tools. Explainability is still an issue — it's not going to put chess coaches out of business just yet. But the knowledge it generates is information we can all learn from.

Be sure to read the entire editorial.

Openings

The Berlin Wall

On top level the Berlin Defense is a popular defensive weapon but it also offers Black good chances to win if White does not proceed precisely. On this DVD Victor Bologan shows what Black can and should do if White tries to avoid the main lines of the Berlin Defense.

In the pre-paper, numerous fascinating graphs had been published on the opening preferences of AlphaZero as it evolved, as well as its results in test matches against Stockfish. This time the statistics are shared more in a visual manner with colour bars to help see when it won more or lost.

There is also a fascinating breakdown of its favourite 6-ply sequence in self-play as it evolved. In other words, what would it play as the best opening for both sides for six plies. AlphaZero was trained for a total of 700 thousand steps (think of these as lessons in its evolution), and here we can see what it thought was ideal after just 50 thousand steps, then 143 thousand steps, and so forth until its pinnacle of opening play… get ready to grimace: the Berlin.

The Berlin as the logical evolution of theory?

Some might see the Berlin as the final word by AlphaZero on openings as a sign of regression. After all, after 608 thousand steps, it thought the classic Ruy Lopez was ideal.

What we learned

For developers and programmers, this was a godsend as it finally put a large number of questions to rest regarding parameters used in training and playing, as well as some truly eye-opening revelations. For those wondering about the exact implementations, Deep Mind has provided sample pseudocode as they call it, enough to show how some of the algorithms might be coded. Among the more exciting items on a technical level was a formula that had the base of the search change according to the number of nodes per move it reached. The deeper it looked, the wider the search became.

So does this wrap up AlphaZero for good now? Hardly. As Demis Hassabis was so ready to point out recently, a new AlphaZero has been developed that is stronger than the one referenced in the paper. Be ready for new announcements!

GM King analysis

Grandmaster Daniel King analyses several of the new games from AlphaZero for his PowerPlay Show.

Replay all AlphaZero's games

New ...

New Game

Edit Game

Setup Position

Open...

PGN

FEN

Share...

Share Board (.png)

Share Board (configure)

Share playable board

Share game as GIF

Notation (PGN)

QR Code

Layout...

Use splitters

Swipe notation/lists

Reading mode

Flip Board

Settings

Move	N	Result	Elo	Players

1.e4	1,185,008	54%	2421	---
1.d4	959,510	55%	2434	---
1.Nf3	286,503	56%	2441	---
1.c4	184,834	56%	2442	---
1.g3	19,892	56%	2427	---
1.b3	14,600	54%	2428	---
1.f4	5,954	48%	2377	---
1.Nc3	3,911	50%	2384	---
1.b4	1,791	48%	2379	---
1.a3	1,250	54%	2406	---
1.e3	1,081	49%	2409	---
1.d3	969	50%	2378	---
1.g4	670	46%	2361	---
1.h4	466	54%	2382	---
1.c3	439	51%	2425	---
1.h3	289	56%	2420	---
1.a4	118	60%	2461	---
1.f3	100	47%	2427	---
1.Nh3	93	66%	2506	---
1.Na3	47	62%	2476	---

1.e4 e5 2.Nf3 Nc6 3.Bb5 Nf6 4.d3 Bc5 5.Bxc6 dxc6 6.0-0 Nd7 7.Nbd2 0-0 8.Qe1 f6 9.Nc4 Rf7 10.a4 Bf8 11.Kh1 Nc5 12.a5 Ne6 13.Ncxe5 fxe5 14.Nxe5 Rf6 15.Ng4 Rf7 16.Ne5 Re7 17.a6 c5 18.f4 Qe8 19.axb7 Bxb7 20.Qa5 Nd4 21.Qc3 Re6 22.Be3 Rb6 23.Nc4 Rb4 24.b3 a5 25.Rxa5 Rxa5 26.Nxa5 Ba6 27.Bxd4 Rxd4 28.Nc4 Rd8 29.g3 h6 30.Qa5 Bc8 31.Qxc7 Bh3 32.Rg1 Rd7 33.Qe5 Qxe5 34.Nxe5 Ra7 35.Nc4 g5 36.Rc1 Bg7 37.Ne5 Ra8 38.Nf3 Bb2 39.Rb1 Bc3 40.Ng1 Bd7 41.Ne2 Bd2 42.Rd1 Be3 43.Kg2 Bg4 44.Re1 Bd2 45.Rf1 Ra2 46.h3 Bxe2 47.Rf2 Bxf4 48.Rxe2 Be5 49.Rf2 Kg7 50.g4 Bd4 51.Re2 Kf6 52.e5+ Bxe5 53.Kf3 Ra1 54.Rf2 Re1 55.Kg2+ Bf4 56.c3 Rc1 57.d4 Rxc3 58.dxc5 Rxc5 59.b4 Rc3 60.h4 Ke5 61.hxg5 hxg5 62.Re2+ Kf6 63.Kf2 Be5 64.Ra2 Rc4 65.Ra6+ Ke7 66.Ra5 Ke6 67.Ra6+ Bd6 0–1

White	EloW	Black	EloB	Res	Year	ECO	Event	Rnd

Stockfish 8	-	AlphaZero	-	0–1	2017	AlphaZero vs. Stockfish	1.1
Stockfish 8	-	AlphaZero	-	0–1	2017	AlphaZero vs. Stockfish	1.2
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.3
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.4
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.5
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.6
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.7
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.8
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.9
AlphaZero	-	Stockfish 8	-	1–0	2017	AlphaZero vs. Stockfish	1.10
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.1
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.2
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.3
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.4
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.5
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.6
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.7
Stockfish	-	AlphaZero	-	0–1	2018	AlphaZero vs. Stockfish	2.8
Stockfish	-	AlphaZero	-	0–1	2018	AlphaZero vs. Stockfish	2.9
Stockfish	-	AlphaZero	-	1–0	2018	AlphaZero vs. Stockfish	2.10
Stockfish	-	AlphaZero	-	0–1	2018	AlphaZero vs. Stockfish	2.11
Stockfish	-	AlphaZero	-	0–1	2018	AlphaZero vs. Stockfish	2.12
Stockfish	-	AlphaZero	-	0–1	2018	AlphaZero vs. Stockfish	2.13
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.14
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.15
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.16
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.17
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.18
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.19
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.20
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.21
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.22
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.23
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.24
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.25
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.26
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.27
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.28
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.29
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.30
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.31
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.32
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.33
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.34
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.35
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	2.36
Stockfish	-	AlphaZero	-	1–0	2018	AlphaZero vs. Stockfish	2.37
AlphaZero	-	Stockfish	-	0–1	2018	AlphaZero vs. Stockfish	2.38
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.39
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.40
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.41
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.42
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.43
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.44
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.45
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.46
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.47
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.48
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.49
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.50
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.51
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.52
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.53
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.54
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.55
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.56
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.57
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.58
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.59
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.60
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	2.61
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.62
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.63
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.64
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.65
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.66
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.67
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.68
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.69
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.70
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.71
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.72
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.73
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.74
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.75
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.76
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.77
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.78
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.79
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.80
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.81
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.82
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.83
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.84
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.85
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.86
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.87
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.88
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.89
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.90
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.91
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.92
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.93
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.94
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.95
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.96
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.97
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.98
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.99
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.100
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.101
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.102
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.103
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.104
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.105
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.106
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.107
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.108
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.109
AlphaZero	-	Stockfish	-	½–½	2018	AlphaZero vs. Stockfish	2.110
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	3.1
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	3.2
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	3.3
AlphaZero	-	Stockfish	-	1–0	2018	AlphaZero vs. Stockfish	3.4
Stockfish	-	AlphaZero	-	0–1	2018	AlphaZero vs. Stockfish	3.5
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	3.6
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	3.7
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	3.8
Stockfish	-	AlphaZero	-	½–½	2018	AlphaZero vs. Stockfish	3.9
Stockfish	-	AlphaZero	-	1–0	2018	AlphaZero vs. Stockfish	3.10

Endgame Turbo 5 USB flash drive

Perfect endgame analysis and a huge increase in engine performance: Get it with the new Endgame Turbo 5! This brings the full 6-piece Syzygy endgame tablebases on a pendrive. Just plug it in a USB socket and you are set!