The future is here – AlphaZero learns chess

by Albert Silver
12/6/2017 – Imagine this: you tell a computer system how the pieces move — nothing more. Then you tell it to learn to play the game. And a day later — yes, just 24 hours — it has figured it out to the level that beats the strongest programs in the world convincingly! DeepMind, the company that recently created the strongest Go program in the world, turned its attention to chess, and came up with this spectacular result.

ChessBase 18 - Mega package ChessBase 18 - Mega package

Winning starts with what you know
The new version 18 offers completely new possibilities for chess training and analysis: playing style analysis, search for strategic themes, access to 6 billion Lichess games, player preparation by matching Lichess games, download Chess.com games with built-in API, built-in cloud engine and much more.

More...

DeepMind and AlphaZero

About three years ago, DeepMind, a company owned by Google that specializes in AI development, turned its attention to the ancient game of Go. Go had been the one game that had eluded all computer efforts to become world class, and even up until the announcement was deemed a goal that would not be attained for another decade! This was how large the difference was. When a public challenge and match was organized against the legendary player Lee Sedol, a South Korean whose track record had him in the ranks of the greatest ever, everyone thought it would be an interesting spectacle, but a certain win by the human. The question wasn’t even whether the program AlphaGo would win or lose, but how much closer it was to the Holy Grail goal. The result was a crushing 4-1 victory, and a revolution in the Go world. In spite of a ton of second-guessing by the elite, who could not accept the loss, eventually they came to terms with the reality of AlphaGo, a machine that was among the very best, albeit not unbeatable. It had lost a game after all.AlphaGo logo

The saga did not end there. A year later a new updated version of AlphaGo was pitted against the world number one of Go, Ke Jie, a young Chinese whose genius is not without parallels to Magnus Carlsen in chess. At the age of just 16 he won his first world title and by the age of 17 was the clear world number one. That had been in 2015, and now at age 19, he was even stronger. The new match was held in China itself, and even Ke Jie knew he was most likely a serious underdog. There were no illusions anymore. He played superbly but still lost by a perfect 3-0, a testimony to the amazing capabilities of the new AI.

Many chess players and pundits had wondered how it would do in the noble game of chess. There were serious doubts on just how successful it might be. Go is a huge and long game with a 19x19 grid, in which all pieces are the same, and not one moves. Calculating ahead as in chess is an exercise in futility so pattern recognition is king. Chess is very different. There is no questioning the value of knowledge and pattern recognition in chess, but the royal game is supremely tactical and a lot of knowledge can be compensated for by simply outcalculating the opponent. This has been true not only of computer chess, but humans as well.

However, there were some very startling results in the last few months that need to be understood. DeepMind’s interest in Go did not end with that match against the number one. You might ask yourself what more there was to do after that? Beat him 20-0 and not just 3-0? No, of course not. However, the super Go program became an internal litmus test of a sorts. Its standard was unquestioned and quantified, so if one wanted to test a new self-learning AI, and how good it was, then throwing it at Go and seeing how it compared to the AlphaGo program would be a way to measure it.

A new AI was created called AlphaZero. It had several strikingly different changes. The first was that it was not shown tens of thousands of master games in Go to learn from, instead it was shown none. Not a single one. It was merely shown the rules, without any other information. The result was a shock. Within just three days its completely self-taught Go program was stronger than the version that had beat Lee Sedol, a result the previous AI had needed over a year to achieve. Within three weeks it was beating the strongest AlphaGo that had defeated Ke Jie. What is more: while the Lee Sedol version had used 48 highly specialized processors to create the program, this new version used only four!

Graph showing the relative evolution of AlphaZero : Source: DeepMind

AlphaZero learns Chess

Approaching chess might still seem unusual. After all, although DeepMind had already shown near revolutionary breakthroughs thanks to Go, that had been a game that had yet to be ‘solved’. Chess already had its Deep Blue 20 years ago, and today even a good smartphone can beat the world number one. What is there to prove exactly?

Garry Kasparov is seen chatting with Demis Hassabis, founder of DeepMind | Photo: Lennart Ootes

It needs to be remembered that Demis Hassabis, the founder of DeepMind has a profound chess connection of his own. He had been a chess prodigy in his own right, and at age 13 was the second highest rated player under 14 in the world, second only to Judit Polgar. He eventually left the chess track to pursue other things, like founding his own PC video game company at age 17, but the link is there. There was still a burning question on everyone’s mind: just how well would AlphaZero do if it was focused on chess? Would it just be very smart, but smashed by the number-crunching engines of today where a single ply is often the difference between winning or losing? Or would something special come of it?

Professor David Silver explains how AlphaZero was able to progress much quicker when it had to learn everything on its own as opposed to analzying large amounts of data. The efficiency of a principled algorithm was the most important factor.

A new paradigm 

On December 5 the DeepMind group published a new paper at the site of Cornell University called "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm", and the results were nothing short of staggering. AlphaZero had done more than just master the game, it had attained new heights in ways considered inconceivable. The test is in the pudding of course, so before going into some of the fascinating nitty-gritty details, let’s cut to the chase. It played a match against the latest and greatest version of Stockfish, and won by an incredible score of 64 : 36, and not only that, AlphaZero had zero losses (28 wins and 72 draws)!

Stockfish needs no introduction to ChessBase readers, but it's worth noting that the program was on a computer that was running nearly 900 times faster! Indeed, AlphaZero was calculating roughly 80 thousand positions per second, while Stockfish, running on a PC with 64 threads (likely a 32-core machine) was running at 70 million positions per second. To better understand how big a deficit that is, if another version of Stockfish were to run 900 times slower, this would be equivalent to roughly 8 moves less deep. How is this possible?

The paper "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" at Cornell University

The paper explains:

“AlphaZero compensates for the lower number of evaluations by using its deep neural network to focus much more selectively on the most promising variations – arguably a more “human-like” approach to search, as originally proposed by Shannon. Figure 2 shows the scalability of each player with respect to thinking time, measured on an Elo scale, relative to Stockfish or Elmo with 40ms thinking time. AlphaZero’s MCTS scaled more effectively with thinking time than either Stockfish or Elmo, calling into question the widely held belief that alpha-beta search is inherently superior in these domains.”

This diagram shows that the longer AlphaZero had to think, the more it improved compared to Stockfish

In other words, instead of a hybrid brute-force approach, which has been the core of chess engines today, it went in a completely different direction, opting for an extremely selective search that emulates how humans think. A top player may be able to outcalculate a weaker player in both consistency and depth, but it still remains a joke compared to what even the weakest computer programs are doing. It is the human’s sheer knowledge and ability to filter out so many moves that allows them to reach the standard they do. Remember that although Garry Kasparov lost to Deep Blue it is not clear at all that it was genuinely stronger than him even then, and this was despite reaching speeds of 200 million positions per second. If AlphaZero is really able to use its understanding to not only compensate 900 times fewer moves, but surpass them, then we are looking at a major paradigm shift.

How does it play?

Since AlphaZero did not benefit from any chess knowledge, which means no games or opening theory, it also means it had to discover opening theory on its own. And do recall that this is the result of only 24 hours of self-learning. The team produced fascinating graphs showing the openings it discovered as well as the ones it gradually rejected as it grew stronger!

Professor David Silver, lead scientist behind AlphaZero, explains how AlphaZero learned openings in Go, and gradually began to discard some in favor of others as it improved. The same is seen in chess.

In the diagram above, we can see that in the early games, AlphaZero was quite enthusiastic about playing the French Defense, but after two hours (this so humiliating) began to play it less and less.

The Caro-Kann fared a good deal better, and held a prime spot in AlphaZero's opening choices until it also gradually filtered it out. So what openings did AlphaZero actually like or choose by the end of its learning process? The English Opening and the Queen's Gambit!

The paper also came accompanied by ten games to share the results. It needs to be said that these are very different from the usual fare of engine games. If Karpov had been a chess engine, he might have been called AlphaZero. There is a relentless positional boa constrictor approach that is simply unheard of. Modern chess engines are focused on activity, and have special safeguards to avoid blocked positions as they have no understanding of them and often find themselves in a dead end before they realize it. AlphaZero has no such prejudices or issues, and seems to thrive on snuffing out the opponent’s play. It is singularly impressive, and what is astonishing is how it is able to also find tactics that the engines seem blind to.

 

In this position from Game 5 of the ten published, this position arose after move 20...Kh8. The completely disjointed array of Black’s pieces is striking, and AlphaZero came up with the fantastic 21.Bg5!! After analyzing it and the consequences, there is no question this is the killer move here, and while my laptop cannot produce 70 million positions per second, I gave it to Houdini 6.02 with 9 million positions per second. It analyzed it for one full hour and was unable to find 21.Bg5!!

A screenshot of Houdini 6.02 after an hour of analysis

Here is another little gem of a shot, in which AlphaZero had completely stymied Stockfish positionally, and now wraps it up with some nice tactics. Look at this incredible sequence in game nine:

 

Here AlphaZero played the breathtaking 30. Bxg6!! The threat is obviously 30...fxg6 31. Qxe6+, but how do you continue after the game's 30...Bxg5 31. Qxg5 fxg6?

 

Here AlphaZero continued with 32. f5!! and after 32...Rg8 33. Qh6 Qf7 34. f6 obtained a deadly bind, and worked it into a win 20 moves later. Time to get a thesaurus for all the references synonymous of 'amazing'. 

What lies ahead

So where does this leave chess, and what does it mean in general? This is a game-changer, a term that is so often used and abused, and there is no other way of describing it. Deep Blue was a breakthrough moment, but its result was thanks to highly specialized hardware whose purpose was to play chess, nothing else. If one had tried to make it play Go, for example, it would have never worked. This completely open-ended AI able to learn from the least amount of information and take this to levels hitherto never imagined is not a threat to ‘beat’ us at any number of activities, it is a promise to analyze problems such as disease, famine, and other problems in ways that might conceivably lead to genuine solutions.  

For chess, this will likely lead to genuinely breakthrough engines following in these footsteps. That is what happened in Go. For years and years, Go programs had been more or less stuck where they were, unable to make any meaningful advances, and then came along AlphaGo. It wasn't because AlphaGo offered some inspiration to 'try harder', it was because just as here, a paper was published detailing all the techniques and algorithms developed and used so that others might follow in their footsteps. And they did. Literally within a couple of months, new versions of top programs such as Crazy Stone, began offering updated engines with Deep Learning, which brought hundreds (plural) of Elo in improvement. This is no exaggeration.

Within a couple of months, the revolutionary techniques used to create AlphaGo began to appear in top PC programs of Go

The paper on chess offers similar information allowing anyone to do what they did. Obviously they won't have the benefit of the specialized TPUs, a processor designed especially for this deep learning training, but nor are they required to do so. It bears remembering that this was also done without the benefit of many of the specialized programming techniques and tricks in chess programming. Who is to say they cannot be combined for even greater results? Even the DeepMind team think it bears investigating:

"It is likely that some of these techniques could further improve the performance of AlphaZero; however, we have focused on a pure self-play reinforcement learning approach and leave these extensions for future research."

Replay the ten games between AlphaZero and Stockfish 8 (70 million NPS)

Links


Born in the US, he grew up in Paris, France, where he completed his Baccalaureat, and after college moved to Rio de Janeiro, Brazil. He had a peak rating of 2240 FIDE, and was a key designer of Chess Assistant 6. In 2010 he joined the ChessBase family as an editor and writer at ChessBase News. He is also a passionate photographer with work appearing in numerous publications, and the content creator of the YouTube channel, Chess & Tech.

Discuss

Rules for reader comments

 
 

Not registered yet? Register

Enqwert Enqwert 12/7/2017 11:15
Alpha zero changes opening choises at it evolves, it remembers so it has opening book, a big advantage. Endgame table base and hardware differences also matter. A better match should be held.
dofski dofski 12/7/2017 11:10
@ketchuplover
"Now I want to see a program named AlphaGiri which is designed to only draw chess games and have it battle AlphaZero"
That is interesting.
Suppose AlphaGiri gets 1 point for a draw and 0 for a win or loss. Whereas AlphaZero gets 1, 1/2 or 0 as usual.

For AlphaZero to beat AlphaGiri it must go for a win as going for a draw loses and going for a loss draws. Whereas AlphaGiri will try always to draw.
fgkdjlkag fgkdjlkag 12/7/2017 11:04
Obviously there is an upper elo limit, I'm not sure why this is considered interesting or significant?

@Busho, the point is that the lack of an opening book for stockfish (when alphago trained itself with 700,000-1 million games), may have enabled it to get to the superior position in which Bg5 was possible. Stockfish was starting out with 1 minute for each of its opening moves.

Does the knowledge we now have from AlphaGo, AlphaZero, and AlphaShogi give us insight into which of the 3 is the more complex game?
dofski dofski 12/7/2017 10:44
@psamant
Are you suggesting that the levelling off in all three games is a kind of artefact a function of the ELO method. The paper says the graphs were produced during "self play reinforcement learning". If this means different copies of the program playing against each other, I think it is possible that the ELO of these copies could always increase in tandem indefinitely (when pitted against Stockfish). A very high ELO when tested against Stockfish is possible.

Yet there is a levelling off in all three games. To me it seems either there is some kind of ceiling to ELO of any machine including AlphaZero which would be a very interesting discovery OR the AlphaZero program needs to be improved to increase to shoot to another level. This seems against the concept in a way that the machine is learning by itself, maybe it can be made to learn by itself better.
ketchuplover ketchuplover 12/7/2017 10:36
Now I want to see a program named AlphaGiri which is designed to only draw chess games and have it battle AlphaZero.
Schnabelwolke Schnabelwolke 12/7/2017 09:58
The complete wisdom of chess in barely four hours sounds like a massive feat but I am not totally convinced yet. To me it looks like the guys from google could have taken some short cuts to get into the big press. Think about this: Take some reasonable open source chess program and let it run on considerably strong hardware a thousand times against poor Stockfish (deprived of its opening book), intelligently (and artificially) varying the moves. Quiet likely that there will arise, say, some 28 wins. And also quit likely some astounding stategic maneuvers will emergy by that (all that equally spectacular but less successful tries will quickly be forgotten again ...). Take an intelligent storing system to keep your successes and ready you are to fight against Stockfish (bare of its opening book). If run massively in parallel all this could be (and apparently has been) accomplished in just four hours.

Taken all this into account the whole story sound very much less unlikely to me.
LetoAtreides82 LetoAtreides82 12/7/2017 09:54
Igor the opening data is not from games played between AZ and Stockfish, it's from it's learning phase in which it played against itself.
vecter vecter 12/7/2017 09:53
So many butthurt people in the comments. Why?

It's over, AI chess is the future of chess.
ketchuplover ketchuplover 12/7/2017 09:44
At one time I had a book titled "The World's Worst Joke Book" or something similar to that. One anecdote was that of a man asking the strongest computer does God exist. The reply was it does now.

Be afraid. Be afraid^infinity
schack schack 12/7/2017 09:38
You have radically misrepresented the relative computing strength of the two computers.

AlphaZero generated the self-play games on 5000 1st gen TPUs and self-trained on 64 2nd gen TPUs. It then took the results of that training and used 4 TPUs to play.

I dont' have direct knowledge of what each of these TPUs can do. I have read that each one might be rated at 160Tflops. Even four of these TPUs is serious, serious hardware. The best consumer GPUs are at 11 or 12 Tflops, meaning that you need 60 top GPUs to match this hardware.

In contrast, Stockfish ran on 64 threads (what cpu? what ghz?) with 1 gb of hash. This is woefully misconfigured. Using the formula given by Robert Houdart in his Houdini helpme files, Stockfish should have had 42gb of ram. That's a heck of a difference.
Odomok Odomok 12/7/2017 07:35
We as human beings should be proud that the machine favours the English, discovered by ourselves. What if the machine would have said that 1g4 …2 c3 is the best opening?
reddawg07 reddawg07 12/7/2017 06:46
So what's next? I hope it follows in the footsteps of IBM's Watson. What can it offer beyond being a better chess or Go player. Houdini or Stockton is more than enough chessplaying ability for us humans. AlphaZero needs to learn other stuff that is essential for humans to tackle problems coming in the near future like the impact of climate change and energy.
cloudmann cloudmann 12/7/2017 06:43
I hate to spoil the Alphazero party, but in Game #3 listed on the website, would Stockfish 8 have REALLY played 49. -- Rf8? That was the move that lost the game for SF8. However, I don't think SF8 would have played that move. My analysis shows that the move would have probably been 49.-- Kf8 with an eval of (0.00) rather than SF8's eval of (+5.95) for 49.--Rf8. I would love to see SF8's hash on this move.
Bella-2 Bella-2 12/7/2017 05:13
Better not adjust this algorithm to solve mankind's problems. It could be that it came to the conclusion that mankind itself is the problem and therefore should be erased.
jsaldea12 jsaldea12 12/7/2017 05:10
Zvi, the position is not illegal and THERE IS A SOLUTION. JSALDEA

Zvi Mendlowitz 4 hours ago
@jsaldea12
The position is illegal. Also, there is no solution.

I am waiting for AZ to learn a programming language... Then it could make its own AI... which would make its own AI... the first one in 24 hours, the second one in 12 hours, the third one in 6 hours... after 48 hours - singularity :-)
jsaldea12jsaldea12 5 hours ago
• Dec. 7. 2017 (7:15PM)

Dr. Demis Hassalbis
Inventor Deep Mind Alpha Go

Sir:

Please see if Alpha Go can solve this 30 years in the making chess puzzle of mine. Would appreciate response from Deep Mind. White mates black in 9 moves.
Position: White: Pa2, Ka3, Pa4, Ba7, Ba8, Pb5, Pc2, Pd6,Pe2, Nf3, Ng1, Pg5
Black Pc3. Kc4, Pc5, B-c8, N-d5,Pd7, Pe3, P-g3, P-f6, Pg7, Ph6


Regards.
Toff Toff 12/7/2017 04:03
se non è vero... Let's prove it.
Igor Freiberger Igor Freiberger 12/7/2017 04:02
KingZor: no, the PLOTS in Table 2 are based on database results, the 0/0/0 data for each opening are games played by AZ and Stockfish.

AZ seems to be extremely strong, but the paper has flaws that let me question if its achievements were got under the described conditions.

Regarding other problems it could solve, please remember all this AI is based on situations with known endpoints within a delimited universe of variables (a closed problem). To, for example, find a heal for a disease is far more complex as it is an open problem —although AI could help on this, of course.
fons fons 12/7/2017 03:47
When I read AlphaZero learned chess from scratch in 4 hours and then trashed Stockfish I also wondered if -somehow- it was April fools. This is pretty amazing.

Quote from 33rdsquare: While AlphaZero remains a game playing domain-constrained AI, this work does appear to be a big step towards full AGI. (Artificial general intelligence.)

So yeah: those guys with their crazy predictions about general AI in the next few decades might actually be right. Start preparing for the singularity.


(In the comments of a previous article I also wondered how AlphaGo would do in chess based on the fact that you can't play chess on general principles alone. Although that was before I learned about AlphaGoZero and without knowledge of how AlphaGo actually worked apart from how it was described in popular science articles. Now we know the answer. AlphaZero does calculate (80 thousand positions per second), so I guess that's enough to deal with those pesky tactics.)
psamant psamant 12/7/2017 02:46
@dofski
"...why does AlphaZero approach asymptotically just about the same level as Stockfish about 3400 when it is learning by playing itself. Why not 3000 or 5000 or 10000. Does the 3400 represent some upper limit which is difficult to surpass"
Very interesting observation! Perhaps someone here can come up with a hypothesis for this? Is there some upper limit to ELO rating possibility, specially in the changed circumstances where having more opponents with high rating is no longer a constraint, considering that we have abundant computer programs with 3000 or more of rating levels.
lajosarpad lajosarpad 12/7/2017 02:39
Now it's time to write and execute a Conditional Functional Dependency search on the database of the engine. It might find rules like:

"double isolated pawns are powerful outposts for knight pairs"

whereas we, humans could only reach the conclusion that:

"double isolated pawns are bad... with a few exceptions"

A huge learning phase on cause-effect relationships would yield a lot of staggering aggregate data.
ltoime@gmail.com ltoime@gmail.com 12/7/2017 02:15
It is intriguing how different the Elo learning curves look for the different games. In Shogi, it fascinatingly didn’t always increase - there were occasional small sustained dips.
What happens when AZ plays against itself for a week, or a month, or a year?

Stay tuned.
A7fecd1676b88 A7fecd1676b88 12/7/2017 01:51
So a super computer can learn and figure out good moves, in a very simple game, if the rules don't change. Shocking.
That is not the real world, now is it?
For example, weather models will sometimes have chaos, and so the rules are variable. Humans can handle that. Computers ? Good luck
Yawn. A nothing burger.
dofski dofski 12/7/2017 01:51
Figures 1 and 2 in the paper show that the ELO difference between AlphaZero and Stockfish is "quite low" it seems roughly 100 or less. This seems to account for the score in the match between the two.

But in Figure 1 why does AlphaZero approach asymptotically just about the same level as Stockfish about 3400 when it is learning by playing itself. Why not 3000 or 5000 or 10000. It seems a funny coincidence.

Does the 3400 represent some upper limit which is difficult to surpass.
franis franis 12/7/2017 01:08
What is a proof that Bg5 is better than Be3 or b4?
KingZor KingZor 12/7/2017 01:04
Igor Freiberger, table 2 refers to human games. Interesting that there were only two draws in Shogi out of 100 games, which is about the same frequency as among the Japanese masters. A game about as complex as chess, if not more, but with much more decisive games.
Zvi Mendlowitz Zvi Mendlowitz 12/7/2017 12:55
@jsaldea12
The position is illegal. Also, there is no solution.

I am waiting for AZ to learn a programming language... Then it could make its own AI... which would make its own AI... the first one in 24 hours, the second one in 12 hours, the third one in 6 hours... after 48 hours - singularity :-)
jsaldea12 jsaldea12 12/7/2017 12:16
• Dec. 7. 2017 (7:15PM)

Dr. Demis Hassalbis
Inventor Deep Mind Alpha Go

Sir:

Please see if Alpha Go can solve this 30 years in the making chess puzzle of mine. Would appreciate response from Deep Mind. White mates black in 9 moves.
Position: White: Pa2, Ka3, Pa4, Ba7, Ba8, Pb5, Pc2, Pd6,Pe2, Nf3, Ng1, Pg5
Black Pc3. Kc4, Pc5, B-c8, N-d5,Pd7, Pe3, P-g3, P-f6, Pg7, Ph6

Regards.

Jose S. Aldea
Scientist-inventor
savantKing99 savantKing99 12/7/2017 11:40
it sound too good to be true. And yes of course it is a big step forwards. But after all it is just a simulation. And again AlphaZero will not pass the turing test.
It is just rubbish!! If you read the paper it says that it learned chess by itself in just 24 hours. But in the mean time it played like 1 million games with himself. So it created a openings book!! I don't know how to describe it. But Google just hide a phase in the program that it looks likes that it is self teaching. Apart all form this, of course it is genius. But has nothing to do with AI. Sorry
celeje celeje 12/7/2017 11:02
@bertman: The comment by @rokko below about comparing positions/sec is correct. AZ was not running on a slower computer. It is just calculating in a different way. If you must compare the hardware, note that AZ was running on a machine with 4 TPUs. Google itself claims its TPU is up to 30x higher performance than a contemporary CPU, so AZ is running on the equivalent of up to 120 CPUs. Please change your second paragraph in the section 'A New Paradigm', which is highly misleading at best.
Bojan KG Bojan KG 12/7/2017 10:57
I am fascinated by us humans, we are able to create wonders but at same time we can not cope with ancient virus like rabbies. If this algorithm is able to solve anything by learning on its own then this is one of the most important scientific breakthroughs in history of mankind. The claim that AplhaZero "has beaten" chess is most ridiculous statement I have heard of for many many years.
rokko rokko 12/7/2017 10:49
Finding Bg5 is indeed impressive and shows, like many well-known examples, that usual chess programmes prune too aggressively.

But SF's computer power was NOT great (70000 kN/s is a top desktop computer and 1GB of hash table is a 2010 laptop) and comparing numbers between programmes shows the difference in their approach but not the computing power.

AlphaZero seems to have played 700,000 games, i.e. the output of many years of professional chess, to "train". This is as if SF did not only have an opening book but the MegaDatabase (as a lot of it is old or amateur) as its disponsal.
Bammer Bammer 12/7/2017 10:45
I do not get your conclusion regarding openings: Does this mean the French and the CaroKann are not more than equalizing defenses? Disappointing. A quick look at the ECO classifications in the paper itself also makes me wonder why there is a line of the Bf4 Queens Gambit given as Reti Opening, a Grünfeld as a KingsIndian and a D15 Line ChebanenkoSlav is a D06 QG. Seems like there wasn't a chess player around to do a minimum of proof reading.
pcst pcst 12/7/2017 10:04
I think even Best Program still have Bug. test half second per game it will see it can solve problem of test bug mode.
Busho Busho 12/7/2017 09:57
To everyone complaining that Stockfish was handicapped, consider this: AlphaZero found 21.Bg5!!, and Stockfish couldn't find it even though it had significantly more compute power. Also, consider this: AlphaZero started with ZERO knowledge and was able to self-learn opening theory in 24 hours. It took centuries for humans to gain this kind of insight into chess openings.

This technology can be a game changer. No doubt in my mind.
Bojan KG Bojan KG 12/7/2017 09:51
Complete rubbish - what exactly does cracking or crushing chess mean? Total number of chess games is 10 to the power of 500, number of chess games up to 40 moves is 10 to the power of 120. Of course many of these do not make sense but they are there anyway. Can you just imagine computer power needed to solve chess in its entirety? Even 8-piece tablebase is currently out of reach let alone complete chess. When Komodo, SF and Houdini were developed many, as today, thought these are unbeatable and play perfect chess and it turned out to be big mistake. Every credit to the team who developed this algorithm but cracking/crushing chess? Give me a break.
celeje celeje 12/7/2017 09:44
@fgkdjlkag: I relied on the article in which Nakamura & Larry Kaufman both commented on there being no opening book. (The article does not mention anything about endgame tablebases, though.) I'm guessing the chess journalist interviewing them may have told them the match conditions, so if he's wrong, they'd both have gotten the wrong info.
Igor Freiberger Igor Freiberger 12/7/2017 09:14
Frederic and Albert: are you sure this paper is serious? A number of details are not coherent. Table 1 of the paper says there was a 100-game match between AZ and Stockfish with no defeats by AZ. But Table 2 shows results for the 12 most common openings with 50 games as White and 50 games as Black for each one. Three conclusion here: (1) There was a total of 1200 games between AZ and Stockfish, with 24 losses of AZ; (2) Opening choice was not free, they set both to play a defined number of games for each line; (3) AZ was already self-played tons of learning games when it faced Stockfish. So it was already “learnt” openings and have its own “book” (in form of theta parameters) while Stockfish was completely vulnerable on this part, a huge handcap. There was other points suspect, but I will not invest so much time analysing all this. AZ may be a relevant advance in AI, but the paper does not proves that.
pcst pcst 12/7/2017 09:13
I wolud like to test AlphaZero learns chess where can I test Bug of program. Or maybe AlphaZero learns chess can make to Thai chess program many player likes to test. Thanks for read my comment.
Busho Busho 12/7/2017 09:10
Looks like there are lots of Stockfish lovers here. It will take them a while to realize what just happened: decades of human hours spent on optimization and refinement of search algos & positional heuristics was surpassed by a single self-learning algo left alone to figure it all out in 24 hours.

For me, after the initial feeling of utter amazement, a very deep seated fear has settled in. Like Frederic said, it is the beginning of Skynet. So, now I guess we have to create a time machine, and then go back in time to make sure that Demis continues his chess career and doesn't switch to computer science. Just kidding.

I think this is great news, and let's hope this technology lives up to its promise and solves real world problems and not just chess problems.
Bank2010 Bank2010 12/7/2017 09:10
I had an argument with someone about when chess will be solved. I predicted in 10 years, he said more than 50 years. Now we are wrong. Chess will be solved whenever DeepMind wants.