Computers choose: who was the strongest player?

by ChessBase
10/30/2006 – Who is the best chess player of all time? Entire books have been devoted to the subject, but all have one major flaw: they are mainly subjective. Necessarily so, since there is no direct way of comparing Morphy to Fischer, Lasker to Kasparov. Or is there? Two scientists from Slovenia try it with computers and statistics. The results might surprise you.

ChessBase 18 - Mega package ChessBase 18 - Mega package

Winning starts with what you know
The new version 18 offers completely new possibilities for chess training and analysis: playing style analysis, search for strategic themes, access to 6 billion Lichess games, player preparation by matching Lichess games, download Chess.com games with built-in API, built-in cloud engine and much more.

More...

Who was The Strongest Chess Player of All Time?

Computer Analysis of World Chess Champions

By Matej Guid and Ivan Bratko
University of Ljubljana, Faculty of Computer and Information Science, Artificial Intelligence Laboratory, Ljubljana, Slovenia. This article is based on a paper by the same authors published in the ICGA Journal; full reference is given below.

Who is the best chess player of all time? Chess players are often interested in this question to which there is no well founded, objective answer, because it requires a comparison between chess players of different eras who never met across the board. With the emergence of high-quality chess programs a possibility of such an objective comparison arises. However, so far computers were mostly used as a tool for statistical analysis of the players' results.

Our approach was different: we were interested in the chess players' quality of play regardless of the game score, which we evaluated with the help of computer analyses of individual moves made by each player. A method to assess the difficulty of positions was designed, in order to take into account the differences in players' styles and to compensate for the fact that calm positional players in their typical games have less chance to commit gross tactical errors than aggressive tactical players. We also give a carefully chosen methodology for using computer chess programs for evaluating the true strength of chess players.

The fourteen classic-version World Champions, from the first World Chess Championship in 1886 to the present, were evaluated. Matches for the title of »World Chess Champion«, in which players contended for or were defending the title, were selected for analysis. Several different criteria were designed. The basis for evaluation was the difference between the position values resulting from the moves played by the human and the moves chosen as best by the chess program. We also calculated the average number of blunders and observed how would the players perform providing they would all deal with equally complex positions. Our analyses, among other things, also clearly show that the percentage of best moves played depends on analysed position itself and that is in very high correlation with the difference of best two moves evaluations (according to the computer): the bigger the difference between best two moves evaluations – the easier it is to find the best move. By observing the average material quantity during the games, we tried to determine players inclination to simplify positions.

Generally, our computer analysis seems to have produced sensible results that can be nicely interpreted by a chess expert. Anyway, many will find some of the results quite surprising. The winner according to the main criterion, where we measured average deviations between evaluations of played moves and best evaluated moves according to the computer, is Jose Raul Capablanca, the 3rd World Champion.
As we did in the study, this result should be interpreted in the light of the comparatively low complexity of positions in Capablanca's games. Anyway, he was also on top according to other criteria where we measured quality of play and was only beaten according to one criterion (albeit a very important one), namely the quality of play provided that all players dealt with equally complex positions, by the present World Champion, Vladimir Kramnik. Both Capablanca and Kramnik distinctly deviated from the rest of the players.

Separatelly, we conducted analysis of the latest World Championship match between Kramnik and Topalov. The results (mean loss per move: Kramnik 0.1220, Topalov 0.1328; only the games with classical time control were taken into account) tell that Kramnik's play was somewhat better, while the overall quality of play in the match was on quite decent level (comparing to other World Championship matches), although the record (Kramnik's 0.0903 in the London 2000 match against Kasparov) was really not in danger. Note that the lower this measure the better the performance.

The chess program Crafty was used to perform the analyses. We needed an open source program in order to slightly modify it, as is described in the article. One may argue that Crafty is weaker than at least some of the fourteen World Champions who were taken into consideration. However, altogether more than 37,000 positions were evaluated and even if evaluations are not always perfect, for our analysis they just need to be sufficiently accurate on average since small occasional errors cancel out through statistical averaging. Anyway, we would like to encourage other researchers that might have access to source code of the strongest commercial chess programs, to modify and run them in the way we proposed, in order to obtain a comparison between various different engines as well.

The basic criterion for evaluating World Champions was the average difference between moves played and best evaluated moves by computer analysis. According to this analysis, the winner was the third World Champion, Jose Raul Capablanca. This result should be interpreted in the light of the comparatively low complexity of positions in Capablanca's games which is quite in line with the known assessments in the chess literature of his style. For example, Garry Kasparov in his set of books My Great Predecessors when commenting Capablanca's games speculates that Capablanca occasionally did not even bother to calculate deep tactical variations. The Cuban simply preferred to play moves that were clear and positionally so strongly justified that calculation of variations was simply not necessary. He also describes Capablanca with the following words: “He contrived to win the most important tournaments and matches, going undefeated for years (of all the champions he lost the fewest games).” and “his style, one of the purest, most crystal-clear in the entire history of chess, astonishes one with his logic.”

The results of blunder-rate measurement are similar. We expected positional players to perform better by this criterion than tactical players, since in quiet positions there were less opportunities to blunder. Note the excelent result of Tigran Petrosian who is widely renowned as a pure positional player. In compliance with this observation, Steinitz, who lived in an era of tactical romantic chess, took clearly last place.

Capablanca is renowned for playing a 'simple' chess and avoiding complications, while it is common that Steinitz and Tal faced many 'wild' positions in their games. The results of complexity measurement clearly coincide with this common opinion.

The method for assesing the complexity of positions is described in detail in the original article (see also the reference below). Graph of errors made by players at different levels of complexity clearly indicates the validity of the chosen measure of complexity of positions; the players made little errors in simple positions, and the error rate increased with increasing complexity.

We used the above mentioned metric of position complexity to determine the distribution of moves played across different intervals of complexity, based on positions that players had faced themselves. This, in turn, largely defines their style of play. For example, Capablanca had much less dealing with complex situations compared to Tal, who is to be regarded as a tactical player.

The main deficiency of the two criteria, as detailed in the previous subsections, is in the observation that there are several types of players with specific properties, to whom the criteria do not directly apply. It is reasonable to expect that positional players in average commit fewer errors due to the somewhat less complex positions in which they find themselves as a result of their style of play, than tactical players. The latter, on average, deal with more complex positions, but are also better at handling them and use this advantage to achieve excellent results in competition. We wanted to determine how players would perform when facing equally complex positions. The winner was the fourteenth World Champion Vladimir Kramnik. Kramnik also had the best performance of all the matches; his average error in his match against Kasparov (London, 2000) was only 0.0903. We also tried to determine how well the players would play, should they all play in the style of Capablanca, Tal, etc. It is interesting to notice that Kasparov would outperform Karpov, providing they both played in Tal's style.

The percentage of best moves played alone does not actually describe the quality of a player as much as one might expect. In certain types of position it is much easier to find a good move than in others. Experiments showed that the percentage of best moves played is highly correlated to the difference in evaluations of the best and second-best move in a given position. The greater the difference, the better was the percentage of player's success in making the best move.

Based on that observation, another criterion was the expected number of best moves played providing that all players dealt with positions with equal difference between the best two moves, as was described in the previous section. It represents another attempt to bring the champions to a common denominator. See the results right below.

Kramnik, Fischer, and Alekhine had the highest percentage of best moves played, but also the above-mentioned difference was high. In contrast, Capablanca, who was right next regarding the percentage of the best move played, on average dealt with the smallest difference between the best two moves. The winner by this criterion was once again Capablanca. He and Kramnik again clearly outperformed the others.

The purpose of calculating the average material quantity, that is the sum of the numerically expressed values of all pieces on board, was not to determine the quality of play, but to collect additional information on a player's style of play. We mainly tried to observe a player's inclination to simplify positions.

Among the players who stand out from the others, Kramnik obviously dealt with less material on board (remember his early queen exchanges in his Berlin Wall games against Kasparov?). The opposite could be said for Steinitz, Spassky, and Petrosian.

The authors

Matej Guid has received B.Sc. degree in computer science at the University of Ljubljana. He works on his Ph.D. thesis at Artificial intelligence Laboratory, Faculty of Computer and Information Science of Ljubljana University. His present research includes machine learning, computer chess, heuristic programming and robot learning by experimentation. At the present time he works on programming an automated Chess Tutor that will be able to intelligently comment chess games in a language comprehensible to humans. Chess has been one of Matej's favourite hobbies since he was a little kid, he was also a junior champion of Slovenia a couple of times.
Ivan Bratko is professor of computer science at University of Ljubljana, Slovenia. He is head of Artificial intelligence Laboratory, Faculty of Computer and Information Science of Ljubljana University, and has conducted research in machine learning, knowledge-based systems, qualitative modelling, intelligent robotics, heuristic programming and computer chess (do you know the famous Bratko-Kopec test?). Professor Bratko has published over 200 scientific papers and a number of books, including the best-selling Prolog Programming for Artificial Intelligence. Chess is one of his favourite hobbies.

The original paper was published in the ICGA Journal, Vol 29, No. 2, June 2006, pages 65-73. It was also presented at the 5th International Conference on Computers and Games, May 29-31 2006, Turin, Italy, and was published in the proceedings book.


Reports about chess: tournaments, championships, portraits, interviews, World Championships, product launches and more.

Discuss

Rules for reader comments

 
 

Not registered yet? Register