Centipawn Analysis: evaluating strength with an engine

by Albert Silver
11/1/2022 – Using an engine to find out how many moves in a game match its top choice, is but one way to measure a move's quality with an engine. Another key way is what analysts such as Ken Regan use: a tool to compare the engine's evaluation of a move with its own choice of best move, and see by how much the evaluations differ. Now you can do this too using the Centipawn Analysis tool in ChessBase 16!

ChessBase 18 - Mega package ChessBase 18 - Mega package

Winning starts with what you know
The new version 18 offers completely new possibilities for chess training and analysis: playing style analysis, search for strategic themes, access to 6 billion Lichess games, player preparation by matching Lichess games, download Chess.com games with built-in API, built-in cloud engine and much more.

More...

The Let's Check option is a fascinating way to compare your moves with the top choices of other engines, but imagine you played the second best move every time, yet only a small 0.05 centipawn difference from the top best move. Let's Check will understandably just say you had a 0% Engine Correlation, since none of your moves match the top choice, while failing to give a more complete picture of your play. That is where the Centipawn Analysis comes in. With it, you will enjoy a more thorough evaluation of each player's performance in a game or tournament.

How Centipawn Analysis works

Centipawn Analysis is a feature that can be executed as an independent game review with an engine, or be included as part of the Tactical Analysis. In a nutshell, it analyzes every move to see what the best engine move was and its evaluation and then the move played and its evaluation, and compares the relative centipawn loss, if any.

If you open a single game in ChessBase, you can also run a pure and quick analysis with it by going to the Analysis menu and selecting Centipawn Analysis.

For example, let's suppose this is the position being evaluated:

 

Aronian played 17. f3 here which the engine gives as second best, albeit by a minuscule margin:

Assuming no other engine disagrees with this ranking, Let's Check will report this as a miss since it does not match an engine's top choice. In contrast Centipawn Analysis won't care about the overall ranking of the move, and only the difference in evaluation, which in this case is 0.07 centipawns.

ChessBase 16 - Mega package Edition 2022

Your key to fresh ideas, precise analyses and targeted training!
Everyone uses ChessBase, from the World Champion to the amateur next door. It is the program of choice for anyone who loves the game and wants to know more about it. Start your personal success story with ChessBase and enjoy the game even more.

Like Let's Check, the Centipawn Analysis will provide a report at the end of the game, informing you of the average error rate during the game. Here is a comparison of their evaluations of the same game. First we have the Let's Check summary:

Base on pure top engine matches, Anand's result was solid, but nothing special. However when looking at it with the Centipawn Analysis we get a very different impression:

It tells us that Vishy's play was near flawless and his average error rate was exceptionally low. What is the Weighted Error Value compared to the Centipawn Analysis seen in the image above? ChessBase has an algorithm that highlights moves of greater value, and moves of lower value. Suppose there was a sequence of forced recaptures. These might be played 'flawlessly', but they were not really representative of a higher standard of play. In the Weighted Error Value those moves are ignored. There are other factors, but the point is the Weighted Error Value takes other things into consideration when giving its final verdict.

One of the biggest values is that it is effective at evaluating a game that was lost via ten moves yielding 0.15 centipawns, accumulating to a total of -1.50, or nine perfect moves and one blunder costing 1.50. The centipawn analysis will give the same error rate in the end, since losing one way or the other makes little difference in the end, but it will bring up this breakdown at the summary at the end all the same.

So how good is 0.04? It that really 'exceptionally low' as I stated? To put that in perspective, an elite grandmaster tournament is usually won with a tournament average error rate in the 0.12-0.14 range. How do I know this?

Analyzing a player or a tournament

ChessBase allows to do more than just find the report of a single game, it allows you to run a batch of such analyses on a group of games and tell you the average performance of everyone. Let's imagine we want to analyze the complete Sinquefield Cup recently finished and see how the players did overall. 

First we open up a list of all the games of the tournament, then highlight the same you want analyzed.

Now go to the Analysis tab and click on Centipawn Analysis. Although it will give a full report in each and every game, it will do more than that. 

It will provide you with a full summary on the run, as well as charts depicting their relative centipawn error rate performance compared to their Elo performance.

But if you want to be able to work with the data you need not content yourself with this. In the folder where the actual database is located, an Excel spreadsheet will be located with detailed breakdowns.

Important: Be warned it will create the Excel file named after the database you did the analysis on, so if you were to run the centipawn analysis on a subset of games of Mega 2022 for example, then each time you ran the centipawn analysis, it would overwrite the previous Excel file with the same name.

Fat Fritz 2

Fat Fritz 2.0 is the successor to the revolutionary Fat Fritz, which was based on the famous AlphaZero algorithms. This new version takes chess analysis to the next level and is a must for players of all skill levels.

Conclusion

Centipawn is a fascinating atemporal way of testing the pure quality of play based on engine analysis, and can be used to test a game, even yours, to see how you are doing. As always, be careful about trying to reach verdicts on single examples, since even I have short wins that enjoy a score below 0.10, whilst fully aware I could never maintain that as an average.

Links

 


Born in the US, he grew up in Paris, France, where he completed his Baccalaureat, and after college moved to Rio de Janeiro, Brazil. He had a peak rating of 2240 FIDE, and was a key designer of Chess Assistant 6. In 2010 he joined the ChessBase family as an editor and writer at ChessBase News. He is also a passionate photographer with work appearing in numerous publications, and the content creator of the YouTube channel, Chess & Tech.

Discuss

Rules for reader comments

 
 

Not registered yet? Register