Let's Check: the elite are better than you know

by Albert Silver
10/17/2022 – There are several ways to check a player's performance using an engine. One is to simply ask an engine to analyze every move and highlight every disagreement however small. Another is to use the tool in ChessBase and Fritz known as Let's Check. Here are the results from the recent Sinquefield Cup including a 100% match and the curiously high results by...

"Evolving Genius": learn to attack and play brilliancies. Fritz offers you everything you will need as a dedicated chess enthusiast.

There are three main ways today to evaluate a player using an engine. This is not to be confused with annotating a game.

The most sophisticated one is used by analysts such as Dr. Ken Regan, who compile the average error rates of players per level. For example, I might lose an average 0.15 pawns per move compared to the engine's best, while a top GM might lose only 0.02. His system will go deeper than this, but that is still the foundation on which it lies and it will be better at catching 'smart cheaters' than a more basic system such as below.

The simplest is just to analyze a game with an engine and ask it to highlight every move it disagrees with, however small the difference. Obviously the risk is that in some positions, there might be three roughly equal moves that three engines play slightly differently.

Imagine you are analyzing with only Stockfish, and it says that five moves out of ten are not a match. This might overlook that two of the moves that don't match its choices, are chosen by another top engine such as Komodo Dragon 3. In other words, only five match Stockfish, but seven in all match top engine choices. That is the underlying point of Let's Check. When you analyze a game with it, it will not only tell you what a variety of engines thought of each move, it will give you a summary called Engine Correlation at the top, showing the percentage of times a player's moves matched the top choice of an engine.

However, unlike a plain engine comparison, it won't compare with just one top engine move, it will compare with several, and if the move matches any of those engines, then it is a match for Engine Correlation. 

The new Komodo Dragon 3 engine has gained 100 Elo points in playing strength over its predecessor when using a processor core in blitz. That's a huge improvement for a program that already reached at an Elo level of over 3500!

Sinquefield Cup

Recently there were several claims about high Engine Correlation matches between Hans Niemann's games and the Let's Check choices, so out of curiosity I ran a complete Let's Check on all the games in the recent Sinquefield Cup and I must say the results were unexpected.

The first result to come out was that one player did actually obtain a 100% match. This was not the result of some ultra-short draw, since Let's Check will ignore theory moves, and games with too few moves played. I.e. a game that was 28 moves long but had 20 moves of theory will not be eligible for an Engine Correlation result. Who is this engine matching wonder? Wesley So.

In his game against Ian Nepomniachtchi, the American player achieved a 100% Engine Correlation score. However, he was not the star performer overall in terms of such measurements, since it was his only game over 80%. No, one player managed to score three times in excess of 90% engine correlation. Aha! I hear you cry out. We have him! So who is this chess engine-like god?

Levon Aronian had several of the highest quality games according to Let's Check

Meet Levon Aronian, late-bloomer extraordinaire, who had an engine correlation of 92% against Caruana (who himself has a 96% correlation in that same game) over 45 moves, 91% against Wesley So in 43 moves, and 91% against Magnus Carlsen in 36 moves. Plus two more games with over 80%.

He was not quite alone though, and none other than Ian Nepomniachtchi had two as well, plus several over 80%, showing the quality of play that led him to win the Candidates this year. Note that he had an average 78% engine correlation for the entire Candidates, 11% more than second-best Caruana.

The burning question on your mind, dear reader, is what about Hans? In terms of engine correlation, Hans was the worst. His best game, with an 88% match over 55 moves, was in round seven against Maxime Vachier-Lagrave. In his game against Carlsen it was a modest 68%, but of course Magnus was playing dreadful that day, and had only 37%. 

The mythical 100%

So how rare is 100% after all? It is rare but not as rare as you might think. I ran some random checks through games in 1999-2000 as I was curious about Kasparov and Kramnik. All in all I had some 150 eligible games, maybe less, yet it turned up a higher-than-expected number of perfect matches.

For example, the rapid games Amber tournament had several 100% perfect games, including Jeroen Piket in one, and Kramnik in another. And against Topalov no less... Memories of Toiletgate. There were also two(!) by Kasparov in Bosnia in 1999, another in Bosnia in 2000, one more by Kramnik in the World Knockout event against Korchnoi over 41 moves and later one by Michael Adams against Vlad in that same event.

However, there is a caveat that must be mentioned when using such tools. It is eminently possible to game the system to show a 100% match where it normally might not. You see, when doing a Let's Check analysis within Fritz, you have the option of providing your own engine, and then telling it to only use it for moves that did not match engine choices. In other words, you are trying to find an engine it will match. And if it does.... the engine correlation will improve.

 
Kasparov vs I. Sokolov, (Bosnia 2000)

Originally, this game was only a 90% match, with no engine choosing Garry Kasparov's 16.cxd5 for example. After trying several, I found an engine that chose it, and entered it as another Let's Check choice. Now the tally reads:

So yes, the results can absolutely be manipulated by the unscrupulous. A telltale sign might be in the engines listed. If a new game shows Stockfish 14+, Komodo 12+ and so on, it should be fine, but if you see some very old engines or odd names for that same new game, be on your guard, as they may have been used only to get an extra match. 

The ChessBase Mega Database 2022 is the premiere chess database with over 9.2 million games from 1560 to 2021 in high quality.

Regardless, here is the signature win by Kasparov with notes from Mega Database:

 
New ...
Open...
Share...
Layout...
Flip Board
Settings
MoveNResultEloPlayers
Position not in LiveBook
1.e4 e5 2.Nf3 Nf6 3.Nxe5 d6 4.Nf3 Nxe4 5.d4 d5 6.Bd3 Be7 7.0-0 Nc6 8.c4 Nb4 9.Be2 0-0 10.Nc3 Bf5 11.a3 Nxc3 12.bxc3 Nc6 13.Re1 Bf6 14.Bf4 Ne7?! 14...Na5 15.cxd5 Qxd5 16.Bf1 b6 17.Ne5 Rad8 18.g4 Be4 19.Qe2 Bxe5 20.Bxe5 Nb3 21.Ra2 Bf3 22.Qe3 Na5 23.Rc2 f6 24.Bxc7 Anand,V-Sokolov,I/Chess@iceland rapidplay Group B, Kopavo 2000/1-0 (56) 15.Qb3 b6 15...Rb8 16.Be5 16.cxd5 Nxd5 17.Be5 Bg4?! 17...c6 18.c4 Nc7 19.Bd3 18.Rad1 Be7?! 19.h3 Bh5 20.g4!± Bg6 21.Bg3 21.Bb5!? 21...Nf6 21...a5 22.a4 Nf6 23.Ne5 22.Ne5 Ne4? 22...Be4 23.Bf3+- 23.Nxg6 hxg6 24.Bf3 Nxg3 25.Bxa8 Qxa8 26.fxg3 Qf3 27.Rxe7 Qxg3+ 28.Kf1 Qxh3+ 29.Ke2 Qxg4+ 30.Kd2 Qg5+ 31.Re3+- 23...Nxg3 23...Bd6 24.Bxe4 Bxe4 25.Rxe4+- 24.Nc6! 24.fxg3 Qd6 24...Qd6 25.Nxe7+ 25.Rxe7?! Qf4 26.Qd5 25...Kh8 26.Bxa8 26.Bxa8 Rxa8 27.Nxg6+ 27.fxg3?? Qxg3+ 28.Kf1 Qf3+ 29.Kg1 Qg3+ 30.Kf1 Qf3+ 31.Kg1 Qg3+= 27...Qxg6 27...fxg6 28.Qf7 h5 29.Re7 Rg8 30.Rd3+- 27...hxg6 28.Qxf7 Rf8 28...Qc6 29.fxg3 Qxc3 30.Kg2+- 29.Re8 Ne2+ 30.Kf1+- 28.fxg3+- 1–0
  • Start an analysis engine:
  • Try maximizing the board:
  • Use the four cursor keys to replay the game. Make moves to analyse yourself.
  • Press Ctrl-B to rotate the board.
  • Drag the split bars between window panes.
  • Download&Clip PGN/GIF/FEN/QR Codes. Share the game.
  • Games viewed here will automatically be stored in your cloud clipboard (if you are logged in). Use the cloud clipboard also in ChessBase.
  • Create an account to access the games cloud.
WhiteEloWBlackEloBResYearECOEventRnd
Kasparov,G2851Sokolov,I26371–02000C42Sarajevo Bosnia 30th5

Conclusion

Does this in any way invalidate the use of a tool such as Let's Check? Of course not, but as all such tools, they must be used with good sense and judgement. The fact that modern elite players can rattle off multiple games with such extraordinarily high engine matches is a testament to the increasing overall quality of the chess players, since the engines they are matching today, are also hundreds of Elo stronger than engines of a decade ago. These players are also studying and learning from the engines, and that increase in pure ability is a consequence of it.

 


Born in the US, he grew up in Paris, France, where he completed his Baccalaureat, and after college moved to Rio de Janeiro, Brazil. He had a peak rating of 2240 FIDE, and was a key designer of Chess Assistant 6. In 2010 he joined the ChessBase family as an editor and writer at ChessBase News. He is also a passionate photographer with work appearing in numerous publications, and the content creator of the YouTube channel, Chess & Tech.

Discuss

Rules for reader comments

 
 

Not registered yet? Register

We use cookies and comparable technologies to provide certain functions, to improve the user experience and to offer interest-oriented content. Depending on their intended use, analysis cookies and marketing cookies may be used in addition to technically required cookies. Here you can make detailed settings or revoke your consent (if necessary partially) with effect for the future. Further information can be found in our data protection declaration.