Let's Check: the elite are better than you know

There are three main ways today to evaluate a player using an engine. This is not to be confused with annotating a game.

The most sophisticated one is used by analysts such as Dr. Ken Regan, who compile the average error rates of players per level. For example, I might lose an average 0.15 pawns per move compared to the engine's best, while a top GM might lose only 0.02. His system will go deeper than this, but that is still the foundation on which it lies and it will be better at catching 'smart cheaters' than a more basic system such as below.

The simplest is just to analyze a game with an engine and ask it to highlight every move it disagrees with, however small the difference. Obviously the risk is that in some positions, there might be three roughly equal moves that three engines play slightly differently.

Imagine you are analyzing with only Stockfish, and it says that five moves out of ten are not a match. This might overlook that two of the moves that don't match its choices, are chosen by another top engine such as Komodo Dragon 3. In other words, only five match Stockfish, but seven in all match top engine choices. That is the underlying point of Let's Check. When you analyze a game with it, it will not only tell you what a variety of engines thought of each move, it will give you a summary called Engine Correlation at the top, showing the percentage of times a player's moves matched the top choice of an engine.

However, unlike a plain engine comparison, it won't compare with just one top engine move, it will compare with several, and if the move matches any of those engines, then it is a match for Engine Correlation.

Komodo Dragon 3

The new Komodo Dragon 3 engine has gained 100 Elo points in playing strength over its predecessor when using a processor core in blitz. That's a huge improvement for a program that already reached at an Elo level of over 3500!

Sinquefield Cup

Recently there were several claims about high Engine Correlation matches between Hans Niemann's games and the Let's Check choices, so out of curiosity I ran a complete Let's Check on all the games in the recent Sinquefield Cup and I must say the results were unexpected.

The first result to come out was that one player did actually obtain a 100% match. This was not the result of some ultra-short draw, since Let's Check will ignore theory moves, and games with too few moves played. I.e. a game that was 28 moves long but had 20 moves of theory will not be eligible for an Engine Correlation result. Who is this engine matching wonder? Wesley So.

In his game against Ian Nepomniachtchi, the American player achieved a 100% Engine Correlation score. However, he was not the star performer overall in terms of such measurements, since it was his only game over 80%. No, one player managed to score three times in excess of 90% engine correlation. Aha! I hear you cry out. We have him! So who is this chess engine-like god?

Levon Aronian had several of the highest quality games according to Let's Check

Meet Levon Aronian, late-bloomer extraordinaire, who had an engine correlation of 92% against Caruana (who himself has a 96% correlation in that same game) over 45 moves, 91% against Wesley So in 43 moves, and 91% against Magnus Carlsen in 36 moves. Plus two more games with over 80%.

He was not quite alone though, and none other than Ian Nepomniachtchi had two as well, plus several over 80%, showing the quality of play that led him to win the Candidates this year. Note that he had an average 78% engine correlation for the entire Candidates, 11% more than second-best Caruana.

The burning question on your mind, dear reader, is what about Hans? In terms of engine correlation, Hans was the worst. His best game, with an 88% match over 55 moves, was in round seven against Maxime Vachier-Lagrave. In his game against Carlsen it was a modest 68%, but of course Magnus was playing dreadful that day, and had only 37%.

The mythical 100%

So how rare is 100% after all? It is rare but not as rare as you might think. I ran some random checks through games in 1999-2000 as I was curious about Kasparov and Kramnik. All in all I had some 150 eligible games, maybe less, yet it turned up a higher-than-expected number of perfect matches.

For example, the rapid games Amber tournament had several 100% perfect games, including Jeroen Piket in one, and Kramnik in another. And against Topalov no less... Memories of Toiletgate. There were also two(!) by Kasparov in Bosnia in 1999, another in Bosnia in 2000, one more by Kramnik in the World Knockout event against Korchnoi over 41 moves and later one by Michael Adams against Vlad in that same event.

However, there is a caveat that must be mentioned when using such tools. It is eminently possible to game the system to show a 100% match where it normally might not. You see, when doing a Let's Check analysis within Fritz, you have the option of providing your own engine, and then telling it to only use it for moves that did not match engine choices. In other words, you are trying to find an engine it will match. And if it does.... the engine correlation will improve.

Kasparov vs I. Sokolov, (Bosnia 2000)

Originally, this game was only a 90% match, with no engine choosing Garry Kasparov's 16.cxd5 for example. After trying several, I found an engine that chose it, and entered it as another Let's Check choice. Now the tally reads:

So yes, the results can absolutely be manipulated by the unscrupulous. A telltale sign might be in the engines listed. If a new game shows Stockfish 14+, Komodo 12+ and so on, it should be fine, but if you see some very old engines or odd names for that same new game, be on your guard, as they may have been used only to get an extra match.

Mega Database 2022

The ChessBase Mega Database 2022 is the premiere chess database with over 9.2 million games from 1560 to 2021 in high quality.

Regardless, here is the signature win by Kasparov with notes from Mega Database:

New ...

New Game

Edit Game

Setup Position

Open...

PGN

FEN

Share...

Share Board (.png)

Share Board (configure)

Share playable board

Share game as GIF

Notation (PGN)

QR Code

Layout...

Use splitters

Swipe notation/lists

Reading mode

Flip Board

Settings

Move	N	Result	Elo	Players

1.e4	1,185,008	54%	2421	---
1.d4	959,510	55%	2434	---
1.Nf3	286,503	56%	2441	---
1.c4	184,834	56%	2442	---
1.g3	19,892	56%	2427	---
1.b3	14,600	54%	2428	---
1.f4	5,954	48%	2377	---
1.Nc3	3,911	50%	2384	---
1.b4	1,791	48%	2379	---
1.a3	1,250	54%	2406	---
1.e3	1,081	49%	2409	---
1.d3	969	50%	2378	---
1.g4	670	46%	2361	---
1.h4	466	54%	2382	---
1.c3	439	51%	2425	---
1.h3	289	56%	2420	---
1.a4	118	60%	2461	---
1.f3	100	47%	2427	---
1.Nh3	93	66%	2506	---
1.Na3	47	62%	2476	---

1.e4 e5 2.Nf3 Nf6 3.Nxe5 d6 4.Nf3 Nxe4 5.d4 d5 6.Bd3 Be7 7.0-0 Nc6 8.c4 Nb4 9.Be2 0-0 10.Nc3 Bf5 11.a3 Nxc3 12.bxc3 Nc6 13.Re1 Bf6 14.Bf4 Ne7?! 14...Na5 15.cxd5 Qxd5 16.Bf1 b6 17.Ne5 Rad8 18.g4 Be4 19.Qe2 Bxe5 20.Bxe5 Nb3 21.Ra2 Bf3 22.Qe3 Na5 23.Rc2 f6 24.Bxc7 Anand,V-Sokolov,I/Chess@iceland rapidplay Group B, Kopavo 2000/1-0 (56) 15.Qb3 b6 15...Rb8 16.Be5 16.cxd5 Nxd5 17.Be5 Bg4?! 17...c6 18.c4 Nc7 19.Bd3 18.Rad1 Be7?! 19.h3 Bh5 20.g4!± Bg6 21.Bg3 21.Bb5!? 21...Nf6 21...a5 22.a4 Nf6 23.Ne5 22.Ne5 Ne4? 22...Be4 23.Bf3+- 23.Nxg6 hxg6 24.Bf3 Nxg3 25.Bxa8 Qxa8 26.fxg3 Qf3 27.Rxe7 Qxg3+ 28.Kf1 Qxh3+ 29.Ke2 Qxg4+ 30.Kd2 Qg5+ 31.Re3+- 23...Nxg3 23...Bd6 24.Bxe4 Bxe4 25.Rxe4+- 24.Nc6! 24.fxg3 Qd6 24...Qd6 25.Nxe7+ 25.Rxe7?! Qf4 26.Qd5 25...Kh8 26.Bxa8 26.Bxa8 Rxa8 27.Nxg6+ 27.fxg3?? Qxg3+ 28.Kf1 Qf3+ 29.Kg1 Qg3+ 30.Kf1 Qf3+ 31.Kg1 Qg3+= 27...Qxg6 27...fxg6 28.Qf7 h5 29.Re7 Rg8 30.Rd3+- 27...hxg6 28.Qxf7 Rf8 28...Qc6 29.fxg3 Qxc3 30.Kg2+- 29.Re8 Ne2+ 30.Kf1+- 28.fxg3+- 1–0

White	EloW	Black	EloB	Res	Year	ECO	Event	Rnd

Kasparov,G

2851

Sokolov,I

2637

1–0

2000

C42

Sarajevo Bosnia 30th

Conclusion

Does this in any way invalidate the use of a tool such as Let's Check? Of course not, but as all such tools, they must be used with good sense and judgement. The fact that modern elite players can rattle off multiple games with such extraordinarily high engine matches is a testament to the increasing overall quality of the chess players, since the engines they are matching today, are also hundreds of Elo stronger than engines of a decade ago. These players are also studying and learning from the engines, and that increase in pure ability is a consequence of it.

SHOP

SHOP

Let's Check: the elite are better than you know

ONLINE SHOP

ChessBase Magazine 221

Sinquefield Cup

The mythical 100%

Conclusion

Discuss

Fritz 20

Silence the Sicilian - Win with the Alapin Variation (2.c3)

Queen's Gambit Accepted Powerbook 2025

Queen's Gambit Accepted Powerbase 2025

Fritz 20 & Opening Encyclopaedia 2025

Master Modern Opening Strategy: Flank Attacks against Classical Openings

Rossolimo-Moscow Powerbase 2025

Rossolimo-Moscow Powerbook 2025

Pop-up for detailed settings