From SF wiki:
--------------------------------------------------
Results of Stockfish vs StockfishSyzygy (10+0.1, 1t, 16MB, UHO_XXL_+0.90_+1.19.pgn):
Elo: -1.26 +/- 1.46, nElo: -2.64 +/- 3.05
LOS: 4.46 %, DrawRatio: 57.40 %, PairsRatio: 0.96
Games: 50000, Wins: 13671, Losses: 13853, Draws: 22476, Points: 24909.0 (49.82 %)
Ptnml(0-2): [130, 5293, 14349, 5085, 143], WL/DD Ratio: 1.37
--------------------------------------------------
Stockfish 17.1 with 6men TB on very fast M2 SSD. With longer time control probably less gain.
Brand-new Stockfish tablebase test
Moderator: Ras
-
Jouni
- Posts: 3693
- Joined: Wed Mar 08, 2006 8:15 pm
- Full name: Jouni Uski
-
Joerg Oster
- Posts: 983
- Joined: Fri Mar 10, 2006 4:29 pm
- Location: Germany
- Full name: Jörg Oster
Re: Brand-new Stockfish tablebase test
I wonder if the mostly already decided openings play a role here ...Jouni wrote: ↑Mon Oct 27, 2025 1:47 pm From SF wiki:
--------------------------------------------------
Results of Stockfish vs StockfishSyzygy (10+0.1, 1t, 16MB, UHO_XXL_+0.90_+1.19.pgn):
Elo: -1.26 +/- 1.46, nElo: -2.64 +/- 3.05
LOS: 4.46 %, DrawRatio: 57.40 %, PairsRatio: 0.96
Games: 50000, Wins: 13671, Losses: 13853, Draws: 22476, Points: 24909.0 (49.82 %)
Ptnml(0-2): [130, 5293, 14349, 5085, 143], WL/DD Ratio: 1.37
--------------------------------------------------
Stockfish 17.1 with 6men TB on very fast M2 SSD. With longer time control probably less gain.
Also note, this is foremost testing the WDL part of the bases. How much are the DTZ files?
Especially when it comes to those rather difficult rook or queen endgames?!
Edit and P.S.: Of course, modern engines simply became so strong that endgame bases make no measurable difference anymore.
However, it always has been difficult to measure endgame knowledge ...
Jörg Oster
-
Jouni
- Posts: 3693
- Joined: Wed Mar 08, 2006 8:15 pm
- Full name: Jouni Uski
Re: Brand-new Stockfish tablebase test
In my own test playing directly endings common thing is, that TBs have negative effect to play until may be 1000 games. Then they are stored to RAM to speed up access.
Jouni
-
syzygy
- Posts: 5781
- Joined: Tue Feb 28, 2012 11:56 pm
Re: Brand-new Stockfish tablebase test
I suspect the speed "penalty" has become very small now on modern computers with modern SSDs (and sufficient RAM), and with Stockfish now having a much heavier (NNUE) evaluation than in 2013.
-
Ajedrecista
- Posts: 2138
- Joined: Wed Jul 13, 2011 9:04 pm
- Location: Madrid, Spain.
Re: Brand-new Stockfish tablebase test.
Hello Jörg:
PairsRatio comes from pentanomial(0-2) again: (143 + 5085)/(5293 + 130) ~ 0.9640 ~ 0.96. In other words, the ratio of won pairs and lost pairs. Sign(PairsRatio - 1) = Sign(Elo) = Sign(normalized Elo) = Sign(LOS - 50%), of course.
WL/DD is the ratio of win-lose pairs and all-draw pairs in 1 at pentanomial(0-2). Once the number of draws in 1 at pentanomial(0-2) is calculated: 22476 - 5085 - 5293 = 12098, then there are 12098/2 = 6049 all-draw pairs and 14349 - 6049 = 8300 win-lose pairs, thus WL/DD Ratio = 8300/6049 ~ 1.3721 ~ 1.37. There were 6049 out of 25000 pairs (almost 24.2%) with two draws despite the almost one-sided openings!
Regards from Spain.
Ajedrecista.
I find amazing that 'mostly already decided openings' still led to circa 44.95% of draws. Please note that the value of DrawRatio of 57.40% refers to 1 at pentanomial(0-2): 14349/(130 + 5293 + 14349 + 5085 + 143) = 14349/25000 = 57.396% ~ 57.40%. I have just realized about it. Before that, I thought that DrawRatio in these stats was draws/games.Joerg Oster wrote: ↑Mon Oct 27, 2025 2:09 pmI wonder if the mostly already decided openings play a role here ...Jouni wrote: ↑Mon Oct 27, 2025 1:47 pm From SF wiki:
--------------------------------------------------
Results of Stockfish vs StockfishSyzygy (10+0.1, 1t, 16MB, UHO_XXL_+0.90_+1.19.pgn):
Elo: -1.26 +/- 1.46, nElo: -2.64 +/- 3.05
LOS: 4.46 %, DrawRatio: 57.40 %, PairsRatio: 0.96
Games: 50000, Wins: 13671, Losses: 13853, Draws: 22476, Points: 24909.0 (49.82 %)
Ptnml(0-2): [130, 5293, 14349, 5085, 143], WL/DD Ratio: 1.37
--------------------------------------------------
Stockfish 17.1 with 6men TB on very fast M2 SSD. With longer time control probably less gain.![]()
[...]
PairsRatio comes from pentanomial(0-2) again: (143 + 5085)/(5293 + 130) ~ 0.9640 ~ 0.96. In other words, the ratio of won pairs and lost pairs. Sign(PairsRatio - 1) = Sign(Elo) = Sign(normalized Elo) = Sign(LOS - 50%), of course.
WL/DD is the ratio of win-lose pairs and all-draw pairs in 1 at pentanomial(0-2). Once the number of draws in 1 at pentanomial(0-2) is calculated: 22476 - 5085 - 5293 = 12098, then there are 12098/2 = 6049 all-draw pairs and 14349 - 6049 = 8300 win-lose pairs, thus WL/DD Ratio = 8300/6049 ~ 1.3721 ~ 1.37. There were 6049 out of 25000 pairs (almost 24.2%) with two draws despite the almost one-sided openings!
Regards from Spain.
Ajedrecista.
-
jp
- Posts: 1485
- Joined: Mon Apr 23, 2018 7:54 am
Re: Brand-new Stockfish tablebase test.
Um, how do you see that (from the quoted results)?Ajedrecista wrote: ↑Mon Oct 27, 2025 7:49 pm...
I find amazing that 'mostly already decided openings' still led to circa 44.95% of draws. ...
-
Ajedrecista
- Posts: 2138
- Joined: Wed Jul 13, 2011 9:04 pm
- Location: Madrid, Spain.
Re: Brand-new Stockfish tablebase test.
Hello:
'Mostly already decided openings' was a quote by Jörg. I guess that the name of the opening book tells that the exit positions of the book are evaluated between +0.90 and +1.19 by SF at a certain depth. I remember that a eval of +1 at move 32 (much later than the exit positions of the book) means 50% of chances of a win, unless recent changes that I am not aware.
Regards from Spain.
Ajedrecista.
Draws/Games = 22476/50000 ~ 44.95%.jp wrote: ↑Tue Oct 28, 2025 4:25 amUm, how do you see that (from the quoted results)?Ajedrecista wrote: ↑Mon Oct 27, 2025 7:49 pm...
I find amazing that 'mostly already decided openings' still led to circa 44.95% of draws. ...
'Mostly already decided openings' was a quote by Jörg. I guess that the name of the opening book tells that the exit positions of the book are evaluated between +0.90 and +1.19 by SF at a certain depth. I remember that a eval of +1 at move 32 (much later than the exit positions of the book) means 50% of chances of a win, unless recent changes that I am not aware.
Regards from Spain.
Ajedrecista.