Page 1 of 1

SPCC: Testrun of Stockfish 210117 finished

Posted: Wed Jan 20, 2021 11:37 am
by pohl4711
AB-testrun of Stockfish 210117 finished - a bad regression...

https://www.sp-cc.de

(Perhaps you have to clear your browsercache or reload the website)

Re: SPCC: Testrun of Stockfish 210117 finished

Posted: Wed Jan 20, 2021 11:42 am
by Ozymandias
It falls inside the margin of error, but ncm testing also indicates that something went wrong on the 11th. Not the worst we've seen lately, though.

Re: SPCC: Testrun of Stockfish 210117 finished

Posted: Wed Jan 20, 2021 11:49 am
by pohl4711
Ozymandias wrote: Wed Jan 20, 2021 11:42 am It falls inside the margin of error, but ncm testing also indicates that something went wrong on the 11th. Not the worst we've seen lately, though.
Stockfish 210111 is strong in my testings. The regression came later. Only 2 patches followed. One of them a "non-functional" patch. So the regression is the latest patch from 210117 ("Add penalty for doubled pawns in agile structure").

Re: SPCC: Testrun of Stockfish 210117 finished

Posted: Wed Jan 20, 2021 2:49 pm
by Jouni
NCM has peak rating 8.1. with "Update copyright years". Sadly not possible before 2022 again :) .

Re: SPCC: Testrun of Stockfish 210117 finished

Posted: Wed Jan 20, 2021 5:08 pm
by RubiChess
pohl4711 wrote: Wed Jan 20, 2021 11:49 am
Ozymandias wrote: Wed Jan 20, 2021 11:42 am It falls inside the margin of error, but ncm testing also indicates that something went wrong on the 11th. Not the worst we've seen lately, though.
Stockfish 210111 is strong in my testings. The regression came later. Only 2 patches followed. One of them a "non-functional" patch. So the regression is the latest patch from 210117 ("Add penalty for doubled pawns in agile structure").
"Add penalty for doubled pawns in agile structure" is a handmade evaluation patch and should have almost no effect when using NNUE.
I guess it is just some bad luck combined with error bars.

Andreas