Re: Playing the endgame like a boss !!
Posted: Sun Mar 17, 2019 9:27 am
I do not know if NN are going to dominate but it is clear that stockfish goes in the wrong way and it is going to lose the first place.
I believe that the way to test only by many games is not the correct way to continue to get better.
I think that first step if you have an engine should be to a build a test suite from games of the engine when the engine does not find the right move.
Testing a new patch should be done first in 1000 positions that the engine failed to find the right move.
If there is no improvement then it is a waste of resources to test at short time control or long time control because improvement in elo means also improvement in the move choice of the engine in part of the cases.
There should be for every patch that pass a list of positions when the patch improve the move choice of the engine in order to help other developers.
It does not happen in the stockfish framework and people who look at the results of the tests see only that the version after the new patch passed SPRT test.
I believe that the way to test only by many games is not the correct way to continue to get better.
I think that first step if you have an engine should be to a build a test suite from games of the engine when the engine does not find the right move.
Testing a new patch should be done first in 1000 positions that the engine failed to find the right move.
If there is no improvement then it is a waste of resources to test at short time control or long time control because improvement in elo means also improvement in the move choice of the engine in part of the cases.
There should be for every patch that pass a list of positions when the patch improve the move choice of the engine in order to help other developers.
It does not happen in the stockfish framework and people who look at the results of the tests see only that the version after the new patch passed SPRT test.