Is there something like this out there? It could be made by looking at positions where both SF and LC0 with 7-piece tablebases take more than 30 minutes on high-end hardware to switch their best move to a single different move and then also evaluate other all other moves as being at least 2.0 worse. Or alternatively, only one of them switches but then "proves" it by playing against the other engine's defense. Of course, the problem with making one is hardware resources.
Tests set are not great to evaluate strengths but it then would be interesting to see how new versions and new programs do on it.
Automatically created test set of very difficult moves
Moderators: hgm, Rebel, chrisw
-
- Posts: 343
- Joined: Sun Aug 25, 2019 8:33 am
- Full name: .
-
- Posts: 12542
- Joined: Wed Mar 08, 2006 8:57 pm
- Location: Redmond, WA USA
Re: Automatically created test set of very difficult moves
A simple way to do it is to load all the TCEC games into EPD records and then scan for a differenece of opinion as to the eval. When there is a sudden rise in eval and the other engine does not see that for a while, that is probably a good test position. An additional check would be to see that the engine that saw the rise in eval went on to win. If both of those conditions are true, emit the position and the key move.
Since TCEC is on such high-end hardware, it is probably a very good move. And there are no terrible programs in the tournament
Since TCEC is on such high-end hardware, it is probably a very good move. And there are no terrible programs in the tournament
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
-
- Posts: 343
- Joined: Sun Aug 25, 2019 8:33 am
- Full name: .
Re: Automatically created test set of very difficult moves
That does sound like a good way to do it.