### Re: replace the evaluation by playing against yourself

Posted: Tue Jan 30, 2018 11:12 am
Dann Corbit wrote:
Uri Blass wrote:
zullil wrote:
Uri Blass wrote:I wonder if somebody try it.

You start with chess engine A and do what you like with the search algorithm.

When you come to the evaluation part then you simply tell stockfish to play against itself at small depth like depth 6 and return a number based on the result of the game(if the game is drawn you return 0.00 and otherwise you return mate in X for the side that won).
So every leaf is evaluated by a SF vs. SF game? Seems rather impractical, to say the least.
I do not see the problem.
The program may be weaker and may get smaller depths but still may see some ideas that normal stockfish does not see.
I think if searching for a draw, a simpler approach is to simply recognize failure to make progress.

E.g. a score of 1000 seems great, but if you get it ten plies in a row, it is almost certainly a draw (or the current plan is a draw at best).

I think that there are many cases when a big score that repeat more than 10 times is not a draw.

Here is an example

Here is an example

3.66 repeat from depth 62 to depth 80 and it is not a draw.

After 40.fxg5 it seems that hxg5 is forced

### Re: replace the evaluation by playing against yourself

Posted: Tue Jan 30, 2018 11:14 am
Will another method of detection be more efficient?
I think a case like that will be very hard to resolve by any means.

### Re: replace the evaluation by playing against yourself

Posted: Wed Jan 31, 2018 9:52 am
Isn't it a kind of monte-carlo playout?
Since the Alpha-zero success I've been fooling around the idea of the playouts to evaluate the board.

What I found was that in the random game you can't select the moves with absolute randomness. you have to guide the moves with some evaluation (I'm using quiescence now), and after that, select one of the first N moves, using a kind of probability distribution that chooses the moves closer to the first one in evaluation.

Then, after some thousand of games, the statistics of the evaluation of the position became more clear. I found that I was able to detect easily the correct moves in zugzwang situations, and I supose that the fortress are well handled too. Havent't tried yet.

Also, in a 1 ply search, I'm selecting the move with the best LOS, not the move with the best percentage.

Of course, this is still extremely inefficient, is tactically blind, and I'm still trying to build a search of 2-ply, but is a nice experiment.

### Re: replace the evaluation by playing against yourself

Posted: Wed Jan 31, 2018 2:24 pm
asanjuan wrote:Also, in a 1 ply search, I'm selecting the move with the best LOS, not the move with the best percentage.
Have you tried selecting the move with the lowest lose percentage? Alpha Zero gave the impression of going for positions that first and foremost, it could see no way of losing, so it may explain why no game was lost in the 100 games match.

### Re: replace the evaluation by playing against yourself

Posted: Thu Feb 01, 2018 3:36 pm
Ovyron wrote:
asanjuan wrote:Also, in a 1 ply search, I'm selecting the move with the best LOS, not the move with the best percentage.
Have you tried selecting the move with the lowest lose percentage? Alpha Zero gave the impression of going for positions that first and foremost, it could see no way of losing, so it may explain why no game was lost in the 100 games match.
Not yet. But i would say that LOS is more precise. For example, given the same lose percentage, you select the winner easily with LOS:

a2a3 W=38 D=26 L=37 eval=0,504950 los=0,545812
e2e3 W=40 D=23 L=37 eval=0,515000 los=0,633364