Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40

jp · Post by jp » Wed Mar 27, 2019 10:55 pm

mwyoung wrote: ↑Wed Mar 27, 2019 10:05 pm [White "Lc0 v0.21.1"]
[Black "Stockfish 240319 64 POPCNT"]
[Result "1-0"]

How are you adjudicating the games? Both sides showing a huge eval. for some number moves?
I guess that's it.

mwyoung · Post by **mwyoung** » Wed Mar 27, 2019 11:39 pm

jp wrote: ↑Wed Mar 27, 2019 10:55 pm
mwyoung wrote: ↑Wed Mar 27, 2019 10:05 pm [White "Lc0 v0.21.1"]
[Black "Stockfish 240319 64 POPCNT"]
[Result "1-0"]
How are you adjudicating the games? Both sides showing a huge eval. for some number moves?
I guess that's it.

It is set to auto in the GUI. Setting is Resign Late....

jp · Post by jp » Thu Mar 28, 2019 1:09 am

That's a boost for Lc, because it often doesn't convert winning endgames.
But playing to mate or a 6-man TB win would take much more time, and maybe that's not what you want to test anyway. (We already know Lc is bad at endgames, and maybe these tests are more about opening advantages, etc.)

mwyoung · Post by **mwyoung** » Thu Mar 28, 2019 1:12 am

jp wrote: ↑Thu Mar 28, 2019 1:09 am That's a boost for Lc, because it often doesn't convert winning endgames.
But playing to mate or a 6-man TB win would take much more time, and maybe that's not what you want to test anyway. (We already know Lc is bad at endgames, and maybe these tests are more about opening advantages, etc.)

You are incorrect. I only set this setting after I confirmed it did not affect the results. Check the tapes.....

jp · Post by jp » Thu Mar 28, 2019 1:14 am

I should say that's a potential boost. Whether it affects results depends whether it gets those winning endgames it cannot convert or drawing endgames it cannot hold. e.g. It would have affected the TCEC results. The games are shown on the Lc blog. It failed to win a 7-man endgame where SF's eval was +150 or something. (But the time cost here may not be worth it anyway. As long as we know what the settings are for each test, it's fine.)

Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40

Re: Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40

Re: Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40

Re: Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40

Re: Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40

Re: Experimental Test: Leela Chess Zero v0.21.1 41715 vs Stockfish 240319 10m/40