Chess reinforcement learning by AlphaGo Zero methods

brianr · Post by **brianr** » Fri Mar 09, 2018 1:16 pm

Depends on your point of view. I've been doing computer chess since 1971, but not very well. Tinker is consistently in the top of the bottom third of engines. Hah. For me, it is about the journey.

Trying to understand the AZ approach and watching my gpu chew on the NN until things train a bit more is satisfying enough. The Leela Chess team is far ahead, like Fishtest with Stockfish, but holds real promise with a more crafted NN and tuning approach fueled by crowd-sourced horsepower.

Henk · Post by **Henk** » Fri Mar 09, 2018 1:31 pm

4673 output nodes and (8? *) 19 * 64 input nodes makes each network slow.

brianr · Post by **brianr** » Fri Mar 09, 2018 3:11 pm

Yup, which is why I started looking at tic-tac-toe.
That NN is very fast with the AZ approach.
Then, I looked at Othello.
Thanks to https://github.com/suragnair/alpha-zero-general

Its NN is considerably slower, but the game is far more complex.
Of course, it is another major complexity jump to chess.

Mangling: Don't bring a knife NN brain to a gunfight (chess or go) :

Chess reinforcement learning by AlphaGo Zero methods

Re: Chess reinforcement learning by AlphaGo Zero methods

Re: Chess reinforcement learning by AlphaGo Zero methods

Re: Chess reinforcement learning by AlphaGo Zero methods