Don't you already have Zeta running most of the engine in the gpu?
But, It has been estimated that AGZ would take about 1,700 years of "typical" cpu/gpu time. Google burned about $25M.
There is a distributed effort for Go making quite good progress
https://github.com/gcp/leela-zero
GCP's prior work on "plain" Leela both in terms of playing and portability is most impressive.
Neural networks for chess position evaluation- request
Moderators: hgm, Rebel, chrisw
-
- Posts: 536
- Joined: Thu Mar 09, 2006 3:01 pm
-
- Posts: 2662
- Joined: Wed Mar 10, 2010 10:18 pm
- Location: Hamburg, Germany
- Full name: Srdja Matovic
Re: Neural networks for chess position evaluation- request
Yes, but in this design it was not intended to query a NN during search....Don't you already have Zeta running most of the engine in the gpu?
maybe i have to ponder on this a bit.
UpsBut, It has been estimated that AGZ would take about 1,700 years of "typical" cpu/gpu time. Google burned about $25M.
I just had the numbers of Giraffe in my mind, 3 days to one week for training on 12 core resp. 4 core machine.
Hmm,There is a distributed effort for Go making quite good progress
maybe the chess community needs an commercial sponsor for the compute cycles for such an NN project.
--
Srdja
-
- Posts: 2662
- Joined: Wed Mar 10, 2010 10:18 pm
- Location: Hamburg, Germany
- Full name: Srdja Matovic
Re: Neural networks for chess position evaluation- request
So far i have implemented parallel search and a dummy NN in Zeta,Yes, but in this design it was not intended to query a NN during search....Quote:
Don't you already have Zeta running most of the engine in the gpu?
maybe i have to ponder on this a bit.
but it turns out that plain GPU assist approach like LC0,
seems the way to go.
My parallel search scales bad above 32 workers,
so it would be more efficient to do the search on CPU,
and use the GPU for neural network evaluation only,
that would utilize both of them better.
--
Srdja
-
- Posts: 72
- Joined: Mon Mar 07, 2016 4:41 pm
- Location: Zürich, Switzerland
Re: Neural networks for chess position evaluation- request
How do you arrive at this conclusion? Is there any rule of thumb how to design the NN architecture? What would you do in 8x8 american checkers?Rémi Coulom wrote:I took a longer look at what you did. Max pooling is used for image recognition, but makes little sense for chess. You'll need several layers of convolutions before reducing the resolution. I would simply stack a dozen 3x3 convolutions, and then directly go to a couple scalar layers, without any max pooling.