Announcing lczero

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

Guenther
Posts: 4718
Joined: Wed Oct 01, 2008 6:33 am
Location: Regensburg, Germany
Full name: Guenther Simon

Re: Announcing lczero

Post by Guenther »

Jhoravi wrote:
gladius wrote:Training from SF self-play games seems to be working well. Here is self-play game on the latest network:


It is starting to understand chess :). Still, a long, long ways to go of course.

The weights are available for download from https://github.com/glinscott/lczero-wei ... _64.txt.gz if you want to try at home. It has working UCI support, so it could even play against other engines now!
It's working!! The first dozen moves are no random anymore! Proof is that whites dark Bishop retreated twice when threatened by blacks h6 and g5 pawn moves and so does the Knight at Nf3 moved to Ne5 when threatened by g4.

Interesting is that the remaining moves goes back to random proving that the learning concentration starts at the opening phase then moving forward meaning it may master the endgame last.
Are you sure? When I saw this game it looked like the opening was fed
and until the given opening was over the blunderfest started...
https://rwbc-chess.de

[Trolls n'existent pas...]
gladius
Posts: 568
Joined: Tue Dec 12, 2006 10:10 am
Full name: Gary Linscott

Re: Announcing lczero

Post by gladius »

Guenther wrote:
Jhoravi wrote:
gladius wrote:Training from SF self-play games seems to be working well. Here is self-play game on the latest network:


It is starting to understand chess :). Still, a long, long ways to go of course.

The weights are available for download from https://github.com/glinscott/lczero-wei ... _64.txt.gz if you want to try at home. It has working UCI support, so it could even play against other engines now!
It's working!! The first dozen moves are no random anymore! Proof is that whites dark Bishop retreated twice when threatened by blacks h6 and g5 pawn moves and so does the Knight at Nf3 moved to Ne5 when threatened by g4.

Interesting is that the remaining moves goes back to random proving that the learning concentration starts at the opening phase then moving forward meaning it may master the endgame last.
Are you sure? When I saw this game it looked like the opening was fed
and until the given opening was over the blunderfest started...
The opening wasn’t fed, the network had learned it. That’s what I mean when I say it’s stating to understand chess. No opening books used :)
Guenther
Posts: 4718
Joined: Wed Oct 01, 2008 6:33 am
Location: Regensburg, Germany
Full name: Guenther Simon

Re: Announcing lczero

Post by Guenther »

gladius wrote:
Guenther wrote: Are you sure? When I saw this game it looked like the opening was fed
and until the given opening was over the blunderfest started...
The opening wasn’t fed, the network had learned it. That’s what I mean when I say it’s stating to understand chess. No opening books used :)
Ok, thanks for the confirmation.
https://rwbc-chess.de

[Trolls n'existent pas...]
gladius
Posts: 568
Joined: Tue Dec 12, 2006 10:10 am
Full name: Gary Linscott

Re: Announcing lczero

Post by gladius »

And to show that the training process is actually increasing strength, the version that trained overnight is now beating the previous best 49-13-38, or ~131 elo.

Here is an example game, with the new network as black.
User avatar
Rebel
Posts: 7313
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: Announcing lczero

Post by Rebel »

I am waiting for the Steinitz gambit to pop up as best for white :wink:
Gian-Carlo Pascutto
Posts: 1260
Joined: Sat Dec 13, 2008 7:00 pm

Re: Announcing lczero

Post by Gian-Carlo Pascutto »

Guenther wrote: Ok, thanks for the confirmation.
To clarify more: the current training data is a big set of SF-SF games where a book was used, in order to be able to debug the training and the search.

It seems the network can already "remember" many book lines from its training. It did not invent them from self-play. But it does not have a book itself either.
pferd
Posts: 134
Joined: Thu Jul 24, 2014 2:49 pm

Re: Announcing lczero

Post by pferd »

This seems like a very interesting project.

I am playing some 5 minute games against it right now and it moves instantly every single time. Is this the expected behaviour?
Uri Blass
Posts: 10805
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Announcing lczero

Post by Uri Blass »

pferd wrote:This seems like a very interesting project.

I am playing some 5 minute games against it right now and it moves instantly every single time. Is this the expected behaviour?
Looking at the games it seems that something is wrong.

Google got level above GM strength.
I understand that they had hardware advantage but still
I think that it should be possible to get at least above level of 2000 by similiar methods and the level that I see that is simply stupid mistakes of losing pieces is not even close.

Beating the old version by result that is not close to 100-0 is too slow progress.

Google got something that get better if you give it more time.
If I understand correctly lczero does not get better with more time because it play immediately when time control is not relevant.
User avatar
Eelco de Groot
Posts: 4661
Joined: Sun Mar 12, 2006 2:40 am
Full name:   Eelco de Groot

Re: Announcing lczero

Post by Eelco de Groot »

Uri Blass wrote:
pferd wrote:This seems like a very interesting project.

I am playing some 5 minute games against it right now and it moves instantly every single time. Is this the expected behaviour?
Looking at the games it seems that something is wrong.

Google got level above GM strength.
I understand that they had hardware advantage but still
I think that it should be possible to get at least above level of 2000 by similiar methods and the level that I see that is simply stupid mistakes of losing pieces is not even close.

Beating the old version by result that is not close to 100-0 is too slow progress.

Google got something that get better if you give it more time.
If I understand correctly lczero does not get better with more time because it play immediately when time control is not relevant.


I think there is no search yet. That would explain that the program moves instantly. The eval is in theory as strong as a quiescence search -just my guess- but that is even for Alpha Zero not yet GM I suppose. And the neural netwerk is much much smaller. Because the community does not have those massive resources, especially floating point hardware that Google threw at this. It simply is not available yet for us simple amateurs.

If lczero had learned to play 2. c4 in Indian openings or other d4 openings, totally from scratch without even knowing some piece values, that would have been a miracle. Nothing short of that, I almost believed it though :) but what Gian-Carlo is saying explains it better and is much much more realistic.

lczero is obviously learning.

You can not expect it to duplicate Alpha Zero. Somebody said it would take 1700 years on his slow laptop to duplicate Alpha Zero. (If I remember that rigt, I should look it up) And there aren't even learning clients yet, at least that is what Gary said a few days ago.
Debugging is twice as hard as writing the code in the first
place. Therefore, if you write the code as cleverly as possible, you
are, by definition, not smart enough to debug it.
-- Brian W. Kernighan
gladius
Posts: 568
Joined: Tue Dec 12, 2006 10:10 am
Full name: Gary Linscott

Re: Announcing lczero

Post by gladius »

pferd wrote:This seems like a very interesting project.

I am playing some 5 minute games against it right now and it moves instantly every single time. Is this the expected behaviour?
This is because the default number of playouts is set to 800, and there is no time management.

You can increase the number of playouts by passing the -p argument on the command line, like -p20000, and the engine will think for longer, and play stronger.