Interestingly the new AlphaGo uses residual neural networks. These supposedly do not suffer from layer saturation which is the phenomenon that performance does not improve or even degrades as more layers are added to the network.
https://arxiv.org/pdf/1512.03385.pdf
It might be interesting to try this in Giraffe. I think the author reported that adding more layers did not improve Giraffe. Perhaps converting to a residual neural network can fix this.
Note that the aim should be to have an eval which is _much_ better than the one of Stockfish, to offset the fact that NN are inherently slower.
We are doomed - AlphaGo Zero, learning only from basic rules
Moderators: hgm, Rebel, chrisw
-
- Posts: 2272
- Joined: Mon Sep 29, 2008 1:50 am
Re: We are doomed - AlphaGo Zero, learning only from basic r
Ideas=science. Simplification=engineering.
Without ideas there is nothing to simplify.
Without ideas there is nothing to simplify.
-
- Posts: 5228
- Joined: Thu Mar 09, 2006 9:40 am
- Full name: Vincent Lejeune
Re: We are doomed - AlphaGo Zero, learning only from basic r
24 minutes video from DeepMind : https://www.youtube.com/watch?v=WXHFqTvfFSw
-
- Posts: 1610
- Joined: Fri Mar 01, 2013 5:28 pm
- Location: USA
Re: We are doomed - AlphaGo Zero, learning only from basic r
nice heads up, good video.Vinvin wrote:24 minutes video from DeepMind : https://www.youtube.com/watch?v=WXHFqTvfFSw
"Without change, something sleeps inside us, and seldom awakens. The sleeper must awaken." (Dune - 1984)
Lonnie
Lonnie
-
- Posts: 10300
- Joined: Thu Mar 09, 2006 12:37 am
- Location: Tel-Aviv Israel
Re: We are doomed - AlphaGo Zero, learning only from basic r
I do not think that there is an engine that learned all the human knowledge of the past century.Daniel Shawul wrote:I wan't actually aware of that test but the one given in Giraffe's paper on Page 25: https://arxiv.org/pdf/1509.01549v1.pdfPK wrote:The only direct comparison of Giraffe eval with a decently strong engine that I am aware of:
http://talkchess.com/forum/viewtopic.ph ... 39&t=64096
Peter's test also confirm that Giraffee's eval is close to Stockfish's, but it is not equally efficient due to 10x slowdown incurred by the NN evaluation. So it seems Giraffe has
already learned (probably not tabula rasa ? ) all the human chess knowledge of the past century...Code: Select all
Giraffe (1s) 2400 258570 9641 Giraffe (0.5s) 2400 119843 9211 Giraffe (0.1s) 2400 24134 8526 Stockfish 5 3387 108540 10505 Senpai 1.0 3096 86711 9414 Texel 1.04 2995 119455 8494 Arasan 17.5 2847 79442 7961 Scorpio 2.7.6 2821 139143 8795 Crafty 24.0 2801 296918 8541 GNU Chess 6 / Fruit 2.1 2685 58552 8307 Sungorus 1.4 2309 145069 7729
Knowledge is also knowledge of tactical patterns when you do not need to calculate all legal moves of the opponent like a stupid computer.
For example
[D]8/2q5/8/4k3/8/8/7Q/4K3 b - - 0 1
humans know that black lose the queen without calculating all the legal moves of the black king because they know the pattern.
I do not know if giraffe or stockfish know it by evaluation.
Of course the situation is different when the position is the following and you know that black save the queen here without calculating the possible legal moves of black.
[D]8/4q3/8/4k3/8/8/4Q3/4K3 b - - 0 1
Another example is the following
[D]4k3/4Q3/8/6B1/8/8/8/4K3 b - - 0 1
You memorize the pattern queen defended by something when the black king is one square near it in the last rank and you do not need to calculate black's moves when you simply know it is mate because you remember the pattern.
I do not know if there are chess programs that know it and know that the following is the same pattern of course
[D]8/8/8/6Qk/5P2/8/8/4K3 b - - 0 1
but not the following that is not the same pattern
[D]5B2/8/7Q/7k/8/8/8/4K3 b - - 0 1
Uri
-
- Posts: 2851
- Joined: Wed Mar 08, 2006 10:01 pm
- Location: Irvine, CA, USA
Re: We are doomed - AlphaGo Zero, learning only from basic r
That was 24 minutes? It seemed more like 2:40 to me.Vinvin wrote:24 minutes video from DeepMind : https://www.youtube.com/watch?v=WXHFqTvfFSw
Deasil is the right way to go.
-
- Posts: 6995
- Joined: Thu Aug 18, 2011 12:04 pm
Re: We are doomed - AlphaGo Zero, learning only from basic r
[a bit off-topic] I am (was) used to think so, but not any longer. Google CRISPR/Cas9 for instance and consider the potential.Leo wrote:I very much agree. I am tired of the bombast from Google on what great things they are going to do for humanity. AI is overrated.Cardoso wrote:For that one I think they will take milleniums, or simply never.
They can't cure migranes or diabetes, much less stop aging.
And I think some people expect too much from science.
My mother had a severe skin disease on her feet called hyperkeratosis, with profound cracks in the sckin wich hurted badly, she was treated with the best doctor in the field in the country (portugal), with very agressive medications, and none of the several treatements worked. Desperate my mother tryed some plant called "malvas" in portuguese, after 2 weeks she was much better, after 6 weeks she was completed cured and the problem went completely away.
I think the human body is too complex for science.
Even a single cell is tremendously complex.
-
- Posts: 5228
- Joined: Thu Mar 09, 2006 9:40 am
- Full name: Vincent Lejeune
Re: We are doomed - AlphaGo Zero, learning only from basic r
Sorry I mismatched with the first game demo here : https://www.youtube.com/watch?v=-Wh4CfsWDyMDirt wrote:That was 24 minutes? It seemed more like 2:40 to me.Vinvin wrote:24 minutes video from DeepMind : https://www.youtube.com/watch?v=WXHFqTvfFSw
-
- Posts: 5228
- Joined: Thu Mar 09, 2006 9:40 am
- Full name: Vincent Lejeune
Re: We are doomed - AlphaGo Zero, learning only from basic r
If I read well the doc about Giraffe ( https://arxiv.org/pdf/1509.01549.pdf ), it uses 3 network layers.Daniel Shawul wrote:Giraffe already did that with the NN evaluation. It used hand-selected features as inputs (presence & location of pieces) etc, and was able to compete with Stockfish's eval. It is most likely possible to have a Giraffe-zero atleast for the evaluation only --i.e. it will learn everything the chess world knows about good static evaluation (not search) from self-play games only in a couple of hours.Vinvin wrote:I hope for such an experience for chess : starting a very deep learning with only basic rules, piece coordinate and piece interaction.
Even no "piece value" concept hardcoded.
In the doc about AlphaGo Zero Here , it uses 12 layers !
nature.com wrote:(2) AlphaGo Lee is the program that defeated Lee Sedol 4–1 in March 2016.
It was previously unpublished, but is similar in most regards to AlphaGo Fan (12).
However, we highlight several key differences to facilitate a fair comparison. First,
the value network was trained from the outcomes of fast games of self-play by
AlphaGo, rather than games of self-play by the policy network; this procedure
was iterated several times—an initial step towards the tabula rasa algorithm pre-
sented in this paper. Second, the policy and value networks were larger than those
described in the original paper—using 12 convolutional layers of 256 planes—
and were trained for more iterations. This player was also distributed over many
machines using 48 TPUs, rather than GPUs, enabling it to evaluate neural networks
faster during search.
-
- Posts: 362
- Joined: Thu Mar 16, 2006 7:39 pm
- Location: Portugal
- Full name: Alvaro Cardoso
Re: We are doomed - AlphaGo Zero, learning only from basic r
We are doomed alright, but it's not because of AphaGo Zero, or Google or anything of the kind.
If we are doomed it's because other reasons 99% of us dont't give a damn, even when advised/warned.
We adults often complain it's dificult to raise our kids, we complain they don't respond to our best efforts, they don't listen to us, they don't care of our advice.
Well adults behave the same way, they also don't respond to the best advices, they also insist in doing things their own way, and of course the result can't be good. So in this respect many adults didn't really grow up. We have too many mental barriers to sound advice.
Sorry if I sound too generic and cryptic, but I wouldn't like to give further details. Just look at the news tonight and think on today's society and it's problems and the caos families live in and maybe you will agree with me.
If we are doomed it's because other reasons 99% of us dont't give a damn, even when advised/warned.
We adults often complain it's dificult to raise our kids, we complain they don't respond to our best efforts, they don't listen to us, they don't care of our advice.
Well adults behave the same way, they also don't respond to the best advices, they also insist in doing things their own way, and of course the result can't be good. So in this respect many adults didn't really grow up. We have too many mental barriers to sound advice.
Sorry if I sound too generic and cryptic, but I wouldn't like to give further details. Just look at the news tonight and think on today's society and it's problems and the caos families live in and maybe you will agree with me.
-
- Posts: 12038
- Joined: Mon Jul 07, 2008 10:50 pm
Re: We are doomed - AlphaGo Zero, learning only from basic r
does anyone know if AlphaGo Zero has hit a plateau or is it still gaining elo points ?