Stockfish 10 was released 29.11.2018

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Uri Blass
Posts: 10282
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Stockfish 10 was released 29.11.2018

Post by Uri Blass »

Zenmastur wrote: Wed Dec 11, 2019 7:12 pm
Uri Blass wrote: Wed Dec 11, 2019 6:16 pm
I think that one of the problem with testing in the stockfish framework is that you adjudicate games as a draw when the evaluation is 0.00 for some moves and there is no progress and do not continue until you see a draw by the 50 move rule.

It means that even if some patch make an improvement then it will probably not pass the tests in the framework because the games are adjudicated too early as a draw.
I'm not sure that's the problem.

The bounds used for simplification tests are [-3.00,1.00]. This allows to many regressions. It would be much more balanced if they changed the bounds to [-2.00,2.00]. Even [-2.50,1.50] would help.
I do not believe that this is the problem.
Simplifications usually pass with more than 50% and there is no proof that stockfish without the simplifications is better.
Zenmastur
Posts: 919
Joined: Sat May 31, 2014 8:28 am

Re: Stockfish 10 was released 29.11.2018

Post by Zenmastur »

Uri Blass wrote: Wed Dec 11, 2019 11:43 pm
Zenmastur wrote: Wed Dec 11, 2019 7:12 pm
Uri Blass wrote: Wed Dec 11, 2019 6:16 pm
I think that one of the problem with testing in the stockfish framework is that you adjudicate games as a draw when the evaluation is 0.00 for some moves and there is no progress and do not continue until you see a draw by the 50 move rule.

It means that even if some patch make an improvement then it will probably not pass the tests in the framework because the games are adjudicated too early as a draw.
I'm not sure that's the problem.

The bounds used for simplification tests are [-3.00,1.00]. This allows to many regressions. It would be much more balanced if they changed the bounds to [-2.00,2.00]. Even [-2.50,1.50] would help.
I do not believe that this is the problem.
Simplifications usually pass with more than 50% and there is no proof that stockfish without the simplifications is better.
Well if you look at the simplifications and their regressions found here: https://nextchessmove.com/dev-builds
You can see that many of them are regressions and that over time they seem to loose almost as many ELO as the rest of the patchs gain. This has recently lead to several months of basically no change in SF's ELO.
Only 2 defining forces have ever offered to die for you.....Jesus Christ and the American Soldier. One died for your soul, the other for your freedom.
Michel
Posts: 2272
Joined: Mon Sep 29, 2008 1:50 am

Re: Stockfish 10 was released 29.11.2018

Post by Michel »

Zenmastur wrote: Wed Dec 11, 2019 11:58 pm
Uri Blass wrote: Wed Dec 11, 2019 11:43 pm
Zenmastur wrote: Wed Dec 11, 2019 7:12 pm
Uri Blass wrote: Wed Dec 11, 2019 6:16 pm
I think that one of the problem with testing in the stockfish framework is that you adjudicate games as a draw when the evaluation is 0.00 for some moves and there is no progress and do not continue until you see a draw by the 50 move rule.

It means that even if some patch make an improvement then it will probably not pass the tests in the framework because the games are adjudicated too early as a draw.
I'm not sure that's the problem.

The bounds used for simplification tests are [-3.00,1.00]. This allows to many regressions. It would be much more balanced if they changed the bounds to [-2.00,2.00]. Even [-2.50,1.50] would help.
I do not believe that this is the problem.
Simplifications usually pass with more than 50% and there is no proof that stockfish without the simplifications is better.
Well if you look at the simplifications and their regressions found here: https://nextchessmove.com/dev-builds
You can see that many of them are regressions and that over time they seem to loose almost as many ELO as the rest of the patchs gain. This has recently lead to several months of basically no change in SF's ELO.
Error bars and perhaps occasional bad runs.

There have been tests were a bunch of simplifications were reverted and there was no elo gain. There is no evidence whatsoever that simplifications in SF cause elo loss.
Ideas=science. Simplification=engineering.
Without ideas there is nothing to simplify.
Zenmastur
Posts: 919
Joined: Sat May 31, 2014 8:28 am

Re: Stockfish 10 was released 29.11.2018

Post by Zenmastur »

Michel wrote: Thu Dec 12, 2019 12:23 am
Zenmastur wrote: Wed Dec 11, 2019 11:58 pm
Uri Blass wrote: Wed Dec 11, 2019 11:43 pm
Zenmastur wrote: Wed Dec 11, 2019 7:12 pm
Uri Blass wrote: Wed Dec 11, 2019 6:16 pm
I think that one of the problem with testing in the stockfish framework is that you adjudicate games as a draw when the evaluation is 0.00 for some moves and there is no progress and do not continue until you see a draw by the 50 move rule.

It means that even if some patch make an improvement then it will probably not pass the tests in the framework because the games are adjudicated too early as a draw.
I'm not sure that's the problem.

The bounds used for simplification tests are [-3.00,1.00]. This allows to many regressions. It would be much more balanced if they changed the bounds to [-2.00,2.00]. Even [-2.50,1.50] would help.
I do not believe that this is the problem.
Simplifications usually pass with more than 50% and there is no proof that stockfish without the simplifications is better.
Well if you look at the simplifications and their regressions found here: https://nextchessmove.com/dev-builds
You can see that many of them are regressions and that over time they seem to loose almost as many ELO as the rest of the patchs gain. This has recently lead to several months of basically no change in SF's ELO.
Error bars and perhaps occasional bad runs.

There have been tests were a bunch of simplifications were reverted and there was no elo gain. There is no evidence whatsoever that simplifications in SF cause elo loss.
You mean other than the regression tests?
Only 2 defining forces have ever offered to die for you.....Jesus Christ and the American Soldier. One died for your soul, the other for your freedom.
Uri Blass
Posts: 10282
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Stockfish 10 was released 29.11.2018

Post by Uri Blass »

Zenmastur wrote: Wed Dec 11, 2019 11:58 pm
Uri Blass wrote: Wed Dec 11, 2019 11:43 pm
Zenmastur wrote: Wed Dec 11, 2019 7:12 pm
Uri Blass wrote: Wed Dec 11, 2019 6:16 pm
I think that one of the problem with testing in the stockfish framework is that you adjudicate games as a draw when the evaluation is 0.00 for some moves and there is no progress and do not continue until you see a draw by the 50 move rule.

It means that even if some patch make an improvement then it will probably not pass the tests in the framework because the games are adjudicated too early as a draw.
I'm not sure that's the problem.

The bounds used for simplification tests are [-3.00,1.00]. This allows to many regressions. It would be much more balanced if they changed the bounds to [-2.00,2.00]. Even [-2.50,1.50] would help.
I do not believe that this is the problem.
Simplifications usually pass with more than 50% and there is no proof that stockfish without the simplifications is better.
Well if you look at the simplifications and their regressions found here: https://nextchessmove.com/dev-builds
You can see that many of them are regressions and that over time they seem to loose almost as many ELO as the rest of the patchs gain. This has recently lead to several months of basically no change in SF's ELO.
There are not enough games to know if a simplification is a regression or an improvement but you can get an unbiased estimate for the average value of simplifications from stockfish10.

These are the first numbers and you need to get more numbers from the link and calculate average for that purpose.
At least when I look at the first numbers it seems to me that the average is positive.

209.73->207.78(-1.95 elo) 1.12.2018 simplification
208.88->206.03(-2.85 elo) 6.12.2018 simplification
208.75->211.58(2.83 elo) 16.12.2018 simplification
214.03->216.00(1.97 elo) 16.12.2018 simplification
214.25->213.08(-1.17 elo) 24.12.2018 simplification
212.66->213.98(1.32 elo) 27.12.2018 simplification
209.88->210.54(0.66 elo) 4.1.2019 simplification
211.45->215.12(3.67 elo) 10.1.2019 simplification
215.12->212.84(-2.28 elo) 14.1.2019 simplification
212.84->212.17(-0.67 elo) 14.1.2019 simplification
212.17->216.75(4.58 elo) 17.1.2019 simplification
215.25->217.07(1.82 elo) 22.1.2019 simplification
216.10->215.10(-1 elo) 29.1.2019 simplification
215.10->221.39(6.29 elo) 31.1.2019 simplification
217.75->219.64(1.89 elo) 8.2.2019 simplification
219.64->220.48(0.84 elo) 21.2.2019 simplification
220.48->220.45(-0.03 elo) 21.2.2019 simplification
218.64->219.64(1 elo) 27.2.2019 simplification
220.93->218.38->220.45(-0.48 elo) 5.3 simplifications
219.49->220.93(+1.44 elo) 10.3 simplification
219.87->218.20(-1.67 elo) 20.3 simplification
221.09->218.53(-2.56 elo) 24.3 simplification
217.85->218.81(0.96 elo) 4.4 simplification
223.36->220.86->221.82(-1.54 elo) 13.4 simplifications
219.64->219.14->218.61->219.15(-0.49 elo) 16.4 smplifications
219.15->220.30(1.15 elo) 17.4 simplification
218.51->218.61(0.1 elo) 19.4 simplification
221.37->220.81->225.70(4.33 elo) 9.5 simplifications
Zenmastur
Posts: 919
Joined: Sat May 31, 2014 8:28 am

Re: Stockfish 10 was released 29.11.2018

Post by Zenmastur »

Uri Blass wrote: Thu Dec 12, 2019 3:56 am
There are not enough games to know if a simplification is a regression or an improvement but you can get an unbiased estimate for the average value of simplifications from stockfish10.

These are the first numbers and you need to get more numbers from the link and calculate average for that purpose.
At least when I look at the first numbers it seems to me that the average is positive.

209.73->207.78(-1.95 elo) 1.12.2018 simplification
208.88->206.03(-2.85 elo) 6.12.2018 simplification
208.75->211.58(2.83 elo) 16.12.2018 simplification
214.03->216.00(1.97 elo) 16.12.2018 simplification
214.25->213.08(-1.17 elo) 24.12.2018 simplification
212.66->213.98(1.32 elo) 27.12.2018 simplification
209.88->210.54(0.66 elo) 4.1.2019 simplification
211.45->215.12(3.67 elo) 10.1.2019 simplification
215.12->212.84(-2.28 elo) 14.1.2019 simplification
212.84->212.17(-0.67 elo) 14.1.2019 simplification
212.17->216.75(4.58 elo) 17.1.2019 simplification
215.25->217.07(1.82 elo) 22.1.2019 simplification
216.10->215.10(-1 elo) 29.1.2019 simplification
215.10->221.39(6.29 elo) 31.1.2019 simplification
217.75->219.64(1.89 elo) 8.2.2019 simplification
219.64->220.48(0.84 elo) 21.2.2019 simplification
220.48->220.45(-0.03 elo) 21.2.2019 simplification
218.64->219.64(1 elo) 27.2.2019 simplification
220.93->218.38->220.45(-0.48 elo) 5.3 simplifications
219.49->220.93(+1.44 elo) 10.3 simplification
219.87->218.20(-1.67 elo) 20.3 simplification
221.09->218.53(-2.56 elo) 24.3 simplification
217.85->218.81(0.96 elo) 4.4 simplification
223.36->220.86->221.82(-1.54 elo) 13.4 simplifications
219.64->219.14->218.61->219.15(-0.49 elo) 16.4 smplifications
219.15->220.30(1.15 elo) 17.4 simplification
218.51->218.61(0.1 elo) 19.4 simplification
221.37->220.81->225.70(4.33 elo) 9.5 simplifications
I have no clue what all this is supposed to mean.
Only 2 defining forces have ever offered to die for you.....Jesus Christ and the American Soldier. One died for your soul, the other for your freedom.
Michel
Posts: 2272
Joined: Mon Sep 29, 2008 1:50 am

Re: Stockfish 10 was released 29.11.2018

Post by Michel »

Zenmastur wrote: Thu Dec 12, 2019 1:42 pm
Uri Blass wrote: Thu Dec 12, 2019 3:56 am
There are not enough games to know if a simplification is a regression or an improvement but you can get an unbiased estimate for the average value of simplifications from stockfish10.

These are the first numbers and you need to get more numbers from the link and calculate average for that purpose.
At least when I look at the first numbers it seems to me that the average is positive.

209.73->207.78(-1.95 elo) 1.12.2018 simplification
208.88->206.03(-2.85 elo) 6.12.2018 simplification
208.75->211.58(2.83 elo) 16.12.2018 simplification
214.03->216.00(1.97 elo) 16.12.2018 simplification
214.25->213.08(-1.17 elo) 24.12.2018 simplification
212.66->213.98(1.32 elo) 27.12.2018 simplification
209.88->210.54(0.66 elo) 4.1.2019 simplification
211.45->215.12(3.67 elo) 10.1.2019 simplification
215.12->212.84(-2.28 elo) 14.1.2019 simplification
212.84->212.17(-0.67 elo) 14.1.2019 simplification
212.17->216.75(4.58 elo) 17.1.2019 simplification
215.25->217.07(1.82 elo) 22.1.2019 simplification
216.10->215.10(-1 elo) 29.1.2019 simplification
215.10->221.39(6.29 elo) 31.1.2019 simplification
217.75->219.64(1.89 elo) 8.2.2019 simplification
219.64->220.48(0.84 elo) 21.2.2019 simplification
220.48->220.45(-0.03 elo) 21.2.2019 simplification
218.64->219.64(1 elo) 27.2.2019 simplification
220.93->218.38->220.45(-0.48 elo) 5.3 simplifications
219.49->220.93(+1.44 elo) 10.3 simplification
219.87->218.20(-1.67 elo) 20.3 simplification
221.09->218.53(-2.56 elo) 24.3 simplification
217.85->218.81(0.96 elo) 4.4 simplification
223.36->220.86->221.82(-1.54 elo) 13.4 simplifications
219.64->219.14->218.61->219.15(-0.49 elo) 16.4 smplifications
219.15->220.30(1.15 elo) 17.4 simplification
218.51->218.61(0.1 elo) 19.4 simplification
221.37->220.81->225.70(4.33 elo) 9.5 simplifications
I have no clue what all this is supposed to mean.
That says enough about your understanding. Please refrain from further commenting.
Ideas=science. Simplification=engineering.
Without ideas there is nothing to simplify.
Zenmastur
Posts: 919
Joined: Sat May 31, 2014 8:28 am

Re: Stockfish 10 was released 29.11.2018

Post by Zenmastur »

Michel wrote: Thu Dec 12, 2019 2:57 pm
Zenmastur wrote: Thu Dec 12, 2019 1:42 pm
Uri Blass wrote: Thu Dec 12, 2019 3:56 am
There are not enough games to know if a simplification is a regression or an improvement but you can get an unbiased estimate for the average value of simplifications from stockfish10.

These are the first numbers and you need to get more numbers from the link and calculate average for that purpose.
At least when I look at the first numbers it seems to me that the average is positive.

209.73->207.78(-1.95 elo) 1.12.2018 simplification
208.88->206.03(-2.85 elo) 6.12.2018 simplification
208.75->211.58(2.83 elo) 16.12.2018 simplification
214.03->216.00(1.97 elo) 16.12.2018 simplification
214.25->213.08(-1.17 elo) 24.12.2018 simplification
212.66->213.98(1.32 elo) 27.12.2018 simplification
209.88->210.54(0.66 elo) 4.1.2019 simplification
211.45->215.12(3.67 elo) 10.1.2019 simplification
215.12->212.84(-2.28 elo) 14.1.2019 simplification
212.84->212.17(-0.67 elo) 14.1.2019 simplification
212.17->216.75(4.58 elo) 17.1.2019 simplification
215.25->217.07(1.82 elo) 22.1.2019 simplification
216.10->215.10(-1 elo) 29.1.2019 simplification
215.10->221.39(6.29 elo) 31.1.2019 simplification
217.75->219.64(1.89 elo) 8.2.2019 simplification
219.64->220.48(0.84 elo) 21.2.2019 simplification
220.48->220.45(-0.03 elo) 21.2.2019 simplification
218.64->219.64(1 elo) 27.2.2019 simplification
220.93->218.38->220.45(-0.48 elo) 5.3 simplifications
219.49->220.93(+1.44 elo) 10.3 simplification
219.87->218.20(-1.67 elo) 20.3 simplification
221.09->218.53(-2.56 elo) 24.3 simplification
217.85->218.81(0.96 elo) 4.4 simplification
223.36->220.86->221.82(-1.54 elo) 13.4 simplifications
219.64->219.14->218.61->219.15(-0.49 elo) 16.4 smplifications
219.15->220.30(1.15 elo) 17.4 simplification
218.51->218.61(0.1 elo) 19.4 simplification
221.37->220.81->225.70(4.33 elo) 9.5 simplifications
I have no clue what all this is supposed to mean.
That says enough about your understanding. Please refrain from further commenting.
Please refrain from being a complete ass! I'll comment on any post that pleases me.

My problem with the above is there is no labels or source given and about a third have no date.

The simplifications from June 1st 2019 to December 1st 2019 have lost almost 25 ELO. At least according to https://nextchessmove.com/dev-builds.

If that's not a regression then I don't know what a regression is.

Regards,

Zenmastur
Only 2 defining forces have ever offered to die for you.....Jesus Christ and the American Soldier. One died for your soul, the other for your freedom.
Uri Blass
Posts: 10282
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Stockfish 10 was released 29.11.2018

Post by Uri Blass »

Zenmastur wrote: Thu Dec 12, 2019 1:42 pm
Uri Blass wrote: Thu Dec 12, 2019 3:56 am
There are not enough games to know if a simplification is a regression or an improvement but you can get an unbiased estimate for the average value of simplifications from stockfish10.

These are the first numbers and you need to get more numbers from the link and calculate average for that purpose.
At least when I look at the first numbers it seems to me that the average is positive.

209.73->207.78(-1.95 elo) 1.12.2018 simplification
208.88->206.03(-2.85 elo) 6.12.2018 simplification
208.75->211.58(2.83 elo) 16.12.2018 simplification
214.03->216.00(1.97 elo) 16.12.2018 simplification
214.25->213.08(-1.17 elo) 24.12.2018 simplification
212.66->213.98(1.32 elo) 27.12.2018 simplification
209.88->210.54(0.66 elo) 4.1.2019 simplification
211.45->215.12(3.67 elo) 10.1.2019 simplification
215.12->212.84(-2.28 elo) 14.1.2019 simplification
212.84->212.17(-0.67 elo) 14.1.2019 simplification
212.17->216.75(4.58 elo) 17.1.2019 simplification
215.25->217.07(1.82 elo) 22.1.2019 simplification
216.10->215.10(-1 elo) 29.1.2019 simplification
215.10->221.39(6.29 elo) 31.1.2019 simplification
217.75->219.64(1.89 elo) 8.2.2019 simplification
219.64->220.48(0.84 elo) 21.2.2019 simplification
220.48->220.45(-0.03 elo) 21.2.2019 simplification
218.64->219.64(1 elo) 27.2.2019 simplification
220.93->218.38->220.45(-0.48 elo) 5.3 simplifications
219.49->220.93(+1.44 elo) 10.3 simplification
219.87->218.20(-1.67 elo) 20.3 simplification
221.09->218.53(-2.56 elo) 24.3 simplification
217.85->218.81(0.96 elo) 4.4 simplification
223.36->220.86->221.82(-1.54 elo) 13.4 simplifications
219.64->219.14->218.61->219.15(-0.49 elo) 16.4 smplifications
219.15->220.30(1.15 elo) 17.4 simplification
218.51->218.61(0.1 elo) 19.4 simplification
221.37->220.81->225.70(4.33 elo) 9.5 simplifications
I have no clue what all this is supposed to mean.
This is about the link

I will explain one line and you can understand the other lines based on the same logic
209.73->207.78(-1.95 elo) 1.12.2018 simplification

The following lines are from the link
https://nextchessmove.com/dev-builds
20181201-0929 20000 11146 433 8421 +207.78 +/- 3.64 Simplification
20181129-1517 20000 11271 478 8251 +209.73 +/- 3.69 Non Functional

209.73 is elo difference from stockfish7 before the simplification.
207.78 is elo difference from stockfish7 after the simplification.

-1.95 is the estimate for elo improvement from the simplification(note that the statistical mistake is above 3.6 elo).
1.12.2018 is the date of the simplification.


The idea is that you can get unbiased estimate for the elo that stockfish get from simplifications by the sum of all these numbers.
I did not calculate the sum of all these numbers but at least the sum of the numbers that I wrote that is only about part of the simplications is above 0.

Maybe somebody can continue to calculate the sum of all the numbers(there are many simplifications after the simplification of 9.5.2019 when I did not write the numbers but you can get it from the link).
Uri Blass
Posts: 10282
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Stockfish 10 was released 29.11.2018

Post by Uri Blass »

Zenmastur wrote: Thu Dec 12, 2019 7:01 pm
Michel wrote: Thu Dec 12, 2019 2:57 pm
Zenmastur wrote: Thu Dec 12, 2019 1:42 pm
Uri Blass wrote: Thu Dec 12, 2019 3:56 am
There are not enough games to know if a simplification is a regression or an improvement but you can get an unbiased estimate for the average value of simplifications from stockfish10.

These are the first numbers and you need to get more numbers from the link and calculate average for that purpose.
At least when I look at the first numbers it seems to me that the average is positive.

209.73->207.78(-1.95 elo) 1.12.2018 simplification
208.88->206.03(-2.85 elo) 6.12.2018 simplification
208.75->211.58(2.83 elo) 16.12.2018 simplification
214.03->216.00(1.97 elo) 16.12.2018 simplification
214.25->213.08(-1.17 elo) 24.12.2018 simplification
212.66->213.98(1.32 elo) 27.12.2018 simplification
209.88->210.54(0.66 elo) 4.1.2019 simplification
211.45->215.12(3.67 elo) 10.1.2019 simplification
215.12->212.84(-2.28 elo) 14.1.2019 simplification
212.84->212.17(-0.67 elo) 14.1.2019 simplification
212.17->216.75(4.58 elo) 17.1.2019 simplification
215.25->217.07(1.82 elo) 22.1.2019 simplification
216.10->215.10(-1 elo) 29.1.2019 simplification
215.10->221.39(6.29 elo) 31.1.2019 simplification
217.75->219.64(1.89 elo) 8.2.2019 simplification
219.64->220.48(0.84 elo) 21.2.2019 simplification
220.48->220.45(-0.03 elo) 21.2.2019 simplification
218.64->219.64(1 elo) 27.2.2019 simplification
220.93->218.38->220.45(-0.48 elo) 5.3 simplifications
219.49->220.93(+1.44 elo) 10.3 simplification
219.87->218.20(-1.67 elo) 20.3 simplification
221.09->218.53(-2.56 elo) 24.3 simplification
217.85->218.81(0.96 elo) 4.4 simplification
223.36->220.86->221.82(-1.54 elo) 13.4 simplifications
219.64->219.14->218.61->219.15(-0.49 elo) 16.4 smplifications
219.15->220.30(1.15 elo) 17.4 simplification
218.51->218.61(0.1 elo) 19.4 simplification
221.37->220.81->225.70(4.33 elo) 9.5 simplifications
I have no clue what all this is supposed to mean.
That says enough about your understanding. Please refrain from further commenting.
Please refrain from being a complete ass! I'll comment on any post that pleases me.

My problem with the above is there is no labels or source given and about a third have no date.

The simplifications from June 1st 2019 to December 1st 2019 have lost almost 25 ELO. At least according to https://nextchessmove.com/dev-builds.

If that's not a regression then I don't know what a regression is.

Regards,

Zenmastur
I did not calculate the estimate for the sum of elo for simplification from 1.6.2019 to 1.12.2019 but there are many simplifications so I believe the statistical mistake in the estimate for the value of them is high.

There may be a reduction above the statistical error but when you did not decide about the test in 1.6.2019 and decide about it in 1.12.2019 then it is misleading statistics because even sum of 0 elo changes can be expected to be above the statistical error if you are free to choose the dates and choose dates when you are not happy with the results.