Testing Stockfish 11-03-13. 480 Games.

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Tomcass »

TESTING STOCKFISH DEVELOPMENT 020316: : 1920 GAMES
Timestamp: 1456945856 Bench: 8576437

SUMMARY OF 1.920 GAMES.

Against Komodo 9.3 (3259) = 57.19% SugaR 2.0 (3285)= 50.42% Houdini 4 Pro (3136)= 69.69% Gull 3XP (3103) = 74.90%

FIXED TIME CONTROL 4 MINUTES + 0 SECONDS (960 GAMES)
62.92%

INCREMENTAL TIME CONTROL 2 MINUTES + 2 SECONDS (960 GAMES)
63.18%

Global score for Stockfish 020316 after 1.920 games: 1.210.5 – 709.5= 63.05%
AVERAGE ELO OF OPONENTS= 3.196
ESTIMATED ELO FOR STOCKFISH DEV 070216 = 3.287


Congratulations to the Stockfish Team for this substantial step ahead.

MY RANKING OF SELECTED ENGINES

1.- Stockfish Development 020316= 3.287
2.- SugaR 2.0= 3.285
3.- Stockfish Development 150116 IPMAN= 3.283
4.- SugaR Pro v1.3= 3.280
5.- Stockfish 7 = 3.268
6.- Komodo 9.3 = 3.259
7.- Komodo 9.2 = 3.251.
8.- Komodo 9.1= 3.247
9.- DON 190216 = 3.242
10.- Stockfish 6= 3.219
11.- Komodo 9= 3.209
12.- Houdini 4= 3.136
13.- Gull 3 Development 150216= 3.120
14.- Gull 3 XP= 3.103

Best regards to the computer chess community. :wink:

Tom.
User avatar
Ozymandias
Posts: 1535
Joined: Sun Oct 25, 2009 2:30 am

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Ozymandias »

Not that I want to rain on your parade:

Code: Select all

     Program                   Elo    +    -   Games   Score   Av.Op.  Draws
   1 Stockfish 160113 x64    : 3315    6    6  7000    72.5 %   3138   35.9 %
   2 Stockfish 160302 x64    : 3312    7    7  7000    72.2 %   3138   36.4 % (new)
   3 Stockfish 160129 x64    : 3312    6    6  7000    72.2 %   3138   36.7 %
   4 Stockfish 151214 x64    : 3306    6    6  7000    71.5 %   3138   36.9 %
   5 Stockfish 151205 x64    : 3305    6    6  7000    71.4 %   3138   37.4 %
   6 Stockfish 151222 x64    : 3304    6    6  7000    71.3 %   3138   37.2 %
   7 Stockfish 7 160102      : 3304    6    6  7000    71.3 %   3138   36.9 %
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Tomcass »

Ozymandias wrote:Not that I want to rain on your parade:

Code: Select all

     Program                   Elo    +    -   Games   Score   Av.Op.  Draws
   1 Stockfish 160113 x64    : 3315    6    6  7000    72.5 %   3138   35.9 %
   2 Stockfish 160302 x64    : 3312    7    7  7000    72.2 %   3138   36.4 % (new)
   3 Stockfish 160129 x64    : 3312    6    6  7000    72.2 %   3138   36.7 %
   4 Stockfish 151214 x64    : 3306    6    6  7000    71.5 %   3138   36.9 %
   5 Stockfish 151205 x64    : 3305    6    6  7000    71.4 %   3138   37.4 %
   6 Stockfish 151222 x64    : 3304    6    6  7000    71.3 %   3138   37.2 %
   7 Stockfish 7 160102      : 3304    6    6  7000    71.3 %   3138   36.9 %
No problem, Juan.

Different tests, at different time controls, against different rivals, running in a different hardware ... will probaly give different outputs. This is the case.

Un abrazo!

Tom.
felix
Posts: 81
Joined: Mon Feb 24, 2014 6:10 pm
Location: Mexicali, República de Baja California

Re: For Tom- or Maybe Anyone Who Knows!!

Post by felix »

I believe some contempt=10 is used for stockfish to "avoid draws" on that test

so is a "weaker" stockfish

Regards
Be careful with your words, once they are said, they can be only forgiven, not forgotten
User avatar
Ozymandias
Posts: 1535
Joined: Sun Oct 25, 2009 2:30 am

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Ozymandias »

felix wrote:I believe some contempt=10 is used for stockfish to "avoid draws" on that test

so is a "weaker" stockfish

Regards
Where did you read that?
felix
Posts: 81
Joined: Mon Feb 24, 2014 6:10 pm
Location: Mexicali, República de Baja California

Re: For Tom- or Maybe Anyone Who Knows!!

Post by felix »

http://spcc.beepworld.de/endless-roundrobin.htm

Current participants: Stockfish 160120 and Komodo 9.3, Stockfish Contempt is set to +10 (reduce early 3fold-draws of Stockfish)

I don't know if is used for the last tests.

Regards
Be careful with your words, once they are said, they can be only forgiven, not forgotten
User avatar
Ozymandias
Posts: 1535
Joined: Sun Oct 25, 2009 2:30 am

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Ozymandias »

That's a different project, he runs several.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Tomcass »

TESTING STOCKFISH DEVELOPMENT 100316: : 960 GAMES
Timestamp: 1457648766 Bench: 8261839

6 real cores
Ponder: Off.
GUI: Fritz 14
Book: Perfect 2014c – Limit 5 moves. Sedat –
No tablebases. No RTB used.
Hash 512
Relative Speed: 29.54
Knodes per second: 14.177

Time Control 4+0
Stockfish 100316 64 POPCNT_6 - Komodo 9.3 64-bit_nob_St_6 34.5 - 25.5 +13/=43/-4 57.50%
Stockfish 100316 64 POPCNT_6 - SugaR 2.0 64 POPCNT_X6 31.0 - 29.0 +4/=54/-2 51.67%
Stockfish 100316 64 POPCNT_6 - Houdini 4 Pro x64_St_Ct0_X6 40.0 - 20.0 +23/=34/-3 66.67%
Stockfish 100316 64 POPCNT_6 - Gull 3 x64 XP_X6 43.0 - 17.0 +27/=32/-1 71.67%

Time Control 2+2
Stockfish 100316 64 POPCNT_6 - Komodo 9.3 64-bit_nob_St_6 36.0 - 24.0 +15/=42/-3 60.00%
Stockfish 100316 64 POPCNT_6 - SugaR 2.0 64 POPCNT_X6 31.5 - 28.5 +3/=57/-0 52.50%
Stockfish 100316 64 POPCNT_6 - Houdini 4 Pro x64_St_Ct0_X6 44.0 - 16.0 +29/=30/-1 73.33%
Stockfish 100316 64 POPCNT_6 - Gull 3 x64 XP_X6 41.0 - 19.0 +24/=34/-2 68.33%

Score after 480 Games using 6 cores= 301.0 – 179.0 = 62.71%
480 Games: (No games available due to an incidence with Mediafire) Sorry.


8 real cores
Ponder: Off
GUI: Fritz 14
Book: Perfect 2014c – Limit 5 moves. Sedat –
No tablebases. No RTB used.
Hash 1024
Relative Speed: 53.22
Knodes per Second: 25.543

Fixed Time Control 4+0
Stockfish 100316 64 BMI2_8 - Komodo 9.3 64-bit_St_nob_X8 31.0 - 29.0 +10/=42/-8 51.67%
Stockfish 100316 64 BMI2_8 - SugaR 2.0 64 BMI2_X8 29.0 - 31.0 +2/=54/-4 48.33%
Stockfish 100316 64 BMI2_8 - Houdini 4 Pro x64_Ct0_8 44.5 - 15.5 +33/=23/-4 74.17%
Stockfish 100316 64 BMI2_8 - Gull 3 x64 BMI2_X8 48.5 - 11.5 +37/=23/-0 80.83%

Incremental Time Control 2+2
Stockfish 100316 64 BMI2_8 - Komodo 9.3 64-bit_St_nob_X8 34.5 - 25.5 +17/=35/-8 57.50%
Stockfish 100316 64 BMI2_8 - SugaR 2.0 64 BMI2_X8 30.5 - 29.5 +4/=53/-3 50.83%
Stockfish 100316 64 BMI2_8 - Houdini 4 Pro x64_Ct0_8 40.0 - 20.0 +20/=40/-0 66.67%
Stockfish 100316 64 BMI2_8 - Gull 3 x64 BMI2_X8 46.5 - 13.5 +34/=25/-1 77.50%

Score after 480 games using 8 cores: 304.5 – 175.5= 63.44%
480 Games: (No games available due to an incidence with Mediafire) Sorry

Global score for Stockfish 100316 after 960 games: 605.5 – 354.5= 63.07%
AVERAGE ELO OF OPONENTS= 3.196
ESTIMATED ELO FOR STOCKFISH DEV 070216 = 3.288


This test has confirmed the excellent evolution of SF DEVELOPMENT. This score is almost the same obtained by the current leader in my tests, SF DEVELOPMENT 0202316.

Best regards,

Tom.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Tomcass »

... sorry, there has been a typo in my previous post that I have not been allowed to edit. This is the correct text:

Global score for Stockfish 100316 after 960 games: 605.5 – 354.5= 63.07%
AVERAGE ELO OF OPONENTS= 3.196
ESTIMATED ELO FOR STOCKFISH DEV 100316 = 3.288


This test has confirmed the excellent evolution of SF DEVELOPMENT. This score is almost the same obtained by the current leader in my tests, SF DEVELOPMENT 020316.

Best regards,

Tom.

... I would like to suggest to allow edition of posts for 30 minutes after posting instead of the 15 current minutes. Thank you!.
Kohflote
Posts: 219
Joined: Wed Sep 19, 2007 11:07 am
Location: Singapore

Re: For Tom- or Maybe Anyone Who Knows!!

Post by Kohflote »

Dear Tom,

Will you continue the test till 1920 games? Thank you.

Best regards,
Koh, Kah Huat