FGRL 10 min + 6 sec Rating list - Komodo 11.2

fastgm · Post by **fastgm** » Sun Jul 23, 2017 12:36 pm

Rating list - 10 minutes + 6 seconds

Komodo 11.2 (+5 to Komodo 11.01)

http://www.fastgm.de

Progress:

     Engine                &#58;    Elo   Error  Played   (%)       W      D      L     D&#40;%)   CFS
 ----------------------------------------------------------------------------------------------
   1 Komodo 11.2           &#58;   3235     10    2700   71.30    1271   1308    121   48.44    72
   3 Komodo 11.01          &#58;   3230      9    3900   76.40    2207   1545    148   39.62    94
   4 Komodo 10.4           &#58;   3219      9    3300   71.58    1578   1568    154   47.52    57
   6 Komodo 10.3           &#58;   3205      8    4200   71.52    2037   1934    229   46.05    88
   7 Komodo 10.2           &#58;   3199      7    5400   70.81    2574   2500    326   46.30    66
   8 Komodo 10.1           &#58;   3196      8    4200   77.85    2485   1569    146   37.36    73
   9 Komodo 9.42           &#58;   3193      8    4200   76.60    2397   1640    163   39.05    54
  10 Komodo 10             &#58;   3192     10    3000   75.77    1670   1206    124   40.20   100
  12 Komodo 9              &#58;   3128      7    4500   67.18    1882   2282    336   50.71    75
  16 Komodo 8              &#58;   3078      7    5400   64.49    2169   2627    604   48.65    86
  20 Komodo 7a             &#58;   3035      8    3600   58.21    1111   1969    520   54.69    96
  22 Komodo TCECr          &#58;   3022      9    3000   61.95    1122   1473    405   49.10    98

.

pohl4711 · Post by **pohl4711** » Sun Jul 23, 2017 12:50 pm

Ouch! Only +5 Elo and Komodo 11.2 lost both 300 games head-to-head against Stockfish 8 (141.5-158.5) and against Houdini 5 (149-151).
Disappointing.

Modern Times · Post by **Modern Times** » Sun Jul 23, 2017 2:04 pm

Our blitz results are showing +6 but with only 600 games so far you can't rely on that. But yes in our test so far it scored 47% against Stockfish 8 and 49.5% against Houdini 5. I like Andreas's testing because of the large number of games and consequent low error margins.

I'm running this at chess960 so will see how it goes there.

JJJ · Post by **JJJ** » Sun Jul 23, 2017 2:15 pm

Komodo 11.01 vs Stockfish 8 : 45,8% 300 games
Komodo 11.2 vs Stockfish 8 : 47,2% 300 games

Komodo 11.01 vs Houdini 5 : 48,7% 300 games
Komodo 11.2 vs Houdini 5 : 49,7% 300 game

I see a progress here. I think it needs more game for both version to know better.

Also, maybe this time Komodo won more elo at bullet than in mid time control or long time control.

Modern Times · Post by **Modern Times** » Sun Jul 23, 2017 2:20 pm

JJJ wrote: Komodo 11.01 vs Stockfish 8 : 45,8% 300 games
Komodo 11.2 vs Stockfish 8 : 47,2% 300 games

Komodo 11.01 vs Houdini 5 : 48,7% 300 games
Komodo 11.2 vs Houdini 5 : 49,7% 300 game

I see a progress here.

Yes agreed.

majortom · Post by **majortom** » Sun Jul 23, 2017 2:28 pm

fastgm wrote:Rating list - 10 minutes + 6 seconds

Komodo 11.2 (+5 to Komodo 11.01)

http://www.fastgm.de

Nice test Andreas as always!
Waiting for the 60'+15" test of K11.2.
I found the everage ELO gain = 6 per version since K9.42 according at Fastgm's 10 minutes + 6 seconds Rating list:

lkaufman · Post by **lkaufman** » Sun Jul 23, 2017 8:05 pm

JJJ wrote:Komodo 11.01 vs Stockfish 8 : 45,8% 300 games
Komodo 11.2 vs Stockfish 8 : 47,2% 300 games

Komodo 11.01 vs Houdini 5 : 48,7% 300 games
Komodo 11.2 vs Houdini 5 : 49,7% 300 game

I see a progress here. I think it needs more game for both version to know better.

Also, maybe this time Komodo won more elo at bullet than in mid time control or long time control.

It is possible that we fixed some things that matter more in bullet chess than in long tc chess. But I note that the early CEGT 40/20 results are encouraging, so perhaps there is no problem other than the general rule that rating gains contract with greater TC due to more draws.

Gusev · Post by **Gusev** » Sun Jul 23, 2017 8:25 pm

It is also quite possible that the openings are more drawish in some tests than in others.

lkaufman wrote:
JJJ wrote:Komodo 11.01 vs Stockfish 8 : 45,8% 300 games
Komodo 11.2 vs Stockfish 8 : 47,2% 300 games

Komodo 11.01 vs Houdini 5 : 48,7% 300 games
Komodo 11.2 vs Houdini 5 : 49,7% 300 game

I see a progress here. I think it needs more game for both version to know better.

Also, maybe this time Komodo won more elo at bullet than in mid time control or long time control.
It is possible that we fixed some things that matter more in bullet chess than in long tc chess. But I note that the early CEGT 40/20 results are encouraging, so perhaps there is no problem other than the general rule that rating gains contract with greater TC due to more draws.

FGRL 10 min + 6 sec Rating list - Komodo 11.2

FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2

Re: FGRL 10 min + 6 sec Rating list - Komodo 11.2