Komodo-Dragon-2 vs Stockfish 14 at knight odss

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by lkaufman »

Rebel wrote: Wed Sep 22, 2021 7:15 am
lkaufman wrote: Wed Sep 22, 2021 2:43 am So bishops are indeed worth more than knights (at least when bishop pair is broken for the side losing the bishop), no surprise there. But it is interesting that Stockfish lost much more than Komodo from this, SF score was nearly cut in half going from knight odds to bishop odds! Regarding rook odds, it is roughly a class (200 elo) larger handicap than knight odds, so a field in the 2500 to 2530 range for opponents might be more balanced, but anyway it will be interesting.
At the moment I am doing queen odds, just to be complete.

The rook epd is not good, see:

Code: Select all

rnbqkb1r/ppp1pppp/3p4/3nP3/3P4/8/PPP2PPP/1NBQKBNR w KQkq - 0 4; v=-526
r1bqkb1r/pppnpppp/3p1n2/8/2PP4/2N5/PP2PPPP/2BQKBNR w KQkq - 2 4; v=-529
rnbqk1nr/ppp1ppbp/3p2p1/8/3PP3/5N2/PPP2PPP/1NBQKB1R w KQkq - 0 4; v=-536
r1bqkb1r/ppp1pppp/2n2n2/3p4/2PP4/4P3/PP3PPP/1NBQKBNR w KQkq - 1 4; v=-538
Castling flags are wrong and positions are ignored by cute.

Does somebody has a good rook odds epd of (at least) 100 positions?
At queen odds, if you keep the same field, I don't think that either Dragon or Stockfish will get more than a few draws in 700 games, maybe not even that. Probably you need engines about a thousand elo lower than these for a reasonably close match at queen odds even at this bullet tc.
Komodo rules!
Chessqueen
Posts: 5577
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Chessqueen »

Rebel wrote: Wed Sep 22, 2021 7:15 am
lkaufman wrote: Wed Sep 22, 2021 2:43 am So bishops are indeed worth more than knights (at least when bishop pair is broken for the side losing the bishop), no surprise there. But it is interesting that Stockfish lost much more than Komodo from this, SF score was nearly cut in half going from knight odds to bishop odds! Regarding rook odds, it is roughly a class (200 elo) larger handicap than knight odds, so a field in the 2500 to 2530 range for opponents might be more balanced, but anyway it will be interesting.
At the moment I am doing queen odds, just to be complete.

The rook epd is not good, see:

Code: Select all

rnbqkb1r/ppp1pppp/3p4/3nP3/3P4/8/PPP2PPP/1NBQKBNR w KQkq - 0 4; v=-526
r1bqkb1r/pppnpppp/3p1n2/8/2PP4/2N5/PP2PPPP/2BQKBNR w KQkq - 2 4; v=-529
rnbqk1nr/ppp1ppbp/3p2p1/8/3PP3/5N2/PPP2PPP/1NBQKB1R w KQkq - 0 4; v=-536
r1bqkb1r/ppp1pppp/2n2n2/3p4/2PP4/4P3/PP3PPP/1NBQKBNR w KQkq - 1 4; v=-538
Castling flags are wrong and positions are ignored by cute.

Does somebody has a good rook odds epd of (at least) 100 positions?
At an average of 1 second per move Stockfish 14 will perform much better than Stockfish 14 at 5 seconds per move, the longer TC you give to the lower rated opponent the better they will perform against Stockfish 14. Therefore, testing at an average of 1 second per move does NOT give us a good measurement of Stockfish14 vs Komodo Dragon2 playing Odds :roll:
Do NOT worry and be happy, we all live a short life :roll:
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

lkaufman wrote: Wed Sep 22, 2021 7:31 am
Rebel wrote: Wed Sep 22, 2021 7:15 am
lkaufman wrote: Wed Sep 22, 2021 2:43 am So bishops are indeed worth more than knights (at least when bishop pair is broken for the side losing the bishop), no surprise there. But it is interesting that Stockfish lost much more than Komodo from this, SF score was nearly cut in half going from knight odds to bishop odds! Regarding rook odds, it is roughly a class (200 elo) larger handicap than knight odds, so a field in the 2500 to 2530 range for opponents might be more balanced, but anyway it will be interesting.
At the moment I am doing queen odds, just to be complete.

The rook epd is not good, see:

Code: Select all

rnbqkb1r/ppp1pppp/3p4/3nP3/3P4/8/PPP2PPP/1NBQKBNR w KQkq - 0 4; v=-526
r1bqkb1r/pppnpppp/3p1n2/8/2PP4/2N5/PP2PPPP/2BQKBNR w KQkq - 2 4; v=-529
rnbqk1nr/ppp1ppbp/3p2p1/8/3PP3/5N2/PPP2PPP/1NBQKB1R w KQkq - 0 4; v=-536
r1bqkb1r/ppp1pppp/2n2n2/3p4/2PP4/4P3/PP3PPP/1NBQKBNR w KQkq - 1 4; v=-538
Castling flags are wrong and positions are ignored by cute.

Does somebody has a good rook odds epd of (at least) 100 positions?
At queen odds, if you keep the same field, I don't think that either Dragon or Stockfish will get more than a few draws in 700 games, maybe not even that. Probably you need engines about a thousand elo lower than these for a reasonably close match at queen odds even at this bullet tc.
True, but we need those anyway when we move to the lower regions (in steps of 200 elo less) for a good comparison.
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

QUEEN odds matches

Komodo-Dragon-2

Code: Select all

QUEEN odds match Komodo-Dragon-2 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name             Win Draw Loss Unf.  Score Games       %
------------------------------------------------------------
  1 Benjamin 1.0    +100   =0   -0   *0  100.0   100  100.0%
  2 Dumb 1.8        +100   =0   -0   *0  100.0   100  100.0%
  3 ProDeo 2.2       +99   =1   -0   *0   99.5   100   99.5%
  4 k2 099           +99   =1   -0   *0   99.5   100   99.5%
  5 Velvet 1.2.0     +98   =1   -1   *0   98.5   100   98.5%
  6 Fruit 2.1        +98   =0   -2   *0   98.0   100   98.0%
  7 Zahak 5.0        +96   =3   -1   *0   97.5   100   97.5%
  8 Komodo-Dragon 2   +4   =6 -690   *0    7.0   700    1.0%
Stockfish 14

Code: Select all

QUEEN odds match Stockfish 14 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name          Win Draw Loss Unf.  Score Games       %
---------------------------------------------------------
  1 Benjamin 1.0 +100   =0   -0   *0  100.0   100  100.0%
  2 Dumb 1.8     +100   =0   -0   *0  100.0   100  100.0%
  3 Fruit 2.1    +100   =0   -0   *0  100.0   100  100.0%
  4 ProDeo 2.2   +100   =0   -0   *0  100.0   100  100.0%
  5 Velvet 1.2.0 +100   =0   -0   *0  100.0   100  100.0%
  6 Zahak 5.0    +100   =0   -0   *0  100.0   100  100.0%
  7 k2 099       +100   =0   -0   *0  100.0   100  100.0%
  8 Stockfish 14   +0   =0 -700   *0    0.0   700    0.0%
Party time!

100 victories over the almighty Stockfish. Print it. And hang copies on the walls of the living room.

Next, rook odds. I wrote a small util that fixes the castling rights.
90% of coding is debugging, the other 10% is writing bugs.
Ferdy
Posts: 4833
Joined: Sun Aug 10, 2008 3:15 pm
Location: Philippines

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Ferdy »

Rebel wrote: Wed Sep 22, 2021 7:15 am The rook epd is not good, see:

Code: Select all

rnbqkb1r/ppp1pppp/3p4/3nP3/3P4/8/PPP2PPP/1NBQKBNR w KQkq - 0 4; v=-526
r1bqkb1r/pppnpppp/3p1n2/8/2PP4/2N5/PP2PPPP/2BQKBNR w KQkq - 2 4; v=-529
rnbqk1nr/ppp1ppbp/3p2p1/8/3PP3/5N2/PPP2PPP/1NBQKB1R w KQkq - 0 4; v=-536
r1bqkb1r/ppp1pppp/2n2n2/3p4/2PP4/4P3/PP3PPP/1NBQKBNR w KQkq - 1 4; v=-538
Castling flags are wrong and positions are ignored by cute.

Does somebody has a good rook odds epd of (at least) 100 positions?
Just in case you still need more or somebody needs it, uploaded rook odds epd around 280K pos from regular start epd file, randomly remove white queen rook or white king rook then rebuild the epd.

Tested on cutechess-cli with different SF version.

Code: Select all

Score of first vs second: 0 - 505 - 0  [0.000] 505
...      first playing White: 0 - 505 - 0  [0.000] 505
...      White vs Black: 0 - 505 - 0  [0.000] 505
Elo difference: -inf +/- nan, LOS: 0.0 %, DrawRatio: 0.0 %
User avatar
AdminX
Posts: 6339
Joined: Mon Mar 13, 2006 2:34 pm
Location: Acworth, GA

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by AdminX »

Ferdy wrote: Wed Sep 22, 2021 10:23 am
Rebel wrote: Wed Sep 22, 2021 7:15 am The rook epd is not good, see:

Code: Select all

rnbqkb1r/ppp1pppp/3p4/3nP3/3P4/8/PPP2PPP/1NBQKBNR w KQkq - 0 4; v=-526
r1bqkb1r/pppnpppp/3p1n2/8/2PP4/2N5/PP2PPPP/2BQKBNR w KQkq - 2 4; v=-529
rnbqk1nr/ppp1ppbp/3p2p1/8/3PP3/5N2/PPP2PPP/1NBQKB1R w KQkq - 0 4; v=-536
r1bqkb1r/ppp1pppp/2n2n2/3p4/2PP4/4P3/PP3PPP/1NBQKBNR w KQkq - 1 4; v=-538
Castling flags are wrong and positions are ignored by cute.

Does somebody has a good rook odds epd of (at least) 100 positions?
Just in case you still need more or somebody needs it, uploaded rook odds epd around 280K pos from regular start epd file, randomly remove white queen rook or white king rook then rebuild the epd.

Tested on cutechess-cli with different SF version.

Code: Select all

Score of first vs second: 0 - 505 - 0  [0.000] 505
...      first playing White: 0 - 505 - 0  [0.000] 505
...      White vs Black: 0 - 505 - 0  [0.000] 505
Elo difference: -inf +/- nan, LOS: 0.0 %, DrawRatio: 0.0 %
Thanks Ferdy!
"Good decisions come from experience, and experience comes from bad decisions."
__________________________________________________________________
Ted Summers
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

ROOK ODDS

Stockfish 14

Code: Select all

ROOK odds match Stockfish 14 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name          Win Draw Loss Unf.  Score Games       %
---------------------------------------------------------
  1 Stockfish 14 +109  =45 -546   *0  131.5   700   18.8%
  2 k2 099        +84   =5  -11   *0   86.5   100   86.5%
  3 Benjamin 1.0  +82   =5  -13   *0   84.5   100   84.5%
  4 ProDeo 2.2    +84   =1  -15   *0   84.5   100   84.5%
  5 Dumb 1.8      +80   =8  -12   *0   84.0   100   84.0%
  6 Zahak 5.0     +76   =7  -17   *0   79.5   100   79.5%
  7 Velvet 1.2.0  +73  =11  -16   *0   78.5   100   78.5%
  8 Fruit 2.1     +67   =8  -25   *0   71.0   100   71.0%

Total Games:     700
White Wins:      109 (15.6%)
Black Wins:      546 (78.0%)
Draws:            45 (6.4%)
Unfinished:        0 (0.0%)

Estimated ratings for this elo 2715 pool

   # PLAYER          :  RATING  POINTS  PLAYED   (%)
   1 k2 099          :  2811.5    86.5     100    87
   2 Benjamin 1.0    :  2783.2    84.5     100    85
   3 ProDeo 2.2      :  2783.2    84.5     100    85
   4 Dumb 1.8        :  2776.6    84.0     100    84
   5 Zahak 5.0       :  2723.5    79.5     100    80
   6 Velvet 1.2.0    :  2713.0    78.5     100    79
   7 Fruit 2.1       :  2642.9    71.0     100    71
   8 Stockfish 14    :  2486.0   131.5     700    19
Komodo-Dragon 2

Code: Select all

ROOK odds match Komodo Dragon 2 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name             Win Draw Loss Unf.  Score Games       %
------------------------------------------------------------
  1 Komodo-Dragon 2 +154  =49 -497   *0  178.5   700   25.5%
  2 Benjamin 1.0     +78   =9  -13   *0   82.5   100   82.5%
  3 Dumb 1.8         +76  =10  -14   *0   81.0   100   81.0%
  4 ProDeo 2.2       +74   =8  -18   *0   78.0   100   78.0%
  5 k2 099           +76   =4  -20   *0   78.0   100   78.0%
  6 Zahak 5.0        +73   =6  -21   *0   76.0   100   76.0%
  7 Velvet 1.2.0     +66   =4  -30   *0   68.0   100   68.0%
  8 Fruit 2.1        +54   =8  -38   *0   58.0   100   58.0%

Total Games:     700
White Wins:      154 (22.0%)
Black Wins:      497 (71.0%)
Draws:            49 (7.0%)
Unfinished:        0 (0.0%)

Estimated ratings for this elo 2715 pool

   # PLAYER             :  RATING  POINTS  PLAYED   (%)
   1 Benjamin 1.0       :  2816.7    82.5     100    83
   2 Dumb 1.8           :  2799.1    81.0     100    81
   3 k2 099             :  2766.8    78.0     100    78
   4 ProDeo 2.2         :  2766.8    78.0     100    78
   5 Zahak 5.0          :  2747.0    76.0     100    76
   6 Velvet 1.2.0       :  2677.1    68.0     100    68
   7 Fruit 2.1          :  2601.6    58.0     100    58
   8 Komodo-Dragon 2    :  2545.0   178.5     700    26
Stockfish : 18.8%
Komodo : 25..5%


Once again Komodo had the better survival instinct.

All games at - http://rebel13.nl/odds-2700.zip

-----------------------------

Next, the whole cricus again with 2500 engines.
90% of coding is debugging, the other 10% is writing bugs.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by lkaufman »

Rebel wrote: Wed Sep 22, 2021 3:48 pm ROOK ODDS

Stockfish 14

Code: Select all

ROOK odds match Stockfish 14 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name          Win Draw Loss Unf.  Score Games       %
---------------------------------------------------------
  1 Stockfish 14 +109  =45 -546   *0  131.5   700   18.8%
  2 k2 099        +84   =5  -11   *0   86.5   100   86.5%
  3 Benjamin 1.0  +82   =5  -13   *0   84.5   100   84.5%
  4 ProDeo 2.2    +84   =1  -15   *0   84.5   100   84.5%
  5 Dumb 1.8      +80   =8  -12   *0   84.0   100   84.0%
  6 Zahak 5.0     +76   =7  -17   *0   79.5   100   79.5%
  7 Velvet 1.2.0  +73  =11  -16   *0   78.5   100   78.5%
  8 Fruit 2.1     +67   =8  -25   *0   71.0   100   71.0%

Total Games:     700
White Wins:      109 (15.6%)
Black Wins:      546 (78.0%)
Draws:            45 (6.4%)
Unfinished:        0 (0.0%)

Estimated ratings for this elo 2715 pool

   # PLAYER          :  RATING  POINTS  PLAYED   (%)
   1 k2 099          :  2811.5    86.5     100    87
   2 Benjamin 1.0    :  2783.2    84.5     100    85
   3 ProDeo 2.2      :  2783.2    84.5     100    85
   4 Dumb 1.8        :  2776.6    84.0     100    84
   5 Zahak 5.0       :  2723.5    79.5     100    80
   6 Velvet 1.2.0    :  2713.0    78.5     100    79
   7 Fruit 2.1       :  2642.9    71.0     100    71
   8 Stockfish 14    :  2486.0   131.5     700    19
Komodo-Dragon 2

Code: Select all

ROOK odds match Komodo Dragon 2 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name             Win Draw Loss Unf.  Score Games       %
------------------------------------------------------------
  1 Komodo-Dragon 2 +154  =49 -497   *0  178.5   700   25.5%
  2 Benjamin 1.0     +78   =9  -13   *0   82.5   100   82.5%
  3 Dumb 1.8         +76  =10  -14   *0   81.0   100   81.0%
  4 ProDeo 2.2       +74   =8  -18   *0   78.0   100   78.0%
  5 k2 099           +76   =4  -20   *0   78.0   100   78.0%
  6 Zahak 5.0        +73   =6  -21   *0   76.0   100   76.0%
  7 Velvet 1.2.0     +66   =4  -30   *0   68.0   100   68.0%
  8 Fruit 2.1        +54   =8  -38   *0   58.0   100   58.0%

Total Games:     700
White Wins:      154 (22.0%)
Black Wins:      497 (71.0%)
Draws:            49 (7.0%)
Unfinished:        0 (0.0%)

Estimated ratings for this elo 2715 pool

   # PLAYER             :  RATING  POINTS  PLAYED   (%)
   1 Benjamin 1.0       :  2816.7    82.5     100    83
   2 Dumb 1.8           :  2799.1    81.0     100    81
   3 k2 099             :  2766.8    78.0     100    78
   4 ProDeo 2.2         :  2766.8    78.0     100    78
   5 Zahak 5.0          :  2747.0    76.0     100    76
   6 Velvet 1.2.0       :  2677.1    68.0     100    68
   7 Fruit 2.1          :  2601.6    58.0     100    58
   8 Komodo-Dragon 2    :  2545.0   178.5     700    26
Stockfish : 18.8%
Komodo : 25..5%


Once again Komodo had the better survival instinct.

All games at - http://rebel13.nl/odds-2700.zip

-----------------------------

Next, the whole cricus again with 2500 engines.
Really strange that Stockfish scored higher giving rook odds than giving bishop odds, though still much worse than Komodo (which as one would expect scored much higher giving bishop odds than rook odds). I wonder why this should be? I think it is related to how the programs evaluate the initial positions; Stockfish doesn't think rook odds is so bad relatively speaking; but I don't see why misevaluating the position should make the results agree with the misevaluation.
Komodo rules!
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

lkaufman wrote: Wed Sep 22, 2021 4:20 pm
Rebel wrote: Wed Sep 22, 2021 3:48 pm ROOK ODDS

Stockfish 14

Code: Select all

ROOK odds match Stockfish 14 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name          Win Draw Loss Unf.  Score Games       %
---------------------------------------------------------
  1 Stockfish 14 +109  =45 -546   *0  131.5   700   18.8%
  2 k2 099        +84   =5  -11   *0   86.5   100   86.5%
  3 Benjamin 1.0  +82   =5  -13   *0   84.5   100   84.5%
  4 ProDeo 2.2    +84   =1  -15   *0   84.5   100   84.5%
  5 Dumb 1.8      +80   =8  -12   *0   84.0   100   84.0%
  6 Zahak 5.0     +76   =7  -17   *0   79.5   100   79.5%
  7 Velvet 1.2.0  +73  =11  -16   *0   78.5   100   78.5%
  8 Fruit 2.1     +67   =8  -25   *0   71.0   100   71.0%

Total Games:     700
White Wins:      109 (15.6%)
Black Wins:      546 (78.0%)
Draws:            45 (6.4%)
Unfinished:        0 (0.0%)

Estimated ratings for this elo 2715 pool

   # PLAYER          :  RATING  POINTS  PLAYED   (%)
   1 k2 099          :  2811.5    86.5     100    87
   2 Benjamin 1.0    :  2783.2    84.5     100    85
   3 ProDeo 2.2      :  2783.2    84.5     100    85
   4 Dumb 1.8        :  2776.6    84.0     100    84
   5 Zahak 5.0       :  2723.5    79.5     100    80
   6 Velvet 1.2.0    :  2713.0    78.5     100    79
   7 Fruit 2.1       :  2642.9    71.0     100    71
   8 Stockfish 14    :  2486.0   131.5     700    19
Komodo-Dragon 2

Code: Select all

ROOK odds match Komodo Dragon 2 vs a pool of 2700-2730 elo rated engines
Time Control : Time control : 40/40
Games        : 700

Results from file all.pgn:

No. Name             Win Draw Loss Unf.  Score Games       %
------------------------------------------------------------
  1 Komodo-Dragon 2 +154  =49 -497   *0  178.5   700   25.5%
  2 Benjamin 1.0     +78   =9  -13   *0   82.5   100   82.5%
  3 Dumb 1.8         +76  =10  -14   *0   81.0   100   81.0%
  4 ProDeo 2.2       +74   =8  -18   *0   78.0   100   78.0%
  5 k2 099           +76   =4  -20   *0   78.0   100   78.0%
  6 Zahak 5.0        +73   =6  -21   *0   76.0   100   76.0%
  7 Velvet 1.2.0     +66   =4  -30   *0   68.0   100   68.0%
  8 Fruit 2.1        +54   =8  -38   *0   58.0   100   58.0%

Total Games:     700
White Wins:      154 (22.0%)
Black Wins:      497 (71.0%)
Draws:            49 (7.0%)
Unfinished:        0 (0.0%)

Estimated ratings for this elo 2715 pool

   # PLAYER             :  RATING  POINTS  PLAYED   (%)
   1 Benjamin 1.0       :  2816.7    82.5     100    83
   2 Dumb 1.8           :  2799.1    81.0     100    81
   3 k2 099             :  2766.8    78.0     100    78
   4 ProDeo 2.2         :  2766.8    78.0     100    78
   5 Zahak 5.0          :  2747.0    76.0     100    76
   6 Velvet 1.2.0       :  2677.1    68.0     100    68
   7 Fruit 2.1          :  2601.6    58.0     100    58
   8 Komodo-Dragon 2    :  2545.0   178.5     700    26
Stockfish : 18.8%
Komodo : 25..5%


Once again Komodo had the better survival instinct.

All games at - http://rebel13.nl/odds-2700.zip

-----------------------------

Next, the whole cricus again with 2500 engines.
Really strange that Stockfish scored higher giving rook odds than giving bishop odds, though still much worse than Komodo (which as one would expect scored much higher giving bishop odds than rook odds). I wonder why this should be? I think it is related to how the programs evaluate the initial positions; Stockfish doesn't think rook odds is so bad relatively speaking; but I don't see why misevaluating the position should make the results agree with the misevaluation.
I remember Richard Lang programs had a special option, when in a lost position move into survival mode, wildly attack the king, put everything on a passed pawn. Maybe SF has something similar.

Meanwhile knight odds with 2500 elo rating engines has started, the difference with 2700 engines is amazing.

http://rebel13.nl/a/grl.htm
90% of coding is debugging, the other 10% is writing bugs.
Chessqueen
Posts: 5577
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Chessqueen »

lkaufman wrote: Wed Sep 22, 2021 7:31 am
Rebel wrote: Wed Sep 22, 2021 7:15 am
lkaufman wrote: Wed Sep 22, 2021 2:43 am So bishops are indeed worth more than knights (at least when bishop pair is broken for the side losing the bishop), no surprise there. But it is interesting that Stockfish lost much more than Komodo from this, SF score was nearly cut in half going from knight odds to bishop odds! Regarding rook odds, it is roughly a class (200 elo) larger handicap than knight odds, so a field in the 2500 to 2530 range for opponents might be more balanced, but anyway it will be interesting.
At the moment I am doing queen odds, just to be complete.

The rook epd is not good, see:

Code: Select all

rnbqkb1r/ppp1pppp/3p4/3nP3/3P4/8/PPP2PPP/1NBQKBNR w KQkq - 0 4; v=-526
r1bqkb1r/pppnpppp/3p1n2/8/2PP4/2N5/PP2PPPP/2BQKBNR w KQkq - 2 4; v=-529
rnbqk1nr/ppp1ppbp/3p2p1/8/3PP3/5N2/PPP2PPP/1NBQKB1R w KQkq - 0 4; v=-536
r1bqkb1r/ppp1pppp/2n2n2/3p4/2PP4/4P3/PP3PPP/1NBQKBNR w KQkq - 1 4; v=-538
Castling flags are wrong and positions are ignored by cute.

Does somebody has a good rook odds epd of (at least) 100 positions?
At queen odds, if you keep the same field, I don't think that either Dragon or Stockfish will get more than a few draws in 700 games, maybe not even that. Probably you need engines about a thousand elo lower than these for a reasonably close match at queen odds even at this bullet tc.
In order to get an even score with Dragon2, I tested more than 1000 games at an average of 1 seconds per move with these field with Queen Odds

462 Casper rev4 64-bit 1579 +24 −24 50.2% −2.6 23.9% 624
60.5%
463 PolarChess 1.3 1574 +25 −25 49.3% +3.5 16.9% 629
56.3%
464‑465 Darky 0.5d 1571 +24 −24 43.5% +56.2 20.8% 677
49.7%
464‑465 Storm 0.6 1571 +21 −21 38.3% +95.3 15.0% 925
74.3%
466 Damas 9 1560 +25 −25 47.8% +17.1 17.6% 626
76.1%
467 IQ23.003 1547 +21 −21 33.6% +118.6 32.6% 854
72.6%
468 Cicada 0.1 64-bit 1536 +24 −25 44.1% +48.6 19.7% 636
65.6%


In order to get an even score with Dragon2, I tested more than 1000 games at an average of 1 seconds per move with these field with Rook Odds

357 Kurt 0.9.2.2 64-bit 2166 +21 −21 47.7% +18.0 24.2% 828
56.9%
358 ProChess 1.02AD 2164 +19 −19 48.4% +6.3 22.1% 1043
51.8%
359 Chesley r323 64-bit 2163 +22 −22 48.1% +14.4 20.8% 804
51.3%
360 Micah 1.0 64-bit 2162 +25 −25 44.0% +45.9 25.4% 566
73.2%
361 KnockOut 0.7.1 2153 +15 −15 51.1% −8.0 26.6% 1574
81.4%
Do NOT worry and be happy, we all live a short life :roll: