New engine releases 2024 ... 6+3

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

27. Stockfish 20240121 NN dev (replaces Obisidan 10.0 NN)
28. Midnight 9 NN ... still running (replaces Counter 5.5 NN)

If nothing is to do the updates from Nov. 2023 follow

29. Alexandria 5.1.0 NN ... soon (replaced Avalanche 2.1.0 NN)
30. Devre 5.0 NN ... soon
31. Fritz 19 (Gingko) NN ... soon

Code: Select all

                Name                 Games     Win     Draw     Lose       Pts         S-B         %    wins-55m  lost-55m  AV-wins  AV-draws  AV-all

01. Stockfish 20240121 NN dev     :  1000  :  586+  :  414=  :    0-  :   793.0  : 375070.00  :  79.30%     68        0        75       86       80
02. RubiChess 20240112 NN         :  1000  :  422+  :  561=  :   17-  :   702.5  : 329513.50  :  70.25%     49        0        79       83       82
03. Caissa 1.16 NN                :  1000  :  405+  :  575=  :   20-  :   692.5  : 323483.50  :  69.25%     27        0        86       80       82
04. Seer 2.8.0 NN                 :  1000  :  367+  :  594=  :   39-  :   664.0  : 309011.50  :  66.40%     23        0        86       82       84
05. Clover 6.1 NN                 :  1000  :  352+  :  607=  :   41-  :   655.5  : 304864.50  :  65.55%     25        2        87      100       95
06. Arasan 24.1 NN                :  1000  :  237+  :  646=  :  117-  :   560.0  : 260917.75  :  56.00%     29        3        84       87       86
07. Stormphrax 4.0.0 NN           :  1000  :  239+  :  577=  :  184-  :   527.5  : 239363.50  :  52.75%      3       22        91      109      100
08. Starzix 4.0 NN                :  1000  :  183+  :  646=  :  171-  :   506.0  : 233814.00  :  50.60%      6       13        89       94       91
09. Minic 3.40 NN                 :  1000  :  161+  :  660=  :  179-  :   491.0  : 228275.75  :  49.10%      3       17        99       92       91
10. Akimbo 0.8.0 NN               :  1000  :  165+  :  642=  :  193-  :   486.0  : 223509.75  :  48.60%      6       19        95       92       91
-----------------------------------------------------------------------------------------------------------------------------------------------------
11. Pawn 3.0 NN                   :  1000  :  157+  :  653=  :  190-  :   483.5  : 223551.75  :  48.35%      4       11        83       89       87
12. Velvet 6.0.0 NN               :  1000  :  172+  :  622=  :  206-  :   483.0  : 222455.75  :  48.30%     22        0        75       89       88
13. Altair 6.0.0 NN               :  1000  :  142+  :  604=  :  254-  :   444.0  : 202615.25  :  44.40%      1       37        97       91       88
14. Wasp 6.63 NN dev              :  1000  :  136+  :  614=  :  250-  :   443.0  : 203824.75  :  44.30%     18        6        82       81       83
15. Texel 1.11 NN                 :  1000  :  138+  :  605=  :  257-  :   440.5  : 202681.25  :  44.05%     22        2        77       91       89
16. Wasp 6.50 NN                  :  1000  :  130+  :  617=  :  253-  :   438.5  : 199997.25  :  43.85%     28        1        78       82       84
17. Lizard 10.1 NN                :  1000  :  128+  :  618=  :  254-  :   437.0  : 201350.50  :  43.70%      8       22        87       97       92
18. Renegade 1.0.0 NN             :  1000  :   85+  :  557=  :  358-  :   363.5  : 166528.50  :  36.35%      2       44        90      101       93
19. Clarity 4.1.0 NN              :  1000  :   63+  :  577=  :  360-  :   351.5  : 163771.50  :  35.15%      0       30        95       85       85
20. Avalanche 2.1.0 NN            :  1000  :   45+  :  459=  :  496-  :   274.5  : 126864.75  :  27.45%      0       76        92       90       84
-----------------------------------------------------------------------------------------------------------------------------------------------------
21. Counter 5.5 NN                :  1000  :   43+  :  440=  :  517-  :   263.0  : 122936.50  :  26.30%      3       40        85      104       95


White Wins =  2.878 ( 27.41% )
Draws      =  6.144 ( 58.51% )
Black Wins =  1.478 ( 14.08% )
Average    = 175.54 ( 87,77 moves )

Code: Select all

   # Player                       :      Elo  Games  Score%  won  draw  lost  Points  Draw%  Error   OppAvg   OppE   OppD
   1 Stockfish 20240121 NN dev    :  3588.80   1000    79.3  586   414     0   793.0   41.4  16.54  3334.55  12.44   20.0
   2 RubiChess 20240112 NN        :  3503.24   1000    70.3  422   561    17   702.5   56.1  13.42  3338.83  12.60   20.0
   3 Caissa 1.16 NN               :  3494.69   1000    69.3  405   575    20   692.5   57.5  13.66  3339.26  12.59   20.0
   4 Seer 2.8.0 NN                :  3470.97   1000    66.4  367   594    39   664.0   59.4  13.06  3340.44  12.62   20.0
   5 Clover 6.1 NN                :  3464.07   1000    65.5  352   607    41   655.5   60.7  12.58  3340.79  12.64   20.0
   6 Arasan 24.1 NN               :  3390.17   1000    56.0  237   646   117   560.0   64.6  12.14  3344.48  12.66   20.0
   7 Stormphrax 4.0.0 NN          :  3365.96   1000    52.8  239   577   184   527.5   57.7  12.12  3345.69  12.66   20.0
   8 Starzix 4.0 NN               :  3350.04   1000    50.6  183   646   171   506.0   64.6  11.91  3346.49  12.67   20.0
   9 Minic 3.40 NN                :  3338.96   1000    49.1  161   660   179   491.0   66.0  11.67  3347.04  12.69   20.0
  10 Akimbo 0.8.0 NN              :  3335.26   1000    48.6  165   642   193   486.0   64.2  11.44  3347.23  12.70   20.0
  11 Pawn 3.0 NN                  :  3333.41   1000    48.4  157   653   190   483.5   65.3  12.11  3347.32  12.66   20.0
  12 Velvet 6.0.0 NN              :  3333.04   1000    48.3  172   622   206   483.0   62.2  11.93  3347.34  12.67   20.0
  13 Altair 6.0.0 NN              :  3304.11   1000    44.4  142   604   254   444.0   60.4  11.81  3348.78  12.68   20.0
  14 Wasp 6.63 NN dev             :  3303.36   1000    44.3  136   614   250   443.0   61.4  11.92  3348.82  12.67   20.0
  15 Texel 1.11 NN                :  3301.49   1000    44.0  138   605   257   440.5   60.5  11.53  3348.91  12.69   20.0
  16 Wasp 6.50 NN                 :  3300.00   1000    43.9  130   617   253   438.5   61.7  11.83  3348.99  12.68   20.0
  17 Lizard 10.1 NN               :  3298.88   1000    43.7  128   618   254   437.0   61.8  11.82  3349.05  12.68   20.0
  18 Renegade 1.0.0 NN            :  3242.73   1000    36.4   85   557   358   363.5   55.7  12.49  3351.85  12.65   20.0
  19 Clarity 4.1.0 NN             :  3233.25   1000    35.1   63   577   360   351.5   57.7  12.71  3352.33  12.63   20.0
  20 Avalanche 2.1.0 NN           :  3168.84   1000    27.4   45   459   496   274.5   45.9  14.30  3355.55  12.56   20.0
  21 Counter 5.5 NN               :  3158.51   1000    26.3   43   440   517   263.0   44.0  14.38  3356.06  12.55   20.0

White advantage = 54.99 +/- 2.09
Draw rate (equal opponents) = 75.73 % +/- 0.62
Stockfish in all stats on rank 1.
Move-average wins with 75 with so many short wins is just fantastic.
Same move-average Velvet 6.0.0 NN have.

OK, I wonder about the move-average for draws if I compare the results with SF16 and my FCP-Tourney-2024 with 40 moves in 20 minutes.

Best
Frank

Ah, here the first results from Midnight 9 NN ...

Code: Select all

Midnight 9 NN - Wasp 6.50 NN                14.0	-	10.0		58.33%		
Midnight 9 NN - Wasp 6.63 NN dev            14.5	-	10.5		58.00%		
Midnight 9 NN - Akimbo 0.8.0 NN              9.0	-	14.0		39.13%		
Midnight 9 NN - Clarity 4.1.0 NN            14.5	-	 9.5		60.42%		
Midnight 9 NN - Caissa 1.16 NN               4.5	-	19.5		18.75%		
Midnight 9 NN - Pawn 3.0 NN                 11.5	-	12.5		47.92%		
Midnight 9 NN - Texel 1.11 NN                9.5	-	14.5		39.58%		
Midnight 9 NN - RubiChess 20240112 NN        2.5	-	21.5		10.42%		
Midnight 9 NN - Avalanche 2.1.0 NN          14.0	-	10.0		58.33%		
Midnight 9 NN - Lizard 10.1 NN              13.5	-	10.5		56.25%		
Midnight 9 NN - Minic 3.40 NN               10.0	-	14.0		41.67%		
Midnight 9 NN - Renegade 1.0.0 NN           13.0	-	11.0		54.17%		
Midnight 9 NN - Arasan 24.1 NN              10.5	-	13.5		43.75%		
Midnight 9 NN - Seer 2.8.0 NN                7.5	-	16.5		31.25%		
Midnight 9 NN - Velvet 6.0.0 NN              9.5	-	14.5		39.58%		
Midnight 9 NN - Stormphrax 4.0.0 NN         11.5	-	12.5		47.92%		
Midnight 9 NN - Starzix 4.0 NN               7.0	-	17.0		29.17%		
Midnight 9 NN - Clover 6.1 NN                7.0	-	17.0		29.17%		
Midnight 9 NN - Altair 6.0.0 NN             11.0	-	12.0		47.83%		
Midnight 9 NN - Stockfish 20240121 NN dev    3.5	-	20.5		14.58%
	
198.0	-	281.0		41.34%		
479 out of 1000 games played
Level: 4 Minutes/Game + 2 Seconds/Move
Should be around 3280 Elo!
The final results are tomorrow available!

Best
Frank
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

28. Midnight 9 NN ... ready (replaces Counter 5.5 NN)
29. Alexandria 5.1.0 NN ... still running (replaces Avalanche 2.1.0 NN)
30. Fritz 19 NN (Gingko) ... soon (replaces Wasp 6.63 NN dev)
31. Devre 5.0 NN

Code: Select all

                Name                 Games     Win     Draw     Lose       Pts         S-B         %    wins-55m  lost-55m  AV-wins  AV-draws  AV-all

01. Stockfish 20240121 NN dev     :  1000  :  579+  :  421=  :    0-  :   789.5  : 374447.75  :  78.95%     62         0       75       86       80
02. RubiChess 20240112 NN         :  1000  :  417+  :  565=  :   18-  :   699.5  : 329439.00  :  69.95%     44         0       79       84       82
03. Caissa 1.16 NN                :  1000  :  399+  :  580=  :   21-  :   689.0  : 323484.50  :  68.90%     27         0       85       79       81
04. Seer 2.8.0 NN                 :  1000  :  357+  :  605=  :   38-  :   659.5  : 308336.25  :  65.95%     22         0       86       82       83
05. Clover 6.1 NN                 :  1000  :  336+  :  623=  :   41-  :   647.5  : 303277.00  :  64.75%     25         2       86       99       94
06. Arasan 24.1 NN                :  1000  :  229+  :  651=  :  120-  :   554.5  : 259960.75  :  55.45%     30         3       83       85       85
07. Stormphrax 4.0.0 NN           :  1000  :  222+  :  594=  :  184-  :   519.0  : 237884.50  :  51.90%      2        22       90      108       99
08. Starzix 4.0 NN                :  1000  :  178+  :  651=  :  171-  :   503.5  : 234252.25  :  50.35%      4        13       91       93       91
09. Minic 3.40 NN                 :  1000  :  150+  :  666=  :  184-  :   483.0  : 226808.00  :  48.30%      2        18       97       91       90
10. Akimbo 0.8.0 NN               :  1000  :  160+  :  645=  :  195-  :   482.5  : 223287.00  :  48.25%      5        19       95       92       91
-----------------------------------------------------------------------------------------------------------------------------------------------------
11. Pawn 3.0 NN                   :  1000  :  151+  :  656=  :  193-  :   479.0  : 222928.25  :  47.90%      5        10       83       87       86
12. Velvet 6.0.0 NN               :  1000  :  162+  :  632=  :  206-  :   478.0  : 221928.50  :  47.80%     23         0       73       88       88
13. Wasp 6.63 NN dev              :  1000  :  128+  :  616=  :  256-  :   436.0  : 202601.00  :  43.60%     17         6       82       80       83
14. Altair 6.0.0 NN               :  1000  :  122+  :  624=  :  254-  :   434.0  : 200776.25  :  43.40%      1        38       95       91       88
15. Texel 1.11 NN                 :  1000  :  125+  :  616=  :  259-  :   433.0  : 201751.25  :  43.30%     21         2       78       92       89
16. Lizard 10.1 NN                :  1000  :  118+  :  620=  :  262-  :   428.0  : 199720.75  :  42.80%      9        24       84       97       91
17. Wasp 6.50 NN                  :  1000  :  114+  :  626=  :  260-  :   427.0  : 197356.50  :  42.70%     22         1       78       82       84
18. Midnight 9 NN                 :  1000  :   88+  :  606=  :  306-  :   391.0  : 182514.00  :  39.10%      6        19       88       86       86
19. Renegade 1.0.0 NN             :  1000  :   69+  :  567=  :  364-  :   352.5  : 164537.50  :  35.25%      2        44       88      101       93
20. Clarity 4.1.0 NN              :  1000  :   52+  :  584=  :  364-  :   344.0  : 162322.25  :  34.40%      0        30       96       84       84
-----------------------------------------------------------------------------------------------------------------------------------------------------
21. Avalanche 2.1.0 NN            :  1000  :   34+  :  472=  :  494-  :   270.0  : 126667.75  :  27.00%      0        76       92       88       83


White Wins =  2.844 ( 27.09% )
Draws      =  6.310 ( 60.10% )
Black Wins =  1.346 ( 12.82% )
Average    = 173.86 ( 86,93 moves )

Code: Select all

   # Player                       :      Elo  Games  Score%  won  draw  lost  Points  Draw%  Error   OppAvg   OppE   OppD
   1 Stockfish 20240121 NN dev    :  3592.18   1000    79.0  579   421     0   789.5   42.1  15.92  3343.39  12.25   20.0
   2 RubiChess 20240112 NN        :  3508.03   1000    70.0  417   565    18   699.5   56.5  13.50  3347.60  12.37   20.0
   3 Caissa 1.16 NN               :  3499.13   1000    68.9  399   580    21   689.0   58.0  13.80  3348.04  12.36   20.0
   4 Seer 2.8.0 NN                :  3474.83   1000    66.0  357   605    38   659.5   60.5  13.08  3349.26  12.39   20.0
   5 Clover 6.1 NN                :  3465.21   1000    64.8  336   623    41   647.5   62.3  13.17  3349.74  12.39   20.0
   6 Arasan 24.1 NN               :  3394.03   1000    55.5  229   651   120   554.5   65.1  11.79  3353.30  12.46   20.0
   7 Stormphrax 4.0.0 NN          :  3367.80   1000    51.9  222   594   184   519.0   59.4  11.59  3354.61  12.47   20.0
   8 Starzix 4.0 NN               :  3356.41   1000    50.4  178   651   171   503.5   65.1  12.09  3355.18  12.44   20.0
   9 Minic 3.40 NN                :  3341.36   1000    48.3  150   666   184   483.0   66.6  11.88  3355.93  12.45   20.0
  10 Akimbo 0.8.0 NN              :  3340.99   1000    48.3  160   645   195   482.5   64.5  11.68  3355.95  12.46   20.0
  11 Pawn 3.0 NN                  :  3338.42   1000    47.9  151   656   193   479.0   65.6  11.63  3356.08  12.47   20.0
  12 Velvet 6.0.0 NN              :  3337.69   1000    47.8  162   632   206   478.0   63.2  11.87  3356.11  12.45   20.0
  13 Wasp 6.63 NN dev             :  3306.70   1000    43.6  128   616   256   436.0   61.6  11.84  3357.66  12.46   20.0
  14 Altair 6.0.0 NN              :  3305.21   1000    43.4  122   624   254   434.0   62.4  11.51  3357.74  12.47   20.0
  15 Texel 1.11 NN                :  3304.47   1000    43.3  125   616   259   433.0   61.6  11.85  3357.77  12.46   20.0
  16 Lizard 10.1 NN               :  3300.75   1000    42.8  118   620   262   428.0   62.0  11.76  3357.96  12.46   20.0
  17 Wasp 6.50 NN                 :  3300.00   1000    42.7  114   626   260   427.0   62.6  11.61  3358.00  12.47   20.0
  18 Midnight 9 NN                :  3272.89   1000    39.1   88   606   306   391.0   60.6  11.51  3359.35  12.47   20.0
  19 Renegade 1.0.0 NN            :  3243.09   1000    35.3   69   567   364   352.5   56.7  12.19  3360.84  12.44   20.0
  20 Clarity 4.1.0 NN             :  3236.36   1000    34.4   52   584   364   344.0   58.4  12.78  3361.18  12.41   20.0
  21 Avalanche 2.1.0 NN           :  3174.41   1000    27.0   34   472   494   270.0   47.2  13.92  3364.28  12.35   20.0

White advantage = 57.90 +/- 2.10
Draw rate (equal opponents) = 76.56 % +/- 0.61
Wow, Midnight 9 NN with very strong results.
Great improvments!!

Best
Frank
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

Hi there,

I overworked the log file.

Very slowly my little blitz tournament is getting better and more interesting.
Not many new releases at the moment, so I have a chance to add more of the "older" releases.

Note: Highest priority the new releases have.
It need a time to add all of the stronger older releases.

Max. I can add 41 different engines in this tournament.

At the moment the move average for the tournament is great!
The draw rate is also good!

After the last engine I added, Velvet is again number 1 with a move average of wins ... 73 moves.

Here is the log file and the rules again!

But all can be downloaded at
https://www.amateurschach.de/fling/index.html

Best
Frank

Code: Select all

-----------------------------------------------------------------------------------------------------------------------------

System Wasp-1 = Intel® Core™ i9-10900k = 10 cores, 4.4Ghz overclocked, game in 4 minutes + 2 seconds
System Wasp-2 = Intel® Core™ i7-1185g7 =  4 cores, 3.0Ghz overclocked, game in 6 minutes + 3 seconds, rather rarely used

-----------------------------------------------------------------------------------------------------------------------------

Start point November 2023 - continuously
Games available: https://www.amateurschach.de/fling/index.html
January 25th, 2024 = v28

-----------------------------------------------------------------------------------------------------------------------------

// todo

33. Uralochka 3.40a NN                                     executable = November 03rd, 2023   Test on Wasp-1 = soon
    Ivan Maklyakov, RUS
    https://gitlab.com/freemanzlat/uralochka3/-/releases

32. Berserk 12.1 NN                                        executable = November 12th, 2023   Test on Wasp-1 = soon
    Jay Honnold, USA
    https://github.com/jhonnold/berserk/releases

31. Devre 5.0 NN                                           executable = November 12th, 2023   Test on Wasp-1 = soon
    Ömer Faruk Tutkun, TUR
    https://github.com/OmerFarukTutkun/Devre

30. Fritz 19 NN (Gingko)                                   executable = September 11th, 2023  Test on Wasp-1 = soon
    Fritz 19 NN (Gingko) replaced Wasp 6.63 NN dev         released   = November 21st, 2023
    Frank Schneider, GER
    https://shop.chessbase.com/en/products/fritz_19

29. Alexandria 5.1.0 NN                                    executable = November 26th, 2023   Test on Wasp-1 = still running
    Alexandria 5.1.0 NN replaces Avalanche 2.1.0 NN
    PGG and Contributors
    https://github.com/PGG106/Alexandria

-----------------------------------------------------------------------------------------------------------------------------

// ready

28. Midnight 9 NN                                          executable = January 22nd, 2024    Test on Wasp-1
    Midnight 9 NN replaces Counter 5.5 NN
    Archishmaan Peyyet, USA
    https://github.com/archishou/MidnightChessEngine

27. Stockfish 20240121 NN dev                              executable = January 21st, 2024    Test on Wasp-1
    the Stockfish developers
    Stockfish 24012111 NN dev replaces Obsidian 10.0 NN, have a look under 21.
    https://abrok.eu/stockfish/
    https://github.com/official-stockfish/Stockfish/releases
    https://stockfishchess.org/

    Note:
    Newer engine releases should be stronger as 3200 Elo.
    I changed that from 3150 to 3200 Elo.
    Counter 5.5 NN and Avalanche 2.10 NN will be replaced.

26. Altair 6.0.0 NN                                        executable = December 05th, 2023    Test on Wasp-1
    Alexander Tian, CHN
    https://github.com/Alex2262/AltairChessEngine

25. Clover 6.1 NN                                          executable = December 12th, 2023    Test on Wasp-1
    Luca-Mihnea Metehau, ROM
    https://github.com/lucametehau/CloverEngine

23. Starzix 4.0 NN                                         executable = January 20th, 2024     Test on Wasp-1
(2) Starzix 4.0 NN replaces Starzix 3.0 NN
    Ricardo Pinto, POR
    https://github.com/zzzzz151/Starzix

    Note:
    A newer version 4.0 NN is available                    executable = January 22nd, 2024     // no test

22. Stormphrax 4.0.0 NN                                    executable = December 17th, 2023    Test on Wasp-1
    Conor Anstey, GBR
    https://github.com/Ciekce/Stormphrax

20. Velvet 6.0.0 NN                                        executable = December 21st, 2023    Test on Wasp-1
    Martin Honert, GER
    https://github.com/mhonert/velvet-chess/

16. Seer 2.8.0 NN                                          executable = December 31st, 2023    Test on Wasp-1
    Connor McMonigle, USA
    https://github.com/connormcmonigle/seer-nnue/

15. Arasan 24.1 NN                                         executable = January 14th, 2024     Test on Wasp-1
    Jon Dart, USA
    https://www.arasanchess.org/index.shtml
    https://github.com/jdart1/arasan-chess

    - without *.rc file

14. Renegade 1.0.0 NN                                      executable = January 13th, 2024     Test on Wasp-1
    Krisztián Peőcz, HUN
    https://github.com/pkrisz99/Renegade/

13. Minic 3.40 NN                                          executable = January 14th, 2024     Test on Wasp-1
    Vivien CLAUZON, FRA
    https://github.com/tryingsomestuff/Minic

    Note:
    A special version 3.41 NN is available                 released   = January 17th, 2024     // no test

12. Lizard 10.1 NN                                         released   = January 13th, 2024     Test on Wasp-1
(2) Lizard 10.1 NN replaces Lizard 10.0 NN
    Liam McGuire, USA
    https://github.com/liamt19/Lizard

09. RubiChess 20240112 NN                                  executable = January 12th, 2024     Test on Wasp 1
    Andreas Matthies, GER
    https://github.com/Matthies/RubiChess/

08. Texel 1.11 NN                                          executable = January 12th, 2024     Test on Wasp-1
    Peter Österlund, SWE
    https://github.com/peterosterlund2/texel

07. Pawn 3.0 NN                                            executable = January 12th, 2024     Test on Wasp-1
    Rui Coelho, POR
    https://github.com/ruicoelhopedro/pawn

06. Counter 5.5 NN                                         released   = January 12th, 2024     Test on Wasp-1
    Vadim Chizhov, RUS
    https://github.com/ChizhovVadim/CounterGo/

05. Caissa 1.16 NN                                         executable = January 11th, 2024     Test on Wasp-2
    Michal Witanowski, POL
    https://github.com/Witek902/Caissa

04. Clarity 4.1.0 NN                                       executable = January 07th, 2024     Test on Wasp-2
    Joseph Pasfield, USA
    https://github.com/Vast342/Clarity

01. Akimbo 0.8.0 NN                                        executable = January 02nd, 2024     Test on Wasp-2
    Jamie Whiting, GBR
    https://github.com/jw1912/akimbo

A)  Wasp 6.50 NN                                           released   = February 28th, 2023    Test on Wasp-2
    John Stanback, USA                                                                         Reference = 3300 Elo
    http://www.waspchess.com/

-----------------------------------------------------------------------------------------------------------------------------

// watching, bugs, replaces, updated

24. RukChess 3.0.18 NN                                     executable = December 17th, 2023    no test for the moment
    Ilya Rukavishnikov, RUS
    https://github.com/Ilya-Ruk/RukChess

21. Obsidian 10.0 NN                                       released   = January 17th, 2024     Test on Wasp=1
    Gabriele Lombardo, ITA  
    https://github.com/gab8192/Obsidian

    Engine support syzygy first time and lost around 28 points / 56 draw games!

    - does not give mate with KR vs. K ... games ended with draw, 50 moves-rule
    - does not give mate with KNB vs. K ... games ended with draw, 50 moves-rule
    - does not give mate with KQ vs. K ... games ended with draw, 50-moves-rule
    - does not give mate with KQ vs. KR ... games ended with draw, 50-moves-rule
    - does not give mate in some other endgame constellations!

    Maybe an update will be available?
    If not, I will stop the test very soon and replaced the engine.

19. Mida 2.3 NN                                            executable = December 27th, 2023    no test for the moment
    Giacomo Porpiglia, ITA
    https://github.com/GiacomoPorpiglia/Mida

18. Drofa 4.1.0 NN                                         executable = December 24th, 2023    no test for the moment
    Alexander Litov & Rhys Rustad-Elliott, RUS & CAN
    https://github.com/justNo4b/Drofa

17. Peacekeeper 2.20 NN                                    executable = December 24th, 2023    no test for the moment
    Kyle Zhang, USA
    https://github.com/Sazgr/peacekeeper

    Test not possible!
    - does not want to checkmate. Winning games runs often over 150 moves
    - during the games the Shredder GUI does nothing and the matches hangs

11. Avalanche 2.1.0 NN                                     executable = January 13th, 2024     Test on Wasp 2
    Yinuo Huang, CHN
    https://github.com/SnowballSH/Avalanche

    - during the games the Shredder GUI does nothing and the matches hangs (4 times)

10. Kuma 1.2 NN                                            released   = January 12th, 2024     no test for the moment
    Kato Daichi, JPN
    https://github.com/kato-daichi/kuma

03. Lizard 10.0 NN                                         released   = January 04th, 2024     Test on Wasp-2
(1) Liam McGuire, USA
    https://github.com/liamt19/Lizard

02. Starzix 3.0 NN                                         executable = January 03rd, 2024     Test on Wasp-2
(1) Ricardo Pinto, POR
    https://github.com/zzzzz151/Starzix

B)  Wasp 6.63 NN dev                                       executable = December 02nd, 2024    Test on Wasp-2
    John Stanback, USA
    http://www.waspchess.com/



Frank Quisinsky, Germany / Gutweiler

Code: Select all

Conditions:

GUI:              Shredder 12 by Stefan Meyer-Kahlen

System Wasp-1:    Intel® Core™ i9-10900k = 10 cores, 4.4Ghz overclocked
Time control:     4 Minutes/Game + 2 Seconds/Move
Operating system: Windows 10

System Wasp-2:    Intel® Core™ i7-1185g7 =  4 cores, 3.0Ghz overclocked
Time control:     6 Minutes/Game + 3 Seconds/Move
Operating system: Windows 11
                  Dell Ultrabook is rather rarely used

Elo:              Wasp 6.50 NN is playing with ~3300 Elo (reference)
Tournament type:  Everyone against everyone, 50 games per match
                  Max. 41 engines = 2.000 games per engine
                  If more as 41 engine = A new engine replaces the last place.
                  An engine should reach approx. 3200 Elo

Endgame 5-man:    syzygy
Opening book:     feobos-6m-v2.1.bkt for Shredder GUI, based on 3.698 lines (balanced)
Cores:            1 core, Hyperthreading = off, Large-Pages = off
Hashtables:       128Mb
Contempt:         Default engine settings changed to contempt = 0 (if possible)
Other settings:   resign = off, ponder = off, learning = off

Replay games:     Every draw less than 20 moves



Frank Quisinsky, Germany / Gutweiler
Krzysztof Grzelak
Posts: 1584
Joined: Tue Jul 15, 2014 12:47 pm

Re: New engine releases 2024 ... 6+3

Post by Krzysztof Grzelak »

Hi Frank,
I have a question for you. It possible to watch the tournament live without interruption on your website?
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

Hi Krzysztof,

I will not open a second replay zone for the blitz-tournament.
The 66+6 tournament is live, every 10 minutes the finished games go into the replay zone.

But what I can do for the blitz-tournament is ...
Add a live mode for the results of the still running round-robin, if you like.
At the moment Alexandria 5.1.0 NN is still running.
So you can always see the current results for this round-robin and the next and future round-robin I will start.

So, at the end of the round robin you can download all the files with games in *.pgn.

Today I added the *.txt files from the Blitz tournament to my replay zone from the 66+6 tournament (at the end of the site).
https://www.amateurschach.de/fling/index.html

Best
Frank

Give me a short hint if you wish a live-mode for the results.
I am thinking that should be enough!
Krzysztof Grzelak
Posts: 1584
Joined: Tue Jul 15, 2014 12:47 pm

Re: New engine releases 2024 ... 6+3

Post by Krzysztof Grzelak »

Hi Frank,
Thank you for your answer.I thought about watching it like at TCEC.
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

Hi Krzysztof,

ah, OK I understand ...
Move-by-move in live mode isn't possible with Shredder GUI.
I can do it for one game only and have to start the GUI after the game again.
Not for 16 still running games possible.

Best
Frank
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

https://www.amateurschach.de/fling/index.html

In my Replay-Zone can be found now a link to current tested engine.
Furthermore, all other links to that tournament.

At the moment: Alexandria 5.1.0 NN
Results updated every 5 minutes.

Best
Frank
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

30. Fritz 19 NN (Gingko) ... still running (replaces Wasp 6.63 NN dev)

All the links to my two tournaments (66+6 / 4+2):
https://www.amateurschach.de/fling/index.html

Or direct the results in live-mode, updated every 5 minutes!
https://www.amateurschach.de/fling/test-30.html

31. Devre 5.0 NN ... no test (I believe not over 3200 Elo)
32. Berserk 12.1 NN ... very soon
33. Fire 9.2 NN ... soon
34. Uralochka 3.40a NN ... soon
35. Dragon 3.3 NN (Komodo) ... soon

All that are releases from November 23, Dragon is from October 23.
If nothing to do I add from time to time the "older releases".

Here the results from Alexandria 5.1.0 NN ...

Code: Select all

                Name                 Games     Win     Draw     Lose       Pts         S-B         %    wins-55m  lost-55m  AV-wins  AV-draws  AV-all

01. Stockfish 20240121 NN dev     :  1000  :  567+  :  433=  :    0-  :   783.5  : 372800.00  :  78.35%     53        0        76       86       81
02. RubiChess 20240112 NN         :  1000  :  394+  :  585=  :   21-  :   686.5  : 324698.50  :  68.65%     36        0        81       84       83
03. Caissa 1.16 NN                :  1000  :  370+  :  608=  :   22-  :   674.0  : 317822.50  :  67.40%     24        0        86       80       82
04. Seer 2.8.0 NN                 :  1000  :  323+  :  637=  :   40-  :   641.5  : 301949.75  :  64.15%     18        0        87       83       84
05. Clover 6.1 NN                 :  1000  :  312+  :  641=  :   47-  :   632.5  : 298101.50  :  63.25%     20        2        88      100       95
06. Alexandria 5.1.0 NN           :  1000  :  268+  :  656=  :   76-  :   596.0  : 280041.50  :  59.60%      9        6        89       95       93
07. Arasan 24.1 NN                :  1000  :  200+  :  668=  :  132-  :   534.0  : 251983.00  :  53.40%     22        4        84       87       86
08. Stormphrax 4.0.0 NN           :  1000  :  198+  :  605=  :  197-  :   500.5  : 231024.00  :  50.05%      2       22        90      107       99
09. Starzix 4.0 NN                :  1000  :  157+  :  659=  :  184-  :   486.5  : 227933.25  :  48.65%      4       13        91       92       90
10. Minic 3.40 NN                 :  1000  :  129+  :  680=  :  191-  :   469.0  : 222557.25  :  46.90%      1       20        98       92       90
-----------------------------------------------------------------------------------------------------------------------------------------------------
11. Akimbo 0.8.0 NN               :  1000  :  136+  :  656=  :  208-  :   464.0  : 217472.00  :  46.40%      4       19        97       92       91
12. Pawn 3.0 NN                   :  1000  :  130+  :  661=  :  209-  :   460.5  : 216137.25  :  46.05%      3       13        84       88       87
13. Velvet 6.0.0 NN               :  1000  :  139+  :  638=  :  223-  :   458.0  : 213958.75  :  45.80%     20        0        74       88       89
14. Texel 1.11 NN                 :  1000  :  100+  :  638=  :  262-  :   419.0  : 198285.50  :  41.90%     14        2        80       93       91
15. Altair 6.0.0 NN               :  1000  :  102+  :  633=  :  265-  :   418.5  : 196303.00  :  41.85%      1       38        96       91       88
16. Wasp 6.63 NN dev              :  1000  :  108+  :  618=  :  274-  :   417.0  : 195869.00  :  41.70%     12        7        83       80       83
17. Lizard 10.1 NN                :  1000  :  101+  :  616=  :  283-  :   409.0  : 192155.00  :  40.90%      5       25        86       97       91
18. Wasp 6.50 NN                  :  1000  :   89+  :  633=  :  278-  :   405.5  : 190044.75  :  40.55%     14        1        80       82       85
19. Midnight 9 NN                 :  1000  :   79+  :  593=  :  328-  :   375.5  : 176473.50  :  37.55%      4       19        89       89       87
20. Renegade 1.0.0 NN             :  1000  :   64+  :  555=  :  381-  :   341.5  : 160520.50  :  34.15%      1       44        90      101       93
-----------------------------------------------------------------------------------------------------------------------------------------------------
21. Clarity 4.1.0 NN              :  1000  :   40+  :  575=  :  385-  :   327.5  : 157042.50  :  32.75%      0       32        92       84       84


White Wins =  2.769 ( 26.37% )
Draws      =  6.494 ( 61.85% )
Black Wins =  1.237 ( 11.78% )
Average    = 175.78 ( 87,89 moves )

Code: Select all

   # Player                       :      Elo  Games  Score%  won  draw  lost  Points  Draw%  Error   OppAvg   OppE   OppD
   1 Stockfish 20240121 NN dev    :  3599.74   1000    78.3  567   433     0   783.5   43.3  15.69  3359.38  11.99   20.0
   2 RubiChess 20240112 NN        :  3511.41   1000    68.7  394   585    21   686.5   58.5  12.88  3363.79  12.14   20.0
   3 Caissa 1.16 NN               :  3501.12   1000    67.4  370   608    22   674.0   60.8  13.46  3364.31  12.11   20.0
   4 Seer 2.8.0 NN                :  3475.11   1000    64.2  323   637    40   641.5   63.7  12.09  3365.61  12.17   20.0
   5 Clover 6.1 NN                :  3468.08   1000    63.3  312   641    47   632.5   64.1  12.49  3365.96  12.15   20.0
   6 Alexandria 5.1.0 NN          :  3440.14   1000    59.6  268   656    76   596.0   65.6  12.10  3367.36  12.17   20.0
   7 Arasan 24.1 NN               :  3394.18   1000    53.4  200   668   132   534.0   66.8  11.30  3369.65  12.21   20.0
   8 Stormphrax 4.0.0 NN          :  3369.74   1000    50.0  198   605   197   500.5   60.5  11.71  3370.88  12.19   20.0
   9 Starzix 4.0 NN               :  3359.55   1000    48.6  157   659   184   486.5   65.9  11.31  3371.38  12.21   20.0
  10 Minic 3.40 NN                :  3346.80   1000    46.9  129   680   191   469.0   68.0  11.09  3372.02  12.22   20.0
  11 Akimbo 0.8.0 NN              :  3343.15   1000    46.4  136   656   208   464.0   65.6  11.45  3372.20  12.21   20.0
  12 Pawn 3.0 NN                  :  3340.59   1000    46.0  130   661   209   460.5   66.1  11.44  3372.33  12.21   20.0
  13 Velvet 6.0.0 NN              :  3338.76   1000    45.8  139   638   223   458.0   63.8  11.27  3372.42  12.22   20.0
  14 Texel 1.11 NN                :  3310.06   1000    41.9  100   638   262   419.0   63.8  11.88  3373.86  12.19   20.0
  15 Altair 6.0.0 NN              :  3309.69   1000    41.9  102   633   265   418.5   63.3  11.85  3373.88  12.19   20.0
  16 Wasp 6.63 NN dev             :  3308.57   1000    41.7  108   618   274   417.0   61.8  11.92  3373.93  12.18   20.0
  17 Lizard 10.1 NN               :  3302.62   1000    40.9  101   616   283   409.0   61.6  12.00  3374.23  12.18   20.0
  18 Wasp 6.50 NN                 :  3300.00   1000    40.5   89   633   278   405.5   63.3  11.53  3374.36  12.20   20.0
  19 Midnight 9 NN                :  3277.32   1000    37.5   79   593   328   375.5   59.3  12.49  3375.50  12.15   20.0
  20 Renegade 1.0.0 NN            :  3250.89   1000    34.1   64   555   381   341.5   55.5  12.65  3376.82  12.15   20.0
  21 Clarity 4.1.0 NN             :  3239.72   1000    32.8   40   575   385   327.5   57.5  13.00  3377.38  12.13   20.0

White advantage = 58.25 +/- 2.06
Draw rate (equal opponents) = 77.50 % +/- 0.57
Best
Frank
Frank Quisinsky
Posts: 6888
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: New engine releases 2024 ... 6+3

Post by Frank Quisinsky »

Hi there,

I will add max. 41 engines in the tournament = 2.000 games for each engine.
If there are more than 41 engines, a newer version will replace the last rank.
At the moment I keep all engines with more than 3200 Elo in the list.

In the last days I replaced 4 engines (two not stronger than 3200 Elo, one dev and one engine with a bug).
The results for all the others are now clearer.

During a test run I replayed the draws under 20 moves.
The opening book isn't optimised for this, but the 3,698 lines are interesting for creating a test set.
I need more results on those 3,698 lines and the tournament on the new releases helps a bit.

Interesting is my replay zone?!
Here you can find the statistics of both tournaments (66+6 / 4+2).
It is really fun to replay the 66+6 games (2.5 hours for one game) from time to time for engines you like.

:-)

Yesterday I made some updates in my Replay-Zone, I try to make it better.

Replay-Zone (Start):
https://www.amateurschach.de/fling/index.html

New for the 4+2 tournament is:
Results in live mode with Elo calculation for the running engine. What we have been doing in engine testing for about 15 years. In the past the computer chess people liked it. A bit old-school testing I think with the UCI-Mother GUI ... Shredder!

If you like to create EAS stats with the 4+2 game database with the tools by Stefan ...
The games are sorted with Hiarcs-Chess-Explorer 1.2.1 and can be found in the download file (at the moment over 20Mb). This blitz-tournament is very young ... a youngster.

Best
Frank