Not read the readme from Dragon (honestly).
With personality "aggr" NN is not working.
Thanks to Peter Martan.
I changed from Dragon 3.3 NN (aggr.) to the same I am using for FCP-Tourney-2024 ...
Dragon 3.3 (Komodo) with Contempt = 0
Thanks Peter!
Best
Frank
FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Moderator: Ras
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there,
two updates:
Seer 2.8.0 NN replaced Seer 2.7.0 NN
Andscacs 0.95 replaced Andscacs 0.95.123 (I forgot that the last release version has a better move-average).
No more engine updates possible!
Tourney can run now for a while.
I created a result-site, more or less an overview!
https://www.amateurschach.de/main/_ma.htm
Best
Frank
two updates:
Seer 2.8.0 NN replaced Seer 2.7.0 NN
Andscacs 0.95 replaced Andscacs 0.95.123 (I forgot that the last release version has a better move-average).
No more engine updates possible!
Tourney can run now for a while.
I created a result-site, more or less an overview!
https://www.amateurschach.de/main/_ma.htm
Best
Frank
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there,
Round 01 landed: 1. Dragon, 2. Seer, 3. RubiChess ... 44. Fizbo
Move-average = 89,5
draws = 60,89
1. https://www.amateurschach.de/fling/index.html ... cockpit / replay-zone
2. https://www.amateurschach.de/main/_ma.htm ... flight plan /overview to results
Passengers (engines), prepare for takeoff ...
Estimated arrival time for round 02 = January 09th, 2024
Wind = 35-70km/h, temperature = 9,5, Air humidity = 83%
Satisfaction of the tournament organisers: 70% ... go Wasp v6.63 go, the big brother v6.50 was better in round-01.
Best
Frank
Round 01 landed: 1. Dragon, 2. Seer, 3. RubiChess ... 44. Fizbo
Move-average = 89,5
draws = 60,89
1. https://www.amateurschach.de/fling/index.html ... cockpit / replay-zone
2. https://www.amateurschach.de/main/_ma.htm ... flight plan /overview to results
Passengers (engines), prepare for takeoff ...
Estimated arrival time for round 02 = January 09th, 2024
Wind = 35-70km/h, temperature = 9,5, Air humidity = 83%
Satisfaction of the tournament organisers: 70% ... go Wasp v6.63 go, the big brother v6.50 was better in round-01.
Best
Frank
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there,
50 first dates ... results from date 02:
1. Dragon, 2. CSTal, 3. RubiChess ... 44. Andscacs
Move-average = 88,55 (I await 87-88)
Draw-quote = 60,78% (around 2% higher as I await)
Now round 03 is still running ...
Wind = 15km/h, temperature = -3,6, Air humidity = 57%
Satisfaction of the tournament organisers: 75,75% ... go Wasp v6.63 go, the big brother v6.50 only with a very light advantage.
Best
Frank
50 first dates ... results from date 02:
1. Dragon, 2. CSTal, 3. RubiChess ... 44. Andscacs
Move-average = 88,55 (I await 87-88)
Draw-quote = 60,78% (around 2% higher as I await)
Now round 03 is still running ...
Wind = 15km/h, temperature = -3,6, Air humidity = 57%
Satisfaction of the tournament organisers: 75,75% ... go Wasp v6.63 go, the big brother v6.50 only with a very light advantage.
Best
Frank
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there,
50 first dates ... results from date 03:
1. Dragon, 2. Seer, 3. Uralochka ... 44. Xiphos
Move-average = 88,66
Draw-quote = 61,10%
Now round 04 is still running ...
Wind = 20km/h, temperature = -1,6, Air humidity = 66%
Satisfaction of the tournament organisers: 80,00% ... go Wasp 6.63 go ... but it is very hard to beat the strong release version 6.50 with a longer time-control. Stockfish 200731 dev lost only one game in round 03 ... vs. Wasp!
Very interesting to see ...
Stockfish 16 NN is around 5-10 Elo stronger with the time-control 40/20 as Dragon 3.3 NN.
So, the different from Stockfish 200731 dev (without NN) to Stockfish 16 NN with the time-control 66+6 + 6-man is around 100 Elo, not 150, not 200, not 300 or 400 (what I read here often) ... the different is 100 Elo. And with longer time-controls as 66+6 the difference continues to fall. But let us wait of more games. I can test later in detail if SF17 is available, means I can start a round-robin vs. this group of 44 engines.
Again the links:
Best start point is my Replay-Zone:
https://www.amateurschach.de/fling/index.html
The new result-page:
https://www.amateurschach.de/main/_ma.htm
Best
Frank
50 first dates ... results from date 03:
1. Dragon, 2. Seer, 3. Uralochka ... 44. Xiphos
Move-average = 88,66
Draw-quote = 61,10%
Now round 04 is still running ...
Wind = 20km/h, temperature = -1,6, Air humidity = 66%
Satisfaction of the tournament organisers: 80,00% ... go Wasp 6.63 go ... but it is very hard to beat the strong release version 6.50 with a longer time-control. Stockfish 200731 dev lost only one game in round 03 ... vs. Wasp!
Very interesting to see ...
Stockfish 16 NN is around 5-10 Elo stronger with the time-control 40/20 as Dragon 3.3 NN.
So, the different from Stockfish 200731 dev (without NN) to Stockfish 16 NN with the time-control 66+6 + 6-man is around 100 Elo, not 150, not 200, not 300 or 400 (what I read here often) ... the different is 100 Elo. And with longer time-controls as 66+6 the difference continues to fall. But let us wait of more games. I can test later in detail if SF17 is available, means I can start a round-robin vs. this group of 44 engines.
Again the links:
Best start point is my Replay-Zone:
https://www.amateurschach.de/fling/index.html
The new result-page:
https://www.amateurschach.de/main/_ma.htm
Best
Frank
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there,
results from round 04:
1. Dragon, 2. CSTal, 3. Caissa ... 44. Shredder
Move-average = 88,61
Draw-quote = 60,41
Now round 05 is still running ...
Wind = 5km/h, temperature = -3,9, Air humidity = 70%
Satisfaction of the tournament organisers: 65,00% ... the fun factor is clearly higher with a game in x minutes per game + x seconds per move. But I don't like the time control because it seems that different engines have not a good time-management. With such a long time control you can see better than with a blitz time control. So in the end ... I am more a fan of x moves in x minutes.
Stockfish 200731 dev (without NN) is playing a strong round again. After 172 games only 6 lost games. The difference to Dragon 3.3 NN with such a long time control is at the moment with Ordo calculation only 96 Elo. If available I will add Stockfish 17 NN later. I am more and more sure that the Neural-Network advantage is not more than 120-130 Elo with such a long time control.
It seems that the current Seer 2.8.0 NN version get a booster for longer time-controls ... must be observed.
Replay-Zone in live mode, games, link to the current tournament-table and so on ...
https://www.amateurschach.de/fling/index.html
Best
Frank
results from round 04:
1. Dragon, 2. CSTal, 3. Caissa ... 44. Shredder
Move-average = 88,61
Draw-quote = 60,41
Now round 05 is still running ...
Wind = 5km/h, temperature = -3,9, Air humidity = 70%
Satisfaction of the tournament organisers: 65,00% ... the fun factor is clearly higher with a game in x minutes per game + x seconds per move. But I don't like the time control because it seems that different engines have not a good time-management. With such a long time control you can see better than with a blitz time control. So in the end ... I am more a fan of x moves in x minutes.
Stockfish 200731 dev (without NN) is playing a strong round again. After 172 games only 6 lost games. The difference to Dragon 3.3 NN with such a long time control is at the moment with Ordo calculation only 96 Elo. If available I will add Stockfish 17 NN later. I am more and more sure that the Neural-Network advantage is not more than 120-130 Elo with such a long time control.
It seems that the current Seer 2.8.0 NN version get a booster for longer time-controls ... must be observed.
Code: Select all
FCP-Tourney-2024-MA after round 04
03.784 games
# Player : Elo Games Score% won draw lost Points Draw% Error OppAvg OppE OppD
1 Dragon 3.3 NN (Komodo) : 3492.49 172 73.5 81 91 0 126.5 52.9 32.84 3293.71 29.31 43.0
2 Seer 2.8.0 NN : 3470.87 172 71.2 76 93 3 122.5 54.1 32.72 3294.22 29.31 43.0
3 CSTal 2.00 NN : 3465.61 172 70.6 72 99 1 121.5 57.6 32.17 3294.34 29.32 43.0
4 RubiChess 20230918 NN : 3447.58 172 68.6 64 108 0 118.0 62.8 31.48 3294.76 29.34 43.0
5 Caissa 1.15 NN : 3442.53 172 68.0 63 108 1 117.0 62.8 29.49 3294.88 29.39 43.0
6 Uralochka 3.40a NN : 3427.63 172 66.3 60 108 4 114.0 62.8 29.46 3295.22 29.39 43.0
7 Revenge 3.0 NN : 3417.87 172 65.1 58 108 6 112.0 62.8 28.66 3295.45 29.41 43.0
7 Clover 6.1 NN : 3417.87 172 65.1 56 112 4 112.0 65.1 29.46 3295.45 29.39 43.0
9 Igel 3.5.0 NN : 3415.45 172 64.8 52 119 1 111.5 69.2 29.59 3295.50 29.38 43.0
10 Rebel EAS NN : 3413.04 172 64.5 54 114 4 111.0 66.3 29.88 3295.56 29.38 43.0
10 Alexandria 5.1.0 NN : 3413.04 172 64.5 57 108 7 111.0 62.8 29.44 3295.56 29.39 43.0
12 Arasan 24.0 NN : 3398.73 172 62.8 48 120 4 108.0 69.8 29.38 3295.89 29.39 43.0
13 Stockfish 200731 dev : 3396.37 172 62.5 49 117 6 107.5 68.0 28.25 3295.95 29.42 43.0
14 SlowChess Blitz 2.9 NN : 3380.00 172 60.5 44 120 8 104.0 69.8 28.94 3296.33 29.40 43.0
15 Carp 3.0.1 NN : 3370.76 172 59.3 43 118 11 102.0 68.6 28.01 3296.54 29.42 43.0
16 Fritz 19 NN (Gingko) : 3354.75 172 57.3 45 107 20 98.5 62.2 28.58 3296.92 29.41 43.0
16 Minic 3.39 NN : 3354.75 172 57.3 41 115 16 98.5 66.9 27.77 3296.92 29.43 43.0
18 Fire 9.2 NN : 3350.21 172 56.7 34 127 11 97.5 73.8 26.95 3297.02 29.45 43.0
18 Altair 6.0.0 NN : 3350.21 172 56.7 42 111 19 97.5 64.5 27.49 3297.02 29.43 43.0
20 Velvet 6.0.0 NN : 3327.65 172 53.8 37 111 24 92.5 64.5 27.05 3297.55 29.44 43.0
21 Wasp 6.50 NN : 3323.16 172 53.2 33 117 22 91.5 68.0 26.60 3297.65 29.45 43.0
22 Wasp 6.63 NN dev : 3318.68 172 52.6 36 109 27 90.5 63.4 27.44 3297.76 29.43 43.0
23 Nemorino 6.11 NN dev : 3300.79 172 50.3 34 105 33 86.5 61.0 26.47 3298.17 29.46 43.0
24 BlackCore 6.0 NN : 3291.85 172 49.1 29 111 32 84.5 64.5 26.67 3298.38 29.45 43.0
25 Texel 1.10 NN : 3287.37 172 48.5 32 103 37 83.5 59.9 27.14 3298.48 29.44 43.0
26 Devre 4.0 NN : 3285.14 172 48.3 22 122 28 83.0 70.9 27.91 3298.54 29.42 43.0
27 Chess.cpp 4.0 NN : 3267.19 172 45.9 30 98 44 79.0 57.0 27.38 3298.95 29.44 43.0
28 Pawn 2.0 NN : 3255.92 172 44.5 21 111 40 76.5 64.5 28.02 3299.21 29.42 43.0
29 Marvin 6.2.0 NN : 3249.13 172 43.6 13 124 35 75.0 72.1 27.23 3299.37 29.44 43.0
30 Tucano 11.00.1 NN : 3242.31 172 42.7 16 115 41 73.5 66.9 27.52 3299.53 29.43 43.0
31 Hakkapeliitta 3.0 NNSV : 3200.61 172 37.5 12 105 55 64.5 61.0 30.47 3300.50 29.36 43.0
32 Mantissa 3.7.2 NN : 3179.05 172 34.9 12 96 64 60.0 55.8 29.25 3301.00 29.39 43.0
33 Midnight 8 NN : 3176.61 172 34.6 15 89 68 59.5 51.7 29.21 3301.06 29.39 43.0
34 Booot 6.4 : 3171.72 172 34.0 13 91 68 58.5 52.9 29.63 3301.17 29.38 43.0
35 Xiphos 0.6 : 3166.79 172 33.4 14 87 71 57.5 50.6 30.68 3301.29 29.36 43.0
35 DanaSah 9.1 NN : 3166.79 172 33.4 12 91 69 57.5 52.9 30.69 3301.29 29.36 43.0
37 Winter 2.0 NN : 3156.83 172 32.3 7 97 68 55.5 56.4 31.02 3301.52 29.35 43.0
38 Nalwald 18 NN : 3154.31 172 32.0 7 96 69 55.0 55.8 30.84 3301.58 29.36 43.0
39 Andscacs 0.95 : 3144.14 172 30.8 10 86 76 53.0 50.0 31.74 3301.81 29.33 43.0
40 Shredder 13 : 3141.57 172 30.5 13 79 80 52.5 45.9 29.91 3301.87 29.38 43.0
40 Laser 1.7 : 3141.57 172 30.5 11 83 78 52.5 48.3 32.07 3301.87 29.33 43.0
42 Chiron 5.01 : 3138.98 172 30.2 12 80 80 52.0 46.5 32.25 3301.93 29.32 43.0
43 Hiarcs 15.2 (aggr.) : 3136.39 172 29.9 11 81 80 51.5 47.1 31.54 3301.99 29.34 43.0
44 Fizbo 2.0 NN : 3117.85 172 27.9 7 82 83 48.0 47.7 33.83 3302.43 29.29 43.0
White advantage = 53.34 +/- 3.36
Draw rate (equal opponents) = 83.21 % +/- 1.11
https://www.amateurschach.de/fling/index.html
Best
Frank
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there
first time I looked at the move average stats after round 04:
It looks much better than in my 40/20 tournament.
I am adding engines with very different styles to the tournament.
Interesting is the "superstar" from FCP-Tourney-2020 ... Booot 6.4.
Booot 6.4 can beat the move average for wins of Velvet and Texel!
The style of Booot 7.2 NN is very boring for me, longest move-average in my 40/20 tournament and a completely different program with neural network to classical eval. These statistics could not be more different.
All in all... the move average stats look normal.
Three engines will be disqualified, the first after round 6, the second after round 8 and the third after round 10.
In danger at the moment are Andscacs 0.95 and Clover 6.1 NN (engines with highest move average for wins and draws). End of the day the tournament ended with 41 engines.
Have a look at the move average for Revenge, SlowChess.
Here I made configuration mistakes in my 40/20 tournament, both are not playing with contempt=0. Now everything looks wonderful.
But most interesting in my opinion is Seer.
It seems that the engine gets a big boost in playing strength with longer time controls. Of course, not enough games for such statements.
The most fun I have is watching games:
When I have time, I watch and have seen a lot of nice games. Honestly, I have the most fun with the DanaSah games and I really like seeing Hakkapeliitta again. From Revenge I saw three of the fast wins live. Wasp is stronger in the endgames, can see more with the long time control.
Enough, it's a really nice tournament, but I often think that 30% of the engines have bad time management. Start playing Blitz with the +6 after about 60-70 moves. Blitz should be start 10-15 moves later.
Best
Frank
first time I looked at the move average stats after round 04:
It looks much better than in my 40/20 tournament.
I am adding engines with very different styles to the tournament.
Interesting is the "superstar" from FCP-Tourney-2020 ... Booot 6.4.
Booot 6.4 can beat the move average for wins of Velvet and Texel!
The style of Booot 7.2 NN is very boring for me, longest move-average in my 40/20 tournament and a completely different program with neural network to classical eval. These statistics could not be more different.
All in all... the move average stats look normal.
Three engines will be disqualified, the first after round 6, the second after round 8 and the third after round 10.
In danger at the moment are Andscacs 0.95 and Clover 6.1 NN (engines with highest move average for wins and draws). End of the day the tournament ended with 41 engines.
Have a look at the move average for Revenge, SlowChess.
Here I made configuration mistakes in my 40/20 tournament, both are not playing with contempt=0. Now everything looks wonderful.
But most interesting in my opinion is Seer.
It seems that the engine gets a big boost in playing strength with longer time controls. Of course, not enough games for such statements.
The most fun I have is watching games:
When I have time, I watch and have seen a lot of nice games. Honestly, I have the most fun with the DanaSah games and I really like seeing Hakkapeliitta again. From Revenge I saw three of the fast wins live. Wasp is stronger in the endgames, can see more with the long time control.
Enough, it's a really nice tournament, but I often think that 30% of the engines have bad time management. Start playing Blitz with the +6 after about 60-70 moves. Blitz should be start 10-15 moves later.
Best
Frank
Code: Select all
Name Games Win Draw Lose Pts S-B % wins-55m lost-55m AV-wins AV-draws AV-all
01. Dragon 3.3 NN (Komodo) : 172 : 81+ : 91= : 0- : 126.5 : 10481.00 : 73.55% 3 0 81 91 86
02. Seer 2.8.0 NN : 172 : 76+ : 93= : 3- : 122.5 : 9808.75 : 71.22% 2 0 89 81 85
03. CSTal 2.00 NN : 172 : 72+ : 99= : 1- : 121.5 : 9748.50 : 70.64% 1 0 89 78 82
04. RubiChess 20230918 NN : 172 : 64+ : 108= : 0- : 118.0 : 9582.00 : 68.60% 1 0 85 82 83
05. Caissa 1.15 NN : 172 : 63+ : 108= : 1- : 117.0 : 9509.25 : 68.02% 2 0 86 92 90
06. Uralochka 3.40a NN : 172 : 60+ : 108= : 4- : 114.0 : 9129.75 : 66.28% 4 0 85 95 91
07. Clover 6.1 NN : 172 : 56+ : 112= : 4- : 112.0 : 8941.00 : 65.12% 0 1 94 102 99
08. Revenge 3.0 NN : 172 : 58+ : 108= : 6- : 112.0 : 8906.50 : 65.12% 7 0 80 85 84
09. Igel 3.5.0 NN : 172 : 52+ : 119= : 1- : 111.5 : 9110.50 : 64.83% 5 0 88 80 82
10. Alexandria 5.1.0 NN : 172 : 57+ : 108= : 7- : 111.0 : 8911.25 : 64.53% 0 2 90 99 96
-----------------------------------------------------------------------------------------------------------------------------------------------------
11. Rebel EAS NN : 172 : 54+ : 114= : 4- : 111.0 : 8863.00 : 64.53% 2 0 82 80 81
12. Arasan 24.0 NN : 172 : 48+ : 120= : 4- : 108.0 : 8746.50 : 62.79% 1 0 92 98 96
13. Stockfish 200731 dev : 172 : 49+ : 117= : 6- : 107.5 : 8679.00 : 62.50% 4 0 79 95 91
14. SlowChess Blitz 2.9 NN : 172 : 44+ : 120= : 8- : 104.0 : 8421.25 : 60.47% 1 0 83 89 88
15. Carp 3.0.1 NN : 172 : 43+ : 118= : 11- : 102.0 : 8127.25 : 59.30% 3 0 83 90 88
16. Minic 3.39 NN : 172 : 41+ : 115= : 16- : 98.5 : 7716.50 : 57.27% 0 0 101 96 97
17. Fritz 19 NN (Gingko) : 172 : 45+ : 107= : 20- : 98.5 : 7698.75 : 57.27% 0 0 83 88 87
18. Fire 9.2 NN : 172 : 34+ : 127= : 11- : 97.5 : 7822.00 : 56.69% 0 0 84 89 89
19. Altair 6.0.0 NN : 172 : 42+ : 111= : 19- : 97.5 : 7669.00 : 56.69% 0 1 98 100 98
20. Velvet 6.0.0 NN : 172 : 37+ : 111= : 24- : 92.5 : 7283.50 : 53.78% 3 0 79 84 85
-----------------------------------------------------------------------------------------------------------------------------------------------------
21. Wasp 6.50 NN : 172 : 33+ : 117= : 22- : 91.5 : 7201.75 : 53.20% 1 0 89 76 82
22. Wasp 6.63 NN dev : 172 : 36+ : 109= : 27- : 90.5 : 7033.00 : 52.62% 2 0 92 74 82
23. Nemorino 6.11 NN dev : 172 : 34+ : 105= : 33- : 86.5 : 6778.50 : 50.29% 0 1 94 97 93
24. BlackCore 6.0 NN : 172 : 29+ : 111= : 32- : 84.5 : 6625.50 : 49.13% 0 3 100 95 93
25. Texel 1.10 NN : 172 : 32+ : 103= : 37- : 83.5 : 6400.50 : 48.55% 3 0 78 99 94
26. Devre 4.0 NN : 172 : 22+ : 122= : 28- : 83.0 : 6516.25 : 48.26% 0 0 89 95 94
27. Chess.cpp 4.0 NN : 172 : 30+ : 98= : 44- : 79.0 : 6008.75 : 45.93% 0 3 95 78 81
28. Pawn 2.0 NN : 172 : 21+ : 111= : 40- : 76.5 : 5972.25 : 44.48% 0 1 101 93 91
29. Marvin 6.2.0 NN : 172 : 13+ : 124= : 35- : 75.0 : 5968.00 : 43.60% 1 1 95 86 87
30. Tucano 11.00.1 NN : 172 : 16+ : 115= : 41- : 73.5 : 5741.00 : 42.73% 0 3 92 89 88
-----------------------------------------------------------------------------------------------------------------------------------------------------
31. Hakkapeliitta 3.0 NNSV : 172 : 12+ : 105= : 55- : 64.5 : 5172.00 : 37.50% 1 1 82 91 88
32. Mantissa 3.7.2 NN : 172 : 12+ : 96= : 64- : 60.0 : 4584.25 : 34.88% 0 2 97 80 82
33. Midnight 8 NN : 172 : 15+ : 89= : 68- : 59.5 : 4573.25 : 34.59% 0 9 88 94 91
34. Booot 6.4 : 172 : 13+ : 91= : 68- : 58.5 : 4564.75 : 34.01% 2 2 74 80 83
35. Xiphos 0.6 : 172 : 14+ : 87= : 71- : 57.5 : 4466.25 : 33.43% 1 0 93 98 92
36. DanaSah 9.1 NN : 172 : 12+ : 91= : 69- : 57.5 : 4302.00 : 33.43% 2 4 82 85 85
37. Winter 2.0 NN : 172 : 7+ : 97= : 68- : 55.5 : 4413.00 : 32.27% 1 1 87 89 90
38. Nalwald 18 NN : 172 : 7+ : 96= : 69- : 55.0 : 4256.50 : 31.98% 0 5 102 95 91
39. Andscacs 0.95 : 172 : 10+ : 86= : 76- : 53.0 : 4079.00 : 30.81% 1 1 96 104 98
40. Shredder 13 : 172 : 13+ : 79= : 80- : 52.5 : 3992.50 : 30.52% 0 6 95 95 91
-----------------------------------------------------------------------------------------------------------------------------------------------------
41. Laser 1.7 : 172 : 11+ : 83= : 78- : 52.5 : 3964.25 : 30.52% 0 0 77 95 91
42. Chiron 5.01 : 172 : 12+ : 80= : 80- : 52.0 : 3872.50 : 30.23% 0 4 90 91 88
43. Hiarcs 15.2 (aggr.) : 172 : 11+ : 81= : 80- : 51.5 : 3879.50 : 29.94% 0 2 86 92 93
44. Fizbo 2.0 NN : 172 : 7+ : 82= : 83- : 48.0 : 3783.00 : 27.91% 0 2 99 87 87
White Wins = 989 ( 26.14% )
Draws = 2.286 ( 60.41% )
Black Wins = 509 ( 13.45% )
Average = 177.22 ( 88,61 moves )
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
Hi there,
If I look in my blitz results of newer releases, Starzix and the last Akimbo are interesting for this tournament. That's right, two people asked for possible updates. Unfortunately, any changes here need too much time. I will not change anything. After round 10 I have three free places in the tournament table. I will use one of them for the next Stockfish release. Might be interesting to see how much more Elo SF can give with neural-network and longer time controls.
This tournament = state at the end of 2023 with engines I think the move average is low. In addition some of the older and in my opinion interesting engines with classic eval.
Interesting can be engines with different playing styles and a low move average to test other / new releases.
Example: 7 with around 3400 or higher, 7 with 3300 or higher, 7 with 3200 or higher for testing new releases with a test set of positions.
Example: Texel, Velvet and Wasp don't play the same way. Only with a quick view we can think it. Velvet is stronger than Texel and Wasp for attacking moves in the middle of the board, Texel has better statistics with white pieces than Wasp and Wasp has better statistics with black pieces than Texel. Furthermore is Wasp most aggressive with pawn moves, open the position.
Revenge, SlowChess and Uralochka are also attackers with different styles but stronger than Texel, Wasp and Velvet. Important are the programs where I can find no weaknesses ... the all-rounders ... like RubiChess.
The problem is ...
The attackers like open positions and often the pawn structures after attacking chess are not good. So many games are lost in the late middlegames. This group of engines must have weaknesses and they are often to be found in the earlier endgames.
To see ... wow, Wasp has so many quick wins is good for first time players or for people like me who work on opening analysis or like quick wins. But such engines often have weaknesses in endgames. Another good example is the older Spark or even Booot 6.4 / 6.5.
Important to have Caissa or Seer in the group, no attackers, but very strong in the late midgame and early endgame. What I like to write is, possible to find out a strong group of 21 Engines, can build the reference for testing all the other new releases with longer time controls.
With engines that like to produced "chewing gum draws" I lost too much time in testing with time controls like x moves in x minutes. So I also have to produce blitz games, because the information I am looking for I cannot read in rating list systems like CEGT or CCRL. Stefan Pohl tests with unbalanced positions and only the strongest.
Again, I will nothing change in that 66+6 tournament!
And again ... move-average is really important because much more games for interesting statistics can be produced with engine comes with a low move-average.
So if I have a test-group of engines (21 reference engines) all of the engines like to produce "No chewing-gums draws" it's not important what the new releases do if they have to play against the group. End of the day I lost not to many time for testing and can produced some interesting stats later. So the 21 reference engines must have a low move-average and must have different playing styles. Not easy to find out it.
Best
Frank
If I look in my blitz results of newer releases, Starzix and the last Akimbo are interesting for this tournament. That's right, two people asked for possible updates. Unfortunately, any changes here need too much time. I will not change anything. After round 10 I have three free places in the tournament table. I will use one of them for the next Stockfish release. Might be interesting to see how much more Elo SF can give with neural-network and longer time controls.
This tournament = state at the end of 2023 with engines I think the move average is low. In addition some of the older and in my opinion interesting engines with classic eval.
Interesting can be engines with different playing styles and a low move average to test other / new releases.
Example: 7 with around 3400 or higher, 7 with 3300 or higher, 7 with 3200 or higher for testing new releases with a test set of positions.
Example: Texel, Velvet and Wasp don't play the same way. Only with a quick view we can think it. Velvet is stronger than Texel and Wasp for attacking moves in the middle of the board, Texel has better statistics with white pieces than Wasp and Wasp has better statistics with black pieces than Texel. Furthermore is Wasp most aggressive with pawn moves, open the position.
Revenge, SlowChess and Uralochka are also attackers with different styles but stronger than Texel, Wasp and Velvet. Important are the programs where I can find no weaknesses ... the all-rounders ... like RubiChess.
The problem is ...
The attackers like open positions and often the pawn structures after attacking chess are not good. So many games are lost in the late middlegames. This group of engines must have weaknesses and they are often to be found in the earlier endgames.
To see ... wow, Wasp has so many quick wins is good for first time players or for people like me who work on opening analysis or like quick wins. But such engines often have weaknesses in endgames. Another good example is the older Spark or even Booot 6.4 / 6.5.
Important to have Caissa or Seer in the group, no attackers, but very strong in the late midgame and early endgame. What I like to write is, possible to find out a strong group of 21 Engines, can build the reference for testing all the other new releases with longer time controls.
With engines that like to produced "chewing gum draws" I lost too much time in testing with time controls like x moves in x minutes. So I also have to produce blitz games, because the information I am looking for I cannot read in rating list systems like CEGT or CCRL. Stefan Pohl tests with unbalanced positions and only the strongest.
Again, I will nothing change in that 66+6 tournament!
And again ... move-average is really important because much more games for interesting statistics can be produced with engine comes with a low move-average.
So if I have a test-group of engines (21 reference engines) all of the engines like to produce "No chewing-gums draws" it's not important what the new releases do if they have to play against the group. End of the day I lost not to many time for testing and can produced some interesting stats later. So the 21 reference engines must have a low move-average and must have different playing styles. Not easy to find out it.
Best
Frank
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
It is a completely new way of testing engines that I will be taking.
Not with my current tournaments, but what I want to do in the future.
To have a strong test set with 50 balanced positions (working on it, need more stats here).
To build a team of 21 engines that can test the others, new releases.
When new engine releases, produce games with very long move averages ... must not be bad. It is not my intention to give such information. Every engine is interesting to me. But I am very sure I can see more in the stats if I build a test team of engines (different styles, perfect move-average and so one) vs. all the new releases.
More and more powerful engines are available.
I have lost the overview for a while.
So I need a new plan to find out more about it, and that very quickly.
I've been doing this for too long and I've got tester's disease. I don't want to miss anything interesting.
So, I am working on it and have no problems to give the information why I do this and that.
Not with my current tournaments, but what I want to do in the future.
To have a strong test set with 50 balanced positions (working on it, need more stats here).
To build a team of 21 engines that can test the others, new releases.
When new engine releases, produce games with very long move averages ... must not be bad. It is not my intention to give such information. Every engine is interesting to me. But I am very sure I can see more in the stats if I build a test team of engines (different styles, perfect move-average and so one) vs. all the new releases.
More and more powerful engines are available.
I have lost the overview for a while.
So I need a new plan to find out more about it, and that very quickly.
I've been doing this for too long and I've got tester's disease. I don't want to miss anything interesting.
So, I am working on it and have no problems to give the information why I do this and that.
-
- Posts: 6888
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: FCP-Tourney-2024-MA ... game in 66 min. + 6 sec. / move
To the test-team ...
I am thinking on ... all with a good move-average and interesting and quiet different playing-styles. All are very danger not easy to beat ... OK DanaSah lose more games as the others fast. But the style is just fantastic.
Newer releases must have a difficult life.

01. RubiChess 20240112 NN
02. Seer 2.8.0 NN
03. Caissa 1.16 NN
04. Revenge 3.0 NN
05. Uralochka 3.40 NN
06. Igel 3.5.0 NN
07. Arasan 24.1 NN
08. SlowChess 2.9 NN
09. Fritz 19 NN (Gingko) NN ... first Fritz version with a really interesting style of play
10. Fire 9.2 NN
11. Starzix 4.0 NN
12. Wasp 6.50 NN
13. Velvet 6.0.0 NN
14. Texel 1.11 NN
15. Pawn 3.0 NN
16. Nemorino 6.11 NN
17. chess.cpp 4.0 NN ... like the engine a lot, very balanced style, often with interesting ideas in late midgames
18. Marvin 6.20 NN
19. DanaSah 9.1 NN
20. Hiarcs 15.2
21. Booot 6.4
---
Yes, not in the field are Rebel or CSTal or Dragon ...
For different reasons! Rebel or CSTal produced from time to time GUI hang-ups.
To add Dragon made no sense, must not have the strongest engines in the field.
---
Now a new Stormphrax 5.0.0 NN is out ...
And have to play vs. this 21 engines the same set of balanced opening positions = 2100 games
The results goes in a rating list without Elo. I am working only with points. Ratings are so boring!
Now a new Marvin 7.00 NN is out ...
So Marvin have to play vs. the same 21 engines, in one match vs. Marvin 6.20 NN.
The results goes in a rating list.
The 21 "Team-worker-engines" are not in the rating-list!
Used only as reference test-engines!
Later in the rating list I will add some interesting stats, not Elo.
I have here different ideas what I can do and working on it.
Thats the idea I had since sommer 2022 after I closed for some reasons my KI-Rating List.
Time control will be 40 in 8 + 2 seconds (time control I like most for testing engines).
I can start with it middle of the year.
I am not sure to 100% with the test-group ... but I am thinking exactly on this group of engines.
Furthermore, I need to different points more results and time for thinking about it.
Best
Frank
I am thinking on ... all with a good move-average and interesting and quiet different playing-styles. All are very danger not easy to beat ... OK DanaSah lose more games as the others fast. But the style is just fantastic.
Newer releases must have a difficult life.

01. RubiChess 20240112 NN
02. Seer 2.8.0 NN
03. Caissa 1.16 NN
04. Revenge 3.0 NN
05. Uralochka 3.40 NN
06. Igel 3.5.0 NN
07. Arasan 24.1 NN
08. SlowChess 2.9 NN
09. Fritz 19 NN (Gingko) NN ... first Fritz version with a really interesting style of play
10. Fire 9.2 NN
11. Starzix 4.0 NN
12. Wasp 6.50 NN
13. Velvet 6.0.0 NN
14. Texel 1.11 NN
15. Pawn 3.0 NN
16. Nemorino 6.11 NN
17. chess.cpp 4.0 NN ... like the engine a lot, very balanced style, often with interesting ideas in late midgames
18. Marvin 6.20 NN
19. DanaSah 9.1 NN
20. Hiarcs 15.2
21. Booot 6.4
---
Yes, not in the field are Rebel or CSTal or Dragon ...
For different reasons! Rebel or CSTal produced from time to time GUI hang-ups.
To add Dragon made no sense, must not have the strongest engines in the field.
---
Now a new Stormphrax 5.0.0 NN is out ...
And have to play vs. this 21 engines the same set of balanced opening positions = 2100 games
The results goes in a rating list without Elo. I am working only with points. Ratings are so boring!
Now a new Marvin 7.00 NN is out ...
So Marvin have to play vs. the same 21 engines, in one match vs. Marvin 6.20 NN.
The results goes in a rating list.
The 21 "Team-worker-engines" are not in the rating-list!
Used only as reference test-engines!
Later in the rating list I will add some interesting stats, not Elo.
I have here different ideas what I can do and working on it.
Thats the idea I had since sommer 2022 after I closed for some reasons my KI-Rating List.
Time control will be 40 in 8 + 2 seconds (time control I like most for testing engines).
I can start with it middle of the year.
I am not sure to 100% with the test-group ... but I am thinking exactly on this group of engines.
Furthermore, I need to different points more results and time for thinking about it.
Best
Frank