



Moderator: Ras
Code: Select all
Score of Leorik-2.2.8theta vs Leorik 2.2.7: 3018 - 2932 - 2424 [0.505] 8374
... Leorik-2.2.8theta playing White: 1711 - 1296 - 1181 [0.550] 4188
... Leorik-2.2.8theta playing Black: 1307 - 1636 - 1243 [0.461] 4186
... White vs Black: 3347 - 2603 - 2424 [0.544] 8374
Elo difference: 3.6 +/- 6.3, LOS: 86.8 %, DrawRatio: 28.9 %
Code: Select all
Rank EAS-Score sacs shorts draws moves Engine/player
-------------------------------------------------------------------
1 112519 13.71% 17.29% 13.33% 71 Leorik-2.2.8theta
2 62636 06.43% 12.54% 20.43% 76 Leorik 2.2.7
Code: Select all
Rank EAS-Score sacs shorts draws moves Engine/player
-------------------------------------------------------------------
1 169571 16.88% 30.78% 13.76% 68 Leorik-2.2.8theta
2 86431 09.35% 04.67% 10.68% 98 zahak-5.0
3 66035 08.41% 11.21% 16.96% 83 Inanis-1.1.1
4 51044 09.73% 06.19% 20.00% 85 odonata-0.6.2
5 48340 07.56% 11.76% 24.18% 82 dumb-1.9
6 42417 09.52% 12.38% 29.58% 87 Supernova-2.4
7 38761 04.76% 08.33% 22.68% 86 blunder-8.5.5
Given his verdict and that the version with this set of weights is stronger than my strongest build from the master branch I think I can declare this experiment a success. I will honor 2.2.8theta for being the first version that I, with tuning alone, couldn't find a clear improvement on and instead of publishing any of the contenders that are roughly in the same ballpark strenghtwise I will go and release Leorik 2.3 with the original 2.2.8theta weights.Bottom line is this Leorik gave me nothing but problems to solve. This Leorik is far more anti human than any of the others. The moves also looked like the moves a grandmaster would play. The difference is astonishing. Congrats!
I wasn't planning it at that time but now there is going to be one. And hopefully not the last update ever - but even if it should be: With the eval being completely derived from selfplay I feel like Leorik is a well rounded package now that I wouldn't have to feel regrets about "abandoning".
Isn't a PSQT per ECO position overkill? If I recognize a individual position I could just play the known best move. This is basically just an opening book. Or do you mean that when a game started with a certain opening the entire game is now played on a set of PSQTs optimized for that opening? That could work but sometimes an engine is asked to evaluate a position with no history of moves. In that case I wouldn't know which PSQT to pick. It would be nice to have some more generic way of describing a position on different axes (e.g. open-closed) and use interpolation again like with the 'phase' value that is used to interpolate between the midgame and endgame tables. (Tapered Eval)Mike Sherwin wrote: ↑Tue Dec 20, 2022 3:10 pm Even if you have satisfied all your goals with Leorik and decide to move on to something else there is one more capability I like to see added. The ability to recognise ECO positions and load piece square tables trained on that position. You might only include a relatively few early positions with Leorik but give the user the ability to add/train additional positions.![]()
I mean train starting from the most popular 2 ply positions like e4 e5, e4 c5, e4 c6 ect. In different starting pawn structures Leorik will do better if it has pstbl's optimised for them. A simple test would be to differentiate between only 1. e4 and 1. d4. That means only three tables, 1 generic like you have now, 2 e4 and 3 d4.lithander wrote: ↑Wed Dec 21, 2022 12:01 pmIsn't a PSQT per ECO position overkill? If I recognize a individual position I could just play the known best move. This is basically just an opening book. Or do you mean that when a game started with a certain opening the entire game is now played on a set of PSQTs optimized for that opening? That could work but sometimes an engine is asked to evaluate a position with no history of moves. In that case I wouldn't know which PSQT to pick. It would be nice to have some more generic way of describing a position on different axes (e.g. open-closed) and use interpolation again like with the 'phase' value that is used to interpolate between the midgame and endgame tables. (Tapered Eval)Mike Sherwin wrote: ↑Tue Dec 20, 2022 3:10 pm Even if you have satisfied all your goals with Leorik and decide to move on to something else there is one more capability I like to see added. The ability to recognise ECO positions and load piece square tables trained on that position. You might only include a relatively few early positions with Leorik but give the user the ability to add/train additional positions.![]()
Code: Select all
# PLAYER : RATING POINTS PLAYED (%)
1 Inanis-1.1.1 : 2767.0 323.5 620 52
2 odonata-0.6.2 : 2744.0 298.5 618 48
3 Leorik-2.3 : 2741.3 1960.5 3716 53
4 zahak-5.0 : 2730.0 295.5 620 48
5 dumb-1.9 : 2703.0 325.0 620 52
6 blunder-8.5.5 : 2700.0 255.0 620 41
7 Supernova-2.4 : 2687.0 258.0 618 42
Code: Select all
Rank EAS-Score sacs shorts draws moves Engine/player
-------------------------------------------------------------------
1 132596 15.02% 22.27% 13.46% 68 Leorik-2.3
2 57194 11.00% 04.00% 18.79% 85 zahak-5.0
3 52975 06.99% 05.38% 17.33% 83 odonata-0.6.2
4 50228 09.28% 13.50% 27.03% 76 dumb-1.9
5 45085 06.67% 10.00% 24.06% 84 Supernova-2.4
6 44044 04.19% 10.23% 22.95% 78 Inanis-1.1.1
7 34561 06.85% 09.59% 26.24% 81 blunder-8.5.5