2950 Elo?! That's much much stronger than all my tests indicated. The opponents were around 2800 Elo which is what the results auf the gauntlet I posted with the announcement above indicated. Also in selfplay 2.4 was only 70 Elo stronger than 2.3... I have no explanation! Let's see were it settles in the end.Modern Times wrote: ↑Sun Mar 26, 2023 9:01 am It is on the CCRL Blitz list already, just made the cut-off, however judging by the average opponent Elo it is much stronger than the tester anticipated! It will require more games against stronger opponents.
Devlog of Leorik
Moderator: Ras
-
- Posts: 915
- Joined: Sun Dec 27, 2020 2:40 am
- Location: Bremen, Germany
- Full name: Thomas Jahn
Re: Devlog of Leorik
-
- Posts: 72
- Joined: Sun Nov 01, 2020 5:27 pm
- Full name: Richard Porti
Re: Devlog of Leorik
I am curious, did you measure ELO differences based on the hardware Leorik runs on?lithander wrote: ↑Sun Mar 26, 2023 12:44 pm2950 Elo?! That's much much stronger than all my tests indicated. The opponents were around 2800 Elo which is what the results auf the gauntlet I posted with the announcement above indicated. Also in selfplay 2.4 was only 70 Elo stronger than 2.3... I have no explanation! Let's see were it settles in the end.Modern Times wrote: ↑Sun Mar 26, 2023 9:01 am It is on the CCRL Blitz list already, just made the cut-off, however judging by the average opponent Elo it is much stronger than the tester anticipated! It will require more games against stronger opponents.

-
- Posts: 915
- Joined: Sun Dec 27, 2020 2:40 am
- Location: Bremen, Germany
- Full name: Thomas Jahn
Re: Devlog of Leorik
Well, of course the engine is less strong on a Raspberry Pi than on my Ryzen 5900X where it searches 10x more nodes per second.

But when I set up a Gauntlet to estimate Elo all engines run on the same hardware. In my case that was an Intel 9700k with 8 cores. I run 7 games in parallel and the PC wasn’t used for anything else during that time. And Leorik 2.4 did not surprise me in any way. It basically confirmed the +70 Elo my selfplay matches between 2.4 and 2.3 predicted. Also I ran 15k Games to limit the influence of pure luck. I was really pretty confident about expecting ~2800… I wonder if there’s anything wrong with my methodology.
But to be honest +230 Elo sounds way to good to be true!

I can’t wait to see where it places after more games or in the 4040 list and on the CEGT. This has suddenly become super exciting, haha

-
- Posts: 1062
- Joined: Tue Apr 28, 2020 10:03 pm
- Full name: Daniel Infuehr
Re: Devlog of Leorik
Its really cool to see someone pushing C#
Wonder if you get a big performance gain by updating to .net 8.0?
I see you are on net6.0 https://github.com/lithander/Leorik/blo ... ore.csproj
Wonder if you get a big performance gain by updating to .net 8.0?
I see you are on net6.0 https://github.com/lithander/Leorik/blo ... ore.csproj
Worlds-fastest-Bitboard-Chess-Movegenerator
Daniel Inführ - Software Developer
Daniel Inführ - Software Developer
-
- Posts: 440
- Joined: Thu Apr 26, 2012 1:51 am
- Location: Oak Park, IL, USA
- Full name: Erik Madsen
Re: Devlog of Leorik - *New* Version 2.4
That sounds great... and a bit confusing. How can one gauntlet yield 2800 Elo and another 2950 Elo? Either way, great progress!
Erik Madsen | My C# chess engine: https://www.madchess.net
-
- Posts: 3699
- Joined: Thu Jun 07, 2012 11:02 pm
Re: Devlog of Leorik
Maybe it is a good blitzer, 2+1 is quite different to 40/30 (assuming that is 40 moves in 30 minutes and not 30 seconds)
-
- Posts: 915
- Joined: Sun Dec 27, 2020 2:40 am
- Location: Bremen, Germany
- Full name: Thomas Jahn
Re: Devlog of Leorik
In cutechess-cli tc=40/30 means 40 moves in 30 seconds! So definitely blitz games. (Otherwise that 15k games gauntlet would have taken several months on my hardware^^)Modern Times wrote: ↑Mon Mar 27, 2023 10:00 am Maybe it is a good blitzer, 2+1 is quite different to 40/30 (assuming that is 40 moves in 30 minutes and not 30 seconds)
-
- Posts: 3699
- Joined: Thu Jun 07, 2012 11:02 pm
Re: Devlog of Leorik
OK - I'm running some 40/15 (minutes) games with it. At that time control it is nowhere near 2950, it does seem to be around 2800 but early days.
-
- Posts: 965
- Joined: Fri Aug 21, 2020 1:25 am
- Location: Planet Earth, Sol system
- Full name: Michael J Sherwin
Re: Devlog of Leorik
Have you tried LMP yet?
Leorik.Engine - Leorik-2.4 : 12.5/20 6-1-13 (=01=1=====1==1=11===) 63% +92
Not enough games, I know I know but hey that is your Job. 
Leorik.Engine - Leorik-2.4 : 12.5/20 6-1-13 (=01=1=====1==1=11===) 63% +92
Code: Select all
//moves after the PV are searched with a null-window around alpha expecting the move to fail low
if (remaining > 1 && playState.PlayedMoves > 1)
{
// LMP
if (eval + 300 <= alpha && playState.PlayedMoves > 10 + remaining) break;
int R = 0;
//non-tactical late moves are searched at a reduced depth to make this test even faster!
if (!inCheck && playState.Stage >= Stage.Quiets && !next.InCheck())
R += 2;
//when not in check moves with a negative SEE score are reduced further
if (!inCheck && _see.IsBad(current, ref move))
R += 2;
if (EvaluateNext(ply, remaining - R, alpha, alpha + 1, moveGen) <= alpha)
continue;
//...but if against expectations the move does NOT fail low we research it at full depth!
}

-
- Posts: 965
- Joined: Fri Aug 21, 2020 1:25 am
- Location: Planet Earth, Sol system
- Full name: Michael J Sherwin
Re: Devlog of Leorik
I let it run for 80 more games. The final result was still good. There should be room for improvement.
Leorik.Engine - Leorik-2.4 : 53.5/100 25-18-57 (=01=1=====1==1=11===0010=1=1===01===01==0==110=01=11=0=1=1==000=1=1=1==1=0=0=0=======110=====1===0==) 54% +28
Leorik.Engine - Leorik-2.4 : 53.5/100 25-18-57 (=01=1=====1==1=11===0010=1=1===01===01==0==110=01=11=0=1=1==000=1=1=1==1=0=0=0=======110=====1===0==) 54% +28