jefk wrote: ↑Thu Jan 02, 2025 4:17 am
So your idea of 'success' is being on the Ccrl list ? Get real. Learning programming/improving programming skills is a fine goal, but making a chess engine elo 2500 imo isn't a success; maybe it would be for some AI, not for humans anymore.
Giraffe is a UCI chess engine written by Matthew Lai, one of AlphaZero's developers.
Thanks to Silvian for testing LazySMP 9.0. It proves that my engine has a rating above 2400. Please note that Giraffe supports Transposition Table but LazySMP does not.
LazySMP wrote: ↑Fri Jan 03, 2025 4:09 amHi Rasmus! Are you sure CT800 rating is around 2700?
Yes.
LazySMP 9.0 beat CT800 V1.46 in 30 moves
King safety has always been an engine weakness, so losses like this can happen, and if they do, the game is shorter naturally because it doesn't go into an endgame. However, it's not about how quickly the engine loses if it does, but how often, with varying openings of course. If you say LazySMP 9.0 should be around 2500, that's 200 Elo difference, so LazySMP should get about 2500 points in a 10000 games match - including quite some wins, obviously.
I usually test at 10s per game for each side, no increment. With an eight core CPU plus hyperthreading, running 14 games in parallel isn't an issue with a proper engine driver; I'm using c-chess-cli under Linux. CPU turbo boost should be switched off to get a more constant CPU speed. 10k games take something like 3 hours or so.
LazySMP wrote: ↑Fri Nov 15, 2024 7:45 am
What is your engine rating in CCRL?
Around 2700.
Hi Rasmus! Are you sure CT800 rating is around 2700?
LazySMP 9.0 beat CT800 V1.46 in 30 moves
You play ONE game vs ct800 and you claim your "thing" is better than ct800 ? How "funny" guy you are .
Remember another guy who test with 1-2 games and never follow experts advices , but I can't remember his name exactly
Welll ..........
Ras wrote: ↑Fri Jan 03, 2025 7:37 am
King safety has always been an engine weakness, so losses like this can happen, and if they do, the game is shorter naturally because it doesn't go into an endgame.
You're right. BTW, king safety is the most important part of the evaluation. I am not familiar with your engine code, otherwise I would have helped you to fix its weakness. I think that you don't spend as much time on improving the evaluation function as you do on improving the search function.
Daniel Anulliero wrote: ↑Fri Jan 03, 2025 11:35 am
You play ONE game vs ct800 and you claim your "thing" is better than ct800 ?
No, my point is CCRL rating may be wrong. Please see LazySMP 5.0 rating which is 2070.
LazySMP wrote: ↑Fri Jan 03, 2025 8:17 pmYou're right. BTW, king safety is the most important part of the evaluation.
There is ofc king safety code in there, which is why in the game, the CT800 castles properly and sets up a standard pawn structure. What it doesn't see is king attacks coming in. It's not that I havn't tried tons of stuff, with danger zones, scoring attackers and whatnot - but it never worked. The software architecture is also limited by hardware constraints - the UCI version is only the testbed, the actual project is the microcontroller version with 1MB ROM for code/data and 192kB RAM for dynamic data (variables, stack, hash tables) with only 16kB RAM remaining as future reserve.
No, my point is CCRL rating may be wrong.
It's consistent with my own tests, usually within 20 Elo. However, I test with six different engines, 10000 games each, plus 50000 games against the previous CT800 version, and an 80000 opening position book to avoid duplicate games. Unless you even start to ramp up your testing, all you see is error margins.
Ras wrote: ↑Fri Jan 03, 2025 8:28 pm
There is ofc king safety code in there, which is why in the game, the CT800 castles properly and sets up a standard pawn structure. What it doesn't see is king attacks coming in. It's not that I haven't tried tons of stuff, with danger zones, scoring attackers and whatnot - but it never worked.
You're right. It's too hard! I remember many years ago when Joerg Oster tuned values for piece check and attack unit factors, king safety went down.
Please see this: https://github.com/official-stockfish/S ... dd82d0fadf
Joerg Oster wrote:
A middle ground patch of two successful tuning patches, one at STC, the other at LTC, which now passed both.
const int QueenCheck = 52; const int QueenCheck = 62;
const int RookCheck = 45; const int RookCheck = 57;
const int BishopCheck = 5; const int BishopCheck = 48;
const int KnightCheck = 17; const int KnightCheck = 78;