I've run my own test during the night at the 10s+0.1s I normally use. It seems Rustic 4 and Leorik are now in the same ballpark:
Score of Rustic Alpha 3.43.100 vs Leorik 0.2.3: 715 - 778 - 507 [0.484]
... Rustic Alpha 3.43.100 playing White: 396 - 360 - 244 [0.518] 1000
... Rustic Alpha 3.43.100 playing Black: 319 - 418 - 263 [0.451] 1000
... White vs Black: 814 - 679 - 507 [0.534] 2000
Elo difference: -10.9 +/- 13.1, LOS: 5.2 %, DrawRatio: 25.4 %
2000 of 2000 games finished.
With regard to speed, they also seem to be similar, but I'm seeing a few things I can't explain. See the time-to-depth test below:
Rustic:
position fen r3k2r/p1ppqpb1/bn2pnp1/3PN3/1p2P3/2N2Q1p/PPPBBPPP/R3K2R w KQkq - 0 1
go depth 10
info score cp 72 depth 1 seldepth 11 time 0 nodes 1032 nps 0 pv e2a6 b4c3
info score cp 72 depth 2 seldepth 11 time 0 nodes 2420 nps 0 pv e2a6 b4c3
info score cp 44 depth 3 seldepth 17 time 1 nodes 10792 nps 10792000 pv d5e6 e7e6 e2a6 e6e5
info score cp 44 depth 4 seldepth 19 time 5 nodes 37884 nps 7576800 pv d5e6 e7e6 e2a6 e6e5
info score cp 20 depth 5 seldepth 19 time 23 nodes 185680 nps 8073043 hashfull 2 pv e2a6 e6d5 c3d5 b6d5 e4d5 e7e5
info score cp 2 depth 6 seldepth 20 time 98 nodes 645425 nps 6585969 hashfull 15 pv d5e6 e7e6 e2a6 h3g2 f3g2 e6e5
info score cp -9 depth 7 seldepth 22 time 486 nodes 3139224 nps 6459309 hashfull 71 pv e2a6 e6d5 c3d5 f6d5 e5d3 f7f5 e1g1 f5e4
info score cp -39 depth 8 seldepth 23 time 1822 nodes 10474182 nps 5748728 hashfull 279 pv e2a6 e6d5 e1g1 h3g2 f3g2 e7e5 c3d5 f6d5 e4d5 e5d5 d2b4 d5g2 g1g2 g7b2
info score cp -26 depth 9 seldepth 25 time 5927 nodes 34466920 nps 5815239 hashfull 716 pv e2a6 e6d5 c3d5 b6d5 a6b7 h3g2 f3g2 a8b8 e4d5 b8b7
info score cp -37 depth 10 seldepth 27 time 24585 nodes 135415089 nps 5508037 hashfull 999 pv e2a6 e6d5 c3b5 d5e4 b5c7 e8d8 f3g3 h3g2 h1g1 f6h5 d2b4 h5g3 b4e7 d8c7 h2g3 g7e5 g1g2 e5b2
bestmove e2a6
quit
Leorik:
position fen r3k2r/p1ppqpb1/bn2pnp1/3PN3/1p2P3/2N2Q1p/PPPBBPPP/R3K2R w KQkq - 0 1
go depth 10
info string Search scheduled to take 715827857ms!
info depth 1 score cp 39 nodes 8370 nps 139500 time 60 pv d5e6
info depth 2 score cp 39 nodes 12941 nps 142208 time 91 pv d5e6 e7e6
info depth 3 score cp -1 nodes 25405 nps 267421 time 95 pv d5e6 e7e6 e2a6
info depth 4 score cp -1 nodes 57046 nps 538169 time 106 pv d5e6 e7e6 e2a6 e6e5
info depth 5 score cp -31 nodes 299820 nps 1774082 time 169 pv e2a6 b4c3 d2c3 e6d5 e5g4
info depth 6 score cp -31 nodes 848063 nps 2546735 time 333 pv e2a6 b4c3 d2c3 e6d5 e5g4 h3g2
info depth 7 score cp -46 nodes 4096299 nps 5520618 time 742 pv d5e6 e7e6 e2a6 h3g2 f3g2 e6e5 c3b5
info depth 8 score cp -47 nodes 15748792 nps 6483652 time 2429 pv e2a6 e6d5 c3b5 h3g2 b5c7 e8d8
info depth 9 score cp -34 nodes 45256103 nps 7416601 time 6102 pv e2a6 e6d5 c3b5 h3g2 b5c7 e8d8
info depth 10 score cp -47 nodes 159081047 nps 6210949 time 25613 pv e2a6 e6d5
bestmove e2a6
quit
Some observations:
- It seems Rustic is faster with regard to time-to-depth.
- It looks like this is caused by the fact that Rustic searches less nodes.
- Still, Leorik often outcalculates Rustic by 1-2 play in the late middle-game, and sometimes even 3 ply in the end-game
- If an engine searches 1-3 ply deeper than another engine this should give you MUCH more strength than +10 Elo. I'll try to run a test with depth settings of 6 and 8 or something like that with Rustic, and see what the difference is. (The higher the depth, the less the Elo difference will be, obviously.)
- What's up with Leorik's strange NPS that needs to "rev up" first? It starts out really strange and at some point exceeds Rustic's nps, which seems to stabilize at 5.5M nodes/sec at higher depths. (Longer time to search, so the average per position is more reliable.)
- What abouv Leorik's super-short PV?
You did help me to discover a massive bug though. Refactoring the printing routines (adding the Display trait in many places) left me with an extra space after "pv". I'll have to fix that ASAP. Printing that extra space costs time you know.
Does Leorik 0.2.3 which you have sent me have the staged move generation that you posted earlier in the thread?