CoronaVirusChess has a blitz rating on the nightmare.nl server of 2777, from the monthly blitz tourneys. Strength is of the same order as Arasan, Wasp and similar, I think. It seems to generally hold its own against the HCE’s (not including SF of course), but can’t handle the NNUE’s.connor_mcmonigle wrote: ↑Fri Apr 30, 2021 5:13 pmThanks for testing chris! I'm quite happy to see I've resolved the TM issues. What's CoronavirusChess' estimated strength and what was the result of the match? Seer is expected to underperform at STC somewhat, but in my testing at 1m+1s against a variety of opponents it looks to be around 3180-3220 elo CCRL.chrisw wrote: ↑Fri Apr 30, 2021 1:33 pm500 40/10 games without timing issues says it's fixed.connor_mcmonigle wrote: ↑Thu Apr 29, 2021 10:50 pm I've released a new minor version of Seer which hopefully resolves the cyclical TM issues people were experiencing: https://github.com/connormcmonigle/seer ... tag/v2.0.1. More information can be found in the release description. Sorry testers.
The TM problems seemed to only affect Windows which explains why I didn't experience any issues in my testing. Guenther has confirmed that Seer no longer loses on time in his limited testing.
Can you confirm that the TM issues are now resolved? Thanks in advance.chrisw wrote: ↑Wed Apr 28, 2021 11:16 am Cutechess, 40/10 games. Note timing problems and plenty of "illegal move" terminations when Seer2 actually winning (I assume because of time problems). I had to delete many PGNs because of talkchess post limits, the one's deleted came from the middle part of the run. Final part of run will contain 40+ unterminated games because of my interrupting CuteChess.
...
First impressions from a quick look at the games, Seer2 very aggressive style, possibly a little premature, eg it likes king attacks (a lot), but launches them a little too quickly, maybe, often giving material or a pawn. Sometimes this works and sometimes not.
To delve a little deeper, I set up game in 15 and watched. First game shows immediately the king side "attack" tendency. At first I thought this was premature, not yet developed, white would get the centre etc. In fact at one point it turned into something looking quite dangerous, but defences were adequate. Very coffee-house style, which is really nice. Maybe that comes from the training set? Humans on LiChess?
[pgn][Event "My Tournament"]
[Site "?"]
[Date "2021.04.30"]
[Round "1"]
[White "Corona-Virus-Chess-1.018"]
[Black "seer_znver201"]
[Result "1-0"]
[ECO "A05"]
[GameDuration "00:39:39"]
[GameEndTime "2021-04-30T12:53:20.606 Romance Summer Time"]
[GameStartTime "2021-04-30T12:13:40.774 Romance Summer Time"]
[Opening "Reti Opening"]
[PlyCount "93"]
[Termination "adjudication"]
[TimeControl "40/900+2"]
1. Nf3 {book} Nf6 {book} 2. c4 {book} d6 {book} 3. Nc3 {book} Nbd7 {book}
4. g3 {book} e5 {book} 5. Bg2 {book} c6 {book} 6. e4 {book} Be7 {-0.21/19 19s}
7. O-O {+0.14/25 24s} h5 {-0.19/20 19s} 8. h3 {+0.18/25 24s} Nh7 {+0.17/21 20s}
9. d4 {+0.19/29 24s} g5 {+0.29/21 20s} 10. d5 {+0.36/27 24s} g4 {-0.12/21 20s}
11. hxg4 {+0.36/29 25s} hxg4 {-0.22/21 20s} 12. Nh2 {+0.28/30 26s}
Nhf6 {-0.16/20 20s} 13. Nxg4 {+0.66/31 27s} Nxg4 {-0.96/23 95s}
14. Qxg4 {+0.61/30 26s} Kf8 {-0.53/21 23s} 15. Qe2 {+0.66/29 24s}
Bg5 {-0.67/23 87s} 16. f4 {+0.88/26 24s} exf4 {-0.98/22 79s}
17. gxf4 {+0.91/28 28s} Bh4 {-1.66/21 16s} 18. Rf3 {+0.58/26 29s}
Nf6 {-1.29/19 16s} 19. e5 {+0.63/30 27s} Ng4 {-0.65/21 23s}
20. e6 {+0.58/29 28s} Bf6 {-1.62/19 16s} 21. Rd3 {+0.55/29 32s}
Qb6+ {-1.55/21 33s} 22. Kf1 {+0.48/29 24s} Nh2+ {-1.15/23 73s}
23. Ke1 {+0.51/24 2.1s} fxe6 {-0.69/20 64s} 24. dxc6 {+0.59/29 28s}
bxc6 {-0.28/23 56s} 25. Ne4 {+0.51/28 29s} Be7 {-0.40/23 48s}
26. Nxd6 {+0.49/30 25s} Ba6 {0.00/22 41s} 27. Be3 {+0.87/31 30s}
Qa5+ {-0.92/20 12s} 28. Qd2 {+1.00/31 29s} Qh5 {-0.71/19 8.4s}
29. Qc3 {+1.00/32 31s} Rd8 {-0.13/19 8.6s} 30. Bc5 {+1.13/30 33s}
Rh7 {-1.79/22 36s} 31. Ne4 {+1.48/31 29s} Rxd3 {-2.82/23 28s}
32. Bxe7+ {+1.72/28 25s} Rxe7 {-2.37/18 5.9s} 33. Qxd3 {+1.66/29 29s}
e5 {-2.93/19 22s} 34. f5 {+2.74/29 30s} Bc8 {-4.71/20 15s}
35. Qd8+ {+3.64/33 33s} Re8 {-3.79/18 3.6s} 36. Qf6+ {+3.95/35 30s}
Kg8 {-7.06/22 11s} 37. Qxc6 {+4.18/36 37s} Qh4+ {-4.62/19 3.7s}
38. Kd2 {+4.03/34 34s} Rd8+ {-1.92/16 2.3s} 39. Kc3 {+4.58/37 32s}
Bxf5 {-5.22/16 2.1s} 40. Rg1 {+4.80/36 29s} Kf8 {-3.80/17 2.0s}
41. Qc5+ {+5.44/32 29s} Qe7 {-13.55/24 35s} 42. Qf2 {+5.64/31 28s}
Qf7 {-14.42/23 17s} 43. Qh4 {+8.02/30 27s} Rd4 {-15.83/26 48s}
44. Qh8+ {+8.56/32 28s} Qg8 {-16.17/27 75s} 45. Qh6+ {+8.87/33 27s}
Qg7 {-19.95/24 25s} 46. Qxh2 {+9.07/33 25s} Bg4 {-16.20/24 70s}
47. Bf3 {+12.70/33 29s, White wins by adjudication: user decision} 1-0
[/pgn]
Second game, Seer2 attacking, again one pawn less, then another pawn, then another pawn, has centre, bishop pair and development. Looks good. Trades to QRPPP vs RRBNPPPPP, Seer alternating between seeing a draw, or seeing +3. Mine thinks draw. And draw it is ...
Very nice, and really quite a coffee-house style, albeit anecdotally on two games only. Like it.
....
The 500 fast test games (40/10) with Seer2.01 came out about 52% winrate to mine, but that could go either way after so few games, so, maybe roughly equal?? Draw rate was 21%, iirc, I think that is quite low, which might reflect something or other (both engines a bit manic?).
I think I still have the pgns of the longer time control games, five altogether, again iirc.
Yes, the labels are more absolutely accurate, by definition, at least at the low man-count training iterations, but the actual positions are likely more manic. And then each iteration adds labels from the manic world above itself, and so on. It’s progressive and it must have an effect I’ld have thought.
I believe the attacking style might have something to do with the distribution of positions Seer was trained on (maybe an excessive number of imbalanced positions), but the labels assigned to the positions are completely independent of the originating games' outcomes.
I’m not sure I understood that last bit! A question though. You said you used 0.7 billion training positions, I guess the collective 3-4-5-6 bin for initial train would have 50 million or so of those? Unless I misunderstood, the Seer NN is using king-piece-square as per nodchip-SF NNUE, but with fewer accumulator neurons and a different structure above that (which kind of cross checks with the 60Mb weights file size). Is 50 million enough to train a big initial NNUE, or is the intention to increase the position mass for future runs?
I think Seer tends to be much looser with material as the playout continuations to <N man position it's trained on can sometimes go up to 30 moves deep which means that the network tends to look more for long term advantages than immediate compensation. This definitely backfires sometimes, though
Using a shallow search as a target biases the network towards looking for immediate compensation. Training a network on the WDL outcome of partially randomized self play games also induces a "short term bias" as positions requiring precise, long continuations (such as a king attack or tricky endgame), where a random move can't be afforded, will be undervalued.