What engine breaks even with GMs in blitz?

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: What engine breaks even with GMs in blitz?

Post by Laskos »

lkaufman wrote: Mon Apr 08, 2019 9:04 pm
Laskos wrote: Mon Apr 08, 2019 8:06 am
lkaufman wrote: Mon Apr 08, 2019 6:43 am I ran a twenty game blitz (3' + 2") knight odds match (two ply book for variety, both b1 and g1 knights) between Lc0 11248 (on my 2080) vs. Giraffe (best version, on my 5 GHz i7 laptop), with Giraffe as a proxy for Magnus Carlsen, a good one since it "knows" to simplify when up a piece while some similarly rated A/B engines may not know or appreciate this. "Magnus" won by 12 to 8 (no draws!). So perhaps it's not yet time to bet against the champ if such a match took place, but we're getting close it seems.
I am not sure at 3' + 2'' blitz, Lc0 11248 on 2080 is probably close at Knight odds to Carlsen, but my claim here:
http://www.talkchess.com/forum3/viewtop ... =2&t=69956

was that at longer 45' + 15'' a top GM might not win all 10 games out of 10 being a Knight up, more probably 7-9, 1-3 being drawn or even lost by the human. It would be fun to watch such a match, as the top human will be happy having an upper hand most of the times, while enjoying some 1-3 setbacks at Knight odds in 10 games! The prize could be proportional to (Wins - Draws - Losses) or even (Wins - Draws - 2*Losses), to give incentive to the human to not lose any points. Lc0 should be left with some temperature (say 0.5) for the first 4-5 moves, for diversity and for not playing into prepared openings. Or by providing Lc0 with a small, prepared book.
Since we already know that this same LCO network defeated GM Naroditsky by a wide margin in blitz games averaging around knight odds, and he is probably within a class of Magnus in blitz strength, we already have good reason to think knight odds vs. Carlsen would be close, so this test is mostly just to confirm that we can make meaningful predictions of human results by these simulartions. As for 45' + 15", what CCRL 40/40 rating do you think would come closest to matching Magnus in strength at 45' + 15" ?
I am on the phone now, but that's not that hard, there were games against top humans in 2003-2004 and we can extrapolate tc and hardware. Take some Fritz 8 or Junior 9 on one core CCRL 40/40 level. My estimate it is some Kasparov or Magnus level at 45' + 15''.
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: What engine breaks even with GMs in blitz?

Post by Laskos »

Laskos wrote: Mon Apr 08, 2019 9:56 pm
lkaufman wrote: Mon Apr 08, 2019 9:04 pm
Laskos wrote: Mon Apr 08, 2019 8:06 am
lkaufman wrote: Mon Apr 08, 2019 6:43 am I ran a twenty game blitz (3' + 2") knight odds match (two ply book for variety, both b1 and g1 knights) between Lc0 11248 (on my 2080) vs. Giraffe (best version, on my 5 GHz i7 laptop), with Giraffe as a proxy for Magnus Carlsen, a good one since it "knows" to simplify when up a piece while some similarly rated A/B engines may not know or appreciate this. "Magnus" won by 12 to 8 (no draws!). So perhaps it's not yet time to bet against the champ if such a match took place, but we're getting close it seems.
I am not sure at 3' + 2'' blitz, Lc0 11248 on 2080 is probably close at Knight odds to Carlsen, but my claim here:
http://www.talkchess.com/forum3/viewtop ... =2&t=69956

was that at longer 45' + 15'' a top GM might not win all 10 games out of 10 being a Knight up, more probably 7-9, 1-3 being drawn or even lost by the human. It would be fun to watch such a match, as the top human will be happy having an upper hand most of the times, while enjoying some 1-3 setbacks at Knight odds in 10 games! The prize could be proportional to (Wins - Draws - Losses) or even (Wins - Draws - 2*Losses), to give incentive to the human to not lose any points. Lc0 should be left with some temperature (say 0.5) for the first 4-5 moves, for diversity and for not playing into prepared openings. Or by providing Lc0 with a small, prepared book.
Since we already know that this same LCO network defeated GM Naroditsky by a wide margin in blitz games averaging around knight odds, and he is probably within a class of Magnus in blitz strength, we already have good reason to think knight odds vs. Carlsen would be close, so this test is mostly just to confirm that we can make meaningful predictions of human results by these simulartions. As for 45' + 15", what CCRL 40/40 rating do you think would come closest to matching Magnus in strength at 45' + 15" ?
I am on the phone now, but that's not that hard, there were games against top humans in 2003-2004 and we can extrapolate tc and hardware. Take some Fritz 8 or Junior 9 on one core CCRL 40/40 level. My estimate it is some Kasparov or Magnus level at 45' + 15''.
Just started a match of 10 games at roughly this level (Arasan 14.3, 2684 CCRL 40/40), but at 15' + 5'' time control, so a bit more favorable for the stronger engine odds taker than 45' + 15''. Already after 3 games, I have a win for Lc0 as White at Knight odds, so I think that my result that we can have 1-3 setbacks (draw or loss) in 10 games of a top GM at Knight odds against a top engine (Lc0) on RTX 2070 or 2080 might be reasonable. Here is the win of Lc0 against an engine roughly the strength of Fritz 8 Bilbao, Junior 9 or Magnus in these conditions.

[pgn][Event "?"] [Site "?"] [Date "2019.04.09"] [Round "3"] [White "lc0_11248"] [Black "Arasan_143"] [Result "1-0"] [FEN "rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKB1R w KQkq - 0 1"] [GameDuration "00:49:43"] [GameEndTime "2019-04-09T01:37:47.959 GTB Daylight Time"] [GameStartTime "2019-04-09T00:48:04.014 GTB Daylight Time"] [PlyCount "241"] [SetUp "1"] [Termination "adjudication"] [TimeControl "900+5"] 1. e4 {-7.05/9 20s} e5 {+3.17/20 28s} 2. d4 {-6.65/10 13s} exd4 {+3.25/20 27s} 3. Qxd4 {-6.76/11 12s} Nc6 {+3.20/19 26s} 4. Qe3 {-6.78/11 22s} Bb4+ {+3.12/18 26s} 5. Bd2 {-6.74/12 39s} Nf6 {+3.11/19 25s} 6. Nc3 {-6.76/13 4.4s} O-O {+3.25/19 25s} 7. O-O-O {-6.81/12 9.5s} d6 {+3.23/19 24s} 8. f3 {-5.20/10 35s} Be6 {+3.23/18 24s} 9. h4 {-4.94/10 41s} Nh5 {+3.13/19 23s} 10. Qe1 {-4.90/10 17s} a5 {+3.48/18 23s} 11. g4 {-3.38/13 15s} Nf6 {+3.14/18 23s} 12. h5 {-3.58/14 14s} Nd7 {+2.95/19 22s} 13. Kb1 {-3.46/12 42s} Nde5 {+3.58/18 22s} 14. Qg3 {-2.88/13 12s} Nd4 {+3.42/18 21s} 15. f4 {-2.83/15 17s} Bxg4 {+2.77/20 42s} 16. Rg1 {-2.82/18 25s} g6 {+3.29/18 20s} 17. fxe5 {-1.97/23 11s} Bxd1 {+4.38/17 20s} 18. Bg5 {-1.91/29 9.1s} Bxc2+ {+4.08/15 19s} 19. Kc1 {-3.96/17 28s} Qe8 {+3.74/17 38s} 20. Nd5 {-0.34/28 25s} f6 {+2.66/15 36s} 21. Bxf6 {-0.37/27 7.5s} Bxe4 {+2.24/15 34s} 22. Bc4 {-0.35/26 9.9s} Ne2+ {+1.91/15 33s} 23. Kd1 {-0.37/25 0.91s} Rf7 {+1.67/15 32s} 24. Qh4 {-0.46/23 21s} Nxg1 {+3.75/14 13s} 25. hxg6 {-0.48/29 0.28s} Bxg6 {+1.63/15 30s} 26. Ne7+ {-0.78/31 53s} Qxe7 {+1.31/20 29s} 27. Bxe7 {-0.79/31 12s} d5 {+1.31/19 14s} 28. Bxd5 {-0.80/30 9.2s} Bxe7 {+0.98/19 13s} 29. Qxe7 {-0.80/31 2.3s} Nf3 {+1.11/18 13s} 30. Bxf7+ {-0.40/25 60s} Bxf7 {+0.74/20 13s} 31. e6 {-0.35/26 22s} Bh5 {+0.78/20 13s} 32. Kc1 {-0.33/26 17s} Re8 {+1.19/19 13s} 33. Qf6 {-0.29/24 14s} Rf8 {+0.74/19 13s} 34. Qh6 {-0.28/18 0.57s} Be8 {+0.58/20 12s} 35. e7 {-0.29/16 0.25s} Rf7 {+0.34/21 24s} 36. Qh5 {-0.28/14 0.25s} c5 {+0.38/20 12s} 37. Qd5 {-0.21/25 74s} Nd4 {+0.62/18 11s} 38. Qg5+ {-0.19/25 20s} Rg7 {+0.09/20 11s} 39. Qf6 {-0.16/24 22s} Rg1+ {+0.14/16 11s} 40. Kd2 {-0.18/26 24s} Rg2+ {+0.10/17 11s} 41. Kd1 {-0.17/24 14s} Bh5+ {+0.47/17 11s} 42. Kc1 {-0.14/22 30s} Bf7 {+0.35/17 11s} 43. Qh6 {-0.14/22 0.26s} Rg1+ {-0.06/17 11s} 44. Kd2 {-0.16/20 0.045s} Rg7 {+0.55/17 10s} 45. Qb6 {-0.25/18 7.2s} Rg2+ {+0.14/17 10s} 46. Kd3 {-1.52/19 70s} Be8 {+0.10/17 10s} 47. b4 {-1.63/16 25s} Nc6 {+2.75/15 10s} 48. b5 {-1.41/19 46s} Nb4+ {+2.50/15 9.9s} 49. Ke4 {-1.45/19 13s} Re2+ {+2.50/16 9.8s} 50. Kf3 {-1.51/19 14s} Rxe7 {+2.43/16 19s} 51. Qxc5 {-1.53/18 7.3s} Rf7+ {+2.06/16 18s} 52. Kg3 {-1.28/14 41s} Rd7 {+2.43/15 9.0s} 53. Qb6 {-1.35/14 48s} Rd3+ {+2.06/14 8.8s} 54. Kh2 {-0.59/12 8.3s} Rd2+ {+1.42/16 17s} 55. Kg1 {-0.60/13 8.9s} Rd1+ {+1.30/16 17s} 56. Kh2 {-0.64/14 5.3s} Rd2+ {+1.42/16 8.2s} 57. Kg1 {-0.63/14 11s} Rd1+ {+1.30/17 8.1s} 58. Kh2 {-0.64/15 0.29s} Rd5 {+1.63/16 8.0s} 59. a4 {+0.03/15 14s} Rd2+ {+1.26/16 7.9s} 60. Kg1 {+0.05/14 9.6s} Rd7 {+1.55/17 7.9s} 61. Qxa5 {+0.10/16 8.3s} Nd5 {+1.51/17 7.8s} 62. Qa8 {+0.11/16 5.7s} Nc7 {+1.02/18 7.7s} 63. Qxb7 {+0.16/12 17s} Nxb5 {+1.23/17 7.6s} 64. Qc8 {+0.15/12 7.2s} Nd6 {+0.63/18 15s} 65. Qa8 {+0.11/12 9.4s} Kf8 {+1.02/16 7.4s} 66. a5 {+0.11/10 4.5s} Nb5 {+0.86/16 7.3s} 67. a6 {+0.07/10 8.7s} Na7 {+0.90/17 7.2s} 68. Qe4 {+0.05/10 4.3s} Bg6 {+1.19/16 7.2s} 69. Qe6 {+0.10/9 12s} Re7 {+0.82/18 7.2s} 70. Qb6 {+0.06/10 5.3s} Be4 {+0.86/16 7.1s} 71. Qb8+ {+0.03/9 5.1s} Kg7 {+0.10/19 14s} 72. Qf4 {+0.21/8 4.2s} Bc6 {+1.23/14 6.7s} 73. Kh2 {+0.21/8 6.2s} Kg6 {+0.78/17 6.8s} 74. Qf8 {+0.31/8 5.1s} Rd7 {+0.66/17 6.8s} 75. Kh3 {+0.27/9 7.5s} Rf7 {+1.11/16 6.6s} 76. Qc5 {+0.05/9 5.0s} Kf6 {+0.74/16 6.5s} 77. Qd6+ {-0.04/9 4.2s} Kf5 {+0.74/17 6.5s} 78. Kh2 {-0.17/9 6.8s} Rd7 {+1.47/15 6.5s} 79. Qc5+ {-0.22/9 2.9s} Ke4 {+1.10/16 6.5s} 80. Qc4+ {-0.25/10 4.1s} Ke3 {+1.06/16 6.4s} 81. Kg1 {+0.02/9 5.3s} Bb5 {+1.14/15 6.5s} 82. Qb4 {+0.01/9 3.5s} Kd3 {+1.26/14 6.3s} 83. Kh1 {+0.01/10 5.7s} Kc2 {+1.30/14 6.3s} 84. Qa3 {+0.14/9 4.6s} h5 {+1.30/16 6.4s} 85. Kg2 {+0.18/9 7.2s} Bc6+ {+0.38/15 12s} 86. Kf2 {+0.12/10 7.7s} h4 {+0.71/16 12s} 87. Kg1 {+0.16/10 2.3s} Rg7+ {+0.71/15 5.9s} 88. Kf1 {+0.17/11 5.7s} Bb5+ {+0.34/17 5.9s} 89. Kf2 {+0.19/11 4.2s} Rf7+ {+0.34/16 5.8s} 90. Kg1 {+0.19/11 7.6s} Bc4 {+0.75/15 5.8s} 91. Qb4 {+0.02/9 5.2s} Rc7 {+0.38/15 5.8s} 92. Qa4+ {+0.09/9 4.3s} Kd3 {+0.42/16 5.8s} 93. Qa3+ {-0.05/10 5.0s} Ke4 {+0.42/17 5.7s} 94. Qd6 {-0.10/10 4.4s} Nb5 {+0.75/16 5.8s} 95. Qb6 {-0.16/10 6.3s} Kd5 {+0.99/14 5.8s} 96. Kf2 {-0.15/10 5.6s} Ra7 {+1.58/15 5.7s} 97. Kg2 {-0.10/9 3.5s} Bd3 {+1.58/16 5.8s} 98. Kh3 {-0.14/9 4.8s} Rh7 {+1.58/17 5.7s} 99. Qd8+ {0.00/9 5.5s} Ke4 {+1.58/17 5.6s} 100. Qa8+ {+0.08/9 6.2s} Ke3 {+1.58/17 5.7s} 101. Qe8+ {+0.07/9 5.0s} Kd2 {+1.58/18 5.6s} 102. Qe5 {+0.02/9 3.8s} Na7 {+1.50/17 5.6s} 103. Qb2+ {+0.41/9 4.4s} Ke3 {+1.42/18 5.7s} 104. Qb6+ {+0.38/9 3.3s} Kf3 {+0.63/17 5.6s} 105. Qb3 {+0.35/9 7.2s} Rd7 {+0.02/16 5.6s} 106. Kxh4 {+0.61/10 4.4s} Rh7+ {-0.04/16 5.5s} 107. Kg5 {+0.76/9 0.055s} Ke3 {-0.04/17 5.5s} 108. Qb6+ {+0.60/10 6.3s} Kd2 {-0.04/17 5.6s} 109. Qb2+ {+0.61/10 7.5s} Kd1 {0.00/17 5.5s} 110. Kf4 {+1.04/11 4.5s} Bc2 {-0.37/18 5.6s} 111. Ke3 {+1.56/10 4.5s} Re7+ {-0.37/18 5.5s} 112. Kf3 {+1.47/11 9.0s} Nc6 {0.00/14 5.5s} 113. Qc3 {+1.64/13 1.5s} Rc7 {-0.37/15 5.5s} 114. Qf6 {+2.22/12 6.0s} Rc8 {-1.13/15 5.5s} 115. Kf2 {+2.40/14 1.9s} Bd3 {-0.37/17 5.5s} 116. Ke3 {+3.62/16 5.6s} Kc2 {-0.37/16 5.5s} 117. Qe6 {+4.48/15 3.5s} Rc7 {-3.54/16 5.4s} 118. a7 {+5.50/13 4.7s} Nxa7 {-3.58/19 5.5s} 119. Qa2+ {+5.62/14 3.7s} Kc3 {-3.62/20 5.4s} 120. Qa5+ {+7.18/12 5.4s} Kb3 {-3.66/21 5.4s} 121. Qxc7 {+7.28/12 1.7s, White wins by adjudication: SyzygyTB} 1-0[/pgn]
Ferdy
Posts: 4833
Joined: Sun Aug 10, 2008 3:15 pm
Location: Philippines

Re: What engine breaks even with GMs in blitz?

Post by Ferdy »

lkaufman wrote: Sun Apr 07, 2019 6:32 pm In blitz (let's say 3' + 2" or as close to this as possible), the top engines today are far beyond human level. But how far down the list do we have to go to find engines (and specified hardware) that score evenly against GMs, preferably ones with known identities and ratings? I'm sure there is plenty of data to answer this question as countless games have been played online over the years, but does anyone actually have some data, such as "Engine xyz on one thread scored 50% against GMs averaging 2600 FIDE" for example? The question I'd like to answer is: How much would we have to add to CCRL blitz ratings to estimate the FIDE blitz rating of a human GM who would score 50% against it at 3' + 2"?
I am currently working on this via similarity approch.

1. Collect games from TWIC 2017 to 2019

2. Save games played by players with rating 2500 to 2700 both white and black.

3. Read these games and save position for similarity testing with following criteria
a. Evaluate the move in the game by running stockfish at multipv 2 at 1s per pos, single thread, save bs1 and bs2 (bestscore1 from multipv1, and bestscore2 from multipv2 respectively).
b. bs1 should not be too bad and not too good either, bs1 >= -100cp and bs1 <= 100cp
c. Don't save position if there is a clear best move, bs1 - bs2 > 100cp
d. Don't save position if side to move is in check
e. Save position in a game randomly at a maximum of 5 pos in every game. It can be 0 depending on the generated move numbers.

Code: Select all

skip this pos, move num 12 is not in pre-generated move random numbers for this game [97, 31, 26, 81, 84]
4. Once saved position reaches 6000 stop the position generation.

5. Get some uci engines that support go movetime command as priority if not those that support go infinite.

6. Let the engines analyze those saved gm positions and use sim3 to generate similarity info. Those engines that have a high similarity will be the candidates for selection.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: What engine breaks even with GMs in blitz?

Post by lkaufman »

Laskos wrote: Mon Apr 08, 2019 9:56 pm
lkaufman wrote: Mon Apr 08, 2019 9:04 pm
Laskos wrote: Mon Apr 08, 2019 8:06 am
lkaufman wrote: Mon Apr 08, 2019 6:43 am I ran a twenty game blitz (3' + 2") knight odds match (two ply book for variety, both b1 and g1 knights) between Lc0 11248 (on my 2080) vs. Giraffe (best version, on my 5 GHz i7 laptop), with Giraffe as a proxy for Magnus Carlsen, a good one since it "knows" to simplify when up a piece while some similarly rated A/B engines may not know or appreciate this. "Magnus" won by 12 to 8 (no draws!). So perhaps it's not yet time to bet against the champ if such a match took place, but we're getting close it seems.
I am not sure at 3' + 2'' blitz, Lc0 11248 on 2080 is probably close at Knight odds to Carlsen, but my claim here:
http://www.talkchess.com/forum3/viewtop ... =2&t=69956

was that at longer 45' + 15'' a top GM might not win all 10 games out of 10 being a Knight up, more probably 7-9, 1-3 being drawn or even lost by the human. It would be fun to watch such a match, as the top human will be happy having an upper hand most of the times, while enjoying some 1-3 setbacks at Knight odds in 10 games! The prize could be proportional to (Wins - Draws - Losses) or even (Wins - Draws - 2*Losses), to give incentive to the human to not lose any points. Lc0 should be left with some temperature (say 0.5) for the first 4-5 moves, for diversity and for not playing into prepared openings. Or by providing Lc0 with a small, prepared book.
Since we already know that this same LCO network defeated GM Naroditsky by a wide margin in blitz games averaging around knight odds, and he is probably within a class of Magnus in blitz strength, we already have good reason to think knight odds vs. Carlsen would be close, so this test is mostly just to confirm that we can make meaningful predictions of human results by these simulartions. As for 45' + 15", what CCRL 40/40 rating do you think would come closest to matching Magnus in strength at 45' + 15" ?
I am on the phone now, but that's not that hard, there were games against top humans in 2003-2004 and we can extrapolate tc and hardware. Take some Fritz 8 or Junior 9 on one core CCRL 40/40 level. My estimate it is some Kasparov or Magnus level at 45' + 15''.
I believe those were roughly the engines that tied matches with Kasparov at standard time back then, probably running on four threads, though I'm not sure about that. I'm not sure how many times faster the hardware is today; I suppose we have to clarify whether we are talking about playing on the old machines CCRL uses as a standard or on our current machines. Since you are running your matches on your new machine, not on an old AMD like the ones used as reference by CCRL, I suppose we should pick an engine that would be Carlsen level on that hardware, and if so I think you are picking too strong an engine; I imagine that those two engines on your current machine on 1 thread are as good as whatever they had around 2003 on four threads, so it should be 2800+ level even at 40/2 hours, and hence a big favorite at 45' + 15". But you are more knowledgeable about hardware and a much better mathematician than I am, so please correct me if I am wrong.
Komodo rules!
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: What engine breaks even with GMs in blitz?

Post by lkaufman »

Ferdy wrote: Tue Apr 09, 2019 3:44 am
lkaufman wrote: Sun Apr 07, 2019 6:32 pm In blitz (let's say 3' + 2" or as close to this as possible), the top engines today are far beyond human level. But how far down the list do we have to go to find engines (and specified hardware) that score evenly against GMs, preferably ones with known identities and ratings? I'm sure there is plenty of data to answer this question as countless games have been played online over the years, but does anyone actually have some data, such as "Engine xyz on one thread scored 50% against GMs averaging 2600 FIDE" for example? The question I'd like to answer is: How much would we have to add to CCRL blitz ratings to estimate the FIDE blitz rating of a human GM who would score 50% against it at 3' + 2"?
I am currently working on this via similarity approch.

1. Collect games from TWIC 2017 to 2019

2. Save games played by players with rating 2500 to 2700 both white and black.

3. Read these games and save position for similarity testing with following criteria
a. Evaluate the move in the game by running stockfish at multipv 2 at 1s per pos, single thread, save bs1 and bs2 (bestscore1 from multipv1, and bestscore2 from multipv2 respectively).
b. bs1 should not be too bad and not too good either, bs1 >= -100cp and bs1 <= 100cp
c. Don't save position if there is a clear best move, bs1 - bs2 > 100cp
d. Don't save position if side to move is in check
e. Save position in a game randomly at a maximum of 5 pos in every game. It can be 0 depending on the generated move numbers.

Code: Select all

skip this pos, move num 12 is not in pre-generated move random numbers for this game [97, 31, 26, 81, 84]
4. Once saved position reaches 6000 stop the position generation.

5. Get some uci engines that support go movetime command as priority if not those that support go infinite.

6. Let the engines analyze those saved gm positions and use sim3 to generate similarity info. Those engines that have a high similarity will be the candidates for selection.
Although this does sound interesting, I don't think that similarity in move choice between human and computer is a reasonable way to predict which computer would be of the same strength as the human. The difference between engine and human play is too large for this to work. It's not particularly obvious whether the engine that played most like a 2600 would win or lose to him, only that there is no reason to expect a close match. My guess is that the occasional human blunder would mean that the 2600 human would lose badly to the similar engine, but there could be factors working the other way too.
Komodo rules!
mwyoung
Posts: 2727
Joined: Wed May 12, 2010 10:00 pm

Re: What engine breaks even with GMs in blitz?

Post by mwyoung »

lkaufman wrote: Sun Apr 07, 2019 6:32 pm In blitz (let's say 3' + 2" or as close to this as possible), the top engines today are far beyond human level. But how far down the list do we have to go to find engines (and specified hardware) that score evenly against GMs, preferably ones with known identities and ratings? I'm sure there is plenty of data to answer this question as countless games have been played online over the years, but does anyone actually have some data, such as "Engine xyz on one thread scored 50% against GMs averaging 2600 FIDE" for example? The question I'd like to answer is: How much would we have to add to CCRL blitz ratings to estimate the FIDE blitz rating of a human GM who would score 50% against it at 3' + 2"?
Hello Larry,

It looks like you are going to have to go down to the bottom of the list.

Here is a news report from 1994. About how Fritz 2 won against all the worlds best in 1994 in 5m blitz games.

https://www.independent.co.uk/arts-ente ... 38085.html

In 1994 at the time of the news report. The best processor was the Pentium.

And the report says: "When Intel sponsored the World Chess Express Challenge in Munich last Friday, they could never have hoped for such a good advertisement for their high-speed Pentium processor. It turned a good computer - Fritz 2 - into a world beater.:"

March 1994:
Intel introduces and ships faster Pentium chips, based on 0.6 micron BiCMOS manufacturing. The processor now includes clock-doubling of 1.5 or 2 time the external clock rate, allowing processor speeds of up to 100 MHz on a 50-66 MHz system bus. The processor also includes power management capabilities to allow stopping and restarting the processor. Code-name during development was P54C. The 60/90 MHz Pentium 735 processor is rated at 149.8 MIPS, and is priced at US$849 in 1000 unit quantities. The 66/100 MHz Pentium 815 processor is rated at 166.3 MIPS, and is priced at US$995 in 1000 unit quantities. [205.98] [265] [62] [550.29] [551.168,259] [557.134] [584.43] [689.115] [276]
Last edited by mwyoung on Tue Apr 09, 2019 7:06 am, edited 1 time in total.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
mwyoung
Posts: 2727
Joined: Wed May 12, 2010 10:00 pm

Re: What engine breaks even with GMs in blitz?

Post by mwyoung »

mwyoung wrote: Tue Apr 09, 2019 6:38 am
lkaufman wrote: Sun Apr 07, 2019 6:32 pm In blitz (let's say 3' + 2" or as close to this as possible), the top engines today are far beyond human level. But how far down the list do we have to go to find engines (and specified hardware) that score evenly against GMs, preferably ones with known identities and ratings? I'm sure there is plenty of data to answer this question as countless games have been played online over the years, but does anyone actually have some data, such as "Engine xyz on one thread scored 50% against GMs averaging 2600 FIDE" for example? The question I'd like to answer is: How much would we have to add to CCRL blitz ratings to estimate the FIDE blitz rating of a human GM who would score 50% against it at 3' + 2"?
Hello Larry,

It looks like you are going to have to go down to the bottom of the list.

Here is a news report from 1994. About how Fritz 2 won against all the worlds best in 1994 in 5m blitz games.

https://www.independent.co.uk/arts-ente ... 38085.html

In 1994 at the time of the news report. The best processor was the Pentium.

And the report says: "When Intel sponsored the World Chess Express Challenge in Munich last Friday, they could never have hoped for such a good advertisement for their high-speed Pentium processor. It turned a good computer - Fritz 2 - into a world beater.:"

March 1994:
Intel introduces and ships faster Pentium chips, based on 0.6 micron BiCMOS manufacturing. The processor now includes clock-doubling of 1.5 or 2 time the external clock rate, allowing processor speeds of up to 100 MHz on a 50-66 MHz system bus. The processor also includes power management capabilities to allow stopping and restarting the processor. Code-name during development was P54C. The 60/90 MHz Pentium 735 processor is rated at 149.8 MIPS, and is priced at US$849 in 1000 unit quantities. The 66/100 MHz Pentium 815 processor is rated at 166.3 MIPS, and is priced at US$995 in 1000 unit quantities. [205.98] [265] [62] [550.29] [551.168,259] [557.134] [584.43] [689.115] [276]
I found 2 games of Fritz 2 from 1992. Scoring 1-1 Playing GM Kasparov. Fritz could be playing on a 386 or 486 processor in 1992. My guess would be the 486.

[pgn] [Event "Koln (5')"] [Site "Koln (5')"] [Date "1992.??.??"] [EventDate "?"] [Round "?"] [Result "0-1"] [White "Fritz (Computer)"] [Black "Garry Kasparov"] [ECO "B30"] [WhiteElo "?"] [BlackElo "?"] [PlyCount "77"] 1.e4 c5 2.Nf3 Nc6 3.Nc3 g6 4.d4 cxd4 5.Nxd4 Bg7 6.Be3 Nf6 7.Nxc6 bxc6 8.e5 Ng8 9.f4 Nh6 10.Qd2 O-O 11.O-O-O d6 12.exd6 exd6 13.Qxd6 Qxd6 14.Rxd6 Nf5 15.Rd3 Ba6 16.Bc5 Bxd3 17.Bxf8 Bxf1 18.Bxg7 Bxg2+ 19.Rg1 Kxg7 20.Rxg2 Rb8 21.Re2 Rh8 22.b3 h5 23.Kb2 h4 24.h3 Rd8 25.Ne4 Ng3 26.Nxg3 hxg3 27.Rg2 Rd4 28.Rxg3 Rxf4 29.Rc3 Rh4 30.Rxc6 Rxh3 31.Ra6 g5 32.Rxa7 g4 33.a4 g3 34.Ra5 Rh6 35.Rg5+ Rg6 36.Rxg3 Rxg3 37.c4 f5 38.b4 f4 39.Kc2 0-1[/pgn]

[pgn][Event "Koln (5')"] [Site "Koln (5')"] [Date "1992.??.??"] [EventDate "?"] [Round "?"] [Result "1-0"] [White "Fritz (Computer)"] [Black "Garry Kasparov"] [ECO "A07"] [WhiteElo "?"] [BlackElo "?"] [PlyCount "87"] 1.e4 c5 2.d3 Nc6 3.g3 g6 4.Bg2 Bg7 5.Nf3 d6 6.O-O e5 7.Bg5 f6 8.Be3 Nge7 9.a3 O-O 10.Nc3 Kh8 11.b4 b6 12.Rb1 Be6 13.b5 Nd4 14.a4 f5 15.Ng5 Bg8 16.exf5 Nexf5 17.Bxa8 Qxa8 18.Bxd4 Nxd4 19.Ne2 Nf5 20.c3 d5 21.Re1 h6 22.Nf3 g5 23.d4 e4 24.Ne5 Qe8 25.Nc6 Nd6 26.Nc1 Qd7 27.Nb3 Qh3 28.Qe2 Be6 29.dxc5 bxc5 30.Nxc5 Bg4 31.Qf1 Qh5 32.Rb3 Nc4 33.Nxe4 dxe4 34.Qxc4 Rxf2 35.h4 gxh4 36.Kxf2 hxg3 37.Ke3 Qg5+ 38.Kxe4 Qf5+ 39.Ke3 Qf3+ 40.Kd2 g2 41.Re8+ Bf8 42.Qd4+ Kh7 43.Rb1 Bg7 44.Re7 1-0 [/pgn]
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: What engine breaks even with GMs in blitz?

Post by lkaufman »

mwyoung wrote: Tue Apr 09, 2019 6:59 am
mwyoung wrote: Tue Apr 09, 2019 6:38 am
lkaufman wrote: Sun Apr 07, 2019 6:32 pm In blitz (let's say 3' + 2" or as close to this as possible), the top engines today are far beyond human level. But how far down the list do we have to go to find engines (and specified hardware) that score evenly against GMs, preferably ones with known identities and ratings? I'm sure there is plenty of data to answer this question as countless games have been played online over the years, but does anyone actually have some data, such as "Engine xyz on one thread scored 50% against GMs averaging 2600 FIDE" for example? The question I'd like to answer is: How much would we have to add to CCRL blitz ratings to estimate the FIDE blitz rating of a human GM who would score 50% against it at 3' + 2"?
Hello Larry,

It looks like you are going to have to go down to the bottom of the list.

Here is a news report from 1994. About how Fritz 2 won against all the worlds best in 1994 in 5m blitz games.

https://www.independent.co.uk/arts-ente ... 38085.html

In 1994 at the time of the news report. The best processor was the Pentium.

And the report says: "When Intel sponsored the World Chess Express Challenge in Munich last Friday, they could never have hoped for such a good advertisement for their high-speed Pentium processor. It turned a good computer - Fritz 2 - into a world beater.:"

March 1994:
Intel introduces and ships faster Pentium chips, based on 0.6 micron BiCMOS manufacturing. The processor now includes clock-doubling of 1.5 or 2 time the external clock rate, allowing processor speeds of up to 100 MHz on a 50-66 MHz system bus. The processor also includes power management capabilities to allow stopping and restarting the processor. Code-name during development was P54C. The 60/90 MHz Pentium 735 processor is rated at 149.8 MIPS, and is priced at US$849 in 1000 unit quantities. The 66/100 MHz Pentium 815 processor is rated at 166.3 MIPS, and is priced at US$995 in 1000 unit quantities. [205.98] [265] [62] [550.29] [551.168,259] [557.134] [584.43] [689.115] [276]
I found 2 games of Fritz 2 from 1992. Scoring 1-1 Playing GM Kasparov. Fritz could be playing on a 386 or 486 processor in 1992. My guess would be the 486.

[pgn] [Event "Koln (5')"] [Site "Koln (5')"] [Date "1992.??.??"] [EventDate "?"] [Round "?"] [Result "0-1"] [White "Fritz (Computer)"] [Black "Garry Kasparov"] [ECO "B30"] [WhiteElo "?"] [BlackElo "?"] [PlyCount "77"] 1.e4 c5 2.Nf3 Nc6 3.Nc3 g6 4.d4 cxd4 5.Nxd4 Bg7 6.Be3 Nf6 7.Nxc6 bxc6 8.e5 Ng8 9.f4 Nh6 10.Qd2 O-O 11.O-O-O d6 12.exd6 exd6 13.Qxd6 Qxd6 14.Rxd6 Nf5 15.Rd3 Ba6 16.Bc5 Bxd3 17.Bxf8 Bxf1 18.Bxg7 Bxg2+ 19.Rg1 Kxg7 20.Rxg2 Rb8 21.Re2 Rh8 22.b3 h5 23.Kb2 h4 24.h3 Rd8 25.Ne4 Ng3 26.Nxg3 hxg3 27.Rg2 Rd4 28.Rxg3 Rxf4 29.Rc3 Rh4 30.Rxc6 Rxh3 31.Ra6 g5 32.Rxa7 g4 33.a4 g3 34.Ra5 Rh6 35.Rg5+ Rg6 36.Rxg3 Rxg3 37.c4 f5 38.b4 f4 39.Kc2 0-1[/pgn]

[pgn][Event "Koln (5')"] [Site "Koln (5')"] [Date "1992.??.??"] [EventDate "?"] [Round "?"] [Result "1-0"] [White "Fritz (Computer)"] [Black "Garry Kasparov"] [ECO "A07"] [WhiteElo "?"] [BlackElo "?"] [PlyCount "87"] 1.e4 c5 2.d3 Nc6 3.g3 g6 4.Bg2 Bg7 5.Nf3 d6 6.O-O e5 7.Bg5 f6 8.Be3 Nge7 9.a3 O-O 10.Nc3 Kh8 11.b4 b6 12.Rb1 Be6 13.b5 Nd4 14.a4 f5 15.Ng5 Bg8 16.exf5 Nexf5 17.Bxa8 Qxa8 18.Bxd4 Nxd4 19.Ne2 Nf5 20.c3 d5 21.Re1 h6 22.Nf3 g5 23.d4 e4 24.Ne5 Qe8 25.Nc6 Nd6 26.Nc1 Qd7 27.Nb3 Qh3 28.Qe2 Be6 29.dxc5 bxc5 30.Nxc5 Bg4 31.Qf1 Qh5 32.Rb3 Nc4 33.Nxe4 dxe4 34.Qxc4 Rxf2 35.h4 gxh4 36.Kxf2 hxg3 37.Ke3 Qg5+ 38.Kxe4 Qf5+ 39.Ke3 Qf3+ 40.Kd2 g2 41.Re8+ Bf8 42.Qd4+ Kh7 43.Rb1 Bg7 44.Re7 1-0 [/pgn]
I didn't remember these two events, but I suppose it makes sense that overall Fritz 2 would perform maybe 2750 or so at blitz overall mostly on a Pentium, because Rexchess performed in the 2500s around 1990 on a 486, and Fritz 2 was later and stronger. Considering the hardware avancement since the Pentium, I suppose that the estimates of raising CCRL blitz ratings by 500 for FIDE blitz rating equivalence should be revised upward quite a bit. I don't know if I even have any engine weak enough to play the same level on my 5 Ghz I7 as Fritz 2 did on a Pentium! Well, there's always the handicapped levels of Komodo, one of them must be suitable. But I'd have to have a weak enough engine to run it against to determine which level that would be! Any suggestions of engines of that level that are easy to download and problem-free?
Komodo rules!
mwyoung
Posts: 2727
Joined: Wed May 12, 2010 10:00 pm

Re: What engine breaks even with GMs in blitz?

Post by mwyoung »

lkaufman wrote: Tue Apr 09, 2019 7:21 am
mwyoung wrote: Tue Apr 09, 2019 6:59 am
mwyoung wrote: Tue Apr 09, 2019 6:38 am
lkaufman wrote: Sun Apr 07, 2019 6:32 pm In blitz (let's say 3' + 2" or as close to this as possible), the top engines today are far beyond human level. But how far down the list do we have to go to find engines (and specified hardware) that score evenly against GMs, preferably ones with known identities and ratings? I'm sure there is plenty of data to answer this question as countless games have been played online over the years, but does anyone actually have some data, such as "Engine xyz on one thread scored 50% against GMs averaging 2600 FIDE" for example? The question I'd like to answer is: How much would we have to add to CCRL blitz ratings to estimate the FIDE blitz rating of a human GM who would score 50% against it at 3' + 2"?
Hello Larry,

It looks like you are going to have to go down to the bottom of the list.

Here is a news report from 1994. About how Fritz 2 won against all the worlds best in 1994 in 5m blitz games.

https://www.independent.co.uk/arts-ente ... 38085.html

In 1994 at the time of the news report. The best processor was the Pentium.

And the report says: "When Intel sponsored the World Chess Express Challenge in Munich last Friday, they could never have hoped for such a good advertisement for their high-speed Pentium processor. It turned a good computer - Fritz 2 - into a world beater.:"

March 1994:
Intel introduces and ships faster Pentium chips, based on 0.6 micron BiCMOS manufacturing. The processor now includes clock-doubling of 1.5 or 2 time the external clock rate, allowing processor speeds of up to 100 MHz on a 50-66 MHz system bus. The processor also includes power management capabilities to allow stopping and restarting the processor. Code-name during development was P54C. The 60/90 MHz Pentium 735 processor is rated at 149.8 MIPS, and is priced at US$849 in 1000 unit quantities. The 66/100 MHz Pentium 815 processor is rated at 166.3 MIPS, and is priced at US$995 in 1000 unit quantities. [205.98] [265] [62] [550.29] [551.168,259] [557.134] [584.43] [689.115] [276]
I found 2 games of Fritz 2 from 1992. Scoring 1-1 Playing GM Kasparov. Fritz could be playing on a 386 or 486 processor in 1992. My guess would be the 486.

[pgn] [Event "Koln (5')"] [Site "Koln (5')"] [Date "1992.??.??"] [EventDate "?"] [Round "?"] [Result "0-1"] [White "Fritz (Computer)"] [Black "Garry Kasparov"] [ECO "B30"] [WhiteElo "?"] [BlackElo "?"] [PlyCount "77"] 1.e4 c5 2.Nf3 Nc6 3.Nc3 g6 4.d4 cxd4 5.Nxd4 Bg7 6.Be3 Nf6 7.Nxc6 bxc6 8.e5 Ng8 9.f4 Nh6 10.Qd2 O-O 11.O-O-O d6 12.exd6 exd6 13.Qxd6 Qxd6 14.Rxd6 Nf5 15.Rd3 Ba6 16.Bc5 Bxd3 17.Bxf8 Bxf1 18.Bxg7 Bxg2+ 19.Rg1 Kxg7 20.Rxg2 Rb8 21.Re2 Rh8 22.b3 h5 23.Kb2 h4 24.h3 Rd8 25.Ne4 Ng3 26.Nxg3 hxg3 27.Rg2 Rd4 28.Rxg3 Rxf4 29.Rc3 Rh4 30.Rxc6 Rxh3 31.Ra6 g5 32.Rxa7 g4 33.a4 g3 34.Ra5 Rh6 35.Rg5+ Rg6 36.Rxg3 Rxg3 37.c4 f5 38.b4 f4 39.Kc2 0-1[/pgn]

[pgn][Event "Koln (5')"] [Site "Koln (5')"] [Date "1992.??.??"] [EventDate "?"] [Round "?"] [Result "1-0"] [White "Fritz (Computer)"] [Black "Garry Kasparov"] [ECO "A07"] [WhiteElo "?"] [BlackElo "?"] [PlyCount "87"] 1.e4 c5 2.d3 Nc6 3.g3 g6 4.Bg2 Bg7 5.Nf3 d6 6.O-O e5 7.Bg5 f6 8.Be3 Nge7 9.a3 O-O 10.Nc3 Kh8 11.b4 b6 12.Rb1 Be6 13.b5 Nd4 14.a4 f5 15.Ng5 Bg8 16.exf5 Nexf5 17.Bxa8 Qxa8 18.Bxd4 Nxd4 19.Ne2 Nf5 20.c3 d5 21.Re1 h6 22.Nf3 g5 23.d4 e4 24.Ne5 Qe8 25.Nc6 Nd6 26.Nc1 Qd7 27.Nb3 Qh3 28.Qe2 Be6 29.dxc5 bxc5 30.Nxc5 Bg4 31.Qf1 Qh5 32.Rb3 Nc4 33.Nxe4 dxe4 34.Qxc4 Rxf2 35.h4 gxh4 36.Kxf2 hxg3 37.Ke3 Qg5+ 38.Kxe4 Qf5+ 39.Ke3 Qf3+ 40.Kd2 g2 41.Re8+ Bf8 42.Qd4+ Kh7 43.Rb1 Bg7 44.Re7 1-0 [/pgn]
I didn't remember these two events, but I suppose it makes sense that overall Fritz 2 would perform maybe 2750 or so at blitz overall mostly on a Pentium, because Rexchess performed in the 2500s around 1990 on a 486, and Fritz 2 was later and stronger. Considering the hardware avancement since the Pentium, I suppose that the estimates of raising CCRL blitz ratings by 500 for FIDE blitz rating equivalence should be revised upward quite a bit. I don't know if I even have any engine weak enough to play the same level on my 5 Ghz I7 as Fritz 2 did on a Pentium! Well, there's always the handicapped levels of Komodo, one of them must be suitable. But I'd have to have a weak enough engine to run it against to determine which level that would be! Any suggestions of engines of that level that are easy to download and problem-free?
I have tested all of these in the past, and they have worked. And they are the right vintage...
Put your laptop on power saving mode, and or use less time.

http://rebel13.nl/windows/rebel's%20with%20uci.html

MGP 1993.jpg
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: What engine breaks even with GMs in blitz?

Post by Laskos »

lkaufman wrote: Tue Apr 09, 2019 5:41 am
Laskos wrote: Mon Apr 08, 2019 9:56 pm
lkaufman wrote: Mon Apr 08, 2019 9:04 pm
Laskos wrote: Mon Apr 08, 2019 8:06 am
lkaufman wrote: Mon Apr 08, 2019 6:43 am I ran a twenty game blitz (3' + 2") knight odds match (two ply book for variety, both b1 and g1 knights) between Lc0 11248 (on my 2080) vs. Giraffe (best version, on my 5 GHz i7 laptop), with Giraffe as a proxy for Magnus Carlsen, a good one since it "knows" to simplify when up a piece while some similarly rated A/B engines may not know or appreciate this. "Magnus" won by 12 to 8 (no draws!). So perhaps it's not yet time to bet against the champ if such a match took place, but we're getting close it seems.
I am not sure at 3' + 2'' blitz, Lc0 11248 on 2080 is probably close at Knight odds to Carlsen, but my claim here:
http://www.talkchess.com/forum3/viewtop ... =2&t=69956

was that at longer 45' + 15'' a top GM might not win all 10 games out of 10 being a Knight up, more probably 7-9, 1-3 being drawn or even lost by the human. It would be fun to watch such a match, as the top human will be happy having an upper hand most of the times, while enjoying some 1-3 setbacks at Knight odds in 10 games! The prize could be proportional to (Wins - Draws - Losses) or even (Wins - Draws - 2*Losses), to give incentive to the human to not lose any points. Lc0 should be left with some temperature (say 0.5) for the first 4-5 moves, for diversity and for not playing into prepared openings. Or by providing Lc0 with a small, prepared book.
Since we already know that this same LCO network defeated GM Naroditsky by a wide margin in blitz games averaging around knight odds, and he is probably within a class of Magnus in blitz strength, we already have good reason to think knight odds vs. Carlsen would be close, so this test is mostly just to confirm that we can make meaningful predictions of human results by these simulartions. As for 45' + 15", what CCRL 40/40 rating do you think would come closest to matching Magnus in strength at 45' + 15" ?
I am on the phone now, but that's not that hard, there were games against top humans in 2003-2004 and we can extrapolate tc and hardware. Take some Fritz 8 or Junior 9 on one core CCRL 40/40 level. My estimate it is some Kasparov or Magnus level at 45' + 15''.
I believe those were roughly the engines that tied matches with Kasparov at standard time back then, probably running on four threads, though I'm not sure about that. I'm not sure how many times faster the hardware is today; I suppose we have to clarify whether we are talking about playing on the old machines CCRL uses as a standard or on our current machines. Since you are running your matches on your new machine, not on an old AMD like the ones used as reference by CCRL, I suppose we should pick an engine that would be Carlsen level on that hardware, and if so I think you are picking too strong an engine; I imagine that those two engines on your current machine on 1 thread are as good as whatever they had around 2003 on four threads, so it should be 2800+ level even at 40/2 hours, and hence a big favorite at 45' + 15". But you are more knowledgeable about hardware and a much better mathematician than I am, so please correct me if I am wrong.
I got fairly confused this morning, bad sleep probably :).

First baseline, which I remembered correctly:

From Wiki:
X3D Fritz was a version of the Fritz chess program, which in November 2003 played a four-game Human–computer chess match against world number one Grandmaster Garry Kasparov. The match was tied 2–2. Fritz ran on four Intel Pentium 4 Xeon CPUs at 2.8 GHz.

X3D Fritz is something in-between Fritz 8 and Fritz 8 Bilbao, and close in strength to them. Fritz 8 Bilbao itself played some 12 games against top humans (weaker than Kasparov) and won. So, all the data corroborated (similar data on some Junior 8 and 9 matches against top humans), it's reasonable to say that Fritz 8 Bilbao and Junior 9 on "four Intel Pentium 4 Xeon CPUs at 2.8 GHz" are some Kasparov/Kramnik level of 2003-4 at 40/2 hours.

One my i7 core is close to being 2.5 faster than one of those cores of Xeon. Their effective speed-up on 4 cores in those times was not that good, maybe around 2.8-3.0, so basically these Fritz and Junior engines on one my i7 core are level with Kasparov/Kramnik at 40/2 hours.

No need to extrapolate up to now.

There are two scaling issues needed for extrapolation:

1/ Scaling of human versus machine with TC
2/ Scaling of Knight odds with TC

And here I got stuck with this Leela thing. I am playing Lc0 11248 as the ODDS TAKER (handicapped engine by one Knight) and the conventional Arasan 14.3 as taking the KNIGHT UP chances. It's quite the opposite of handicapped very strong classical AB engine versus human, as we used to see all the handicaps and scaling behaviors. Here I am mimicking basically a super-human Lc0 being a Knight down against a pure classical engine Fritz 8.

Going to 1/ and 2/ --- what is "machine" in that scaling issue? The scaling of Knight odds is clear as a slope, it increases with TC, but I am not sure of magnitude, again, depending on what is "machine".

I understand your reasoning that Fritz 8, Junior 9 (both some Arasan 14.3 level) seem too strong at 45' + 15'', but my doubts about the validity of my and your usual reasoning are validated by the crazy result I got:

At 15' + 5'' in 10 games Lc0 11248 on 2070 (first 4 moves using temperature of 0.5) scored 4 Wins and 6 Losses against Arasan 14.3 on one strong i7 core at Knight Odds. I do not know what to make of this result and what it means. In our usual reasoning, that would mean that Lc0 11248 can score several wins against Magnus at Knight Odds in 10 games at 45' + 15'', but let's not say stupid things out of "usual reasoning".

I clearly need to change the opponent of Lc0 11248 from Arasan 14.3 to another Lc0 to mimic a human opponent. By the way, Arasan seems quite dumb in converting the advantage, and I need a good Leela net ID which knows how to convert large advantages. Do you know a net ID to play well being a Knight up? I will adjust its TC (or nodes) to mimic a 2800 FIDE level human, and I will explain how I did it.

And to add: CCRL ratings don't help me much here, they actually confused me more :). Humans are not obeying them, Leela is not obeying them, all this mess.