And yes testing conditions matter in testing. That is why testing sites use different rating list for different time controls.AndrewGrant wrote: ↑Wed Dec 30, 2020 12:33 pmIndeed. But, I warn them of doing that, for this has greater implications, which they do not yet understand ....
CCC has serious hardware update!
Moderators: hgm, Rebel, chrisw
-
- Posts: 2727
- Joined: Wed May 12, 2010 10:00 pm
Re: CCC has serious hardware update!
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
But my words like silent raindrops fell. And echoed in the wells of silence.
-
- Posts: 253
- Joined: Mon Nov 16, 2020 12:13 pm
- Full name: Manuel Rivera
Re: CCC has serious hardware update!
why talking about x cores vs n cores ? Just test x nodes vs n nodes with and without NNUE to see the difference ?
Raspberry Pi4 bot : https://lichess.org/@/BetterAnalyze
-
- Posts: 708
- Joined: Mon Jan 16, 2012 6:34 am
Re: CCC has serious hardware update!
That testing condition is only useful on fixed number of cores only.
The quality of search coming from less number of cores is much much better than same nodes from multi-cores. ( e.g 1 core vs 8 cores)
-
- Posts: 3293
- Joined: Wed Mar 08, 2006 8:15 pm
Re: CCC has serious hardware update!
BTW why has Dragon much higher NPS than Stockfish now? Dragon 250 Mnps and SF 150 Mnps.
Jouni
-
- Posts: 2727
- Joined: Wed May 12, 2010 10:00 pm
Re: CCC has serious hardware update!
I posted data that cleared ccrl. Under ccrl flawed testing thread. And as predicted you are trashing others for their data. I have more data that can be post from other testing sites. Showing the same lack of scaling. I can post that so you can trash them also.AndrewGrant wrote: ↑Wed Dec 30, 2020 8:27 amThere were many threads addressing the issues with CCRLs methods and how it compresses elo and distorts values due to the opponent pool. If you have data that contradicts the typical scaling, then you should post it. You need thousands of games in your sample. Share the conditions, the opening book, and make sure to understand cores vs threads.mwyoung wrote: ↑Wed Dec 30, 2020 3:25 am You can trash me, CCRL, and others. Because you do not like the facts. But the facts are not going away with how NNUE scales.
Lets look not at my data, but CCRL blitz data. And lets see how NNUE behaves to a typical A/B engine like Ethereal.
1. Ethereal 12.75 64-bit 8CPU 3537 +20 −20 39.9% +61.4 57.2% 683
2. Ethereal 12.75 64-bit 3377 +22 −22 64.8% −86.2 56.5% 600
CCRL rating difference for 1 to 8 cores. 160 Elo
1. Stockfish 12 64-bit 8CPU 3692 +15 −14 71.5% −147.4 54.5% 1532
2. Stockfish 12 64-bit 3639 +15 −14 79.0% −226.6 36.1% 2054
CCRL rating difference from 1 to 8 cores. 53 Elo
"So you start your initial core doubling at +70 per doubling"
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
But my words like silent raindrops fell. And echoed in the wells of silence.
-
- Posts: 10312
- Joined: Thu Mar 09, 2006 12:37 am
- Location: Tel-Aviv Israel
Re: CCC has serious hardware update!
The only proof to show lack of scaling of nnue engines is if some non nnue engine can beat them at longer time control.mwyoung wrote: ↑Wed Dec 30, 2020 4:33 pmI posted data that cleared ccrl. Under ccrl flawed testing thread. And as predicted you are trashing others for their data. I have more data that can be post from other testing sites. Showing the same lack of scaling. I can post that so you can trash them also.AndrewGrant wrote: ↑Wed Dec 30, 2020 8:27 amThere were many threads addressing the issues with CCRLs methods and how it compresses elo and distorts values due to the opponent pool. If you have data that contradicts the typical scaling, then you should post it. You need thousands of games in your sample. Share the conditions, the opening book, and make sure to understand cores vs threads.mwyoung wrote: ↑Wed Dec 30, 2020 3:25 am You can trash me, CCRL, and others. Because you do not like the facts. But the facts are not going away with how NNUE scales.
Lets look not at my data, but CCRL blitz data. And lets see how NNUE behaves to a typical A/B engine like Ethereal.
1. Ethereal 12.75 64-bit 8CPU 3537 +20 −20 39.9% +61.4 57.2% 683
2. Ethereal 12.75 64-bit 3377 +22 −22 64.8% −86.2 56.5% 600
CCRL rating difference for 1 to 8 cores. 160 Elo
1. Stockfish 12 64-bit 8CPU 3692 +15 −14 71.5% −147.4 54.5% 1532
2. Stockfish 12 64-bit 3639 +15 −14 79.0% −226.6 36.1% 2054
CCRL rating difference from 1 to 8 cores. 53 Elo
"So you start your initial core doubling at +70 per doubling"
draws prove nothing because it is possible that you do not earn rating above some elo by doubling the time control.
-
- Posts: 2727
- Joined: Wed May 12, 2010 10:00 pm
Re: CCC has serious hardware update!
That is clearly not correct. And that is not the only way.Uri Blass wrote: ↑Wed Dec 30, 2020 5:19 pmThe only proof to show lack of scaling of nnue engines is if some non nnue engine can beat them at longer time control.mwyoung wrote: ↑Wed Dec 30, 2020 4:33 pmI posted data that cleared ccrl. Under ccrl flawed testing thread. And as predicted you are trashing others for their data. I have more data that can be post from other testing sites. Showing the same lack of scaling. I can post that so you can trash them also.AndrewGrant wrote: ↑Wed Dec 30, 2020 8:27 amThere were many threads addressing the issues with CCRLs methods and how it compresses elo and distorts values due to the opponent pool. If you have data that contradicts the typical scaling, then you should post it. You need thousands of games in your sample. Share the conditions, the opening book, and make sure to understand cores vs threads.mwyoung wrote: ↑Wed Dec 30, 2020 3:25 am You can trash me, CCRL, and others. Because you do not like the facts. But the facts are not going away with how NNUE scales.
Lets look not at my data, but CCRL blitz data. And lets see how NNUE behaves to a typical A/B engine like Ethereal.
1. Ethereal 12.75 64-bit 8CPU 3537 +20 −20 39.9% +61.4 57.2% 683
2. Ethereal 12.75 64-bit 3377 +22 −22 64.8% −86.2 56.5% 600
CCRL rating difference for 1 to 8 cores. 160 Elo
1. Stockfish 12 64-bit 8CPU 3692 +15 −14 71.5% −147.4 54.5% 1532
2. Stockfish 12 64-bit 3639 +15 −14 79.0% −226.6 36.1% 2054
CCRL rating difference from 1 to 8 cores. 53 Elo
"So you start your initial core doubling at +70 per doubling"
draws prove nothing because it is possible that you do not earn rating above some elo by doubling the time control.
We have lots of data from myself and other testers showing the lack of scaling with more cores. Showing many doublings in cpu, but minimal increase in Elo. Not even 70 Elo.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
But my words like silent raindrops fell. And echoed in the wells of silence.
-
- Posts: 550
- Joined: Tue Nov 19, 2019 8:48 pm
- Full name: Alayan Feh
Re: CCC has serious hardware update!
The obvious question is why SF classical isn't performing much closer to SF NNUE on high core count machine, then.
-
- Posts: 2727
- Joined: Wed May 12, 2010 10:00 pm
Re: CCC has serious hardware update!
I guess because it is not a classical engine. SF classical is just getting crushed, by NNUE with 2 cores vs 16 for Stockfish classical. . NNUE scales like crap, but it is so strong it does not matter.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
But my words like silent raindrops fell. And echoed in the wells of silence.
-
- Posts: 536
- Joined: Thu Mar 09, 2006 3:01 pm
Re: CCC has serious hardware update!
SF-NNUE scaled just fine going from 1 to 2, 4, and 8 threads (on 12 core box).
I did not test beyond 8, and the big tourney rigs have way more than 8 of course...
Results posted in SF-NNUE Discord several months ago, IIRC.
I did not test beyond 8, and the big tourney rigs have way more than 8 of course...
Results posted in SF-NNUE Discord several months ago, IIRC.