For a long time most of us wondered what magic was in Rybka that enabled it to play so strong. Now we know that there is no magic. We have the sources of Ippolit and Stockfish that demonstrate that nothing revolutionary has been achieved. However, it is demonstrated that substantial improvements to existing themes have lead to substantial gains in playing strength. In looking at the Stockfish sources I can see what was done and then I can implement everything in an engine of my own. But, that does not mean that I would understand why these specifics are stronger or ever understand the reasoning of its authors. I want to know more than what's apparent in just the reading by rout of source code that is void of the philosophy behind it. I want to know if eliminating mistakes from the eval was less important or more important than any new stuff that was added. Do you have to improve the eval in order for some changes to the search to actually be effective? How much of the elo gain was from the search, the eval or the synergism between the changes to both? Just where does the improved strength come from and why?
Crafty is such a carefully crafted program by an author that understands 'alpha-beta theory' as well or better than anyone else, so even though Crafty is also improving, why are the specifics in Stockfish so much stronger than the specifics in Crafty. I can understand a 50 elo difference between the two, not the gap that does exist.
Can we be enlightened, please?
Questions for the Stockfish team
Moderators: hgm, Rebel, chrisw
-
- Posts: 3196
- Joined: Fri May 26, 2006 3:00 am
- Location: WY, USA
- Full name: Michael Sherwin
Questions for the Stockfish team
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
-
- Posts: 5960
- Joined: Sun Jan 10, 2010 6:15 am
- Location: Maryland USA
Re: Questions for the Stockfish team
Move-ordering and singular extension probably account for a hundred Elo of the gap between Stockfish and Crafty. But as for the rest I too am puzzled.
-
- Posts: 613
- Joined: Sun Jan 18, 2009 7:03 am
Re: Questions for the Stockfish team
I don't like making comparisons with any specific program (like Crafty), but I can list some major points why I think Stockfish is stronger than most other modern programs:
* Relaxed singular extension
* Logarithmic LMR
* Complicated king safety evaluation
* Fine tuned tables and constants in evaluation
* Speed-optimized code
* Aggressive late move pruning at low depths
* Relaxed singular extension
* Logarithmic LMR
* Complicated king safety evaluation
* Fine tuned tables and constants in evaluation
* Speed-optimized code
* Aggressive late move pruning at low depths
Joona Kiiski
-
- Posts: 2684
- Joined: Sat Jun 14, 2008 9:17 pm
Re: Questions for the Stockfish team
zamar wrote:I don't like making comparisons with any specific program (like Crafty), but I can list some major points why I think Stockfish is stronger than most other modern programs:
* Relaxed singular extension
* Logarithmic LMR
* Complicated king safety evaluation
* Fine tuned tables and constants in evaluation
* Speed-optimized code
* Aggressive late move pruning at low depths
I think that someone (read Bob) will have to say something on the above points.
I don't like this kind of discussions because are very prone to handwaving (where I predict this thread is heading to BTW...)
Anyhow one should first try to understand where are hidden the 80 ELO points of difference _still_ (after more then 2 years) exsisting with Glaurung 2.2 and then, once understood this point (hint: it won't be understood ) add on the table the remaining 260 points.
-
- Posts: 3196
- Joined: Fri May 26, 2006 3:00 am
- Location: WY, USA
- Full name: Michael Sherwin
Re: Questions for the Stockfish team
Here is all I know as I have not done much of an examination of g2.2.mcostalba wrote:zamar wrote:I don't like making comparisons with any specific program (like Crafty), but I can list some major points why I think Stockfish is stronger than most other modern programs:
* Relaxed singular extension
* Logarithmic LMR
* Complicated king safety evaluation
* Fine tuned tables and constants in evaluation
* Speed-optimized code
* Aggressive late move pruning at low depths
I think that someone (read Bob) will have to say something on the above points.
I don't like this kind of discussions because are very prone to handwaving (where I predict this thread is heading to BTW...)
Anyhow one should first try to understand where are hidden the 80 ELO points of difference _still_ (after more then 2 years) exsisting with Glaurung 2.2 and then, once understood this point (hint: it won't be understood ) add on the table the remaining 260 points.
...
It uses Late Move Pruning if the move number is beyond a threshold value
There is some type of feedback from the search to the eval
The move ordering routine is quite 'slick' and far more sophisticated than, say, Craftys
Tord was a student of Phalanx in which a certain type of extension is done
---
There is more I would suspect, but I know that these thing are not in Crafty. Am I on the right track?
However, instead of playing games with us you could just be nice and tell us! My mom has Alzheimer's and I have so far sacrificed two and a half years of my life to taking care of her day and night. When do you think that I have had the time to find out for my self the secrets of g2.2 or Stockfish?
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
-
- Posts: 20943
- Joined: Mon Feb 27, 2006 7:30 pm
- Location: Birmingham, AL
Re: Questions for the Stockfish team
Don't follow your comments. G2.2 is not beating Crafty in any of my cluster testing, it has been an opponent for a good while, as is Toga2.mcostalba wrote:zamar wrote:I don't like making comparisons with any specific program (like Crafty), but I can list some major points why I think Stockfish is stronger than most other modern programs:
* Relaxed singular extension
* Logarithmic LMR
* Complicated king safety evaluation
* Fine tuned tables and constants in evaluation
* Speed-optimized code
* Aggressive late move pruning at low depths
I think that someone (read Bob) will have to say something on the above points.
I don't like this kind of discussions because are very prone to handwaving (where I predict this thread is heading to BTW...)
Anyhow one should first try to understand where are hidden the 80 ELO points of difference _still_ (after more then 2 years) exsisting with Glaurung 2.2 and then, once understood this point (hint: it won't be understood ) add on the table the remaining 260 points.
As far as the other stuff goes, you might be overstating the "exponential LMR" benefit. Just a small hint. Won't say how I _know_ this to be true, however.
(hint: it didn't come from "handwaving")
-
- Posts: 2041
- Joined: Wed Mar 08, 2006 8:30 pm
Re: Questions for the Stockfish team
Well Bob, then how about the latest Stockfishes?bob wrote:G2.2 is not beating Crafty in any of my cluster testing,
-
- Posts: 2684
- Joined: Sat Jun 14, 2008 9:17 pm
Re: Questions for the Stockfish team
I have used official CEGT lists for single core CPU as reference for the numbers I have given.bob wrote: Don't follow your comments.
-
- Posts: 20943
- Joined: Mon Feb 27, 2006 7:30 pm
- Location: Birmingham, AL
Re: Questions for the Stockfish team
Already posted some of those. +180 to +200. A gap. But not an insurmountable gap.ernest wrote:Well Bob, then how about the latest Stockfishes?bob wrote:G2.2 is not beating Crafty in any of my cluster testing,
-
- Posts: 20943
- Joined: Mon Feb 27, 2006 7:30 pm
- Location: Birmingham, AL
Re: Questions for the Stockfish team
I use my cluster test results exclusively. No book issues or anything else, just plain and simple "engine vs engine." Already found one serious timing bug that will influence rating lists that use "repeating" time controls that I never use or test with (for example, 40/60 repeating, 40 moves in 60 minutes, then repeat. Gross error in time usage in that.mcostalba wrote:I have used official CEGT lists for single core CPU as reference for the numbers I have given.bob wrote: Don't follow your comments.
I prefer fischer-clock games to avoid time scrambles also.