Correcting Evaluation with the hash table

mjlef · Post by **mjlef** » Fri Feb 05, 2010 10:05 pm

OK, here is the idea. Programs get scores from both board evaluations, and from searches stored in the hash table (often the hash table just has limits). I noticed that the score limit from the hash could often be used to correct the score estimate from the evaluation function..For example, if the evaluation returned say -10, but the hash table entry for this position is an upper limit and is say -20, then we can correct the current evaluation and make it -20, closer to the "truth".

How could this be useful? Well, in the qsearch, we assume (unless in check) that the side to move can stand pat and accept the current score. If this score is too high from the evaluation, then the search misses the fact that if it was forced to move, it would lose something. This should make the qsearch more immune to zugzwang situations. Also, programs often use the current evaluationin pruning decisions. Correcting the eval using the hash values should make these decisions more accurate, since they are returned fomr an actual search.

So, what do you think? Worth testing?

Mark

Edsel Apostol · Post by **Edsel Apostol** » Fri Feb 05, 2010 11:39 pm

mjlef wrote:OK, here is the idea. Programs get scores from both board evaluations, and from searches stored in the hash table (often the hash table just has limits). I noticed that the score limit from the hash could often be used to correct the score estimate from the evaluation function..For example, if the evaluation returned say -10, but the hash table entry for this position is an upper limit and is say -20, then we can correct the current evaluation and make it -20, closer to the "truth".

How could this be useful? Well, in the qsearch, we assume (unless in check) that the side to move can stand pat and accept the current score. If this score is too high from the evaluation, then the search misses the fact that if it was forced to move, it would lose something. This should make the qsearch more immune to zugzwang situations. Also, programs often use the current evaluationin pruning decisions. Correcting the eval using the hash values should make these decisions more accurate, since they are returned fomr an actual search.

So, what do you think? Worth testing?

Mark

I'm currently doing this on the latest public TL. It seems to help but not much though. I have not tested this idea much so it would be great to have another opinion from other programmers. I also think that it might make the pruning more conservative. Please do let me know the results if you managed to test it yourself.

rvida · Post by **rvida** » Fri Feb 05, 2010 11:52 pm

I'm doing this in Critter. But I'm not quite happy with it. It helps a little bit, but it causes some pretty bad instabilities in search.
A typical example: A move fails high on zero window search, but the widened window re-search does not see why it failed high, because the offending hash entry was in the meantime overwritten.

rvida · Post by **rvida** » Fri Feb 05, 2010 11:58 pm

Maybe this is not a problem for other engines, but Critter uses fail-soft with aspiration windows. It is easy to overshot the AW, and the search with widened bounds can take forever to finish - only to discover that the exact score was well insiede the original AW.

BubbaTough · Post by **BubbaTough** » Sat Feb 06, 2010 2:12 am

rvida wrote:I'm doing this in Critter. But I'm not quite happy with it. It helps a little bit, but it causes some pretty bad instabilities in search.
A typical example: A move fails high on zero window search, but the widened window re-search does not see why it failed high, because the offending hash entry was in the meantime overwritten.

This kind of thing can also happen with lazy eval too. All part of the fun.

-Sam

mcostalba · Post by **mcostalba** » Sat Feb 06, 2010 11:12 am

mjlef wrote: So, what do you think? Worth testing?

We are currently experimenting with this. Sometime it works, sometime doesn't. With futility it seems it does not work....but we are still investigating on it.

diep · Post by **diep** » Mon Feb 08, 2010 2:40 pm

mjlef wrote:OK, here is the idea. Programs get scores from both board evaluations, and from searches stored in the hash table (often the hash table just has limits). I noticed that the score limit from the hash could often be used to correct the score estimate from the evaluation function..For example, if the evaluation returned say -10, but the hash table entry for this position is an upper limit and is say -20, then we can correct the current evaluation and make it -20, closer to the "truth".

How could this be useful? Well, in the qsearch, we assume (unless in check) that the side to move can stand pat and accept the current score. If this score is too high from the evaluation, then the search misses the fact that if it was forced to move, it would lose something. This should make the qsearch more immune to zugzwang situations. Also, programs often use the current evaluationin pruning decisions. Correcting the eval using the hash values should make these decisions more accurate, since they are returned fomr an actual search.

So, what do you think? Worth testing?

Mark

If i have information from hashtable it usually gives a cutoff in qsearch.

Mind sharing rough pseudo code with us if you're not describing the usual hashtable cutoff?

If we write down things in vague conceptual words, of course things can get interpreted in a wide manner. In Amsterdam we have a lot of cafe's where there is people who are very professional in saying things in a vague manner, meanwhile smoking very BIG cigarettes.

With pseudo code that is tougher.

Thanks,
Vincent

metax · Post by **metax** » Mon Feb 08, 2010 3:27 pm

diep wrote:Mind sharing rough pseudo code with us if you're not describing the usual hashtable cutoff?

If I have understood this correctly:

Code: Select all

int CorrectEvalScore&#40;int eval, int ttValue, int ttBound&#41;
&#123;
   if &#40;bound == LOWERBOUND&#41;
   &#123;
      return max&#40;eval, ttValue&#41;;
   &#125;
   if &#40;bound == UPPERBOUND&#41;
   &#123;
      return min&#40;eval, ttValue&#41;;
   &#125;
   return eval;
&#125;

And in search, after handling usual TT cut-offs etc.:

Code: Select all

// handle TT cut-offs
eval = CorrectEvalScore&#40;Evaluate&#40;), ttValue, ttBound&#41;;

rvida · Post by **rvida** » Mon Feb 08, 2010 3:43 pm

diep wrote:
If i have information from hashtable it usually gives a cutoff in qsearch.

Mind sharing rough pseudo code with us if you're not describing the usual hashtable cutoff?

He is talking about something like this:

Code: Select all

if &#40;hash_entry->bound == EXACT&#41; 
  stand_pat = hash_entry->score;
else &#123;
  stand_pat = position->eval&#40;);
  if &#40;hash_entry->bound == LOWER_BOUND&#41;
    stand_pat = max&#40;stand_pat, hash_entry->score&#41;;
  else
    if &#40;hash_entry->bound == UPPER_BOUND&#41;
      stand_pat = min&#40;stand_pat, hash_entry->score&#41;;
&#125;

if &#40;stand_pat >= beta&#41; 
  return stand_pat;

this "imporved" stand pat score can be later useful in futility (delta) pruning decision too.

hgm · Post by **hgm** » Tue Feb 09, 2010 10:03 am

diep wrote:If i have information from hashtable it usually gives a cutoff in qsearch.

I guess the proposed scheme would only resort any effect in the QS at the end of the PV branch. There the hash bound would not produce a cutoff as the window is still open, but the current eval could be in contradiction with the score bound from a deeper search.

Rather than using the bound in that case, I would be inclined to force an extension. If it happens in so few leaf nodes, this can hardy be costly.

Correcting Evaluation with the hash table

Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table

Re: Correcting Evaluation with the hash table