Post
by hgm » Tue Mar 18, 2014 8:44 pm
I think 'evaluation discontinuity' is not really the main reason why you would want 'tapered evaluation'. The idea is really a very natural one. The prospects of winning the game, for which evaluation is supposed to give a heuristic estimate, depends not purely on the presence of an advantage (say that Knight that you are ahead), but also on how much material there is is still on the board ('game phase'). It is often the relative advantage that counts, K+N vs K (with equal Pawns) is usually a trivial win, but add two Rooks and a Queen for each side and everything is still possible. In other cases (e.g. King Safety), the advantage becomes less important if there is less material on the board. Having your King safely tucked away in a corner behind Pawns becomes a bit pointless when the opponent only has a Knight left.
But these advantages do not change abruptly, but graduately. There isn't a point in the game where you can say "before trading this Knight, King safety is all important, but now they are traded, it has no importance at all". Sometimes the presence of a Queen is taken as the defining characteristic of an end-game, but taking a King stroll through the center is still completely suicidal if the opponent has no Queen, but still two Rooks, Bishops and Knights. While if he only has a Queen, you can, and often must chance it.
So common sense dictates that the advantage of a better King Safety does not disappear all at once, but gradually as material is traded away. Obviously trading away Queens does more to making the board safer without Pawn shield than trading away minors. But trading away all minors does help a great deal as well. To reflect that in the evaluation you need to 'taper' the King-safety bonus gradually from large to zero when (opponent) material decreases from maximum to bare King. That this makes the evaluation more continuous is just an additional advantage thrown in for free. However, if the search had been such that discontinuous jumps where the best thing in the world for it, it would still have been very wrong to use it. Because it does not reflect the reality of the game.
Interpolating with some game-phase variable based on some weighted counting of the present material does provide an incentive to the program to advance the game phase when it possesses an advantage that is more important in the end-game. E.g. when you are a Knight ahead, and the end-game value of a Knight is higher than the opening value, it can increase its score by trading equal material, because that advances the game-phase towards the end-game. You want that to hold for any material, not just for trading Queens. Or even when you weight in other pieces, you don't want the bonus for trading to only occur when the material drops from above a certain value to below it, and not care about further trading afterwards. Any equal trade will help to express that extra Knight better.
But trading Pawns just makes the game more drawish when you are a minor ahead. Therefore it is a bad idea to weight Pawns into the game phase. This is a good thing, because Pawns also don't provide very much danger to a King, so that you can use the same game-phase measure to interpolate King-Safety and piece values.