1)I see no proof that rybka had no superior evaluation relative to weaker programs.Daniel Shawul wrote:I won't buy in to his 'appeal to humility' because I do not think being bad in move ordering indicates higher elo. It is absurd to try and justify negative correlation between move ordering and higher strength just because a strong engine performed bad. Strength is influenced by way too many factors to make any conclusions. Every one belived Rybka had super evaluation until it is proved not to be so. Therefore we need to make a proper test of each move ordering component for the test to be un-flawed by 'percieved elo of engine', otherwise it is no better than the Rybka PR.
Note that more complex does not mean better.
2)I think that the test proves nothing about quality of move ordering
and I suspect that komodo is going to get "better" move order if Don change komodo's evaluation to only material evaluation.
I do not see a way to compare move ordering between different programs.
statistics about cutoffs show nothing here.
Even when we talk about the same program statistics may be misleading when we do not change the order of moves.
For example one of the changes in latest stockfish
is changing
const int GrainSize = 8;
to
const int GrainSize = 4;
GrainSize=4 means that the evaluation is more accurate and I think that more accurate evaluation means more fail high not in the first move.
Note that Stockfish has the following code in the evaluation:
return Value((result + GrainSize / 2) & ~(GrainSize - 1));