Adam Hair wrote:Ajedrecista wrote:
Adam said that, taking his regression as the best estimator, then Kai's method is better than mine... at least I tried it! I am happy because I started this subtopic inside the main topic and some people came with their work... I do not know if Kai already used D/[µ*(1 - µ)] before the start of this thread, but I am sure that Adam thought about his regression model in these few days! Congratulations to both of you.
Regards from Spain.
Ajedrecista.
We can credit
Kirill Kryukov for applying regression methods to determining the draw rate characteristics of engines. What I have done is simply a continuation of his ideas.
It is not certain that your method does not work better for other data. I do suspect some modification of your formula or Kai's formula is needed.
Thanks for starting this, Jesús (and Don too).
To both Jesus and Adam:
One thing bothers me in the regression is that it takes into account that the strength correlates with draw rate. For round-robin it simply means that it correlates with score (µ). I am not sure we have to account for this, or this is a property of engines, and it simply means that stronger engines are less draw averse.
I adapted my recipe D/(µ(1-µ)) to take account of this correlation (though I am not sure it's needed) to D/((1-µ)*µ^1.10). I will call this "Kai modified", for taking into account the strength of the engines.
The IPON table now looks:
Code: Select all
Name Score D D_max k k*u*(1-u) D/(u*(1-u)) Draw dev. D/(u^1.10*(1-u))
Deep Junior 13.3 39.00% 34.00% 78.00% 0.4359 0.1037 1.429 -5.21% 1.57
Houdini 3 STD 82.00% 24.00% 36.00% 0.6667 0.0984 1.626 -2.05% 1.66
Gull 1.2 45.00% 39.00% 90.00% 0.4333 0.1073 1.576 -1.93% 1.71
Quazar 0.4 36.00% 37.00% 72.00% 0.5139 0.1184 1.606 -1.07% 1.78
HIARCS 14 WCSC 32b 48.00% 40.00% 96.00% 0.4167 0.1040 1.603 -1.06% 1.73
Komodo 5 73.00% 34.00% 54.00% 0.6296 0.1241 1.725 -0.19% 1.78
Protector 1.4.0 39.00% 39.00% 78.00% 0.5 0.1190 1.639 -0.16% 1.80
Deep Shredder 12 45.00% 40.00% 90.00% 0.4444 0.1100 1.616 -0.08% 1.75
Deep Fritz 13 32b 51.00% 40.00% 98.00% 0.4082 0.1020 1.601 -0.07% 1.71
Hannibal 1.2 45.00% 40.00% 90.00% 0.4444 0.1100 1.616 -0.04% 1.75
spark-1.0 41.00% 39.00% 82.00% 0.4756 0.1151 1.612 -0.03% 1.76
Zappa Mexico II 32.00% 35.00% 64.00% 0.5469 0.1190 1.608 0.00% 1.80
Critter 1.4a 71.00% 37.00% 58.00% 0.6379 0.1314 1.797 0.10% 1.86
Spike 1.4 32b 42.00% 40.00% 84.00% 0.4762 0.1160 1.642 0.24% 1.79
Deep Sjeng c't 2010 32b 43.00% 41.00% 86.00% 0.4767 0.1169 1.673 0.45% 1.82
Naum 4.2 50.00% 42.00% 100.00% 0.42 0.1050 1.680 0.63% 1.80
MinkoChess 1.3 31.00% 36.00% 62.00% 0.5806 0.1242 1.683 0.69% 1.89
Chiron 1.5 52.00% 42.00% 96.00% 0.4375 0.1092 1.683 0.87% 1.80
Stockfish 2.2.2 JA 69.00% 40.00% 62.00% 0.6452 0.1380 1.870 1.68% 1.94
Deep Rybka 4.1 68.00% 40.00% 64.00% 0.625 0.1360 1.838 2.41% 1.91
Now I give correlations:
Correlation[Adam, Jesus] = 0.61
Correlation[Adam, Kai] = 0.81
Correlation[Adam, Kai modified] = 0.89
Correlation[Jesus, Kai] = 0.79
So, my "strength adjusted" correlates well with the regression (0.89), but again, I don't know if we have to account for absolute strength, and not just for relative scores.
Kai