Joined: 26 Jul 2006
Posts: 3360

Post subject: Re: EloStat, Bayeselo and Ordo    Posted: Sun Jun 24, 2012 9:46 pm

 hgm wrote: There are two separate issues here: the correctness of the model, ( Logistic, Gaussian, linear) and the correctness of the analysis once the model is given (to determine the parameters).

Yes, but if I am not wrong, the result here can be put on Edmund's plot, given the Bayeselo results and the true percentages, in the plot "White Score against Elo-delta". Your wondering of compression came from that plot.
 Quote: For a Logistic model with (small) draw margin m, the probability for a draw (L(x+m) - L(x-m) ~ d/dx L(x)) is proportional to the probability for one loss + one win (L(x) * (1 - L(x))). So one observed draw has the same effect on the likelihood of x (the rating difference) as one win + one loss. With N wins and M losses the likelihood od x is L(x)^N * (1-L(x))^M, which is maximum when (N*L(x)^(N-1) * (1-L(x))^M - M*L(x)^N * (1-L(x))^(M-1)) * dL/dx(x) = 0 or N*(1-L(x)) = M*L(x) N = (N+M) * L(x) L(x) = N/(N+M) i.e. the expected formula based on the fraction of wins. But as draws count for win + loss, a 15-5 result based on 15 wins and 5 losses has L(x) = 0.75, while one based on 10 wins plus 10 draws would give the same as for 20 wins plus 10 losses, i.e. L(x) = 0.66.

That' fine, I already saw something similar, is the draw just proportional to 1 win and 1 loss, or exactly equal? Second, I don't think draws are equal to anything win-loss in statistical weight sense (be it for maximum likelihood method), one has to use some summed-up trinomial distribution giving the same percentage, with varying N,M, Draws.
My problem is a bit different, if you are right, then I don't know what "rating" is supposed to mean. In absence of any other information, what is the rating difference between two engines scoring +60 =30 -10 against each other? Is it hard and I cannot do that by hand?

Kai
