Name for elo without draws?
Moderators: bob, hgm, Harvey Williamson
Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Name for elo without draws?
If we have N games with results W, D and L (with W+D+L=N), the elo difference is calculated from P = (0*L + 0.5*D + 1*W) / N.
We can throw away the draws and compute elo' from P' = W / (ND) using the same elo formula.
What you then get I now call 'pseudoelo', but I'm wondering if there is a standard name for that quantity already.
We can throw away the draws and compute elo' from P' = W / (ND) using the same elo formula.
What you then get I now call 'pseudoelo', but I'm wondering if there is a standard name for that quantity already.
[Account deleted]

 Posts: 10000
 Joined: Wed Mar 08, 2006 7:57 pm
 Location: Redmond, WA USA
 Contact:
Re: Name for elo without draws?
I would call it wrong Elo.mvk wrote:If we have N games with results W, D and L (with W+D+L=N), the elo difference is calculated from P = (0*L + 0.5*D + 1*W) / N.
We can throw away the draws and compute elo' from P' = W / (ND) using the same elo formula.
What you then get I now call 'pseudoelo', but I'm wondering if there is a standard name for that quantity already.
Imagine two computer players A and B, very evenly matched.
They play one million and two games.
Players A and B draw one million times.
B wins the other two games.
Is B dominantly better than A?
No.
B has EXACTLY the SAME strength as A.
To calculate otherwise is simply incorrect.
Re: Name for elo without draws?
Thank you. As this "pseudoelo", as I call it now, emerges from my evaluation's draw model, I think "wrong elo" is a rather poor name. I'm just looking if the quantity is already named elsewhere or not. I'm not looking for a value judgement.
[Account deleted]

 Posts: 685
 Joined: Tue May 22, 2007 9:13 am
Re: Name for elo without draws?
Even though the ELO difference for this example is tiny, the Likelihood of Superiority of B over A is 7/8. In the words of Remi Coulom: A draw will at the same time make estimated Elo ratings closer to each other, and reduce the width of confidence intervals. It does this in such a way that the LOS does not change.Dann Corbit wrote:I would call it wrong Elo.mvk wrote:If we have N games with results W, D and L (with W+D+L=N), the elo difference is calculated from P = (0*L + 0.5*D + 1*W) / N.
We can throw away the draws and compute elo' from P' = W / (ND) using the same elo formula.
What you then get I now call 'pseudoelo', but I'm wondering if there is a standard name for that quantity already.
Imagine two computer players A and B, very evenly matched.
They play one million and two games.
Players A and B draw one million times.
B wins the other two games.
Is B dominantly better than A?
No.
B has EXACTLY the SAME strength as A.
To calculate otherwise is simply incorrect.
Re: Name for elo without draws?
I will use Wilo in the next version of Ordo (I may even change the name of the program not to confuse things) with a different model as I mentioned before.mvk wrote:If we have N games with results W, D and L (with W+D+L=N), the elo difference is calculated from P = (0*L + 0.5*D + 1*W) / N.
We can throw away the draws and compute elo' from P' = W / (ND) using the same elo formula.
What you then get I now call 'pseudoelo', but I'm wondering if there is a standard name for that quantity already.
http://www.talkchess.com/forum/viewtopi ... ilo#593262
The scales will be different, so the delta "wilos" will give a different meaning, but I have the opinion that it is _the_ way to measure strength based on some theoretical and experimental considerations. For instance, the scale of doubling speeds is very linear with respect to wilos, but it curves a lot with elo. At the higher end, we are actually underestimating the improvements of SF and Komodo. They are actually much stronger than what they look.
Miguel
PS: The new ordowilo will not throw away draws, but will incorporate them into a new draw model where draw rates vary with strength.
Re: Name for elo without draws?
This would be much closer to a sort of LOS measure, which only estimates which one is better (and the likelihood of 2 wins and 1M draws does suggest the program with two wins is slightly better), while Elo tries to estimate the difference between the two and provide a predictive estimate of how future games will go.mvk wrote:Thank you. As this "pseudoelo", as I call it now, emerges from my evaluation's draw model, I think "wrong elo" is a rather poor name. I'm just looking if the quantity is already named elsewhere or not. I'm not looking for a value judgement.
You might pose this to Remi. When we were talking about this very issue here years ago he mentioned the business about draws being irrelevant if all you care about is which program is better. I don't recall that he had a specific name, but he's a good one to ask.

 Posts: 685
 Joined: Tue May 22, 2007 9:13 am
Re: Name for elo without draws?
See the post of his I quoted in my earlier post.bob wrote:This would be much closer to a sort of LOS measure, which only estimates which one is better (and the likelihood of 2 wins and 1M draws does suggest the program with two wins is slightly better), while Elo tries to estimate the difference between the two and provide a predictive estimate of how future games will go.mvk wrote:Thank you. As this "pseudoelo", as I call it now, emerges from my evaluation's draw model, I think "wrong elo" is a rather poor name. I'm just looking if the quantity is already named elsewhere or not. I'm not looking for a value judgement.
You might pose this to Remi. When we were talking about this very issue here years ago he mentioned the business about draws being irrelevant if all you care about is which program is better. I don't recall that he had a specific name, but he's a good one to ask.

 Posts: 10000
 Joined: Wed Mar 08, 2006 7:57 pm
 Location: Redmond, WA USA
 Contact:
Re: Name for elo without draws?
LOS of 7/8 is obvious horse crap.Rein Halbersma wrote:Even though the ELO difference for this example is tiny, the Likelihood of Superiority of B over A is 7/8. In the words of Remi Coulom: A draw will at the same time make estimated Elo ratings closer to each other, and reduce the width of confidence intervals. It does this in such a way that the LOS does not change.Dann Corbit wrote:I would call it wrong Elo.mvk wrote:If we have N games with results W, D and L (with W+D+L=N), the elo difference is calculated from P = (0*L + 0.5*D + 1*W) / N.
We can throw away the draws and compute elo' from P' = W / (ND) using the same elo formula.
What you then get I now call 'pseudoelo', but I'm wondering if there is a standard name for that quantity already.
Imagine two computer players A and B, very evenly matched.
They play one million and two games.
Players A and B draw one million times.
B wins the other two games.
Is B dominantly better than A?
No.
B has EXACTLY the SAME strength as A.
To calculate otherwise is simply incorrect.
The two wins are random noise in a million and two games.
 hgm
 Posts: 23630
 Joined: Fri Mar 10, 2006 9:06 am
 Location: Amsterdam
 Full name: H G Muller
 Contact:
Re: Name for elo without draws?
No!Dann Corbit wrote:LOS of 7/8 is obvious horse crap.
The two wins are random noise in a million and two games.
The situation is the same as having 2 wins out of 2 games. That is not always just random noise, in many cases (actually most cases) it would be because the winning player is significantly stronger. The only thing proven by the long match is that the draw probability is stupendously large. But that does not imply anything at all on the ratio of the win vs loss probability. It could very well be that P(draw) = 0.999998, P(win) = 1.999999e6 and P(loss) = 1e12. One player could be perfect, and cannot lose at all, while the other is only nearly perfect, and makes a losing error about once every million games. The situation in top draughts is somewhat like that (except not for millions of games but for dozens of games).
If you only know that A beat B 2 times out of 2, the odds that B would beat A in the next game that is not a draw are rather poor. It would be very unwise to bet on that when the payout is not at least 4 times the investment.

 Posts: 10000
 Joined: Wed Mar 08, 2006 7:57 pm
 Location: Redmond, WA USA
 Contact:
Re: Name for elo without draws?
This is clearly wrong.
Ignoring a million draws is lunacy.
If those players play a game, the outcome will be a draw.
It would be utterly unsurprising if after a million and two games the next time, the one who lost won three games.
When the math says something utterly stupid, then the math is wrong.
Ignoring a million draws is lunacy.
If those players play a game, the outcome will be a draw.
It would be utterly unsurprising if after a million and two games the next time, the one who lost won three games.
When the math says something utterly stupid, then the math is wrong.