Yes, the overwhelming number of games played by the first 10 opponents keeps the uncertainty in their Elo estimates low. However, Miguel's point is not without merit.Daniel Shawul wrote:You said it would increase _signifcantly_ which it did n't. The error margin is 5 vs 6 after you add a few more games from the second pool. You simply forgot that we still have the oldgames when adding the new pool. Ofcourse I wouldn't expect error margin to be the same with different sets of players but it will not change the error margins significantly as you claimed. Here is a direct transaltion of your frist example. With or without the second pool error margin still remains 1...Now compare this result to what you said:Code: Select all
1 Player9 164 1 1 900100 50.00% 164 20.00% 2 Player8 164 1 1 900100 50.00% 164 20.00% 3 Player7 164 1 1 900100 50.00% 164 20.00% 4 Player6 164 1 1 900100 50.00% 164 20.00% 5 Player5 164 1 1 900100 50.00% 164 20.00% 6 Player4 164 1 1 900100 50.00% 164 20.00% 7 Player3 164 1 1 900100 50.00% 164 20.00% 8 Player2 164 1 1 900100 50.00% 164 20.00% 9 Player1 164 1 1 900100 50.00% 164 20.00% 10 Player0 164 1 1 900100 50.00% 164 20.00% 11 Player10 -163 52 52 190 28.90% 9 20.00% 12 Player11 -163 52 52 190 28.90% 9 20.00% 13 Player12 -163 52 52 190 28.90% 9 20.00% 14 Player13 -163 52 52 190 28.90% 9 20.00% 15 Player14 -163 52 52 190 28.90% 9 20.00% 16 Player15 -163 52 52 190 28.90% 9 20.00% 17 Player16 -163 52 52 190 28.90% 9 20.00% 18 Player17 -163 52 52 190 28.90% 9 20.00% 19 Player18 -163 52 52 190 28.90% 9 20.00% 20 Player19 -163 52 52 190 28.90% 9 20.00%
Obviously it didn't increase at all even if the average elo of the pool is decreased by 163. You forgot that we still have those 100000 games between themselves, otherwise you wouldn't talk about distance examples you gave , which I fail to see its relevance here at all.All the errors will increase tremendously, because now the values against the average of the pool is uncertain.
For completeness here are results ignoring one of the pools. You can see there isn't much of a difference for any of the pools even though they have tremendously different number of games and elos as welll..
First pool's error is same:Second pool's error bar is 52 vs 53Code: Select all
1 Player0 0 1 1 900000 50.00% 0 20.00% 2 Player1 0 1 1 900000 50.00% 0 20.00% 3 Player2 0 1 1 900000 50.00% 0 20.00% 4 Player3 0 1 1 900000 50.00% 0 20.00% 5 Player4 0 1 1 900000 50.00% 0 20.00% 6 Player5 0 1 1 900000 50.00% 0 20.00% 7 Player6 0 1 1 900000 50.00% 0 20.00% 8 Player7 0 1 1 900000 50.00% 0 20.00% 9 Player8 0 1 1 900000 50.00% 0 20.00% 10 Player9 0 1 1 900000 50.00% 0 20.00%
DanielCode: Select all
1 Player0 187 82 82 100 90.00% -186 20.00% 2 Player1 187 82 82 100 90.00% -186 20.00% 3 Player2 187 82 82 100 90.00% -186 20.00% 4 Player3 187 82 82 100 90.00% -186 20.00% 5 Player4 187 82 82 100 90.00% -186 20.00% 6 Player5 187 82 82 100 90.00% -186 20.00% 7 Player6 187 82 82 100 90.00% -186 20.00% 8 Player7 187 82 82 100 90.00% -186 20.00% 9 Player8 187 82 82 100 90.00% -186 20.00% 10 Player9 187 82 82 100 90.00% -186 20.00% 11 Player10 -186 53 53 190 28.90% 10 20.00% 12 Player11 -186 53 53 190 28.90% 10 20.00% 13 Player12 -186 53 53 190 28.90% 10 20.00% 14 Player13 -186 53 53 190 28.90% 10 20.00% 15 Player14 -186 53 53 190 28.90% 10 20.00% 16 Player15 -186 53 53 190 28.90% 10 20.00% 17 Player16 -186 53 53 190 28.90% 10 20.00% 18 Player17 -186 53 53 190 28.90% 10 20.00% 19 Player18 -186 53 53 190 28.90% 10 20.00% 20 Player19 -186 53 53 190 28.90% 10 20.00%
Let's look at a more realistic example:
Code: Select all
Rank Name Elo + - games score oppo. draws
1 Engine_B 0 14 14 504 50% 0 25%
2 Engine_A 0 14 14 504 50% 0 25%
Code: Select all
Rank Name Elo + - games score oppo. draws
1 Engine_B 0 61 61 516 50% 0 25%
2 Engine_A 0 61 61 516 50% 0 25%
3 Engine_C 0 102 102 18 50% 0 33%
4 Engine_D 0 102 102 18 50% 0 33%