Well it depends on the settings right ?tpoppins wrote:How do you get +/-11 error bars with just 126 games? It's double that with Bayeselo and higher still with Elostat.
This what I chose to use just out of habit, ymmv:
Code: Select all
Mac-Pro:~/cluster.mfb] michaelbyrne% bay
version 0058, Copyright (C) 1997-2016 Remi Coulom and updated by Michael Byrne.
compiled Jul 24 2016 00:03:35.
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under the terms and conditions of the GNU General Public License.
See http://www.gnu.org/copyleft/gpl.html for details.
ResultSet>rp /Users/michaelbyrne/cluster.mfb/10212029.pgn
126 game(s) loaded
ResultSet>elo
ResultSet-EloRating>mm 1 1
Iteration 100: 0.00135696
00:00:00,00
ResultSet-EloRating>covariance
ResultSet-EloRating>r
Rank Name Rating Δ + - # Σ Σ% W L D W% =% OppR
---------------------------------------------------------------------------------------------------------
1 SF-McBrain v3.0 TCEC-T2 3106 0.0 11 11 126 66.0 52.4 20 14 92 15.9 73.0 3094
2 Stockfish 151017 64 POPC 3094 12.2 11 11 126 60.0 47.6 14 20 92 11.1 73.0 3106
---------------------------------------------------------------------------------------------------------
Δ = delta from the next higher rated opponent
# = number of games played
Σ = total score, 1 point for win, 1/2 point for draw
ResultSet-EloRating>los
SF St
SF-McBrain v3.0 TCEC-T2 86
Stockfish 151017 64 POPC 13
ResultSet-EloRating>
Thinking about it, since I play both sides with white and black I should be using "mm 01" and the results would look like this
Code: Select all
ResultSet>rp /Users/michaelbyrne/cluster.mfb/10212029.pgn
126 game(s) loaded
ResultSet>elo
ResultSet-EloRating>mm 0 1
00:00:00,00
ResultSet-EloRating>covariance
ResultSet-EloRating>r
Rank Name Rating Δ + - # Σ Σ% W L D W% =% OppR
---------------------------------------------------------------------------------------------------------
1 SF-McBrain v3.0 TCEC-T2 3108 0.0 15 15 126 66.0 52.4 20 14 92 15.9 73.0 3092
2 Stockfish 151017 64 POPC 3092 16.1 15 15 126 60.0 47.6 14 20 92 11.1 73.0 3108
---------------------------------------------------------------------------------------------------------
Δ = delta from the next higher rated opponent
# = number of games played
Σ = total score, 1 point for win, 1/2 point for draw
ResultSet-EloRating>los
SF St
SF-McBrain v3.0 TCEC-T2 85
Stockfish 151017 64 POPC 14
ResultSet-EloRating>
https://github.com/MichaelB7/bayeselo