Martin Thoresen wrote:Dann Corbit wrote:
Now, running 30 games is a great idea. But then don't imagine that you have decided who is strongest yet.
Dann, this is presicely what I am trying to explain to mr. Taylor.
My matches at tournament time control are
not in any way intended to give a final answer to whether engine A is stronger than engine B.
To reach such conclusions at the very slow time control I use, a large number of computers would have to be used in order to get the game numbers high enough within a humane time-frame.
Even with 1500 games the error margins are quite high.
My matches and upcoming tournaments are intended to provide the viewers with some quality chess entertainment, nothing else.
Best Regards,
Martin
I don't even agree that all games are absolutely useless and say absolutely nothing about how an engine or a human plays, before a few thousand games have been played.
If this is the case, there can hardly be much quality chess entertainment.
When a machine moves, you are also interested in how much you think it ought to know why it is moving there, especially if you don't understand it.
It's hardly entertaining when you keep being sure the machine has a mistake, and you break your head trying to see what was behind it, only to realize that OF COURSE it was a stupid move.
You can't have much entertainment when it is nothing but disappointment over and over again. One wants to see things being proven over the board. It's not enough fun just to imagine and imagine no end, and mental masturbation etc. Then in the end you don't any conclusions about the positions and why this or that happened, as it is all just like random nonsense.
You want to see and apreaciate quality, and to know if and when there was or was not quality.
And you don't only want to know who won and how many, but you want to know what it looked like.
When you have all that, you only need to watch a few games to get a better idea how strong the programs are, than simply hearing the results of a few hundred games.
If you have no other interest or feeling or understanding, then of course, just play 10,000 games. But 20 games with quality (and long TC) show more, in almost every way.
Didn't Kasparov say that if a human beats a computer even one game, it shows that the human is superior in understanding.
It's not too much different if you test a small number, if it's done well, and you can see how the games went.
At no point did houdini or Stocfish get a plus score in (i don't think), it was only Rybka inching up and up, steadily.
There is so much to argue about this.
The only thing the statistics do is to get an exact rating. But the likelihood of this being being wildly different to after 20 quality games, is very low.
OK?