Ajedrecista wrote:
Thank you very much for your interest! I stay tuned for the results. Honestly, I will be surprised if these numbers make sense.
It doesn't make sense for me to test this by increase or decreasing the strength of Houdini. In either case it only increases the number of decisive games for Houdini. It either starts winning more games if you increase it, or it starts losing more games but in both cases you decisive game rate is lowest only when your wins and losses are balanced out.
So instead I decided to see if I could make Stockfish look like the most decisive program by weakening it. This makes more sense since it already consuming the most testing resources to match Komodo and Houdini.
I only have a couple of hundred games so I need a lot more, but this appears to drop the Stockfish draw rate enough to be below Houdini's. And your formula is not "fooled" either, it still see's Houdini as the most draw fearing program. I'll give a full report when I have a few thousand games.
If this works, we may be able to run any programs without messing with time adjustments, just as you set out to do. It probably wouldn't hurt to at least make a rough estimated adjustment anyway because I have a feeling there are still second order errors in your way - but I admit I don't understand it completely. For example maybe it works better if you get all programs within 100 ELO of each other.
Don
Capital punishment would be more effective as a preventive measure if it were administered prior to the crime.