Houdini 3 running for the IPON

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
Houdini
Posts: 1471
Joined: Tue Mar 16, 2010 12:00 am

Re: Houdini 3 running for the IPON

Post by Houdini »

Laskos wrote:+42 now, after 1,000 games. Don't forget that the final result is calculated with Bayeselo and could be off the value against the average (as it is estimated now). And H3 perfoms ~80 points above K5 in the direct match. If these numbers will stay, I don't see Komodo "catching in a few months".
Currently 3083 (+46) after 1088 games.
I would be disappointed with less than 40 Elo improvement for Houdini 3 in IPON.
Either way, we're very close to the 50 Elo gain I "officially" announced, you cannot expect any more precision from any rating list nor from my own development testing gauntlets (which measured 50 to 55 Elo).

Robert
User avatar
Leto
Posts: 2071
Joined: Thu May 04, 2006 3:40 am
Location: Dune

Re: Houdini 3 running for the IPON

Post by Leto »

From IPON:

Houdini 3 STD - Komodo 5 (3012) 41.5 - 27.5 60.14% Perf=3083
Houdini 3 STD - Critter 1.4a (2990) 43.5 - 26.5 62.14% Perf=3076
Houdini 3 STD - Stockfish 2.2.2 JA (2972) 48.5 - 19.5 71.32% Perf=3130
Houdini 3 STD - Deep Rybka 4.1 (2965) 46.5 - 22.5 67.39% Perf=3091
Houdini 3 STD - Naum 4.2 (2840) 58.0 - 10.0 85.29% Perf=3145
Houdini 3 STD - HIARCS 14 WCSC 32b (2824) 56.0 - 12.0 82.35% Perf=3091
Houdini 3 STD - Gull 1.2 (2805) 58.5 - 8.5 87.31% Perf=3140
Houdini 3 STD - Hannibal 1.2 (2801) 58.5 - 10.5 84.78% Perf=3099
Houdini 3 STD - Deep Shredder 12 (2800) 56.0 - 13.0 81.16% Perf=3053
Houdini 3 STD - Deep Sjeng c't 2010 32b (2795) 60.0 - 9.0 86.96% Perf=3124
Houdini 3 STD - Spike 1.4 32b (2784) 61.0 - 8.0 88.41% Perf=3136
Houdini 3 STD - spark-1.0 (2770) 58.5 - 8.5 87.31% Perf=3105
Houdini 3 STD - Protector 1.4.0 (2761) 57.5 - 10.5 84.56% Perf=3056
Houdini 3 STD - Deep Junior 13.3 (2754) 57.0 - 11.0 83.82% Perf=3039
Houdini 3 STD - Quazar 0.4 (2740) 62.5 - 5.5 91.91% Perf=3162
Houdini 3 STD - Zappa Mexico II (2709) 60.5 - 6.5 90.30% Perf=3096
Houdini 3 STD - MinkoChess 1.3 (2696) 64.5 - 3.5 94.85% Perf=3202
948.5 - 212.5 81.70% Perf=3084


1161 out of 2550 games played
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: Houdini 3 running for the IPON

Post by Dr.Wael Deeb »

Houdini wrote:
Laskos wrote:+42 now, after 1,000 games. Don't forget that the final result is calculated with Bayeselo and could be off the value against the average (as it is estimated now). And H3 perfoms ~80 points above K5 in the direct match. If these numbers will stay, I don't see Komodo "catching in a few months".
Currently 3083 (+46) after 1088 games.
I would be disappointed with less than 40 Elo improvement for Houdini 3 in IPON.
Either way, we're very close to the 50 Elo gain I "officially" announced, you cannot expect any more precision from any rating list nor from my own development testing gauntlets (which measured 50 to 55 Elo).

Robert
Many moons ago I predicted +40 Elo at best and it seems that my prediction is transforming to reality.........
Dr.D
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
beram
Posts: 1187
Joined: Wed Jan 06, 2010 3:11 pm

Re: Houdini 3 running for the IPON

Post by beram »

Dr.Wael Deeb wrote:
Houdini wrote:
Laskos wrote:+42 now, after 1,000 games. Don't forget that the final result is calculated with Bayeselo and could be off the value against the average (as it is estimated now). And H3 perfoms ~80 points above K5 in the direct match. If these numbers will stay, I don't see Komodo "catching in a few months".
Currently 3083 (+46) after 1088 games.
I would be disappointed with less than 40 Elo improvement for Houdini 3 in IPON.
Either way, we're very close to the 50 Elo gain I "officially" announced, you cannot expect any more precision from any rating list nor from my own development testing gauntlets (which measured 50 to 55 Elo).

Robert
Many moons ago I predicted +40 Elo at best and it seems that my prediction is transforming to reality.........
Dr.D
No, in contrary, it should have been +40 at least
MM
Posts: 766
Joined: Sun Oct 16, 2011 11:25 am

Re: Houdini 3 running for the IPON

Post by MM »

Laskos wrote:
lkaufman wrote:At this writing it is +35 over Houdini 2, a respectible gain for a year though less than predicted. Of course this could still change quite a bit. It's good enough for me to buy a copy myself, I often want two opinions (Komodo and Houdini) of a position. If the gain stays around this level, I think Komodo will catch it in a few months, as we are now on a par with Houdini 2.
+42 now, after 1,000 games. Don't forget that the final result is calculated with Bayeselo and could be off the value against the average (as it is estimated now). And H3 perfoms ~80 points above K5 in the direct match. If these numbers will stay, I don't see Komodo "catching in a few months".
Kai, please, can you tell me if the final performance of H3 will be higher or lower than that one actually showed in the page of ipon chess? I'm not expert at all of the elo calculation.

Thank you in advance

Best Regards
MM
Vinvin
Posts: 5298
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Houdini 3 running for the IPON

Post by Vinvin »

IWB wrote:As usual you find the running tourney here: http://www.inwoba.de

Have fun
Ingo
Ingo, please after this match would you mind to play a shorter match with H3 tactical vs non-top engines (let say 100 games against : Deep Rybka 4.1, Deep Fritz 13 32b, Naum 4.2, Chiron 1.1a, HIARCS 14 WCSC 32b and Gull 1.2) ?

I didn't see any rating for H3tactical in games yet ...

Thanks,
Vincent
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: Houdini 3 running for the IPON

Post by Dr.Wael Deeb »

beram wrote:
Dr.Wael Deeb wrote:
Houdini wrote:
Laskos wrote:+42 now, after 1,000 games. Don't forget that the final result is calculated with Bayeselo and could be off the value against the average (as it is estimated now). And H3 perfoms ~80 points above K5 in the direct match. If these numbers will stay, I don't see Komodo "catching in a few months".
Currently 3083 (+46) after 1088 games.
I would be disappointed with less than 40 Elo improvement for Houdini 3 in IPON.
Either way, we're very close to the 50 Elo gain I "officially" announced, you cannot expect any more precision from any rating list nor from my own development testing gauntlets (which measured 50 to 55 Elo).

Robert
Many moons ago I predicted +40 Elo at best and it seems that my prediction is transforming to reality.........
Dr.D
No, in contrary, it should have been +40 at least
We'll see....early times still......
Dr.D
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
Maharadja
Posts: 78
Joined: Thu Dec 24, 2009 1:22 pm

Re: Houdini 3 running for the IPON

Post by Maharadja »

Houdini 3 STD - Komodo 5 (3012) 71.5 - 43.5 62.17% Perf=3098
Houdini 3 STD - Critter 1.4a (2990) 69.0 - 46.0 60.00% Perf=3060
Houdini 3 STD - Stockfish 2.2.2 JA (2972) 81.5 - 33.5 70.87% Perf=3126
Houdini 3 STD - Deep Rybka 4.1 (2965) 76.5 - 37.5 67.11% Perf=3088
Houdini 3 STD - Naum 4.2 (2840) 95.0 - 18.0 84.07% Perf=3128
Houdini 3 STD - HIARCS 14 WCSC 32b (2824) 93.0 - 21.0 81.58% Perf=3082
Houdini 3 STD - Gull 1.2 (2805) 97.5 - 14.5 87.05% Perf=3136
Houdini 3 STD - Hannibal 1.2 (2801) 96.5 - 17.5 84.65% Perf=3097
Houdini 3 STD - Deep Shredder 12 (2800) 92.0 - 22.0 80.70% Perf=3048
Houdini 3 STD - Deep Sjeng c't 2010 32b (2795) 101.5 - 12.5 89.04% Perf=3158
Houdini 3 STD - Spike 1.4 32b (2784) 103.0 - 12.0 89.57% Perf=3157
Houdini 3 STD - spark-1.0 (2770) 98.5 - 15.5 86.40% Perf=3091
Houdini 3 STD - Protector 1.4.0 (2761) 96.5 - 17.5 84.65% Perf=3057
Houdini 3 STD - Deep Junior 13.3 (2754) 98.5 - 15.5 86.40% Perf=3075
Houdini 3 STD - Quazar 0.4 (2740) 101.5 - 11.5 89.82% Perf=3118
Houdini 3 STD - Zappa Mexico II (2709) 103.0 - 11.0 90.35% Perf=3097
Houdini 3 STD - MinkoChess 1.3 (2696) 106.5 - 7.5 93.42% Perf=3156
1581.5 - 356.5 81.60% Perf=3082

1938 out of 2550 games played

Hi,

I don't understand how this average rating is calculated.
why cant we sum up the ratings per engine and divide it by 17?
thus: 52772/17=3104,235294
Vinvin
Posts: 5298
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Houdini 3 running for the IPON

Post by Vinvin »

Maharadja wrote:...
Hi,

I don't understand how this average rating is calculated.
why cant we sum up the ratings per engine and divide it by 17?
thus: 52772/17=3104,235294
Because 100 games with perf=2500 + 100 games with perf=2600 is NOT equal to a perf=2550 ...
Uri Blass
Posts: 10892
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Houdini 3 running for the IPON

Post by Uri Blass »

Maharadja wrote:Houdini 3 STD - Komodo 5 (3012) 71.5 - 43.5 62.17% Perf=3098
Houdini 3 STD - Critter 1.4a (2990) 69.0 - 46.0 60.00% Perf=3060
Houdini 3 STD - Stockfish 2.2.2 JA (2972) 81.5 - 33.5 70.87% Perf=3126
Houdini 3 STD - Deep Rybka 4.1 (2965) 76.5 - 37.5 67.11% Perf=3088
Houdini 3 STD - Naum 4.2 (2840) 95.0 - 18.0 84.07% Perf=3128
Houdini 3 STD - HIARCS 14 WCSC 32b (2824) 93.0 - 21.0 81.58% Perf=3082
Houdini 3 STD - Gull 1.2 (2805) 97.5 - 14.5 87.05% Perf=3136
Houdini 3 STD - Hannibal 1.2 (2801) 96.5 - 17.5 84.65% Perf=3097
Houdini 3 STD - Deep Shredder 12 (2800) 92.0 - 22.0 80.70% Perf=3048
Houdini 3 STD - Deep Sjeng c't 2010 32b (2795) 101.5 - 12.5 89.04% Perf=3158
Houdini 3 STD - Spike 1.4 32b (2784) 103.0 - 12.0 89.57% Perf=3157
Houdini 3 STD - spark-1.0 (2770) 98.5 - 15.5 86.40% Perf=3091
Houdini 3 STD - Protector 1.4.0 (2761) 96.5 - 17.5 84.65% Perf=3057
Houdini 3 STD - Deep Junior 13.3 (2754) 98.5 - 15.5 86.40% Perf=3075
Houdini 3 STD - Quazar 0.4 (2740) 101.5 - 11.5 89.82% Perf=3118
Houdini 3 STD - Zappa Mexico II (2709) 103.0 - 11.0 90.35% Perf=3097
Houdini 3 STD - MinkoChess 1.3 (2696) 106.5 - 7.5 93.42% Perf=3156
1581.5 - 356.5 81.60% Perf=3082

1938 out of 2550 games played

Hi,

I don't understand how this average rating is calculated.
why cant we sum up the ratings per engine and divide it by 17?
thus: 52772/17=3104,235294
because if you calculate in that way then it means that 99-1 and 51-49 against 2 opponents is not the same rating as 100-0 and 50-50

100-0 gives infinite performace so the average is infinite in the second case when it is finite in the first case.