a Telltale position

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Gian-Carlo Pascutto
Posts: 1243
Joined: Sat Dec 13, 2008 7:00 pm

Re: a Telltale position

Post by Gian-Carlo Pascutto »

Gerd Isenberg wrote: Hmm, what you quote funny (Qxd4 Nxd4) looks very reasonable to me, and I would expect every engine reporting each fail high at the root that way, since Qxd4 is the only capture and tried first in the first iteration of an IID framework. What is the point?
There aren't many engines reporting each fail high at the root that way, and especially not for the entire first iteration.

Just try it. I guess most people who developed their own engines got annoyed by this sort of output.

Then there is the fact that you must do move ordering in a specific way in order for that move to be ordered first... (it is never first for me, for example)
Last edited by Gian-Carlo Pascutto on Sat Feb 12, 2011 9:23 pm, edited 2 times in total.
Christopher Conkie
Posts: 6073
Joined: Sat Apr 01, 2006 9:34 pm
Location: Scotland

Re: a Telltale position

Post by Christopher Conkie »

Damir wrote:same evaluation, so what? No prove that engines are are clones or of same strength just because they show exact fail high at the root...
what's your point ?
If you cannot see the point of my first post in this thread I doubt you ever will.

Maybe you are colour blind to red.

Chris
Gerd Isenberg
Posts: 2250
Joined: Wed Mar 08, 2006 8:47 pm
Location: Hattingen, Germany

Re: a Telltale position

Post by Gerd Isenberg »

Christopher Conkie wrote:
Gerd Isenberg wrote:
Christopher Conkie wrote: [d]rn1q1r2/p1p1ppkp/1p4p1/8/2BP4/2N2N2/PP3PPP/R1B1K2R b KQ - 0 11

Houdini 1.0 x32:
1/3 00:00 30 0 -8.70 Qxd4 Nxd4 (this bit is very funny indeed)


:lol:

Chris
Hmm, what you quote funny (Qxd4 Nxd4) looks very reasonable to me, and I would expect every engine reporting each fail high at the root that way, since Qxd4 is the only capture and tried first in the first iteration of an IID framework. What is the point?

Gerd
I would ask you to look again at the comparison Gerd.

Chris
Sorry, I don't understand it. You must be more patient and explicit.

You mean same scores after Qxd4 Nxd4? Coincidence that a weighted sum of different features has the same possibly rounded value?

Gerd
Richard Allbert
Posts: 792
Joined: Wed Jul 19, 2006 9:58 am

Re: a Telltale position

Post by Richard Allbert »

Hi Gerd!

I think if you feed this to a lot of engines, there aren't many that produce a score -900 < val < -800 on the first pv string.

See you Belgium

Richard
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: a Telltale position

Post by lkaufman »

Cubeman wrote:How do the games from that position end, it would be interesting for some test games between the so called Ippo clones and the traditional other strong engines.A wrong evaluation would show up in game results.Sometimes I think Human evaluations are not necessary the absolute truth.I also imagine that there could be many engines out there even before Rybka beta that would evaluate similar scores as Houdini and Critter.
I ran off a quick 100 games using the Monte Carlo feature of Rybka 4 at five ply (which is really 8 ply). White won by 78 to 22 confirming the human GM assessment. I imagine that really ancient engines might score this around zero, but this should have no relevance to how current engines evaluate.
beram
Posts: 1187
Joined: Wed Jan 06, 2010 3:11 pm

Re: a Telltale position

Post by beram »

Well I will take a closer look
Last edited by beram on Sat Feb 12, 2011 9:40 pm, edited 3 times in total.
Christopher Conkie
Posts: 6073
Joined: Sat Apr 01, 2006 9:34 pm
Location: Scotland

Re: a Telltale position

Post by Christopher Conkie »

Gerd Isenberg wrote:
Christopher Conkie wrote:
Gerd Isenberg wrote:
Christopher Conkie wrote: [d]rn1q1r2/p1p1ppkp/1p4p1/8/2BP4/2N2N2/PP3PPP/R1B1K2R b KQ - 0 11

Houdini 1.0 x32:
1/3 00:00 30 0 -8.70 Qxd4 Nxd4 (this bit is very funny indeed)


:lol:

Chris
Hmm, what you quote funny (Qxd4 Nxd4) looks very reasonable to me, and I would expect every engine reporting each fail high at the root that way, since Qxd4 is the only capture and tried first in the first iteration of an IID framework. What is the point?

Gerd
I would ask you to look again at the comparison Gerd.

Chris
Sorry, I don't understand it. You must be more patient and explicit.

You mean same scores after Qxd4 Nxd4? Coincidence that a weighted sum of different features has the same possibly rounded value?

Gerd
I think Gian-Carlo has answered part of what I am showing above. However, I would also point out that the moves are exactly the same through depth 1-9 in my original post and......

......that the draw score is also obtained at the same depth.

This is a major feature of all Ippolit derivatives (I mean the first iteration).

Upon examination the Robbolito you see was/is the closest thing (well its the same thing really) to Houdini 1.0.

It is why we say the first Houdini was a direct copy of Robbolito. When SMP was added, that was taken from Ivanhoe.

I can but tell you what I/we see when we test engines. I hope this explains it a bit more clearly.

:)

Chris
Gian-Carlo Pascutto
Posts: 1243
Joined: Sat Dec 13, 2008 7:00 pm

Re: a Telltale position

Post by Gian-Carlo Pascutto »

By the way, an underlying issue that causes the funny output is that Ippolit is using MVV/LVA scoring to order the moves at the root (in fact, they are passed from a function higher up), whereas they use (proper) SEE in the search tree, and correctly order Qxd4 backwards there.

This is why you won't find much engines with such output. You must have almost the same bug.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: a Telltale position

Post by lkaufman »

Christopher Conkie wrote:
Gerd Isenberg wrote:
Christopher Conkie wrote: [d]rn1q1r2/p1p1ppkp/1p4p1/8/2BP4/2N2N2/PP3PPP/R1B1K2R b KQ - 0 11

Houdini 1.0 x32:
1/3 00:00 30 0 -8.70 Qxd4 Nxd4 (this bit is very funny indeed)


:lol:

Chris
Hmm, what you quote funny (Qxd4 Nxd4) looks very reasonable to me, and I would expect every engine reporting each fail high at the root that way, since Qxd4 is the only capture and tried first in the first iteration of an IID framework. What is the point?

Gerd
I would ask you to look again at the comparison Gerd.

Chris
His point is that starting with the queen blunder is normal, while your point is that the evals are identical. The analysis you quote does seem to prove beyond any reasonable doubt that Houdini 1.0 and Robbolito are nearly identical in eval and perhaps also in low-depth search. The chance that two independent programs would produce the same moves at the same depths and with scores that are in the same ratio (except for the identical 8.7 scores) is about the same as the chance of being hit by a meteor. Even the eval diffs are fully accounted for by the score transformation formula posted here some time ago, which reduces the scores when the eval is modest but not when it is large.
User avatar
michiguel
Posts: 6401
Joined: Thu Mar 09, 2006 8:30 pm
Location: Chicago, Illinois, USA

Re: a Telltale position

Post by michiguel »

Gian-Carlo Pascutto wrote:By the way, an underlying issue that causes the funny output is that Ippolit is using MVV/LVA scoring to order the moves at the root (in fact, they are passed from a function higher up), whereas they use (proper) SEE in the search tree, and correctly order Qxd4 backwards there.

This is why you won't find much engines with such output. You must have almost the same bug.

If I enable showing the first iteration (disabled in the ini file by default)
this is Gaviota's output

I get Chris point about similarities, but I Qxd4 in iteration 1 is perfectly normal. A close to 0.00 evaluation does not mean much. Gaviota does it and it is a weaker engine. In addition, I disagree with the implications of similarities to R2. Robbolito is stylistically similar to R3 not r2. In fact, Naum is VERY similar to R2 and not R3 and in this position gets a +1 eval.

I do not believe eval means anything here. Chris may have a point with the similarity in PVs + eval, but I disagree with Larry in the original post.

Miguel

Code: Select all

cores 2
clearbrain
analyze
iterative deepening --> start, thread=0
set timer to infinite
         5   1       0.0   -11.65  Qxd4 7.Nxd4
         8   1       0.0    -0.29  f5
        18   1       0.0    -0.05  h5
        21   1       0.0    +0.07  f6
        33   1       0.0    +0.42  Nc6
        46   1&#58;      0.0    +0.42  Nc6
       159   2       0.0    +0.17  Nc6 7.Bd5
       232   2&#58;      0.0    +0.17  Nc6 7.Bd5
       672   3       0.0    +0.26  Nc6 7.Bd5 Qd6
      1470   3       0.0    +0.27  Qd6 7.Bd5 c6
      1521   3&#58;      0.0    +0.27  Qd6 7.Bd5 c6
      2701   4       0.0    +0.04  Qd6 7.O-O Nc6 8.Bd5
      4609   4&#58;      0.1    +0.04  Qd6 7.O-O Nc6 8.Bd5
     11881   5       0.1    +0.25  Qd6 7.O-O Nc6 8.Ne4 Qd7
     15934   5&#58;      0.1    +0.25  Qd6 7.O-O Nc6 8.Ne4 Qd7
     29859   6       0.2    +0.05  Qd6 7.O-O Nc6 8.Bd5 Rab8 9.Bg5
     42991   6&#58;      0.3    +0.05  Qd6 7.O-O Nc6 8.Bd5 Rab8 9.Bg5
     81188   7       0.4    +0.00  Qd6 7.O-O Nd7 8.Nd5 Qc6 9.b3 e6
    169784   7       0.7    +0.04  Nd7 7.Bd5 Rb8 8.Ne5 Nf6 9.Nc6 Qd6
                                   10.Nxb8 Nxd5 11.Nxd5 Qxd5
    186800   7&#58;      0.7    +0.04  Nd7 7.Bd5 Rb8 8.Ne5 Nf6 9.Nc6 Qd6
                                   10.Nxb8 Nxd5 11.Nxd5 Qxd5
    243521   8       0.9    +0.02  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Ne4 Nd5
                                   10.O-O
    363115   8       1.3    +0.03  Nc6 7.Bf4 Nb4 8.O-O Qd7 9.a3 Nc2 10.Bb5
    453843   8&#58;      1.6    +0.03  Nc6 7.Bf4 Nb4 8.O-O Qd7 9.a3 Nc2 10.Bb5
    782518   9       2.6    +0.04  Nc6 7.Bf4 e6 8.O-O Ne7 9.Nb5 c6 10.Nd6
                                   b5
    935186   9&#58;      3.2    +0.04  Nc6 7.Bf4 e6 8.O-O Ne7 9.Nb5 c6 10.Nd6
                                   b5
   1474411  10       5.2    +0.01  Nc6 7.Bf4 Na5 8.Ba6 h6 9.h4 f6 10.Kf1
                                   Qd7 11.Re1
   3733320  10&#58;     10.8    +0.01  Nc6 7.Bf4 Na5 8.Ba6 h6 9.h4 f6 10.Kf1
                                   Qd7 11.Re1
   5845574  11      17.1    +0.02  Nc6 7.Bf4 e6 8.O-O h6 9.Nb5 Rc8 10.Ne5
                                   Nxd4 11.Ng4 h5 12.Bh6+ Kg8 13.Bxf8 Kxf8
   7880434  11&#58;     22.1    +0.02  Nc6 7.Bf4 e6 8.O-O h6 9.Nb5 Rc8 10.Ne5
                                   Nxd4 11.Ng4 h5 12.Bh6+ Kg8 13.Bxf8 Kxf8
  11105777  12      30.7    -0.02  Nc6 7.Bf4 e6 8.O-O a6 9.Rfd1 Qe7 10.Ne4
                                   Kh8 11.d5 Qb4 12.b3
  14693002  12      39.3    -0.00  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Ne4 Nd5
                                   10.Bd2 f6 11.O-O g5 12.Kh1
  19068203  12&#58;     49.9    -0.00  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Ne4 Nd5
                                   10.Bd2 f6 11.O-O g5 12.Kh1
  23419141  13      60.6    -0.08  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Ne4 Nd5
                                   10.Bd2 f6 11.O-O e6 12.Kh1 Ne7
  65240544  13     162.0    -0.06  Qd6 7.O-O Nd7 8.Nd5 Qc6 9.Bb3 e6 10.Nc3
                                   Qd6 11.Bg5 Kh8 12.Nb5 Qc6
  74701794  13&#58;    183.8    -0.06  Qd6 7.O-O Nd7 8.Nd5 Qc6 9.Bb3 e6 10.Nc3
                                   Qd6 11.Bg5 Kh8 12.Nb5 Qc6
  95288908  14     231.1    -0.07  Qd6 7.O-O Nd7 8.Nd5 Qc6 9.Bb3 e6 10.Nc3
                                   Qd6 11.Ne4 Qc6 12.Re1 &#91;>&#93;
 135828119  14     321.9    +0.07  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Ne4 Nd5
                                   10.Bd2 f6 11.O-O e6 12.Rac1 Qe7 13.Kh1
 148170004  14&#58;    349.3    +0.07  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Ne4 Nd5
                                   10.Bd2 f6 11.O-O e6 12.Rac1 Qe7 13.Kh1
 181696490  15     423.4    -0.04  Nd7 7.Bd5 Rb8 8.Bc6 Nf6 9.Bf4 Nh5
                                   10.Bd2 Qd6 11.d5 Qb4 12.Nb5 Qc4 13.Nxa7
                                   Nf6