I think you are ducking a REAL match. Trying to make a 1 ply match happen is way down on my list of priorities and I would not allow myself to be distracted by such a thing. If you want to play a REAL match, Diep vs Komodo then I'm all for it.diep wrote:You are trying to talk your way out of the 1 ply match?lkaufman wrote:This is the problem. Knowledge about pins is generally considered tactical, not evaluation, even if you put it in the eval function. So probably Diep would look great on a one ply test due to this pin knowledge, but this has no bearing on which program has the better evaluation. There is no limit to how much tactical knowledge can be put into an eval function, but whether it justifies the slowdown in search is the question.diep wrote:Great idea Ed. We need an independant tester who also verifies no cheating occurs. Do you volunteer?Rebel wrote:I see a new form of (fun!) competition arising at the horizon, who has the best eval?Don wrote: I personally believe that Komodo has the best evaluation function of any chess program in the world.
Its basic framework:
1. Root search (1-ply) only with standard QS.
2. QS needs to be defined by mutual agreement.
3. No extensions allowed.
25,000 - 50,000 games (or so) to weed out most of the noise because the lack of search.
Details to be worked out of course.
With some luck we'll see then how strong mobility and coordinated piece evaluation plays.
Oh i remember - diep also knows everything about pins, and has extensive kingsafety that will directly attack the opponent king with all pieces, probably with the usual computer bug not using many pawns to do so. Will be giving spectacular attacking games!
Regarding your request for a Komodo 5 version without PST, Richard Vida posted a patch to Komodo 5 making all eval terms configurable. Since we don't condone this I won't post the link here, but if you can find his patch all you need do is set the "xtm" terms ("pawn table multiplier" etc.), to zero and you'll have what you want.
kingsafety is also tactical, mobility is also tactical, evaluating attacks which diep is doing massively that's also tactical?
Yet evaluating the material suddenly is the most important 'positional term' of an evaluation?
Oh comeon we can call everything tactical.
I want a 1 ply match
Make some noise!
I can understand why YOU might want a 1 ply match however because you would have a realistic chance of winning such a match. But you know that you would have no chance of winning a real match with your program vs Komodo.
I would not be making an issue of this but you have had the audacity to demean Komodo, calling it a beancounter and hurling other insults without any evidence whatsoever, so I suggest that you should be willing to put your reputation on the line and allow a real match, not some meaningless 1 ply match with ambiguous rules which would not prove anything about the actual strength of the program.
I know that your program is strong - but strong is relative. I think Larry's estimate is probably reasonable and perhaps even generous, but there must be a reason why you are unwilling to allow it to be tested.