Superlinear interpolator: a nice novelity ?

nepossiver · Post by **nepossiver** » Tue Sep 23, 2008 2:54 am

Eelco de Groot wrote:
Dirt wrote:This still looks linear to me, just with a slightly steeper slope. Where does ph come from? Just incrementing it by one each ply would be simplest, but I don't think that could be right.
Code: Select all
ph   sli_ph
128  144
120  134
110  121.5
100  109
90   96.5
80   84
70   71.5
60   59
50   46.5
40   34
30   21.5
20   9
10  -3.5
Regards, Eelco

The point made by Greg is true, adjusting a linear regression between the data you've given results in a straight line, just more steep, which means either that Glaurung is already non-linear, or the new code is not really non-linear. Anyway, even so the increment in ELO can be real.

best, horacio

mcostalba · Post by **mcostalba** » Tue Sep 23, 2008 7:00 am

This is the final test result for tonight:

After 404 matches 1m+0

Processor Intel core 2 Duo T5250
Windows Vista 32 bit
Chessbase GUI

Glaurung clone 210908 JA vs Glaurung 2.1 (JA) +127 =194 -83 (55.4%) TP=+38 elo

It seems it is still in advantage

Marco

Uri Blass · Post by **Uri Blass** » Tue Sep 23, 2008 9:15 am

bob wrote:
Take any program. make a change to it. Play the old vs the new. If the change is good, the results will be far better than expected. If the change is bad, the results will be far worse than expected. Because since the only difference in the two programs is the change you made, it tends to influence games more than expected.

I've run millions of games testing A vs A' and the results are unreliable. Far better to run A and A' against a common set of opponents and see which turns out to be better.

The question that you are interested to know in order to decide if to accept a change is if A is better than A' and not how much better.

If testing A against A' increase the effect then it is a good test because
you need less games to know which version is stronger.

The only possible problem is if you get often cases when A' beat A and A is better than A' against other opponents but I see no data that suggests that this problem happen often.

Uri

bob · Post by **bob** » Tue Sep 23, 2008 5:45 pm

Uri Blass wrote:
bob wrote:
Take any program. make a change to it. Play the old vs the new. If the change is good, the results will be far better than expected. If the change is bad, the results will be far worse than expected. Because since the only difference in the two programs is the change you made, it tends to influence games more than expected.

I've run millions of games testing A vs A' and the results are unreliable. Far better to run A and A' against a common set of opponents and see which turns out to be better.

The question that you are interested to know in order to decide if to accept a change is if A is better than A' and not how much better.

If testing A against A' increase the effect then it is a good test because
you need less games to know which version is stronger.

The only possible problem is if you get often cases when A' beat A and A is better than A' against other opponents but I see no data that suggests that this problem happen often.

Uri

I have seen many cases where A' beats A, but when played against other programs, it does worse. I have had 3-4 of those this week in making the new changes to Crafty's eval.

wgarvin · Post by **wgarvin** » Tue Sep 23, 2008 5:48 pm

Uri Blass wrote:If testing A against A' increase the effect then it is a good test because
you need less games to know which version is stronger.

The only possible problem is if you get often cases when A' beat A and A is better than A' against other opponents but I see no data that suggests that this problem happen often.

Are you sure? It seems entirely plausible to me that A' might be weaker in a way which A (being almost the same program) is not able to exploit, but which other programs could.

Testing A' against only A is a way of optimizing your engine to play well against itself, which sounds like the wrong local maxima to be optimizing for. Since what you actually want is for it to play well against a variety of other opponents, it will probably be more reliable to test and measure the changes against a variety of other opponents.

Even then, the many past threads about this topic suggest, that proving a change to be conclusively better via testing and measuring is kind of difficult anyway.

Volker Pittlik · Post by **Volker Pittlik** » Thu Sep 25, 2008 11:23 am

mcostalba wrote:...
To have properly compiled versions for 32 and 6f bits would be great !

Thanks
Marco

To realize that to have access to the source code would be nice.

mcostalba · Post by **mcostalba** » Thu Sep 25, 2008 1:58 pm

Volker Pittlik wrote:
mcostalba wrote:...
To have properly compiled versions for 32 and 6f bits would be great !

Thanks
Marco
To realize that to have access to the source code would be nice.

Sorry but I don't understund the question.

Sources link has been already published in this thread:

http://digilander.libero.it/mcostalba/g ... 210908.zip

Do you mean access to daily snapshots ?

Thanks
Marco

Uri Blass · Post by **Uri Blass** » Thu Sep 25, 2008 5:29 pm

wgarvin wrote:
Uri Blass wrote:If testing A against A' increase the effect then it is a good test because
you need less games to know which version is stronger.

The only possible problem is if you get often cases when A' beat A and A is better than A' against other opponents but I see no data that suggests that this problem happen often.
Are you sure? It seems entirely plausible to me that A' might be weaker in a way which A (being almost the same program) is not able to exploit, but which other programs could.

Testing A' against only A is a way of optimizing your engine to play well against itself, which sounds like the wrong local maxima to be optimizing for. Since what you actually want is for it to play well against a variety of other opponents, it will probably be more reliable to test and measure the changes against a variety of other opponents.

Even then, the many past threads about this topic suggest, that proving a change to be conclusively better via testing and measuring is kind of difficult anyway.

The question is practical and not theorethical.

testing A' against A can be a practical way to get faster results for the question if A' is better than A.

The results may be in theory wrong but if it does not happen often then testing only A' against A may give bigger improvement than testing both against B and C and D because there is limited time to test changes.

Uri

Volker Pittlik · Post by **Volker Pittlik** » Thu Sep 25, 2008 5:39 pm

mcostalba wrote:...

...Sources link has been already published in this thread:...

Sorry I haven't seen that. It compiles fine. I'll test it within the next days.

I'm going to report the results at the Winboard Forum.
Volker

Uri Blass · Post by **Uri Blass** » Thu Sep 25, 2008 5:41 pm

bob wrote:
Uri Blass wrote:
bob wrote:
Take any program. make a change to it. Play the old vs the new. If the change is good, the results will be far better than expected. If the change is bad, the results will be far worse than expected. Because since the only difference in the two programs is the change you made, it tends to influence games more than expected.

I've run millions of games testing A vs A' and the results are unreliable. Far better to run A and A' against a common set of opponents and see which turns out to be better.

The question that you are interested to know in order to decide if to accept a change is if A is better than A' and not how much better.

If testing A against A' increase the effect then it is a good test because
you need less games to know which version is stronger.

The only possible problem is if you get often cases when A' beat A and A is better than A' against other opponents but I see no data that suggests that this problem happen often.

Uri
I have seen many cases where A' beats A, but when played against other programs, it does worse. I have had 3-4 of those this week in making the new changes to Crafty's eval.

Note that does worse is not enough and we need significant results
to be sure that it is not because of a statistical noise when the difference against other opponents is very small but for the same direction.

If you have specific data then it may be interesting to see it.

Uri

Superlinear interpolator: a nice novelity ?

Re: Superlinear?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?

Re: Superlinear interpolator: a nice novelity ?