Parameter tuning

wgarvin · Post by **wgarvin** » Wed Mar 23, 2011 3:22 pm

FlavusSnow wrote:Does anyone know of a competitive engine whose parameters are loaded from an ini file? I'm a real fan of these types of tuning algorithms and I'd like to be able to just adjust an ini file instead of recompiling a program in order to tune it.

A related idea is to wrap the tweakable parameter constants in some kind of "Hot value" macro. That way you can easily have a "tweakable" build target, which might be slightly slower but allows you to tune without re-compiling, and then when you're done you just do one normal PGO build or whatever.

Kempelen · Post by **Kempelen** » Wed Mar 23, 2011 5:30 pm

I dont know much about parameter tunning, but in order to do so, I suppose eval must be compared with a big population of positions where result of the game is known (WIN, DRAW, LOSS).

What do you think is better to do engine parameter tunning, to compare agains a big population of engine-engine games or gm-games?

Rémi Coulom · Post by **Rémi Coulom** » Wed Mar 23, 2011 7:06 pm

Hi,

I did a lot of research on black-box parameter optimization last year. That is to say optimization from the win/loss/draw outcome of games. I suppose many people do it manually with tools such as bayeselo, choosing parameters values by hand. Clever automatic procedures can probably be more efficient than manually choosing parameter values.

My intuition is that regression-based approaches are the most efficient, but it is very difficult to come with a good stable algorithm. I am still working on mine.

I recommend the cross-entropy method. It is very simple and efficient:
http://www.personeel.unimaas.nl/m-winan ... rossmc.pdf

I found that the algorithm used by Guillaume can be improved by tricks described in that paper:
http://www.informs-sim.org/wsc09papers/044.pdf

This algorithm is not optimal, but it is certainly better than any genetic algorithm, or simulated annealing. Simulated annealing is not very efficient for noisy measurements of performance. It works for noiseless measurements, with tricky local optima, which is not the situation at all when optimizing a chess program. The cross-entropy method can be viewed as a kind of genetic algorithm, except that it is based on more solid theoretical principles.

Before trying with a real program, you should test algorithms on artificial functions. It is much more efficient.

Rémi

Jan Brouwer · Post by **Jan Brouwer** » Wed Mar 23, 2011 9:57 pm

Rémi Coulom wrote: Before trying with a real program, you should test algorithms on artificial functions. It is much more efficient.

I have been thinking about a chess program simulator to test tuning algorithms.
The simulator would simulate a set of chess programs, each with a certain ELO rating (unknown to the tuning algorithm).
When two programs play a game, the outcome (win/draw/loss) is determined by the the ELO rating difference between the two programs and a random component.

A first challenge is being able to pick a good program from a set of programs with a constant performance with a limited number of games
(if I am not mistaken this is referred to as the multi-armed bandit problem).

Next, we can assign a set of parameters to each program, which influences its playing strength.
Depending on the outcome of games, the tuning algorithm can modify the parameters.
The question then is, how to map the parameters of a program to an ELO rating such that it is a realistic model of a real chess program.
Probably best to start with a something simple like the ELO rating is inversely proportional to the distance to a single optimal set of parameter values.

And, as you mentioned, a simulator would be much faster faster than playing actual games, and it is easy to measure the performance of the tuning algorithm.

Jan

Parameter tuning

Re: Parameter tuning

Re: Parameter tuning

Re: Parameter tuning

Re: Parameter tuning