Issue with Texel Tuning

ciorap0 · Post by **ciorap0** » Sun Feb 12, 2023 2:03 pm

Hello TalkChess!

This is my first post here, I'm developing an engine you can find here: https://github.com/vladciocoiu/ciorap-bot

I am currently working on tuning the evaluation parameters and that's how I found Texel's Tuning method.

This is the code that I use for tuning (please ask me if anything is unclear):

Code: Select all

    const int nParams = initialGuess.size();
    
    // E(params) = sum((sigmoid(eval(params)) - game_result)^2) for all positions
    // where sigmoid(x) = 1 / (1 + exp(-x / 400))
    
    double bestE = E(initialGuess);
    vector<int> bestParValues = initialGuess;
    bool improved = true;
    int iteration = 0;
    while ( improved ) {
        cout << "Iteration " << iteration << " started.\n";
        improved = false;
        for (int pi = 0; pi < nParams; pi++) {
            bool improvedParam;
            do {
                improvedParam = false;
                vector<int> newParValues = bestParValues;
                newParValues[pi] += 1;
                double newE = E(newParValues);

                if (newE < bestE) {
                    improvedParam = true;
                    cout << "Found better value at parameter " << pi << ": " << bestParValues[pi] << " -> " << newParValues[pi] << ", mse=" << newE << '\n';
                    bestE = newE;
                    bestParValues = newParValues;
                    improved = true;
                } else {
                    newParValues[pi] -= 2;
                    newE = E(newParValues);
                    if (newE < bestE) {
                        improvedParam = true;
                        cout << "Found better value at parameter " << pi << ": " << bestParValues[pi] << " -> " << newParValues[pi] << ", mse=" << newE << '\n';
                        bestE = newE;
                        bestParValues = newParValues;
                        improved = true;
                    }
                }
            } while(improvedParam);
      }
      iteration++;
   }
   return bestParValues;

I got the training positions from 9000 matches against a stronger engine (+ 300 elo), and filtered out the positions where the quiescence score != evaluation score.

I have to mention that the parameters I'm trying to tune are the piece-square tables, king safety table, passed pawn bonuses, mobility bonuses, and other smaller features such as trapped minor piece penalties, tempo bonus, rook on open file bonus etc.

The problem is that every time I try and train these parameters, some of them get inexplicably big / small.

For example the value for the queen on g3 in the piece-square table becomes -555 centipawns, or the tempo bonus becomes -107.

Has someone else ever experienced this or knows what could cause it?

I would appreciate your help a lot. Thanks!

AndrewGrant · Post by **AndrewGrant** » Mon Feb 13, 2023 5:09 am

Sounds like a classic case of overfitting due to a lack of data.
I would also suggest you work away from Texel tuning, and do something GD based.
Texel is very very dated.

JoAnnP38 · Post by **JoAnnP38** » Mon Feb 13, 2023 7:34 am

AndrewGrant wrote: ↑Mon Feb 13, 2023 5:09 am Sounds like a classic case of overfitting due to a lack of data.
I would also suggest you work away from Texel tuning, and do something GD based.
Texel is very very dated.

GD == "Gradient Descent" ???

AndrewGrant · Post by **AndrewGrant** » Mon Feb 13, 2023 7:35 am

JoAnnP38 wrote: ↑Mon Feb 13, 2023 7:34 am
AndrewGrant wrote: ↑Mon Feb 13, 2023 5:09 am Sounds like a classic case of overfitting due to a lack of data.
I would also suggest you work away from Texel tuning, and do something GD based.
Texel is very very dated.
GD == "Gradient Descent" ???

Yes. Anything that approximates what you would do if you were training an NN.
Which, imo, a hand-crafted evaluation is.

ciorap0 · Post by **ciorap0** » Mon Feb 13, 2023 10:37 am

AndrewGrant wrote: ↑Mon Feb 13, 2023 5:09 am Sounds like a classic case of overfitting due to a lack of data.
I would also suggest you work away from Texel tuning, and do something GD based.
Texel is very very dated.

So basically 670k positions aren't enough? I've heard of engines that improved a lot with far less, but maybe they had less complex evaluation functions to begin with.

Thanks for the advice! I'm gonna try and come up with a gradient descent algorithm, but do you think the training data that I currently have would be enough for that?

lithander · Post by **lithander** » Mon Feb 13, 2023 11:06 am

ciorap0 wrote: ↑Mon Feb 13, 2023 10:37 am So basically 670k positions aren't enough? I've heard of engines that improved a lot with far less, but maybe they had less complex evaluation functions to begin with.

Thanks for the advice! I'm gonna try and come up with a gradient descent algorithm, but do you think the training data that I currently have would be enough for that?

Depends on the quality of the data not only quantity. Your 9000 games against a stronger engine will only create a subset of all the possible positions, especially if no randomization is involved and that means for some rare piece-positions you won't have a lot of data. Let's say you had only a dozen positions with a queen on g3 but all of them were sourced from a lost game for that queen's side. Well, then your tuner will learn that putting a queen on g3 is a terrible idea!

You can verify the quality of your tuner with a proven dataset. And you can verify the quality of your dataset once the tuner is proven to work correctly. I wouldn't go two steps in one.

ciorap0 · Post by **ciorap0** » Mon Feb 13, 2023 12:04 pm

lithander wrote: ↑Mon Feb 13, 2023 11:06 am You can verify the quality of your tuner with a proven dataset. And you can verify the quality of your dataset once the tuner is proven to work correctly. I wouldn't go two steps in one.

I found the Zurichess dataset. Hope I will get better results with it, and with the GD algorithm

Whiskers · Post by **Whiskers** » Tue Feb 14, 2023 4:27 pm

A good way to test your tuning algorithm is to let it change say only the value of a pawn and see if the results are reasonable, or see if it manages to converge from a starting value that is very small/very large.

As for gradient descent, I found this forum extremely helpful for implementing it: http://www.talkchess.com/forum3/viewtop ... 24150faeca

JVMerlino · Post by **JVMerlino** » Tue Feb 14, 2023 6:58 pm

Sigh - it took me years to wrap my brain around Texel tuning before I could feel good about trying to implement it. Now I'm back to "square zero" with GD - I don't understand it at all!

Fulvio · Post by **Fulvio** » Tue Feb 14, 2023 8:45 pm

JVMerlino wrote: ↑Tue Feb 14, 2023 6:58 pm Sigh - it took me years to wrap my brain around Texel tuning before I could feel good about trying to implement it. Now I'm back to "square zero" with GD - I don't understand it at all!

This is a great explanation imho:

Issue with Texel Tuning

Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning

Re: Issue with Texel Tuning