Texel tuning method question

sandermvdb · Post by **sandermvdb** » Mon Jun 05, 2017 8:13 pm

I just started to tune my evaluation parameters using the Texel tuning method. I am using the q-search score but the problem with this is that calculating the error for my testset (4 million positions) takes about 20 seconds.
The reason that this takes so long is that about 16 million positions are actually evaluated (q-search searches for attacks and check evasions). I skip bad captures in the q-search, but calculating the SEE-score of course also takes time.

Some people use a quiet set for this tuning method. Is this the reason why?

ZirconiumX · Post by **ZirconiumX** » Mon Jun 05, 2017 8:16 pm

sandermvdb wrote:I just started to tune my evaluation parameters using the Texel tuning method. I am using the q-search score but the problem with this is that calculating the error for my testset (4 million positions) takes about 20 seconds.
The reason that this takes so long is that about 16 million positions are actually evaluated (q-search searches for attacks and check evasions). I skip bad captures in the q-search, but calculating the SEE-score of course also takes time.

Some people use a quiet set for this tuning method. Is this the reason why?

I tried the Gaviota method of using the straight eval score rather than QS. It was much faster. Whether it was better is hard to say, because Texel tuning was a loss for me.

Ferdy · Post by **Ferdy** » Mon Jun 05, 2017 11:43 pm

sandermvdb wrote:I just started to tune my evaluation parameters using the Texel tuning method. I am using the q-search score but the problem with this is that calculating the error for my testset (4 million positions) takes about 20 seconds.
The reason that this takes so long is that about 16 million positions are actually evaluated (q-search searches for attacks and check evasions). I skip bad captures in the q-search, but calculating the SEE-score of course also takes time.

Some people use a quiet set for this tuning method. Is this the reason why?

Texel tuning is not about a contest of doing it fast. 4 million in 20s is fine. You better worry on the diversity of the training positions that you use.

jdart · Post by **jdart** » Tue Jun 06, 2017 2:51 am

Right. 20 seconds is fast. Takes me maybe 10 minutes (on a big 24-core machine) but I am using a 2-ply search. I calculate the PV, and then do gradient descent based on the end-of-PV evals. Then periodically I re-calculate the PV as the parameters are tuned.

--Jon

sandermvdb · Post by **sandermvdb** » Tue Jun 06, 2017 8:00 am

So I guess using the local optimization method that is described on the cpw is not the way to go. Tuning ~400 parameters would take years!

PK · Post by PK » Tue Jun 06, 2017 10:03 am

Then be selective. Tune all the material values and see if it helps. After a couple of runs I got the following. The trend was to increase pawn and rook value in the endgame.

Code: Select all

    values&#91;P_MID&#93; = 95;   // 95
    values&#91;N_MID&#93; = 310;  // 310
    values&#91;B_MID&#93; = 322;  // 320
    values&#91;R_MID&#93; = 514;  // 515
    values&#91;Q_MID&#93; = 1000;

    values&#91;P_END&#93; = 110;  // 106
    values&#91;N_END&#93; = 305;  // 305
    values&#91;B_END&#93; = 320;  // 320
    values&#91;R_END&#93; = 527;  // 520
    values&#91;Q_END&#93; = 1012; // 1010

    // Material adjustments

    values&#91;B_PAIR&#93;  = 51;
    values&#91;N_PAIR&#93;  = -9;
    values&#91;R_PAIR&#93;  = -9;
    values&#91;ELEPH&#93;  = 4;  // queen loses that much with each enemy minor on the board
    values&#91;A_EXC&#93;  = 29; // exchange advantage additional bonus
    values&#91;A_MIN&#93; = 53;  // additional bonus for minor piece advantage
    values&#91;A_MAJ&#93; = 60;  // additional bonus for major piece advantage
    values&#91;A_TWO&#93; = 44;  // additional bonus for two minors for a rook
    values&#91;A_ALL&#93; = 80;  // additional bonus for advantage in both majors and minors
    values&#91;N_CL&#93;  = 7;   // knight gains this much with each own pawn present on th board
values&#91;R_OP&#93; = 3; // rook loses that much with each own pawn present on the board

sasachess · Post by **sasachess** » Tue Jun 06, 2017 10:56 am

I divided the tuning procedure into two steps:
1. Scrapping of unhelpful positions (initial position, less than 7 pieces, king in check, Eval! = Quiesce, mate score)
2. parameter tuning

The first step is executed only the first time, given an input.epd file with EPD positions and final result, produces selected.epd with selected positions and skipped.csv with skipped positions.

The second step starts with selected.epd and produces tuned.csv with the tuned parameters.

Ferdy · Post by **Ferdy** » Tue Jun 06, 2017 6:03 pm

jdart wrote:Right. 20 seconds is fast. Takes me maybe 10 minutes (on a big 24-core machine) but I am using a 2-ply search. I calculate the PV, and then do gradient descent based on the end-of-PV evals. Then periodically I re-calculate the PV as the parameters are tuned.

--Jon

Interesting, by "end-of-PV evals", did you use your static evaluation function to get the eval at end of the pv position?

Desperado · Post by **Desperado** » Tue Jun 06, 2017 7:43 pm

Ferdy wrote:
jdart wrote:Right. 20 seconds is fast. Takes me maybe 10 minutes (on a big 24-core machine) but I am using a 2-ply search. I calculate the PV, and then do gradient descent based on the end-of-PV evals. Then periodically I re-calculate the PV as the parameters are tuned.

--Jon
Interesting, by "end-of-PV evals", did you use your static evaluation function to get the eval at end of the pv position?

Maybe i should think about it twice, but the pv eval should be passed to the root as search result. So at first glance i don't know in what way the "eval at the end of the pv" is different to the search result score.

And isn't any line (including the pv of course) computed by the static evaluation at the final node ?!

So, what do i miss ?

AlvaroBegue · Post by **AlvaroBegue** » Tue Jun 06, 2017 8:18 pm

Desperado wrote:
Ferdy wrote:
jdart wrote:Right. 20 seconds is fast. Takes me maybe 10 minutes (on a big 24-core machine) but I am using a 2-ply search. I calculate the PV, and then do gradient descent based on the end-of-PV evals. Then periodically I re-calculate the PV as the parameters are tuned.

--Jon
Interesting, by "end-of-PV evals", did you use your static evaluation function to get the eval at end of the pv position?
Maybe i should think about it twice, but the pv eval should be passed to the root as search result. So at first glance i don't know in what way the "eval at the end of the pv" is different to the search result score.

And isn't any line (including the pv of course) computed by the static evaluation at the final node ?!

So, what do i miss ?

The trick is doing the gradient descent. While it would be possible to do it on the search function itself, it would be hard to make that efficient. So instead, you need to recover what position gave the eval that was propagated to the root, and then compute the gradient of the evaluation function at that node.

Texel tuning method question

Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question

Re: Texel tuning method question