Ferdy wrote:
can you elaborate what change is this?
my last change was to give a little penalty for have no pawns
Well, this is a idea I have but maybe not a good one. I thought that in a endgame, if I have not pawns, the winning could be more difficult, so try to hold the pawns until unavoidable could give more chances to my engine. Of course is a so subtle eval term that I thought it is less than a 5 ELO gain (if it seems good in a ton of games test).
Ferdy wrote:
How do you define a small changes? Is it because of the simple code or is it because of the elo that you got after running a lot of games?
It is because of the elo that you got after running a lot of games is less or equal than 5.
Ferdy wrote:
In my case if the idea is reasonable, generally I will accept the small elo improvement provided I will reach at least 10k test games, for example king safety eval changes. One idea to minimize number of test games and yet your change will be tested thoroughly is to use appropriate test suites. For example if your change is all about passed pawns, then try to select test positions where there is a big chance that passed pawns will arise on the game.
To have a very good chance to get a big elo boost, combine your changes on your eval and search, For example if you change something on search, implement also an eval change that could be related to what you change in your search, if you change your eval and add queen and knight combination attack to opp king, then in your search, try to avoid reducing knight moves (maneuvering moves) that try to attack opp king provided that the queen was already close to opp king, or in another case the last move was a queen that brings closer to opp king and now you have a knight that moves closer to opp king you may not reduce the knight move in this case.
Very good observations I will take into account. It looks natural to you and me that a change must be tested and look if it is reasonable. 10k games for you sounds like if you are trying only to see if there is somethink broke or not, because 10k games could be not enought.
What I am starting to do if samething similar to you, to check with many games (me around 5k, you 10k) and see if the change is reasonable, the do it.
Taking about king safety, this is a eval term I am thinking in making a complete rewrite. It seems to me a very deep one and testing in is more complex that I thought, because as you point, the search must be tunning in accordance. I have not accomplished yet, but it sounds I will need a lot of work to get something better than what I have now.
Thanks.