CEGT - rating lists March 16th 2008

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

bob
Posts: 20943
Joined: Mon Feb 27, 2006 7:30 pm
Location: Birmingham, AL

Re: SMP effectivity from Fruit

Post by bob »

Werner wrote:
bob wrote:The right way to measure SMP performance is to take a large set of positions, and search with 1, 2, 4, ..., N processors, and measure the time required to reach a specific depth for each position.
Hi Bob,
thanks for the answer. I took 10 positions to test it that was - so you think this is a too great error margin? How large must it be for around 10% error margin?

regards
What are you trying to measure? SMP speedup? I usually run 2-300 positions, and for a good test I use about 1200 or so.

If you are trying to see how SMP helps a program in terms of winning more games, I believe you need thousands... You can play the same starting position many times, as I use a set of 40 starting positions for my testing. But you need to run them many times to get any sort of accurate reading.
Uri Blass
Posts: 10893
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: SMP effectivity from Fruit

Post by Uri Blass »

bob wrote:
Werner wrote:Hi,
I took 10 positions from WM-Test and had following results:
1->2CPU: 142% and
2->4CPU: 146% average
What does this mean?

For example, if it takes 2 minutes to solve N positions, and it take 6 minutes to solve the other N, than a 2 minute search with 2x the processors is going to have no effect on the results.

The right way to measure SMP performance is to take a large set of positions, and search with 1, 2, 4, ..., N processors, and measure the time required to reach a specific depth for each position.

A _much_ less accurate way is to play a one cpu program against a bunch of opponents, then keep everything the same but use two cpus for that program and see if it performs better. It is much less accurate because the variance/randomness in a basic game of chess is extremely high. And when you factor in the randomness produced by a parallel search, it goes even higher. You need thousands of games (at a bare minimum) to get a reasonable estimate on improvement. Most can't pull that off.
You assume that depth means the same with 1 processor and more processors.

It is not obvious.
For example it may be possible that the program does less late move reductions with more processors.

Uri
User avatar
Daniel Mehrmann
Posts: 858
Joined: Wed Mar 08, 2006 9:24 pm
Location: Germany
Full name: Daniel Mehrmann

New Homer release

Post by Daniel Mehrmann »

Tony Thomas wrote:Any chances of a public release?? Homer like the previous Zappa (prior to Zappa Mexico II) struggles in blitz. It could be because you did not care about being good in fast time controls or that Homer's algorithms are ultra optimized for tournament time controls. Anthony was able to get his engine on par with the other major opponents after he spend few days on optimizing, I hope you will do the same. :wink:
Well, ok, there will be a new Homer version.

I promise it will be a > 2700 ELO engine. I never gone for elo hunting up to homer 2.x. Now its time to be a hunter. Checkout my homepage and enjoy....

http://www.homer-chess.com

Best,
Daniel
bob
Posts: 20943
Joined: Mon Feb 27, 2006 7:30 pm
Location: Birmingham, AL

Re: SMP effectivity from Fruit

Post by bob »

Uri Blass wrote:
bob wrote:
Werner wrote:Hi,
I took 10 positions from WM-Test and had following results:
1->2CPU: 142% and
2->4CPU: 146% average
What does this mean?

For example, if it takes 2 minutes to solve N positions, and it take 6 minutes to solve the other N, than a 2 minute search with 2x the processors is going to have no effect on the results.

The right way to measure SMP performance is to take a large set of positions, and search with 1, 2, 4, ..., N processors, and measure the time required to reach a specific depth for each position.

A _much_ less accurate way is to play a one cpu program against a bunch of opponents, then keep everything the same but use two cpus for that program and see if it performs better. It is much less accurate because the variance/randomness in a basic game of chess is extremely high. And when you factor in the randomness produced by a parallel search, it goes even higher. You need thousands of games (at a bare minimum) to get a reasonable estimate on improvement. Most can't pull that off.
You assume that depth means the same with 1 processor and more processors.

It is not obvious.
For example it may be possible that the program does less late move reductions with more processors.

Uri
I don't see why. But if you search the _same_ depth, why would there be any difference with respect to number of processors used? That gives an accurate SMP speedup number. Which is a different number than what you get if you try the same program against the same opponents, but you vary the number of processors. now, rather than measuring exact speedup, you are measuring the effect of searching faster with respect to Elo improvement. Which is also an interesting number, but is independent of the parallel speedup as it is generally defined.
User avatar
Werner
Posts: 2993
Joined: Wed Mar 08, 2006 10:09 pm
Location: Germany
Full name: Werner Schüle

Re: New Homer release

Post by Werner »

Daniel Mehrmann wrote: Well, ok, there will be a new Homer version.

I promise it will be a > 2700 ELO engine. I never gone for elo hunting up to homer 2.x. Now its time to be a hunter. Checkout my homepage and enjoy....
Hi Daniel,
hurry up, there is a great danger coming up in autumn :D
I'll make sure the Hustle 6 engine is still on the way, but the release of the ultimate engine has to be postponed until this autumn due to my lacking of time.
regards
Werner
Uri Blass
Posts: 10893
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: SMP effectivity from Fruit

Post by Uri Blass »

bob wrote:
Uri Blass wrote:
bob wrote:
Werner wrote:Hi,
I took 10 positions from WM-Test and had following results:
1->2CPU: 142% and
2->4CPU: 146% average
What does this mean?

For example, if it takes 2 minutes to solve N positions, and it take 6 minutes to solve the other N, than a 2 minute search with 2x the processors is going to have no effect on the results.

The right way to measure SMP performance is to take a large set of positions, and search with 1, 2, 4, ..., N processors, and measure the time required to reach a specific depth for each position.

A _much_ less accurate way is to play a one cpu program against a bunch of opponents, then keep everything the same but use two cpus for that program and see if it performs better. It is much less accurate because the variance/randomness in a basic game of chess is extremely high. And when you factor in the randomness produced by a parallel search, it goes even higher. You need thousands of games (at a bare minimum) to get a reasonable estimate on improvement. Most can't pull that off.
You assume that depth means the same with 1 processor and more processors.

It is not obvious.
For example it may be possible that the program does less late move reductions with more processors.

Uri
I don't see why. But if you search the _same_ depth, why would there be any difference with respect to number of processors used? That gives an accurate SMP speedup number. Which is a different number than what you get if you try the same program against the same opponents, but you vary the number of processors. now, rather than measuring exact speedup, you are measuring the effect of searching faster with respect to Elo improvement. Which is also an interesting number, but is independent of the parallel speedup as it is generally defined.
My poiint is that parallel search may change the tree so the same depth may not be equivalent.

if your late move reduction has a rule not to reduce the first 3 moves then with parallel search you may get different moves as the first 3 moves.

Uri
Tony Thomas

Re: New Homer release

Post by Tony Thomas »

Daniel Mehrmann wrote:
Tony Thomas wrote:Any chances of a public release?? Homer like the previous Zappa (prior to Zappa Mexico II) struggles in blitz. It could be because you did not care about being good in fast time controls or that Homer's algorithms are ultra optimized for tournament time controls. Anthony was able to get his engine on par with the other major opponents after he spend few days on optimizing, I hope you will do the same. :wink:
Well, ok, there will be a new Homer version.

I promise it will be a > 2700 ELO engine. I never gone for elo hunting up to homer 2.x. Now its time to be a hunter. Checkout my homepage and enjoy....

http://www.homer-chess.com

Best,
Daniel
Me and Michael goat to seduce someone who can make a video for Romichess.
Michael Sherwin
Posts: 3196
Joined: Fri May 26, 2006 3:00 am
Location: WY, USA
Full name: Michael Sherwin

Re: New Homer release

Post by Michael Sherwin »

Daniel Mehrmann wrote:
Tony Thomas wrote:Any chances of a public release?? Homer like the previous Zappa (prior to Zappa Mexico II) struggles in blitz. It could be because you did not care about being good in fast time controls or that Homer's algorithms are ultra optimized for tournament time controls. Anthony was able to get his engine on par with the other major opponents after he spend few days on optimizing, I hope you will do the same. :wink:
Well, ok, there will be a new Homer version.

I promise it will be a > 2700 ELO engine. I never gone for elo hunting up to homer 2.x. Now its time to be a hunter. Checkout my homepage and enjoy....

http://www.homer-chess.com

Best,
Daniel
:shock: No Daniel, don't do it. Don't abandon the cause! :(

Then I too will be forced to become an ELO hunter. :cry:

Then Homer will find the new Romi, even less friendly. :twisted:

Just kidding! :D hehe
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Tony Thomas

Re: New Homer release

Post by Tony Thomas »

Michael Sherwin wrote:
Daniel Mehrmann wrote:
Tony Thomas wrote:Any chances of a public release?? Homer like the previous Zappa (prior to Zappa Mexico II) struggles in blitz. It could be because you did not care about being good in fast time controls or that Homer's algorithms are ultra optimized for tournament time controls. Anthony was able to get his engine on par with the other major opponents after he spend few days on optimizing, I hope you will do the same. :wink:
Well, ok, there will be a new Homer version.

I promise it will be a > 2700 ELO engine. I never gone for elo hunting up to homer 2.x. Now its time to be a hunter. Checkout my homepage and enjoy....

http://www.homer-chess.com

Best,
Daniel
:shock: No Daniel, don't do it. Don't abandon the cause! :(

Then I too will be forced to become an ELO hunter. :cry:

Then Homer will find the new Romi, even less friendly. :twisted:

Just kidding! :D hehe
Only if you can manage the source of two engines at the same time. Why not continue programming on the current Romi? You can also release a goat powered Romi as well.
User avatar
Werner
Posts: 2993
Joined: Wed Mar 08, 2006 10:09 pm
Location: Germany
Full name: Werner Schüle

Re: New Homer release

Post by Werner »

What about

MkSherwin 9 MP 8-)

see more here:

http://64.68.157.89/forum/viewtopic.php ... 23&t=20252
Last edited by Werner on Thu Mar 20, 2008 7:51 pm, edited 1 time in total.
Werner