Absolute ELO scale

Guenther · Post by **Guenther** » Mon Dec 19, 2016 3:43 pm

hgm wrote:I don't think you can do that. There are plenty positions where repeating is the only way to save the draw. E.g. in KPK:

Say the above position results from a capture (Kxe3). Black's only chance to draw is take and keep opposition

1... Ke5! 2. Kf3 Kf5! 3. Ke3

This is not yet a repeat, because the previous time the white King was on e3 the black one was on e6, not on f5. But if black is now not allowed to play Ke5, because that will be a repetition, he has to play something else, after which white plays Ke4 and wins.

Well, but we are talking about the _worst_ player version.
It does not want to draw it always wants to lose.
Therefore it should avoid all repetitions, they just blow up the game length
against a random mover.

Vinvin · Post by **Vinvin** » Mon Dec 19, 2016 5:03 pm

nionita wrote:Is there any theoretical problem if we define ELO 0 = play strenght of an engine which plays in every (legal) position one of the legal moves following a random uniform probability distribution?

The problem is that a lof of engines, even very weak, would score >99.999% against this random engine.

hgm · Post by **hgm** » Mon Dec 19, 2016 7:35 pm

Guenther wrote:Well, but we are talking about the _worst_ player version.

Ah, sorry, I missed that. But when it selects moves with a low score, would't that make it automatically avoid moves that draw? I assume these would get a zero score.

Guenther · Post by **Guenther** » Mon Dec 19, 2016 8:06 pm

hgm wrote:
Guenther wrote:Well, but we are talking about the _worst_ player version.
Ah, sorry, I missed that. But when it selects moves with a low score, would't that make it automatically avoid moves that draw? I assume these would get a zero score.

Yes. I thought so too, but in the example game a few posts ago there are dozens of two time repetitions. It is possible though that it was due to
being a game between two 'worst players' versions (and not worst vs. random as I thought first) and there was no worse move except repeating once?
May be those two time repetitions won't happen against a random mover, but I haven't checked it yet...

No idea how much time Daniel wants to spend on this ;)

cdani · Post by **cdani** » Mon Dec 19, 2016 10:43 pm

Here they are:

Randscacs:
www.andscacs.com/randscacs089025.zip
Able to play arbitrarily long games, and also can play depth 1 random moves. Is updated with the current development version of Andscacs.

Andworst:
www.andscacs.com/andworst-0.3.zip
Able to play arbitrarily long games.

Laskos · Post by **Laskos** » Tue Dec 20, 2016 6:23 am

cdani wrote:Here they are:

Randscacs:
www.andscacs.com/randscacs089025.zip
Able to play arbitrarily long games, and also can play depth 1 random moves. Is updated with the current development version of Andscacs.

Andworst:
www.andscacs.com/andworst-0.3.zip
Able to play arbitrarily long games.

Great, Daniel! Works fine, I saw no disconnects. The longest game I saw was 1100+ moves in Cutechess-Cli.

Here are distributions of the game lengths:

And a reliable match result between A-worst and A-random:

Code: Select all

Score of A-Worst vs A-Random&#58; 0 - 812 - 188  &#91;0.094&#93; 1000
ELO difference&#58; -393.60 +/- 24.79
Finished match

The result is WLD

In A-Worst vs A-Worst most of the games (93%) were draws. So, we assist at draw death at both ends of the strength scale!

Laskos · Post by **Laskos** » Tue Dec 20, 2016 10:21 am

There seems to be no significant benefit from search beyond depth=1 for A-worst against A-random:

Code: Select all

Score of A-Worst depth=1 vs A-Random depth=1&#58; 1 - 3162 - 837  &#91;0.105&#93; 4000
ELO difference&#58; -372.48 +/- 11.69
Finished match

Score of A-Worst depth=7 vs A-Random depth=1&#58; 0 - 3218 - 782  &#91;0.098&#93; 4000
ELO difference&#58; -386.08 +/- 12.10
Finished match

Interesting, probably only a small benefit from depth=7. It's probably very hard to make use of search against random mover.

cdani · Post by **cdani** » Tue Dec 20, 2016 11:07 am

Laskos wrote:There seems to be no significant benefit from search beyond depth=1 for A-worst against A-random:
Code: Select all
Score of A-Worst depth=1 vs A-Random depth=1&#58; 1 - 3162 - 837  &#91;0.105&#93; 4000
ELO difference&#58; -372.48 +/- 11.69
Finished match

Score of A-Worst depth=7 vs A-Random depth=1&#58; 0 - 3218 - 782  &#91;0.098&#93; 4000
ELO difference&#58; -386.08 +/- 12.10
Finished match
Interesting, probably only a small benefit from depth=7. It's probably very hard to make use of search against random mover.

The algorithm of A-Worst sure can be improved. I have done just very basic changes over the main Andscacs.

Uri Blass · Post by **Uri Blass** » Tue Dec 20, 2016 11:08 pm

Laskos wrote:
cdani wrote:Here they are:

Randscacs:
www.andscacs.com/randscacs089025.zip
Able to play arbitrarily long games, and also can play depth 1 random moves. Is updated with the current development version of Andscacs.

Andworst:
www.andscacs.com/andworst-0.3.zip
Able to play arbitrarily long games.
Great, Daniel! Works fine, I saw no disconnects. The longest game I saw was 1100+ moves in Cutechess-Cli.

Here are distributions of the game lengths:

And a reliable match result between A-worst and A-random:
Code: Select all
Score of A-Worst vs A-Random&#58; 0 - 812 - 188  &#91;0.094&#93; 1000
ELO difference&#58; -393.60 +/- 24.79
Finished match
The result is WLD

In A-Worst vs A-Worst most of the games (93%) were draws. So, we assist at draw death at both ends of the strength scale!

If you want to lose the game the correct way is first to play well and later you have enough material to force the opponent to make checkmate against you.

I think that it should be easy to lose against A-worst by this tactics.

Guenther · Post by **Guenther** » Tue Dec 20, 2016 11:49 pm

Uri Blass wrote:
If you want to lose the game the correct way is first to play well and later you have enough material to force the opponent to make checkmate against you.

I think that it should be easy to lose against A-worst by this tactics.

This is right for losers chess and I also thought about this before, but I think in chess (different rules - no recapturing is required!) it's not that easy as it sounds.
I am tempted to try to lose against 'Andscacs worst' tomorrow, may be you can also try an example?

Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale

Re: Absolute ELO scale