ELO inflation ha ha ha

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Henk
Posts: 7216
Joined: Mon May 27, 2013 10:31 am

ELO inflation ha ha ha

Post by Henk »

TCEC rapid: Stockfish-Delphil 1/2-1/2 but ELO difference 900 points.

I can't imagine I would play a draw against a 800 player if my ELO rating would be 1700.
Henk
Posts: 7216
Joined: Mon May 27, 2013 10:31 am

Re: ELO inflation ha ha ha

Post by Henk »

Sorry Stockfish played with black.
JJJ
Posts: 1346
Joined: Sat Apr 19, 2014 1:47 pm

Re: ELO inflation ha ha ha

Post by JJJ »

Henk wrote:TCEC rapid: Stockfish-Delphil 1/2-1/2 but ELO difference 900 points.

I can't imagine I would play a draw against a 800 player if my ELO rating would be 1700.
1700 is more than double of 800
2300 is 72% of 3200

So it's still unlikely to see a draw, but it can happens on very rare occasion.
Henk
Posts: 7216
Joined: Mon May 27, 2013 10:31 am

Re: ELO inflation ha ha ha

Post by Henk »

Might be that Stockfish is already 'seeing' too much and that it assumes that it's opponent 'sees' that too. For instance it doesn't attack because it assumes it's opponent sees the right defense.
APassionForCriminalJustic
Posts: 417
Joined: Sat May 24, 2014 9:16 am

Re: ELO inflation ha ha ha

Post by APassionForCriminalJustic »

Henk wrote:TCEC rapid: Stockfish-Delphil 1/2-1/2 but ELO difference 900 points.

I can't imagine I would play a draw against a 800 player if my ELO rating would be 1700.
That's exactly what is wrong with your point. Delphi is not really "weak". An 800 rated player couldn't beat a piece of lettuce. Delphi can still play some pretty decent chess. It was a complete fluke. Stockfish would have been better served with massive contempt. Delphi would miss most of what Stockfish sees anyway.
APassionForCriminalJustic
Posts: 417
Joined: Sat May 24, 2014 9:16 am

Re: ELO inflation ha ha ha

Post by APassionForCriminalJustic »

Henk wrote:Might be that Stockfish is already 'seeing' too much and that it assumes that it's opponent 'sees' that too. For instance it doesn't attack because it assumes it's opponent sees the right defense.
Exactly.
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: ELO inflation ha ha ha

Post by Laskos »

Henk wrote:TCEC rapid: Stockfish-Delphil 1/2-1/2 but ELO difference 900 points.

I can't imagine I would play a draw against a 800 player if my ELO rating would be 1700.
Difference is probably around 700 ELO points in those conditions (800 on CCRL, but hardware in TCEC is stronger, ELO difference compresses a bit). Translated it means some 3% to make a draw. Not negligible.
Henk
Posts: 7216
Joined: Mon May 27, 2013 10:31 am

Re: ELO inflation ha ha ha

Post by Henk »

Isn't the further away from average the less impact an ELO difference has. I guess ELO 1600 is average strength of a player or is it more like 1200.
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: ELO inflation ha ha ha

Post by Laskos »

Henk wrote:Isn't the further away from average the less impact an ELO difference has. I guess ELO 1600 is average strength of a player or is it more like 1200.
I don't understand the question. With ELO, only the ELO difference counts, and the same ELO difference has the same winning percentage. So, if the ELO model is correct, then the score of 3800 against 3000 ELO is the same as 1800 versus 1000 ELO.
Henk
Posts: 7216
Joined: Mon May 27, 2013 10:31 am

Re: ELO inflation ha ha ha

Post by Henk »

Laskos wrote:
Henk wrote:Isn't the further away from average the less impact an ELO difference has. I guess ELO 1600 is average strength of a player or is it more like 1200.
I don't understand the question. With ELO, only the ELO difference counts, and the same ELO difference has the same winning percentage. So, if the ELO model is correct, then the score of 3800 against 3000 ELO is the same as 1800 versus 1000 ELO.
Someone told me that the chance of a draw from 2200 against 2600 player would be higher than a draw from 1800 against 2200 player. So that is nonsense.

By the way how do you convert ELO rating back into chance.