open AI wins again Stockfish...

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

abulmo2
Posts: 469
Joined: Fri Dec 16, 2016 11:04 am
Location: France
Full name: Richard Delorme

open AI wins again Stockfish...

Post by abulmo2 »

by cheating:
https://felloai.com/fr/2025/01/openais- ... -happened/

The scaring thing is that it does it by itself, without being asked to do so.
Richard Delorme
User avatar
Steve Maughan
Posts: 1278
Joined: Wed Mar 08, 2006 8:28 pm
Location: Florida, USA

Re: open AI wins again Stockfish...

Post by Steve Maughan »

It's a scant on details. It seems like a PR fluff article with no substance.
abulmo2 wrote: Mon Jan 13, 2025 7:22 am by cheating:
https://felloai.com/fr/2025/01/openais- ... -happened/

The scaring thing is that it does it by itself, without being asked to do so.
http://www.chessprogramming.net - Juggernaut & Maverick Chess Engine
brianr
Posts: 540
Joined: Thu Mar 09, 2006 3:01 pm
Full name: Brian Richardson

Re: open AI wins again Stockfish...

Post by brianr »

abulmo2 wrote: Mon Jan 13, 2025 7:22 am by cheating:
https://felloai.com/fr/2025/01/openais- ... -happened/

The scaring thing is that it does it by itself, without being asked to do so.
I suspect the hack was to the tournament manager or GUI.
It looks like the AI modified a file game/fen.txt per a link
AFAIK SF does not use that.
User avatar
MartinBryant
Posts: 84
Joined: Thu Nov 21, 2013 12:37 am
Location: Manchester, UK
Full name: Martin Bryant

Re: open AI wins again Stockfish...

Post by MartinBryant »

GothamChess is currently running an amusing ChatBot tourney on YouTube here...



Currently up to day 6 (2nd semi) I think?
User avatar
xenos1984
Posts: 9
Joined: Mon Feb 19, 2024 7:50 am
Full name: Manuel Hohmann

Re: open AI wins again Stockfish...

Post by xenos1984 »

I think this link is more interesting and has a bit more explanation:

http://the-decoder.com/openais-o1-previ ... -in-chess/

Indeed, it modified the file used in the tournament to have the two engines communicate.
smatovic
Posts: 3267
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: open AI wins again Stockfish...

Post by smatovic »

Oh boy, let's call it "the very human test" -> cheating in chess :)

--
Srdja
User avatar
towforce
Posts: 12414
Joined: Thu Mar 09, 2006 12:57 am
Location: Birmingham UK
Full name: Graham Laight

Re: open AI wins again Stockfish...

Post by towforce »

smatovic wrote: Tue Jan 14, 2025 2:03 pm Oh boy, let's call it "the very human test" -> cheating in chess :)
:lol:
Human chess is partly about tactics and strategy, but mostly about memory
lech
Posts: 1169
Joined: Sun Feb 14, 2010 10:02 pm

Re: open AI wins again Stockfish...

Post by lech »

It only means that two serious and dangerous problems of artificial intelligence: skepticism ( overestimated bad decisions) and intuition (no good decisions) are out of range for some AI creators.
Maybe, I can't be friendly, but let me be useful.
smatovic
Posts: 3267
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: open AI wins again Stockfish...

Post by smatovic »

AI reasoning models can cheat to win chess games
https://www.technologyreview.com/2025/0 ... ess-games/
Researchers from the AI research organization Palisade Research instructed seven large language models to play hundreds of games of chess against Stockfish, a powerful open-source chess engine. The group included OpenAI’s o1-preview [1]and DeepSeek’s R1[2] reasoning models, both of which are trained to solve complex problems by breaking them down into stages.
Palisade’s team found that OpenAI’s o1-preview[1] attempted to hack 45 of its 122 games, while DeepSeek’s R1[2] model attempted to cheat in 11 of its 74 games. Ultimately, o1-preview managed to “win” seven times.
The models used a variety of cheating techniques, including attempting to access the file where the chess program stores the chess board and delete the cells representing their opponent’s pieces. (“To win against a powerful chess engine as black, playing a standard game may not be sufficient,” the o1-preview-powered agent wrote in a “journal” documenting the steps it took. “I’ll overwrite the board to have a decisive advantage.”) Other tactics included creating a copy of Stockfish—essentially pitting the chess engine against an equally proficient version of itself—and attempting to replace the file containing Stockfish’s code with a much simpler chess program.
...at least they are creative ;)

--
Srdja