Starting This Weekend- Another High-Powered Match!

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

Re: Komodo 5.1r2 vs. Stockfish 290613- Slight Lead Opened Up

Post by geots »

PaulieD wrote:
geots wrote:
PaulieD wrote:
geots wrote:5 more games finished and we are at the 39 game mark. Komodo 5.1r2 has managed to stretch his lead to 3 games, but with 61 games still to play I am not going to over-emphasize the meaning of it. Tho any lead is better than no lead at all. The million dollar question, with this dev. version of Stockfish being between 35 and 50 elo stronger than the last official Stockfish release- is just exactly how high can they go with elo increases. Well, enough talk and now to the update:




Alienware AURORA_R4
Intel i7 w/6 True Cores
Factory overclocked

Fritz 11 gui
6 Cores each/64bit
256MB hash
Bases=NONE
Ponder_Learning=OFF
Perfect 2012b.ctg w/12-move limit
40/19 Repeating- benchmarked to adapt to 40/40
Match=100 games




Code: Select all

Komodo 5.1r2 64-bit          +7/-4/=28  
Stockfish 290613 64 SSE4.2   +4/-7/=28

(39 games)


See you after some zzs-
It looks like the July versions are even stronger than the June versions...

500 games from recent Stockfish developmental versions

i3 Dual Core @2.53 GHz - 6.0GB RAM
W7 Home Premium X.64 SP-1
GUI: Little Blitzer 2.74
HASH: 64
TABLEBASES: None
BOOK: Nunn 2 Test Suite Reversing Colors
PONDER: OFF
Code:
Games Completed = 500 of 500 (Avg game length = 24.964 sec)
Settings = RR/64MB/17000ms in 200 moves/M 1000cp for 6 moves, D 150 moves/EPD:C:\Users\Paul\Documents\ChessBase\EPD Test Suites\Nunn2 20 position.epd(20)
Time = 13081 sec elapsed, 0 sec remaining
1. Stockfish 062313 122.0/251 50-57-144 (L: m=0 t=0 i=0 a=57) (D: r=120 i=17 f=2 s=3 a=2) (tpm=216.2 d=14.14 nps=1383321)
2. Stockfish 062913 121.5/251 45-53-153 (L: m=4 t=0 i=0 a=49) (D: r=121 i=19 f=8 s=2 a=3) (tpm=203.9 d=14.19 nps=1379037)
3. Stockfish 071113 130.5/250 65-54-131 (L: m=1 t=0 i=0 a=53) (D: r=109 i=14 f=6 s=2 a=0) (tpm=204.8 d=14.25 nps=1394732)
4. Stockfish 071313 126.0/248 52-48-148 (L: m=0 t=0 i=0 a=48) (D: r=114 i=22 f=8 s=1 a=3) (tpm=212.0 d=14.16 nps=1342525)





Paul, some nice work you have done. I only have one issue with it. Time controls such as the one you used here are fine, and they tell a programmer a lot about his engine during the beta testing process. He needs to get as many games as he possibly can. The problem is, there are a handful of engines that play much, much better at longer controls like 40/40 repeating or even the longer controls that FIDE uses. And Stockfish and Komodo are the 2 leaders in that respect. I know no engines that increase in strength with the longer controls anywhere close to the way Komodo and Stockfish do. The other that would come to mind that might be close to them is Deep Rybka 4.1. It too benefits greatly from the longer controls.

Again, there is not one thing wrong with your controls and your work. It just so happens that you are honing in on Stockfish in your tests- and for the reasons I gave- if you would use longer controls you would get a much better idea of the quality of play that Stockfish possesses. Just a thought you might consider.



All the best to you,

gts
George,
I understand about longer time controls, but in this case it is all the same engine (Stockfish) so ultra fast is fine to test the versions against each other to see which is better for selecting the version to put into a longer time control match like yours.
I do know that both Komodo and Stockfish are not as good at blitz or better when competing against lets say Houdini or other very strong engines.





I am not saying everyone agrees with me, but for me, I put no validity whatsoever in the results between engine versions from the same family. That is why I never run them against each other. Not even once. I am just not comfortable with the results.



Best,
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

UPDATE @ 67 Games- Komodo 5.1r2 v Stockfish 290613!

Post by geots »

We Have 67 games behind us, leaving 33 more to be played. Komodo has not increased his lead, but Stockfish has not been able to make a dent in it yet. And he may be beginning to run out of time. Stockfish was not "lucky" to beat Houdini, IMO. He has a style of play that Houdini cannot figure out how to handle. Komodo seems to do better with it. Another issue- this Stockfish dev. version was created on the 29th of June. If there are any stronger out there now, I wonder which ones and by how much. At any rate.......................





Alienware AURORA_R4
Intel i7 w/6 True Cores
Factory overclocked

Fritz 11 gui
6 Cores each/64bit
256MB hash
Bases=NONE
Ponder_Learning=OFF
Perfect 2012b.ctg w/12-move limit
40/19 Repeating- benchmarked to adapt to 40/40
Match=100 games





Code: Select all

Komodo 5.1r2 64-bit          +13/-9/=45   
Stockfish 290613 64 SSE4.2   +9/-13/=45

More to follow-
PaulieD
Posts: 213
Joined: Tue Jun 25, 2013 8:19 pm

Re: UPDATE @ 67 Games- Komodo 5.1r2 v Stockfish 290613!

Post by PaulieD »

geots wrote: Another issue- this Stockfish dev. version was created on the 29th of June. If there are any stronger out there now, I wonder which ones and by how much. At any rate.......................
]
This was answered in my post.
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

Re: UPDATE @ 67 Games- Komodo 5.1r2 v Stockfish 290613!

Post by geots »

PaulieD wrote:
geots wrote: Another issue- this Stockfish dev. version was created on the 29th of June. If there are any stronger out there now, I wonder which ones and by how much. At any rate.......................
]
This was answered in my post.



No offense meant, Paul. But it was not answered in your post, as you will see if you reread mine.


Best,
PaulieD
Posts: 213
Joined: Tue Jun 25, 2013 8:19 pm

Re: UPDATE @ 67 Games- Komodo 5.1r2 v Stockfish 290613!

Post by PaulieD »

The tests are valid George especially for version testing.
Enjoy your version of testing!
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

UPDATE @ the 90 Game Mark- Day Late & A Dollar Short!

Post by geots »

I am afraid Stockfish has run out of time. At game 51- Komodo had a 4 game lead. At game 90- Komodo has a 4 game lead. Nothing has changed. Komodo cannot run away from him, but Stockfish cannot close the gap. They have played dead even for the last 40 games- but unfortunately that doesn't help Stockfish. To make up 4 games in the last 10, well..................................





Alienware AURORA_R4
Intel i7 w/6 True Cores
Factory overclocked

Fritz 11 gui
6 Cores each/64bit
256MB hash
Bases=NONE
Ponder_Learning=OFF
Perfect 2012b.ctg w/12-move limit
40/19 Repeating- benchmarked to adapt to 40/40
Match=100 games



Code: Select all

Komodo 5.1r2 64-bit          +19/-15/=56  
Stockfish 290613 64 SSE4.2   +15/-19/=56


At the very least- it should be an interesting last 10 games!
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

UPDATE- Komodo 5.1r2 vs. Stockfish 290613-Completed!

Post by geots »

This match is now completed and we definitely have a winner- no drawn match. As soon as the info can be coded and filed, the FINAL results will be posted right here. We are probably looking at the next hour or so- give or take a little.


gts
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

Komodo 5.1r2 v Stockfish 290613- FINAL Match Results!

Post by geots »

The match is completed and once Komodo got a 3 game lead, he never let Stockfish close the gap. He carried a 4 game lead thru most of the 2nd half of the match, and in the last stages managed to increase that a slight bit.

I think we really have a perfect scenario here, with a match between Houdini 3 and Komodo 5.1r2. Houdini wins, then he is back at No.1 where most think he belongs anyway. OTOH, if Komodo can beat Houdini 3, that will help cement his place as No.1. Which would certainly be more credible than slapping a No.1 sticker on Hiarcs or Deep Junior. At any rate, to the games................




Alienware AURORA_R4
Intel i7 w/6 True Cores
Factory overclocked
Fritz 11 gui
6 Cores each/64bit
256MB hash
Bases=NONE
Ponder_Learning=OFF
Perfect 2012b.ctg w/12-move limit
40/19 Repeating- benchmarked to adapt to 40/40
Match=100 games




Code: Select all

SP8-i7, 19'/40+19'/40+19'/40  7/13/2013  

                                
Komodo 5.1r2 64-bit          +24    +23/-16/=61   53.50%   53.5/100
Stockfish 290613 64 SSE4.2   -24    +16/-23/=61   46.50%   46.5/100

(100 games)


Before closing and getting ready for the Komodo v Houdini match congratulations must be given to Don and Larry for a job well done.



Back soon-
carldaman
Posts: 2283
Joined: Sat Jun 02, 2012 2:13 am

Re: Komodo 5.1r2 v Stockfish 290613- FINAL Match Results!

Post by carldaman »

Thanks for running such long slow time-control matches, George. Would it be possible to post the pgn as well if you get the chance? I'm sure some mighty strong chess was played. Much appreciated! :)

Thanks,
Carl
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

Re: Komodo 5.1r2 v Stockfish 290613- FINAL Match Results!

Post by geots »

carldaman wrote:Thanks for running such long slow time-control matches, George. Would it be possible to post the pgn as well if you get the chance? I'm sure some mighty strong chess was played. Much appreciated! :)

Thanks,
Carl



Thanks for your interest Carl. Happy to make them available to you. 2 things to be aware of. I have never understood, but when you stop an engine-engine match in a chessbase gui, in the database the games still read 1-100. But in the PGNs, every time you stop and restart, it goes back to 1. I stopped maybe 3 times I believe- so in the PGNs it will show Round 1, 3 times and go from there. Other than that- it really doesn't matter, because there are a total of 100 games. When you restart, you just have to be sure you have the correct engine playing the white pieces. And also, I will admit I have not taken the time yet to check every game. I checked over half of them- but if you see anything wrong please alert me to it. The link:

https://dl.dropboxusercontent.com/u/115 ... SSE4.2.rar



Thanks again,

gts