Movei testing

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

Tony Thomas

Movei testing

Post by Tony Thomas »

I tested the new version of Movei, it didnt perform as well as expected. However, it managed to score better than any other version. I am going to xxxx out double engines, so you can get a more accurate rank. It looks like Movei is ranked top 25 out of all engines that I have, unless I have another double. Movei's rating at my condition is 2624.

Code: Select all

10/13/2007 11:08:50 PM :

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 HiarcsX50UCI                   : 2859   52  51   171    71.9 %   2695   18.7 %
  2 XXXX                     : 2844   51  50   171    70.2 %   2696   19.9 %
  3 Fruit 2.3                      : 2839   55  54   130    69.6 %   2695   25.4 %
  4 TogaII 1.2 beta 2a KS/EHP      : 2838   47  46   180    69.4 %   2695   25.6 %
  5 Rybka v1.0 Beta.w32            : 2830   47  46   184    68.5 %   2695   22.8 %
  6 Shredder10UCI Balmung          : 2792   49  48   167    63.5 %   2696   21.6 %
  7 XXXXX                  : 2791   64  63   100    64.0 %   2691   20.0 %
  8 Ktulu 8.0                      : 2781   47  47   178    61.5 %   2700   19.7 %
  9 Chess Tiger 2007               : 2776   52  52   132    61.7 %   2693   26.5 %
 10 XXXX                     : 2772   56  55   114    61.4 %   2691   28.1 %
 11 Naum 2.2                       : 2768   51  51   130    60.0 %   2697   29.2 %
 12 xxxxxxxxxxxxxxx           : 2766   49  49   144    60.1 %   2695   28.5 %
 13 xxxxxxxx           : 2752   55  55   118    57.6 %   2698   25.4 %
 14 Spike 1.2 Turin                : 2750   45  45   179    57.3 %   2699   24.0 %
 15 xxxxxx                     : 2749   49  49   144    57.3 %   2698   27.1 %
 16 Smarthink 1.00                 : 2738   44  44   178    55.3 %   2701   26.4 %
 17 Glaurung 2 Epsilon/4           : 2721   58  58   108    53.7 %   2696   24.1 %
 18 xxxxxx          : 2699   53  53   130    50.0 %   2699   23.1 %
 19 Gandalf 6.01                   : 2695   46  46   170    49.4 %   2699   22.4 %
 20 List 5.12                      : 2694   53  53   118    49.2 %   2700   28.8 %
 21 Delfi 5.1                      : 2693   48  49   151    48.7 %   2702   24.5 %
 22 DeepSjeng27                    : 2693   52  52   126    49.2 %   2698   27.0 %
 23 Zappa_mexico                   : 2693   54  54   126    49.2 %   2698   22.2 %
 24 Scorpio 1.84 JA                : 2684   47  48   159    47.2 %   2704   23.9 %
 25 Pharaon 3.5.1                  : 2683   45  45   178    47.2 %   2702   24.7 %
 26 xxxxx                 : 2674   54  54   119    47.1 %   2694   26.9 %
 27 xxxxxx                    : 2669   50  51   143    45.8 %   2698   23.1 %
 28 xxxxxxx                      : 2667   76  77    66    43.2 %   2715   19.7 %
 29 xxxxxxxx                 : 2665   50  50   143    44.8 %   2702   23.8 %
 30 Ruffian 1.0.5                  : 2659   49  49   170    44.1 %   2700   15.3 %
 31 Prodeo 1.2                     : 2655   44  44   170    43.5 %   2700   29.4 %
 32 WildCat 7.0                    : 2653   49  49   170    43.2 %   2700   14.7 %
 33 CM10th D2Alos                  : 2653   46  46   170    43.2 %   2700   25.3 %
 34 SlowChess Blitz WV 2.1         : 2644   44  45   178    41.6 %   2703   25.8 %
 35 xxxxxxxxxxxxxx           : 2637   61  62   103    41.7 %   2695   19.4 %
 36 Thinker 4.7a                   : 2626   47  47   169    39.3 %   2701   23.1 %
 37 Movei00_8_438                  : 2624   55  56   120    38.8 %   2704   24.2 %
 38 xxxxxxxxxx                   : 2603   49  49   163    35.9 %   2704   21.5 %
 39 Aristarch 4.50                 : 2600   49  50   169    35.8 %   2702   18.3 %
 40 SOS 5.1                        : 2596   47  48   169    35.2 %   2702   24.3 %
 41 Trace 1.37a                    : 2591   47  48   169    34.6 %   2702   23.1 %
 42 Jonny 2.83                     : 2569   49  49   169    31.7 %   2702   21.9 %
 43 Frenzee 3.0                    : 2559   51  52   169    30.5 %   2703   17.2 %
 44 Scorpio 1.9                    : 2451   80  84    84    19.0 %   2702   16.7 %
pichy
Posts: 2564
Joined: Thu Mar 09, 2006 3:04 am

Re: Movei testing ( You probably need a lot of games........

Post by pichy »

SlowChess Blitz WV 2.1 scored better than Movei00_8_438 on your match , but Movei438 scored much better than SlowChess when I match them head to head. You probably need a lot of games instead of just a few games versus each opponent . Here SlowChess started strong against Movei438 but ended very weak :roll:


Engine Score Mo Sl S-B
1: Movei00_8_438 26.5/40 ········································ 001=11111=101101101=1=111100=10==1=10=11 357.75
2: Slow 13.5/40 110=00000=010010010=0=000011=01==0=01=00 ········································ 357.75

40 games played / Tournament finished
Name of the tournament: Arena tournament
Site/ Country: Jorge, United States
Level: Blitz 5/1
Hardware: AMD Athlon(tm) Processor 1202 MHz with 512MB Memory
Operating system: Microsoft Windows XP Professional Service Pack 2 (Build 2600)
PGN-File: C:\Program Files\Arena\Arena.pgn
Website:
E-Mail Address:

-------------------------------------------------------------------------------------
Tony Thomas wrote:I tested the new version of Movei, it didnt perform as well as expected. However, it managed to score better than any other version. I am going to xxxx out double engines, so you can get a more accurate rank. It looks like Movei is ranked top 25 out of all engines that I have, unless I have another double. Movei's rating at my condition is 2624.

Code: Select all

10/13/2007 11:08:50 PM :


    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 HiarcsX50UCI                   : 2859   52  51   171    71.9 %   2695   18.7 %
  2 XXXX                     : 2844   51  50   171    70.2 %   2696   19.9 %
  3 Fruit 2.3                      : 2839   55  54   130    69.6 %   2695   25.4 %
  4 TogaII 1.2 beta 2a KS/EHP      : 2838   47  46   180    69.4 %   2695   25.6 %
  5 Rybka v1.0 Beta.w32            : 2830   47  46   184    68.5 %   2695   22.8 %
  6 Shredder10UCI Balmung          : 2792   49  48   167    63.5 %   2696   21.6 %
  7 XXXXX                  : 2791   64  63   100    64.0 %   2691   20.0 %
  8 Ktulu 8.0                      : 2781   47  47   178    61.5 %   2700   19.7 %
  9 Chess Tiger 2007               : 2776   52  52   132    61.7 %   2693   26.5 %
 10 XXXX                     : 2772   56  55   114    61.4 %   2691   28.1 %
 11 Naum 2.2                       : 2768   51  51   130    60.0 %   2697   29.2 %
 12 xxxxxxxxxxxxxxx           : 2766   49  49   144    60.1 %   2695   28.5 %
 13 xxxxxxxx           : 2752   55  55   118    57.6 %   2698   25.4 %
 14 Spike 1.2 Turin                : 2750   45  45   179    57.3 %   2699   24.0 %
 15 xxxxxx                     : 2749   49  49   144    57.3 %   2698   27.1 %
 16 Smarthink 1.00                 : 2738   44  44   178    55.3 %   2701   26.4 %
 17 Glaurung 2 Epsilon/4           : 2721   58  58   108    53.7 %   2696   24.1 %
 18 xxxxxx          : 2699   53  53   130    50.0 %   2699   23.1 %
 19 Gandalf 6.01                   : 2695   46  46   170    49.4 %   2699   22.4 %
 20 List 5.12                      : 2694   53  53   118    49.2 %   2700   28.8 %
 21 Delfi 5.1                      : 2693   48  49   151    48.7 %   2702   24.5 %
 22 DeepSjeng27                    : 2693   52  52   126    49.2 %   2698   27.0 %
 23 Zappa_mexico                   : 2693   54  54   126    49.2 %   2698   22.2 %
 24 Scorpio 1.84 JA                : 2684   47  48   159    47.2 %   2704   23.9 %
 25 Pharaon 3.5.1                  : 2683   45  45   178    47.2 %   2702   24.7 %
 26 xxxxx                 : 2674   54  54   119    47.1 %   2694   26.9 %
 27 xxxxxx                    : 2669   50  51   143    45.8 %   2698   23.1 %
 28 xxxxxxx                      : 2667   76  77    66    43.2 %   2715   19.7 %
 29 xxxxxxxx                 : 2665   50  50   143    44.8 %   2702   23.8 %
 30 Ruffian 1.0.5                  : 2659   49  49   170    44.1 %   2700   15.3 %
 31 Prodeo 1.2                     : 2655   44  44   170    43.5 %   2700   29.4 %
 32 WildCat 7.0                    : 2653   49  49   170    43.2 %   2700   14.7 %
 33 CM10th D2Alos                  : 2653   46  46   170    43.2 %   2700   25.3 %
 34 SlowChess Blitz WV 2.1         : 2644   44  45   178    41.6 %   2703   25.8 %
 35 xxxxxxxxxxxxxx           : 2637   61  62   103    41.7 %   2695   19.4 %
 36 Thinker 4.7a                   : 2626   47  47   169    39.3 %   2701   23.1 %
 37 Movei00_8_438                  : 2624   55  56   120    38.8 %   2704   24.2 %
 38 xxxxxxxxxx                   : 2603   49  49   163    35.9 %   2704   21.5 %
 39 Aristarch 4.50                 : 2600   49  50   169    35.8 %   2702   18.3 %
 40 SOS 5.1                        : 2596   47  48   169    35.2 %   2702   24.3 %
 41 Trace 1.37a                    : 2591   47  48   169    34.6 %   2702   23.1 %
 42 Jonny 2.83                     : 2569   49  49   169    31.7 %   2702   21.9 %
 43 Frenzee 3.0                    : 2559   51  52   169    30.5 %   2703   17.2 %
 44 Scorpio 1.9                    : 2451   80  84    84    19.0 %   2702   16.7 %
Tony Thomas

Re: Movei testing

Post by Tony Thomas »

I think Slowchess and and movei are around the same strength. According to my rating list there is only a 20 points difference between them and it is well within the error bars. I am not a movei tester per se, I will leave the head to head matches to you.
GS

Re: Movei testing ( You probably need a lot of games........

Post by GS »

pichy wrote:Here SlowChess started strong against Movei438 but ended very weak :roll:


Engine Score Mo Sl S-B
1: Movei00_8_438 26.5/40 ········································ 001=11111=101101101=1=111100=10==1=10=11 357.75
2: Slow 13.5/40 110=00000=010010010=0=000011=01==0=01=00 ········································ 357.75
Interesting - the match you show says the opposite of your sentence above ;-)
Slow started bad and finished better actually.( It started with only 3/11! but
finsished with 4/11 ...)

Guenther
Spock

Re: Movei testing

Post by Spock »

Tony Thomas wrote:I think Slowchess and and movei are around the same strength. According to my rating list there is only a 20 points difference between them and it is well within the error bars. I am not a movei tester per se, I will leave the head to head matches to you.
At 40/40 we have Movei +5 ELO over Slowchess, so yes I agree they are about the same strength

Code: Select all

 Rank                Engine                 ELO   +    -   Score  AvOp  Games
    1 Movei 0.08.438                       2746  +22  -22  53.3%  -21.1   668
    2 Slow Chess Blitz WV2.1               2741  +15  -15  49.3%   +4.0  1492
pichy
Posts: 2564
Joined: Thu Mar 09, 2006 3:04 am

Re: Movei testing ( You probably need a lot of games........

Post by pichy »

GS wrote:
pichy wrote:Here SlowChess started strong against Movei438 but ended very weak :roll:


Engine Score Mo Sl S-B
1: Movei00_8_438 26.5/40 ········································ 001=11111=101101101=1=111100=10==1=10=11 357.75
2: Slow 13.5/40 110=00000=010010010=0=000011=01==0=01=00 ········································ 357.75
Interesting - the match you show says the opposite of your sentence above ;-)
Slow started bad and finished better actually.( It started with only 3/11! but
finsished with 4/11 ...)

Guenther

You are correct if you take the first 11 games, but I took the first 2 games and the last 2 :wink: Now, where do you think that Movei438 will end on this division :?:

http://wbec-ridderkerk.nl/html/1stdiv.htm

My guess is behind Cherss Tiger 2007 :wink:
Tony Thomas

Re: Movei testing ( You probably need a lot of games........

Post by Tony Thomas »

pichy wrote:
GS wrote:
pichy wrote:Here SlowChess started strong against Movei438 but ended very weak :roll:


Engine Score Mo Sl S-B
1: Movei00_8_438 26.5/40 ········································ 001=11111=101101101=1=111100=10==1=10=11 357.75
2: Slow 13.5/40 110=00000=010010010=0=000011=01==0=01=00 ········································ 357.75
Interesting - the match you show says the opposite of your sentence above ;-)
Slow started bad and finished better actually.( It started with only 3/11! but
finsished with 4/11 ...)

Guenther

You are correct if you take the first 11 games, but I took the first 2 games and the last 2 :wink: Now, where do you think that Movei438 will end on this division :?:

http://wbec-ridderkerk.nl/html/1stdiv.htm

My guess is behind Cherss Tiger 2007 :wink:
I hope so too. It is time again for Uri to make it to the premier edition, he would face serious fight from Bright, Delfi, Colossus and the new version of Prodeo. Most of the others didnt have any updates, but still we are talking about chess.
Tony Thomas

Re: Movei testing

Post by Tony Thomas »

Spock wrote:
Tony Thomas wrote:I think Slowchess and and movei are around the same strength. According to my rating list there is only a 20 points difference between them and it is well within the error bars. I am not a movei tester per se, I will leave the head to head matches to you.
At 40/40 we have Movei +5 ELO over Slowchess, so yes I agree they are about the same strength

Code: Select all

 Rank                Engine                 ELO   +    -   Score  AvOp  Games
    1 Movei 0.08.438                       2746  +22  -22  53.3%  -21.1   668
    2 Slow Chess Blitz WV2.1               2741  +15  -15  49.3%   +4.0  1492
Also, movei usually uses its own book in my testing. This time around, I couldnt make movei use own book, so I used Rybka.abk instead. It is possible that Movei is a nemesis opponent to slowchess. I would look at the CCRL and CGET results when ever I get a chance.