Vintage .... Rating List Winboard from June 1999 (16:42)

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

User avatar
Ovyron
Posts: 4556
Joined: Tue Jul 03, 2007 4:30 am

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by Ovyron »

Laskos wrote: Wed Aug 05, 2020 9:03 am They are veterans of 2000-2700 Elo ratings of top engines compared to humans, and the lower part of the table should be fairly accurate with respect to human ratings.
The problem is that it's possible all the things engine beyond 2900 level are doing are only good against each other. Exploiting their tactical and evaluation flaws, engine's exclusive flaws unrelated to humans. So a human could find more difficult to play a 3000 elo engine than a 3500 elo engine (those 500 elo that exploit weaker engines are irrelevant against humans), we don't know, but style could be more important to avoid any human draw.

If this is the case an engine like Benjamin could be capable of performing better against humans than Stockfish NNUE (what if no human can ever draw Benjamin? is that an infinite elo superiority?), and the latter's 1000 elo advantage would not matter.

To answer these questions we need to stop the handicap matches and instead get a GM to play an engine until they get a draw. The incentive can be that if they get a draw in the first 10 games they get big bucks, but every 10 games their prize is cut in half, until they quit because they'd not win much.
Alayan
Posts: 550
Joined: Tue Nov 19, 2019 8:48 pm
Full name: Alayan Feh

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by Alayan »

I'm of the opinion that chess959 GM vs engine tests would be the best way to ascertain raw chess skill (chess959 is chess960 without the standard start position).

The heavy memorization associated with the standard start position create move quality aberrations, with a massive drop in move strength once the player is out of book. While this memorization is part of a human chess player's strength in normal chess, it's creating severe distortions when attempting to compare ratings.

Obviously, GM would score much worse in chess959 than in regular chess, but the resulting strength estimate would be a much fairer assessment on their abilities in general positions they aren't already familiar with. It's also more logical when having them play against a bookless engine, because otherwise why not fit the engine with a very strong/tricky book.
Frank Quisinsky
Posts: 6808
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by Frank Quisinsky »

Hi Kai,

no, no ...
Not a restart of FCP-Rating-List.

But indeed, I am thinking a longer time about it (restart FCP-Rating-List) because I have two i9-10900k.
But I need the other one most of time for other things.

I have interest to do:

1. One time in the year a FCP Tourney with 41.000 games (need around 5 months). For that reason it's important that the site for the first of that tourneys will be perfect. So it's very easy to use the work for the next tourneys.

2. On the other 7 months of the year I have interest to test Wasp for John. Not sure how long John will working on Wasp. But at the moment John have a lot of fun with his engine and I am thinking John will do that a longer time. With others words, if John is working on Wasp, I have fun in testing Wasp.

No interest to use two PCs for computer chess.
Energy is to expensive.
For looking on my TV ... ten still running matches on Intel i9-10900k is more as enough for me.

Best
Frank

PS: Much more interest to work with the FCP tourney database on Excel statistics with Klaus Wlotzka (Excel expert). Really an event to working with Klaus. FCP Tourney is important because for testing Wasp vs. the others I have my own ratings. Better as to use the ratings from the others I think.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by lkaufman »

Laskos wrote: Wed Aug 05, 2020 9:03 am
lkaufman wrote: Tue Aug 04, 2020 8:29 pm
Current SSDF rating list looks interesting, and I think they use something similar to ELOStat.
https://ssdf.bosjo.net/list.htm

They are veterans of 2000-2700 Elo ratings of top engines compared to humans, and the lower part of the table should be fairly accurate with respect to human ratings.

Here are some 165 engines listed using their database and ELOStat.

Code: Select all


    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Stockfish 11 x64 1800X         : 3447   27  25   330    76.4 %   3243   47.3 %
  2 Stockfish 10 x64 1800X         : 3415   18  17   640    72.8 %   3243   52.8 %
  3 Stockfish 8 MP 1800X           : 3372   20  20   780    81.7 %   3113   35.9 %
  4 Stockfish 9 x64 1800X          : 3367   16  15   802    70.6 %   3215   55.0 %
  5 Komodo 13.1 x64 1800X          : 3358   24  23   280    67.1 %   3234   62.1 %
  6 Komodo 11.01 MP 1800X          : 3333   18  18   814    77.4 %   3119   42.8 %
  7 Stockfish 9 x64 Q6600          : 3330   19  18   440    57.3 %   3279   66.4 %
  8 Stockfish 8 x64 1800X          : 3323   18  18   500    60.8 %   3247   64.0 %
  9 Komodo 12.3 x64 Q6600          : 3323   24  23   320    62.3 %   3235   60.9 %
 10 Komodo 12.3 x64 1800X          : 3322   16  16   680    67.1 %   3198   58.7 %
 11 Deep Shredder 13 1800X         : 3311   18  18   680    72.0 %   3148   49.0 %
 12 Komodo 9.1 MP Q6600            : 3297   16  16  1018    74.1 %   3114   43.9 %
 13 Komodo 11.01 x64 1800X         : 3294   19  19   500    55.8 %   3253   60.4 %
 14 Stockfish 8 x64 Q6600          : 3291   24  24   440    70.2 %   3142   45.9 %
 15 Komodo 13.02 MCTS x64 1800X    : 3285   22  21   360    61.5 %   3203   62.5 %
 16 Stockfish 6 MP Q6600           : 3266   16  16  1016    72.3 %   3099   44.7 %
 17 Komodo 11.01 MP Q6600          : 3263   21  21   442    63.1 %   3170   56.6 %
 18 Booot 6.3.1 x64 1800X          : 3236   13  13   920    49.8 %   3237   64.2 %
 19 Deep Shredder 13 Q6600         : 3230   17  17   804    65.2 %   3121   50.5 %
 20 Komodo 7 MP Q6600              : 3223   17  17   692    59.3 %   3158   55.6 %
 21 Vajolet2 2.8 x64 1800X         : 3185   19  19   650    38.1 %   3270   50.6 %
 22 Komodo 5.1 MP Q6600            : 3178   16  15   894    60.2 %   3106   53.1 %
 23 Booot 6.3.1 x64 Q6600          : 3175   29  29   320    58.0 %   3119   42.2 %
 24 Arasan 21.2 x64 1800X          : 3158   19  19   600    38.2 %   3242   51.8 %
 25 Vajolet2 2.8 x64 Q6600         : 3157   30  29   320    73.4 %   2981   41.2 %
 26 Deep Hiarcs 14 1800X           : 3147   17  17   720    42.3 %   3201   52.9 %
 27 Stockfish 3 MP Q6600           : 3139   14  14  1147    56.6 %   3093   48.5 %
 28 Deep Rybka 4 Q6600             : 3134   15  15   974    58.7 %   3073   52.7 %
 29 Deep Hiarcs 14 Q6600           : 3122   13  13  1434    58.6 %   3061   49.9 %
 30 Chiron 3.01 MP Q6600           : 3116   18  19   616    47.1 %   3136   54.5 %
 31 Deep Rybka 3 Q6600             : 3110   16  16  1056    71.3 %   2952   42.6 %
 32 Wasp 3.5 x64 1800X             : 3105   20  20   600    32.2 %   3234   49.2 %
 33 Wasp 2.01 MP 1800X             : 3100   20  20   646    36.5 %   3196   46.6 %
 34 Wasp 3 x64 1800X               : 3096   17  17   762    40.9 %   3160   54.1 %
 35 Naum 4.2 MP Q6600              : 3071   14  14  1071    58.5 %   3011   53.0 %
 36 Wasp 3.5 x64 Q6600             : 3068   26  26   320    43.6 %   3113   53.4 %
 37 Deep Junior Yokohama Q6600     : 3056   18  18   848    37.3 %   3146   42.8 %
 38 Deep Junior 13.3 Q6600         : 3048   13  13  1317    48.6 %   3058   50.6 %
 39 Spike 1.4 MP Q6600             : 3037   13  13  1565    50.9 %   3031   46.8 %
 40 Naum 4 MP Q6600                : 3035   14  14  1267    58.8 %   2973   45.1 %
 41 Deep Shredder 12 Q6600         : 3032   16  16   832    51.1 %   3024   52.6 %
 42 Deep Fritz 13 Q6600            : 3032   18  18   645    47.1 %   3052   56.4 %
 43 Hiarcs 13.1 MP Q6600           : 3024   16  16   770    50.7 %   3019   55.7 %
 44 Deep Hiarcs 13.2 Q6600         : 3024   18  18   776    46.3 %   3050   45.9 %
 45 Wasp 2.01 x64 1800X            : 3023   25  26   400    25.9 %   3206   44.8 %
 46 Hiarcs 14 A1200                : 3021   23  23   520    57.9 %   2966   41.5 %
 47 Deep Fritz 12 Q6600            : 3014   14  14  1078    48.3 %   3026   55.4 %
 48 Wasp 2.01 MP Q6600             : 3010   26  26   482    22.1 %   3229   35.9 %
 49 Deep Junior 12 Q6600           : 2995   16  16   940    52.1 %   2980   50.4 %
 50 Zappa Mexico II Q6600          : 2987   17  17   938    52.2 %   2972   44.3 %
 51 Naum 3.1 MP Q6600              : 2981   18  18   912    38.5 %   3062   39.5 %
 52 Deep Fritz 11 Q6600            : 2972   13  13  1418    60.8 %   2896   45.5 %
 53 Rybka 3 A1200                  : 2969   29  29   282    46.8 %   2991   49.6 %
 54 The Baron 3.43 x64 1800X       : 2962   22  23   680    25.7 %   3146   32.9 %
 55 Crafty 25.0 MP Q6600           : 2960   19  20   804    35.1 %   3066   36.8 %
 56 The Baron 3.43 x64 Q6600       : 2952   29  29   400    49.0 %   2959   27.0 %
 57 Deep Hiarcs 12 Q6600           : 2942   17  17   922    45.3 %   2975   44.1 %
 58 Deep Shredder 11 Q6600         : 2930   16  16  1004    47.8 %   2946   42.5 %
 59 Naum 4 A1200                   : 2922   25  25   440    42.0 %   2978   42.3 %
 60 Arasan 17.2 MP Q6600           : 2918   20  20   685    45.2 %   2952   42.5 %
 61 Hiarcs 11.2 MP Q6600           : 2917   16  16   980    50.9 %   2910   48.3 %
 62 Arasan 16 MP Q6600             : 2914   21  21   604    38.9 %   2993   43.7 %
 63 Shredder 12 A1200              : 2910   24  24   520    36.5 %   3006   38.1 %
 64 Fritz 13 A1200                 : 2897   32  32   280    65.2 %   2788   39.6 %
 65 Glaurung 2.2 MP Q6600          : 2897   17  17  1002    51.3 %   2888   34.5 %
 66 Deep Junior 10.1 Q6600         : 2885   19  19   846    46.8 %   2908   37.4 %
 67 Wasp 2.01 A1200                : 2872   23  23   560    53.8 %   2845   38.8 %
 68 Fritz 12 A1200                 : 2854   18  18   860    61.7 %   2771   41.7 %
 69 Rybka 2.3.1 A1200              : 2840   21  21   612    49.7 %   2842   39.5 %
 70 Jonny 4 MP Q6600               : 2821   20  20   860    29.7 %   2970   31.3 %
 71 Rybka 1.2 A1200                : 2818   24  24   535    63.7 %   2720   37.4 %
 72 Deep Fritz 8 Q6600             : 2809   19  19   786    40.1 %   2879   37.9 %
 73 Shredder 8 MP Q6600            : 2807   19  19   824    38.9 %   2886   36.8 %
 74 Hiarcs 11.1 UCI A1200          : 2782   25  25   362    56.5 %   2737   50.6 %
 75 CM King 3.5 MP Q6600           : 2779   19  19   932    29.6 %   2929   32.8 %
 76 Deep Junior 8 Q6600            : 2775   25  25   526    33.1 %   2897   32.3 %
 77 Hiarcs 11.1 A1200              : 2771   30  31   325    25.7 %   2956   39.1 %
 78 Junior 10.1 A1200              : 2750   22  22   679    50.6 %   2745   27.8 %
 79 Junior 10 A1200                : 2736   26  26   497    49.4 %   2740   29.6 %
 80 Zap!Chess Zanzibar A1200       : 2729   17  17  1038    46.9 %   2751   32.5 %
 81 Hiarcs 10 HypMod A1200         : 2728   18  18  1016    65.9 %   2614   34.6 %
 82 Shredder 10 UCI A1200          : 2722   19  19   867    55.8 %   2682   34.0 %
 83 Pro Deo 2.1 YAT A1200          : 2721   27  26   400    59.4 %   2655   40.2 %
 84 Shredder 8 A1200               : 2711   21  21   743    59.6 %   2643   31.2 %
 85 Fritz 9 A1200                  : 2710   19  19   835    51.3 %   2701   35.8 %
 86 Fruit 2.2.1 A1200              : 2709   18  18   940    59.8 %   2640   35.1 %
 87 Spike 1.2 A1200                : 2705   29  29   352    45.7 %   2735   38.1 %
 88 Shredder 9 UCI A1200           : 2704   16  16  1238    65.3 %   2595   32.1 %
 89 Pro Deo 2.0 A1200              : 2698   23  23   520    46.0 %   2726   39.6 %
 90 Shredder 7.04 UCI A1200        : 2692   22  22   635    64.6 %   2588   35.1 %
 91 Deep Fritz 8 A1200             : 2686   24  24   532    44.5 %   2724   32.3 %
 92 Junior 8 A1200                 : 2683   27  27   438    54.7 %   2650   30.8 %
 93 Junior 9 A1200                 : 2678   23  23   565    57.5 %   2625   34.3 %
 94 Chess Tiger 2007 A1200         : 2672   25  26   450    36.8 %   2766   38.9 %
 95 Deep Junior 8 A1200            : 2663   30  30   385    61.0 %   2585   29.6 %
 96 Shredder 7 A1200               : 2663   29  29   407    65.5 %   2551   29.2 %
 97 Pro Deo 1.86 A1200             : 2660   23  23   600    36.6 %   2755   32.8 %
 98 Spike 1.1 A1200                : 2659   27  27   400    50.9 %   2653   35.8 %
 99 Deep Fritz 7 A1200             : 2652   24  24   542    63.8 %   2554   36.5 %
100 Pro Deo 1.82 A1200             : 2650   25  25   440    44.3 %   2690   40.5 %
101 Fritz 8 A1200                  : 2642   20  20   825    51.8 %   2630   32.2 %
102 Fritz 7 A1200                  : 2631   26  26   400    52.5 %   2613   40.5 %
103 Gambit Tiger 2 A1200           : 2630   21  21   644    48.7 %   2639   38.7 %
104 Hiarcs 9 A1200                 : 2619   30  30   430    29.4 %   2771   26.7 %
105 Gandalf 6 A1200                : 2613   22  22   627    46.7 %   2636   36.5 %
106 Shredder 6 Pad UCI A1200       : 2610   23  23   569    56.4 %   2565   34.8 %
107 Shredder 6 A1200               : 2606   32  32   280    52.1 %   2591   37.1 %
108 Chess Tiger 15 A1200           : 2605   17  17   870    49.8 %   2606   45.1 %
109 Chess Tiger 2004 A1200         : 2603   19  19   712    55.5 %   2565   43.0 %
110 Pro Deo 1.1 A1200              : 2603   21  21   716    54.5 %   2572   34.4 %
111 Junior 7 A1200                 : 2602   21  21   701    49.5 %   2606   35.1 %
112 Deep Fritz A1200               : 2599   21  21   684    47.2 %   2619   36.8 %
113 Chess Tiger 14 CB A1200        : 2599   22  22   579    54.1 %   2570   40.4 %
114 Rebel 12 A1200                 : 2591   30  30   335    42.7 %   2643   35.2 %
115 Ruffian 1.0.1 A1200            : 2577   20  20   729    44.9 %   2612   35.0 %
116 Rebel Century 4 A1200          : 2567   26  26   448    58.1 %   2510   35.9 %
117 Hiarcs 8 A1200                 : 2563   24  24   529    46.6 %   2587   33.8 %
118 Deep Sjeng 1.5a A1200          : 2562   32  32   301    44.9 %   2598   35.2 %
119 Pocket Shredder Ipaq 114       : 2559   31  31   280    54.3 %   2529   41.4 %
120 Deep Fritz K6-2 450            : 2553   24  24   570    58.7 %   2492   29.3 %
121 Deep Fritz 7 K6-2 450          : 2553   32  32   282    41.1 %   2615   39.7 %
122 Shredder 5.32 A1200            : 2547   19  19   828    44.0 %   2589   36.5 %
123 Gandalf 4.32h A1200            : 2545   29  29   358    45.3 %   2578   36.9 %
124 Gandalf 5 A1200                : 2532   30  30   284    40.3 %   2600   44.7 %
125 Gambit Tiger 2 K6-2 450        : 2526   33  33   280    38.4 %   2608   35.4 %
126 Fritz 6 K6-2 450               : 2524   22  22   673    63.2 %   2430   34.6 %
127 Crafty 18.12 CB A1200          : 2504   25  25   468    42.4 %   2557   36.1 %
128 Gandalf 5.1 A1200              : 2497   28  28   376    45.7 %   2527   37.8 %
129 Shredder 5.32 K6-2 450         : 2488   30  30   366    38.7 %   2569   32.0 %
130 Junior 6 K6-2 450              : 2487   20  20   821    59.4 %   2421   32.8 %
131 Nimzo 7.32 K6-2 450            : 2461   25  25   482    56.3 %   2417   34.6 %
132 Fritz 5.32 K6-2 450            : 2453   31  31   296    51.9 %   2440   37.5 %
133 Junior 5 K6-2 450              : 2442   25  25   519    52.2 %   2427   32.8 %
134 Crafty 19.17 A1200             : 2435   33  33   282    29.3 %   2589   37.2 %
135 Hiarcs 7.32 K6-2 450           : 2430   27  27   426    49.4 %   2434   32.2 %
136 Nimzo 8 K6-2 450               : 2414   30  30   438    27.5 %   2582   26.3 %
137 Gandalf 4.32f K6-2 450         : 2404   29  29   358    46.2 %   2430   33.8 %
138 SOS K6-2 450                   : 2403   16  16  1538    26.4 %   2580   24.1 %
139 Goliath Light K6-2 450         : 2402   17  17  1403    26.3 %   2580   25.7 %
140 Fritz 5.32 P200 MMX            : 2395   26  26   504    34.9 %   2503   29.8 %
141 Crafty 17.07 CB K6-2 450       : 2385   24  24   558    38.1 %   2469   31.7 %
142 MChess Pro 8 K6-2 450          : 2369   27  27   459    34.3 %   2482   32.0 %
143 Fritz 5 P200 MMX               : 2362   25  25   535    69.4 %   2219   34.2 %
144 Hiarcs 7 P200 MMX              : 2350   33  33   291    56.4 %   2305   31.6 %
145 Crafty 18.12 CB K6-2 450       : 2346   33  34   514    14.1 %   2660   19.6 %
146 Junior 5 P200 MMX              : 2343   28  28   386    55.1 %   2308   33.9 %
147 Nimzo 99 P200 MMX              : 2331   26  26   500    45.4 %   2363   30.0 %
148 Nimzo 98 P200 MMX              : 2309   28  28   399    46.7 %   2332   31.3 %
149 Shredder 2 P200 MMX            : 2309   24  24   565    43.3 %   2356   29.2 %
150 Rebel 9 P200 MMX               : 2290   32  32   281    58.4 %   2232   37.7 %
151 Hiarcs 9.5a/9.6 Palm Tung E    : 2290   30  30   380    46.8 %   2312   28.4 %
152 CEBoard Crafty 2004 HP RX4240  : 2239   36  36   260    43.7 %   2284   29.6 %
153 Rebel 9 P90                    : 2224   23  23   596    45.0 %   2259   35.2 %
154 Rebel 8 P90                    : 2216   19  19   901    53.4 %   2193   29.5 %
155 MChess Pro 6 P90               : 2213   19  19   905    55.1 %   2177   31.9 %
156 Hiarcs 6 P90                   : 2211   21  21   709    49.0 %   2218   33.4 %
157 Genius 5 P90                   : 2211   19  19   871    53.7 %   2185   33.3 %
158 Nimzo 3 P90                    : 2177   31  31   330    57.4 %   2125   32.4 %
159 Nimzo 3.5 P90                  : 2172   22  22   636    47.8 %   2188   34.0 %
160 Fritz 3 P90                    : 2143   27  28   452    40.5 %   2210   28.3 %
161 Junior 3.3-3.5 P90             : 2132   31  31   363    47.0 %   2153   25.1 %
162 Palm Tiger 2009 Tung C         : 2106   37  37   260    41.5 %   2165   26.2 %
163 Mephisto London 68030 33 MHz   : 2095   31  31   359    42.5 %   2148   27.6 %
164 Rebel 7 486/66 MHz             : 2072   35  36   270    34.8 %   2181   31.1 %
165 Comet 32 P90                   : 1968   30  31   538    19.0 %   2220   20.8 %
How can you tell from the list how many threads were used? Some say "MP", some don't, does this mean that if "MP" isn't stated it is single thread? If it is "MP", does that always mean 4 thread? Do the processor numbers give a clue?
Komodo rules!
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by lkaufman »

Ovyron wrote: Wed Aug 05, 2020 1:29 pm
Laskos wrote: Wed Aug 05, 2020 9:03 am They are veterans of 2000-2700 Elo ratings of top engines compared to humans, and the lower part of the table should be fairly accurate with respect to human ratings.
The problem is that it's possible all the things engine beyond 2900 level are doing are only good against each other. Exploiting their tactical and evaluation flaws, engine's exclusive flaws unrelated to humans. So a human could find more difficult to play a 3000 elo engine than a 3500 elo engine (those 500 elo that exploit weaker engines are irrelevant against humans), we don't know, but style could be more important to avoid any human draw.

If this is the case an engine like Benjamin could be capable of performing better against humans than Stockfish NNUE (what if no human can ever draw Benjamin? is that an infinite elo superiority?), and the latter's 1000 elo advantage would not matter.

To answer these questions we need to stop the handicap matches and instead get a GM to play an engine until they get a draw. The incentive can be that if they get a draw in the first 10 games they get big bucks, but every 10 games their prize is cut in half, until they quit because they'd not win much.
Measuring elo differences of more than around 200 by direct matches isn't very accurate, for multiple reasons. Beyond 191 elo, even winning every game with White and drawing every game with Black will lose elo. So the stronger player has to play bad openings as Black to minimize the risk of White reaching an easy draw just by memory. For engines, it means that special opening books to do this are needed, and then we're rating the book, not the engine. Ideally chess competition and ratings should change to reflect this problem; all parings could be two game matches with only the winner of the match counting, or all games could be replayed until someone wins (at faster time controls, or with some total time and increment for the match). The point is that White should never benefit from a draw, it is contrary to the logic of chess. This is not a big issue when the players are reasonably close in strength, as the top humans are, but if we have engines playing humans without a handicap it becomes a huge issue.
Komodo rules!
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by Laskos »

lkaufman wrote: Wed Aug 05, 2020 5:19 pm
Laskos wrote: Wed Aug 05, 2020 9:03 am
lkaufman wrote: Tue Aug 04, 2020 8:29 pm
Current SSDF rating list looks interesting, and I think they use something similar to ELOStat.
https://ssdf.bosjo.net/list.htm

They are veterans of 2000-2700 Elo ratings of top engines compared to humans, and the lower part of the table should be fairly accurate with respect to human ratings.

Here are some 165 engines listed using their database and ELOStat.

Code: Select all


    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Stockfish 11 x64 1800X         : 3447   27  25   330    76.4 %   3243   47.3 %
  2 Stockfish 10 x64 1800X         : 3415   18  17   640    72.8 %   3243   52.8 %
  3 Stockfish 8 MP 1800X           : 3372   20  20   780    81.7 %   3113   35.9 %
  4 Stockfish 9 x64 1800X          : 3367   16  15   802    70.6 %   3215   55.0 %
  5 Komodo 13.1 x64 1800X          : 3358   24  23   280    67.1 %   3234   62.1 %
  6 Komodo 11.01 MP 1800X          : 3333   18  18   814    77.4 %   3119   42.8 %
  7 Stockfish 9 x64 Q6600          : 3330   19  18   440    57.3 %   3279   66.4 %
  8 Stockfish 8 x64 1800X          : 3323   18  18   500    60.8 %   3247   64.0 %
  9 Komodo 12.3 x64 Q6600          : 3323   24  23   320    62.3 %   3235   60.9 %
 10 Komodo 12.3 x64 1800X          : 3322   16  16   680    67.1 %   3198   58.7 %
 11 Deep Shredder 13 1800X         : 3311   18  18   680    72.0 %   3148   49.0 %
 12 Komodo 9.1 MP Q6600            : 3297   16  16  1018    74.1 %   3114   43.9 %
 13 Komodo 11.01 x64 1800X         : 3294   19  19   500    55.8 %   3253   60.4 %
 14 Stockfish 8 x64 Q6600          : 3291   24  24   440    70.2 %   3142   45.9 %
 15 Komodo 13.02 MCTS x64 1800X    : 3285   22  21   360    61.5 %   3203   62.5 %
 16 Stockfish 6 MP Q6600           : 3266   16  16  1016    72.3 %   3099   44.7 %
 17 Komodo 11.01 MP Q6600          : 3263   21  21   442    63.1 %   3170   56.6 %
 18 Booot 6.3.1 x64 1800X          : 3236   13  13   920    49.8 %   3237   64.2 %
 19 Deep Shredder 13 Q6600         : 3230   17  17   804    65.2 %   3121   50.5 %
 20 Komodo 7 MP Q6600              : 3223   17  17   692    59.3 %   3158   55.6 %
 21 Vajolet2 2.8 x64 1800X         : 3185   19  19   650    38.1 %   3270   50.6 %
 22 Komodo 5.1 MP Q6600            : 3178   16  15   894    60.2 %   3106   53.1 %
 23 Booot 6.3.1 x64 Q6600          : 3175   29  29   320    58.0 %   3119   42.2 %
 24 Arasan 21.2 x64 1800X          : 3158   19  19   600    38.2 %   3242   51.8 %
 25 Vajolet2 2.8 x64 Q6600         : 3157   30  29   320    73.4 %   2981   41.2 %
 26 Deep Hiarcs 14 1800X           : 3147   17  17   720    42.3 %   3201   52.9 %
 27 Stockfish 3 MP Q6600           : 3139   14  14  1147    56.6 %   3093   48.5 %
 28 Deep Rybka 4 Q6600             : 3134   15  15   974    58.7 %   3073   52.7 %
 29 Deep Hiarcs 14 Q6600           : 3122   13  13  1434    58.6 %   3061   49.9 %
 30 Chiron 3.01 MP Q6600           : 3116   18  19   616    47.1 %   3136   54.5 %
 31 Deep Rybka 3 Q6600             : 3110   16  16  1056    71.3 %   2952   42.6 %
 32 Wasp 3.5 x64 1800X             : 3105   20  20   600    32.2 %   3234   49.2 %
 33 Wasp 2.01 MP 1800X             : 3100   20  20   646    36.5 %   3196   46.6 %
 34 Wasp 3 x64 1800X               : 3096   17  17   762    40.9 %   3160   54.1 %
 35 Naum 4.2 MP Q6600              : 3071   14  14  1071    58.5 %   3011   53.0 %
 36 Wasp 3.5 x64 Q6600             : 3068   26  26   320    43.6 %   3113   53.4 %
 37 Deep Junior Yokohama Q6600     : 3056   18  18   848    37.3 %   3146   42.8 %
 38 Deep Junior 13.3 Q6600         : 3048   13  13  1317    48.6 %   3058   50.6 %
 39 Spike 1.4 MP Q6600             : 3037   13  13  1565    50.9 %   3031   46.8 %
 40 Naum 4 MP Q6600                : 3035   14  14  1267    58.8 %   2973   45.1 %
 41 Deep Shredder 12 Q6600         : 3032   16  16   832    51.1 %   3024   52.6 %
 42 Deep Fritz 13 Q6600            : 3032   18  18   645    47.1 %   3052   56.4 %
 43 Hiarcs 13.1 MP Q6600           : 3024   16  16   770    50.7 %   3019   55.7 %
 44 Deep Hiarcs 13.2 Q6600         : 3024   18  18   776    46.3 %   3050   45.9 %
 45 Wasp 2.01 x64 1800X            : 3023   25  26   400    25.9 %   3206   44.8 %
 46 Hiarcs 14 A1200                : 3021   23  23   520    57.9 %   2966   41.5 %
 47 Deep Fritz 12 Q6600            : 3014   14  14  1078    48.3 %   3026   55.4 %
 48 Wasp 2.01 MP Q6600             : 3010   26  26   482    22.1 %   3229   35.9 %
 49 Deep Junior 12 Q6600           : 2995   16  16   940    52.1 %   2980   50.4 %
 50 Zappa Mexico II Q6600          : 2987   17  17   938    52.2 %   2972   44.3 %
 51 Naum 3.1 MP Q6600              : 2981   18  18   912    38.5 %   3062   39.5 %
 52 Deep Fritz 11 Q6600            : 2972   13  13  1418    60.8 %   2896   45.5 %
 53 Rybka 3 A1200                  : 2969   29  29   282    46.8 %   2991   49.6 %
 54 The Baron 3.43 x64 1800X       : 2962   22  23   680    25.7 %   3146   32.9 %
 55 Crafty 25.0 MP Q6600           : 2960   19  20   804    35.1 %   3066   36.8 %
 56 The Baron 3.43 x64 Q6600       : 2952   29  29   400    49.0 %   2959   27.0 %
 57 Deep Hiarcs 12 Q6600           : 2942   17  17   922    45.3 %   2975   44.1 %
 58 Deep Shredder 11 Q6600         : 2930   16  16  1004    47.8 %   2946   42.5 %
 59 Naum 4 A1200                   : 2922   25  25   440    42.0 %   2978   42.3 %
 60 Arasan 17.2 MP Q6600           : 2918   20  20   685    45.2 %   2952   42.5 %
 61 Hiarcs 11.2 MP Q6600           : 2917   16  16   980    50.9 %   2910   48.3 %
 62 Arasan 16 MP Q6600             : 2914   21  21   604    38.9 %   2993   43.7 %
 63 Shredder 12 A1200              : 2910   24  24   520    36.5 %   3006   38.1 %
 64 Fritz 13 A1200                 : 2897   32  32   280    65.2 %   2788   39.6 %
 65 Glaurung 2.2 MP Q6600          : 2897   17  17  1002    51.3 %   2888   34.5 %
 66 Deep Junior 10.1 Q6600         : 2885   19  19   846    46.8 %   2908   37.4 %
 67 Wasp 2.01 A1200                : 2872   23  23   560    53.8 %   2845   38.8 %
 68 Fritz 12 A1200                 : 2854   18  18   860    61.7 %   2771   41.7 %
 69 Rybka 2.3.1 A1200              : 2840   21  21   612    49.7 %   2842   39.5 %
 70 Jonny 4 MP Q6600               : 2821   20  20   860    29.7 %   2970   31.3 %
 71 Rybka 1.2 A1200                : 2818   24  24   535    63.7 %   2720   37.4 %
 72 Deep Fritz 8 Q6600             : 2809   19  19   786    40.1 %   2879   37.9 %
 73 Shredder 8 MP Q6600            : 2807   19  19   824    38.9 %   2886   36.8 %
 74 Hiarcs 11.1 UCI A1200          : 2782   25  25   362    56.5 %   2737   50.6 %
 75 CM King 3.5 MP Q6600           : 2779   19  19   932    29.6 %   2929   32.8 %
 76 Deep Junior 8 Q6600            : 2775   25  25   526    33.1 %   2897   32.3 %
 77 Hiarcs 11.1 A1200              : 2771   30  31   325    25.7 %   2956   39.1 %
 78 Junior 10.1 A1200              : 2750   22  22   679    50.6 %   2745   27.8 %
 79 Junior 10 A1200                : 2736   26  26   497    49.4 %   2740   29.6 %
 80 Zap!Chess Zanzibar A1200       : 2729   17  17  1038    46.9 %   2751   32.5 %
 81 Hiarcs 10 HypMod A1200         : 2728   18  18  1016    65.9 %   2614   34.6 %
 82 Shredder 10 UCI A1200          : 2722   19  19   867    55.8 %   2682   34.0 %
 83 Pro Deo 2.1 YAT A1200          : 2721   27  26   400    59.4 %   2655   40.2 %
 84 Shredder 8 A1200               : 2711   21  21   743    59.6 %   2643   31.2 %
 85 Fritz 9 A1200                  : 2710   19  19   835    51.3 %   2701   35.8 %
 86 Fruit 2.2.1 A1200              : 2709   18  18   940    59.8 %   2640   35.1 %
 87 Spike 1.2 A1200                : 2705   29  29   352    45.7 %   2735   38.1 %
 88 Shredder 9 UCI A1200           : 2704   16  16  1238    65.3 %   2595   32.1 %
 89 Pro Deo 2.0 A1200              : 2698   23  23   520    46.0 %   2726   39.6 %
 90 Shredder 7.04 UCI A1200        : 2692   22  22   635    64.6 %   2588   35.1 %
 91 Deep Fritz 8 A1200             : 2686   24  24   532    44.5 %   2724   32.3 %
 92 Junior 8 A1200                 : 2683   27  27   438    54.7 %   2650   30.8 %
 93 Junior 9 A1200                 : 2678   23  23   565    57.5 %   2625   34.3 %
 94 Chess Tiger 2007 A1200         : 2672   25  26   450    36.8 %   2766   38.9 %
 95 Deep Junior 8 A1200            : 2663   30  30   385    61.0 %   2585   29.6 %
 96 Shredder 7 A1200               : 2663   29  29   407    65.5 %   2551   29.2 %
 97 Pro Deo 1.86 A1200             : 2660   23  23   600    36.6 %   2755   32.8 %
 98 Spike 1.1 A1200                : 2659   27  27   400    50.9 %   2653   35.8 %
 99 Deep Fritz 7 A1200             : 2652   24  24   542    63.8 %   2554   36.5 %
100 Pro Deo 1.82 A1200             : 2650   25  25   440    44.3 %   2690   40.5 %
101 Fritz 8 A1200                  : 2642   20  20   825    51.8 %   2630   32.2 %
102 Fritz 7 A1200                  : 2631   26  26   400    52.5 %   2613   40.5 %
103 Gambit Tiger 2 A1200           : 2630   21  21   644    48.7 %   2639   38.7 %
104 Hiarcs 9 A1200                 : 2619   30  30   430    29.4 %   2771   26.7 %
105 Gandalf 6 A1200                : 2613   22  22   627    46.7 %   2636   36.5 %
106 Shredder 6 Pad UCI A1200       : 2610   23  23   569    56.4 %   2565   34.8 %
107 Shredder 6 A1200               : 2606   32  32   280    52.1 %   2591   37.1 %
108 Chess Tiger 15 A1200           : 2605   17  17   870    49.8 %   2606   45.1 %
109 Chess Tiger 2004 A1200         : 2603   19  19   712    55.5 %   2565   43.0 %
110 Pro Deo 1.1 A1200              : 2603   21  21   716    54.5 %   2572   34.4 %
111 Junior 7 A1200                 : 2602   21  21   701    49.5 %   2606   35.1 %
112 Deep Fritz A1200               : 2599   21  21   684    47.2 %   2619   36.8 %
113 Chess Tiger 14 CB A1200        : 2599   22  22   579    54.1 %   2570   40.4 %
114 Rebel 12 A1200                 : 2591   30  30   335    42.7 %   2643   35.2 %
115 Ruffian 1.0.1 A1200            : 2577   20  20   729    44.9 %   2612   35.0 %
116 Rebel Century 4 A1200          : 2567   26  26   448    58.1 %   2510   35.9 %
117 Hiarcs 8 A1200                 : 2563   24  24   529    46.6 %   2587   33.8 %
118 Deep Sjeng 1.5a A1200          : 2562   32  32   301    44.9 %   2598   35.2 %
119 Pocket Shredder Ipaq 114       : 2559   31  31   280    54.3 %   2529   41.4 %
120 Deep Fritz K6-2 450            : 2553   24  24   570    58.7 %   2492   29.3 %
121 Deep Fritz 7 K6-2 450          : 2553   32  32   282    41.1 %   2615   39.7 %
122 Shredder 5.32 A1200            : 2547   19  19   828    44.0 %   2589   36.5 %
123 Gandalf 4.32h A1200            : 2545   29  29   358    45.3 %   2578   36.9 %
124 Gandalf 5 A1200                : 2532   30  30   284    40.3 %   2600   44.7 %
125 Gambit Tiger 2 K6-2 450        : 2526   33  33   280    38.4 %   2608   35.4 %
126 Fritz 6 K6-2 450               : 2524   22  22   673    63.2 %   2430   34.6 %
127 Crafty 18.12 CB A1200          : 2504   25  25   468    42.4 %   2557   36.1 %
128 Gandalf 5.1 A1200              : 2497   28  28   376    45.7 %   2527   37.8 %
129 Shredder 5.32 K6-2 450         : 2488   30  30   366    38.7 %   2569   32.0 %
130 Junior 6 K6-2 450              : 2487   20  20   821    59.4 %   2421   32.8 %
131 Nimzo 7.32 K6-2 450            : 2461   25  25   482    56.3 %   2417   34.6 %
132 Fritz 5.32 K6-2 450            : 2453   31  31   296    51.9 %   2440   37.5 %
133 Junior 5 K6-2 450              : 2442   25  25   519    52.2 %   2427   32.8 %
134 Crafty 19.17 A1200             : 2435   33  33   282    29.3 %   2589   37.2 %
135 Hiarcs 7.32 K6-2 450           : 2430   27  27   426    49.4 %   2434   32.2 %
136 Nimzo 8 K6-2 450               : 2414   30  30   438    27.5 %   2582   26.3 %
137 Gandalf 4.32f K6-2 450         : 2404   29  29   358    46.2 %   2430   33.8 %
138 SOS K6-2 450                   : 2403   16  16  1538    26.4 %   2580   24.1 %
139 Goliath Light K6-2 450         : 2402   17  17  1403    26.3 %   2580   25.7 %
140 Fritz 5.32 P200 MMX            : 2395   26  26   504    34.9 %   2503   29.8 %
141 Crafty 17.07 CB K6-2 450       : 2385   24  24   558    38.1 %   2469   31.7 %
142 MChess Pro 8 K6-2 450          : 2369   27  27   459    34.3 %   2482   32.0 %
143 Fritz 5 P200 MMX               : 2362   25  25   535    69.4 %   2219   34.2 %
144 Hiarcs 7 P200 MMX              : 2350   33  33   291    56.4 %   2305   31.6 %
145 Crafty 18.12 CB K6-2 450       : 2346   33  34   514    14.1 %   2660   19.6 %
146 Junior 5 P200 MMX              : 2343   28  28   386    55.1 %   2308   33.9 %
147 Nimzo 99 P200 MMX              : 2331   26  26   500    45.4 %   2363   30.0 %
148 Nimzo 98 P200 MMX              : 2309   28  28   399    46.7 %   2332   31.3 %
149 Shredder 2 P200 MMX            : 2309   24  24   565    43.3 %   2356   29.2 %
150 Rebel 9 P200 MMX               : 2290   32  32   281    58.4 %   2232   37.7 %
151 Hiarcs 9.5a/9.6 Palm Tung E    : 2290   30  30   380    46.8 %   2312   28.4 %
152 CEBoard Crafty 2004 HP RX4240  : 2239   36  36   260    43.7 %   2284   29.6 %
153 Rebel 9 P90                    : 2224   23  23   596    45.0 %   2259   35.2 %
154 Rebel 8 P90                    : 2216   19  19   901    53.4 %   2193   29.5 %
155 MChess Pro 6 P90               : 2213   19  19   905    55.1 %   2177   31.9 %
156 Hiarcs 6 P90                   : 2211   21  21   709    49.0 %   2218   33.4 %
157 Genius 5 P90                   : 2211   19  19   871    53.7 %   2185   33.3 %
158 Nimzo 3 P90                    : 2177   31  31   330    57.4 %   2125   32.4 %
159 Nimzo 3.5 P90                  : 2172   22  22   636    47.8 %   2188   34.0 %
160 Fritz 3 P90                    : 2143   27  28   452    40.5 %   2210   28.3 %
161 Junior 3.3-3.5 P90             : 2132   31  31   363    47.0 %   2153   25.1 %
162 Palm Tiger 2009 Tung C         : 2106   37  37   260    41.5 %   2165   26.2 %
163 Mephisto London 68030 33 MHz   : 2095   31  31   359    42.5 %   2148   27.6 %
164 Rebel 7 486/66 MHz             : 2072   35  36   270    34.8 %   2181   31.1 %
165 Comet 32 P90                   : 1968   30  31   538    19.0 %   2220   20.8 %
How can you tell from the list how many threads were used? Some say "MP", some don't, does this mean that if "MP" isn't stated it is single thread? If it is "MP", does that always mean 4 thread? Do the processor numbers give a clue?
I understood that all 1800X are on all 8 cores, all Q6600 are on all 4 cores, the rest one core. AFAIK, they are very conservative, using only 2 hours games and full CPU for an engine with Ponder=ON (playing on 2 PCs). In fact pretty remarkably so.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: Vintage .... Rating List Winboard from June 1999 (16:42)

Post by lkaufman »

Alayan wrote: Wed Aug 05, 2020 3:13 pm I'm of the opinion that chess959 GM vs engine tests would be the best way to ascertain raw chess skill (chess959 is chess960 without the standard start position).

The heavy memorization associated with the standard start position create move quality aberrations, with a massive drop in move strength once the player is out of book. While this memorization is part of a human chess player's strength in normal chess, it's creating severe distortions when attempting to compare ratings.

Obviously, GM would score much worse in chess959 than in regular chess, but the resulting strength estimate would be a much fairer assessment on their abilities in general positions they aren't already familiar with. It's also more logical when having them play against a bookless engine, because otherwise why not fit the engine with a very strong/tricky book.
We played four games of chess959 in June with Komodo 14 (both regular and MCTS) giving knight odds to GM Alex Lenderman (2642 FIDE Rapid) at 15' + 10". Lenderman scored two wins, one loss, and one draw (one win was vs. regular K, other games vs. mcts). Knight odds is more than 1000 elo at this level, so this gives some idea of "raw chess skill". At least at this rapid tc, a rating above 3500 looks justified.
Komodo rules!