No, not at all.Modern Times wrote:
Does the bug have any effect on normal chess ?
I have sent you the test version.
Moderators: hgm, Rebel, chrisw
No, not at all.Modern Times wrote:
Does the bug have any effect on normal chess ?
very small changes, mostly to evaluation.Don wrote:What has changed? We noticed that there are significant differences in 2.2.2 and this version with respect to playing characteristic.
For example, on a 1 ply search version 23 is about 30 ELO stronger when playing head to head (only 8000 games) and is over twice as fast.
That implies some evaluation improvements and either quies search performance improvements or perhaps simply startup costs have been reduced. One version of Komodo was very slow on sub 5 ply searches because we spend a lot of time obsessing over the ply 1 move list.
However 2.3 seems to retains a big speed advantage although not the 2 to 1 I see in 1 ply searches. It searches at least 1/4 ply more.
Can you tell us what the most impactful changes are?
I run a few thousands 1 ply games and my tester tracks the total time spent each program spent thinking. I don't go by the time returned by the engine as that is not done by any GUI.mcostalba wrote:very small changes, mostly to evaluation.Don wrote:What has changed? We noticed that there are significant differences in 2.2.2 and this version with respect to playing characteristic.
For example, on a 1 ply search version 23 is about 30 ELO stronger when playing head to head (only 8000 games) and is over twice as fast.
That implies some evaluation improvements and either quies search performance improvements or perhaps simply startup costs have been reduced. One version of Komodo was very slow on sub 5 ply searches because we spend a lot of time obsessing over the ply 1 move list.
However 2.3 seems to retains a big speed advantage although not the 2 to 1 I see in 1 ply searches. It searches at least 1/4 ply more.
Can you tell us what the most impactful changes are?
I take this occasion to publicly thank Gary Linscott for his evaluation patches: if there is an increase in this version is due to him !
Sorry Gary for the delay in giving you the very well due credit.
Regrading your test of 1 ply search I don't have any experience because I have never done it, but I don't unerstand how your 1 ply search test result is bounded with speed increase: if it is a fixed depth search speed should not count, or am I missing something ?
Using Linux, I show a huge gain in the direct matchup as you ran, at 5" + .05, about sixty elo. However when run against Komodo the gain drops to about thirty, and when run against Komodo at 15" + .15 it halves again to about 15. So I'll predict that it will show around 10 elo on the blitz rating lists.Laskos wrote:Non SSE compiles, ultra-bullet 2.5s + 0.04s gamesmcostalba wrote:mcostalba wrote: If you send me a pm with an email address I can send to you directly a version compiled by me (not fast as Jim but enough to confirm bug is fixed).
P.S: For people testing in normal chess there is no need to wait, go with the fastest version and have fun
26 +/-7 Elo points improvement at this TC, probably 10-15 Elo points improvement at longer TC.Code: Select all
Program Score % Elo + - Draws 1 Stockfish 2.3 JA 64bit : 3227.5/6000 53.8 3213 7 7 42.9 % 2 Stockfish 2.2.2 JA : 2772.5/6000 46.2 3187 7 7 42.9 %
Kai
I'd say better than that. I tested Stockfish VE_09 64-bit here, Stockfish 2.3 can only be better (hopefully)lkaufman wrote: So I'll predict that it will show around 10 elo on the blitz rating lists.
Actually, looking at your results, you show a negative progression for 2.1.1 to 2.2.1 to 2.2.2. But I think it is pretty clear that SF did make steady but small progress between those versions, so your 2.2.2 results is probably on the low end of the error bar. If so, a gain of 10 or perhaps 12 looks like a good prediction.Modern Times wrote:I'd say better than that. I tested Stockfish VE_09 64-bit here, Stockfish 2.3 can only be better (hopefully)lkaufman wrote: So I'll predict that it will show around 10 elo on the blitz rating lists.
http://www.computerchess.org.uk/ccrl/40 ... _length=30
We shall see.lkaufman wrote: If so, a gain of 10 or perhaps 12 looks like a good prediction.
lkaufman wrote:
Can you tell us what is the percentage increase in nodes per second you get going from 2.2.2. to 2.3? I wonder if my 10% increase in Linux applies to the Windows version.
It complains about a missing DLL so I can't run itmcostalba wrote:No, not at all.Modern Times wrote:
Does the bug have any effect on normal chess ?
I have sent you the test version.
My current elo gain at 15" for 2.3 is 18, which implies around 12 or so at blitz levels. But this is with a speedup on Linux (about 8% now). So I'll be surprised if SF 2.3 shows even +10 on Windows if you show a decrease in NPS. As you say, we'll see. Frankly we were hoping for a bigger gain by SF, as currently only Ippo-related programs are close to Komodo at hyperbullet speeds, and we could use an unrelated opponent closer in strength for testing.Modern Times wrote:We shall see.lkaufman wrote: If so, a gain of 10 or perhaps 12 looks like a good prediction.
lkaufman wrote:
Can you tell us what is the percentage increase in nodes per second you get going from 2.2.2. to 2.3? I wonder if my 10% increase in Linux applies to the Windows version.
2.3 shows a decrease in kn/s for me vs 2.2.2
I guess it's the cygwin1 missing?Modern Times wrote:It complains about a missing DLL so I can't run itmcostalba wrote:No, not at all.Modern Times wrote:
Does the bug have any effect on normal chess ?
I have sent you the test version.