With a modified Sim (I changed the set of positions) at 100 ms/position and at 1000 ms/position when specified as x10, I am getting the following, maybe not the nicest picture of similarity with the current strength progress of engines. All NN enabled engines cluster together, and Leela NN branch together with NNUE branch. Earlier engines show more diversity that the current strongest ones. NINU 0.3 is the Night Nurse 0.3 net.
That's a really good point!!
I found out that all NN engines I am looking have problems with complicated A80, A81, E97-E99 openings.
Engines lost understanding with NN.
Can be a good idea to put best A80 or E99 lines inside your test?
Best lines can be found really easy with our FEBOS database and the ranking system we developed.
The FEOBOS positions are sorted with the developed ranking system.
Thank you for the graphic and your work again!
Interesting what you do all the time (still reader).
That's a really good point!!
I found out that all NN engines I am looking have problems with complicated A80, A81, E97-E99 openings.
Engines lost understanding with NN.
Can be a good idea to put best A80 or E99 lines inside your test?
Best lines can be found really easy with our FEBOS database and the ranking system we developed.
The FEOBOS positions are sorted with the developed ranking system.
Thank you for the graphic and your work again!
Interesting what you do all the time (still reader).
Best
Frank
I think classical engines too have problems with KID and Dutch, but NNUE are probably even worse in this respect as these openings are a bit peculiar. NNUE engines excel in mainstream openings, where they are almost the level of Lc0 positionally. But NNUE engines and Lc0 are underperforming in deviating from usual openings positions and in chess variants. I have probably more than 1000 of KID openings, maybe I will build a Sim including them in the set.
I test Duch lines often for my first impressions of a for me unknown engine or engine version!
Maestro is SlowChess here, great understandings for 92 of 100 of my dutch test-positions!
Not important for your thread!
But all what I like to write is ...
What you find out seems to be a main problem for NN ideas.
End of the day, most chess programs do the same or lost the own face.
Madeleine Birchfield wrote: ↑Sun Nov 15, 2020 8:03 am
What about Seer, Halogen 8, and Minic 3?
I seem to be unable to run Seer 1.1 and Halogen 8.1 with Sim, even if I messed with Sim.tcl file. I am not sure what's the matter, maybe they are not fully UCI compliant. I was curious about them, as they seem to be original NNUE implementations.
Unfortunate, and entirely unsurprising in regards to the NNUE similarities. Perhaps, some hope though. Even though Komodo and Stockfish are, it appears, trained on the same code base, the differences between their evals, and its usage in training, is enough to have at least some diversity. You don't get quite near the level of intra-engine play. So one can return back to the argument of, "Its unique if its trained on different data, even if the trainer is the same", which was the failed mantra of DeusX, but seemed to work for Leelenstein and Allie.
#WeAreAllDraude #JusticeForDraude #RememberDraude #LeptirBigUltra "Those who can't do, clone instead" - Eduard ( A real life friend, not this forum's Eduard )