For my table generator, my wild guess is that the 200+K pdep/pext approach is fastest. In my chess engine, shiftless magic bitboards with compact tables by Volker Annuss easily beats the "Hyperbola Quintessence" approach, but for my generator they are about equal (possibly Hyperbola is faster on AMD). So memory usage seems more of an issue for my generator.Gerd Isenberg wrote:Considering the huge L3, my wild guess is that a 800+K plain pext approach will be fastest in a real engine
Maybe pext for bishops, pdep/pext for rooks.
My plan is to wait for Haswell-E...