I am really sorry for perhaps asking some questions for which the answers from all you learned gentlemen might be far too obvious.  But perhaps it will help me understand what you are trying to achieve and enable me to try some other tests with dedicateds.  If someone would kindly provide some answers to a question below that I am asking based on your proposed test, I would appreciate this.
I have some interest in clone tests based on dedicated computer.  And here is a test I made a while back on a few, which is not too dissimilar to what you are now discussing.  But obviously very basic.
http://spacious-mind.com/html/gk_2100_clones_test.html
Now in my test I had several known clones and also two computers that are known to be totally different but of similar playing strength, and one previously suspected clone.
As you can see the real clones scored on average about 95%. playing back exactly the same moves. The related computer to the clones played back 78% percent of all the moves, the non related clones 65% and 60%.
Now here is my question with regards to your proposed tests.  If you assume that each moves was a board set-up move (game position) then for dedicateds the number 595 of your test would in my test suggest that all the computers are clones of each other.   Ponder was off in my test therefore each position was in reality a unique test position.
Therefore what I am not sure about is the number 595 and how in your test a good number could be provided because in my dedicated test that would actually mean that each computer in the test was in fact a clone?
Or did I understand your proposed test wrong?
Best regards
Nick