Similarity tests

nabildanial · Post by **nabildanial** » Wed Oct 01, 2014 3:59 pm

While I second your statement, I think somebody needs to update the dendrogram thingy, it looks more accurate and reliable to me. The similarity tool has its own flaws, so depending on it as it is the only way to detect clones is just absurd.

Adam Hair · Post by **Adam Hair** » Wed Oct 01, 2014 4:26 pm

Exactly, Gabor!

Code: Select all

As a Test To Detect Clones and Derivatives
 
This tool, in conjunction with other tests, can be used to detect possible clones and derived engines. To be used most effectively, some preliminary work and some observations are needed. A database of unique engine pairs is needed to conduct comparisons.
 
A result of 60% moves matched for a pair is meaningless without other pairwise results to compare the number to. Ideally, enough engines are used to construct the database so that the distribution of the matched move percentages is approximately normal.
 
Also, it should be noted that the amount of time an engine has to think about each position has some effect on the moves chosen, and ultimately on the matched move percentages. One can choose to use a default time period, with some adverse effect on the accuracy of the test. Or one may chose an engine to calibrate the other engine times to.
 
Once an engine is chosen and its thinking time &#40;X&#41; is set, then the other times can be determined by this formula&#58; Y = X*&#40;2^(&#40;Elo Diff&#41;/120&#41;).
 
This formula works well for newer engines, but most likely less well for older engines. In any case, it does give more accurate matched move percentages than using a default time.
 
An additional observation is that a minimum of 5 standard deviations should be used to judge a pair percentage is beyond the norm. If 1000 unique engines &#40;where unique means unrelated engines with unique authors&#41; is considered to be an upper limit, then there could be possibly 999*1000/2 = 499,500 pairs of unique engines.
 
4 standard deviations representes an event that occurs 1 time out of approximately 31,600, or approximately 16 times in 499,500.
 
5 standard deviations represents 1 time in 3,448,556.
 
While not a guarantee of avoiding a false positive, the threshold of 5 standard deviations greatly reduces the chance of it occurring.
 
The drawback of setting the false positive threshold so high is that more false negatives will occur &#40;two similar engines would be deemed non-similar&#41;. However, there are several things to consider.
 
The use of statistical methods assumes that the authors have access to a common pool of ideas, but that there are no interactions between authors/engines. In reality, authors/engines do interact.
  
There are permissible methods by which one author can make his engine more similar to another engine. We have no standard for when some author goes too far. Thus, we have no way to determine an exact threshold.
  
The need to avoid false accusation is greater than the need to determine authors who break the rules slightly. In other words, it is better to let lesser offenders slip through than to make accusations against innocent authors.
  
This tool should not be used solely for determining derivatives and clones. Other methods should be used in conjunction with this tool. Ultimately, any accusation of cloning requires an examination of the code of the accused author.

http://www.top-5000.nl/clone.htm

Laskos · Post by **Laskos** » Wed Oct 01, 2014 5:27 pm

SzG wrote:Reading recent posts I have got the impression that the similarity tool is regarded as a reliable tool for deciding if an engine is original or not. As far as I can remember, at its birth it was stated expressly that on its own it is not suitable for that purpose.
This is just a reminder to the community not to commit the error of judging everything by this tool alone.

It could give false negatives, as Ed showed, but I have not seen any false positive. So:

1/ If it passes Sim test, that may mean nothing.
2/ If it doesn't pass the Sim test, that means it's a clone or a derivative.

IWB · Post by **IWB** » Wed Oct 01, 2014 5:37 pm

Laskos wrote:
SzG wrote:Reading recent posts I have got the impression that the similarity tool is regarded as a reliable tool for deciding if an engine is original or not. As far as I can remember, at its birth it was stated expressly that on its own it is not suitable for that purpose.
This is just a reminder to the community not to commit the error of judging everything by this tool alone.
It could give false negatives, as Ed showed, but I have not seen any false positive. So:

1/ If it passes Sim test, that may mean nothing.
2/ If it doesn't pass the Sim test, that means it's a clone or a derivative.

I believe that you haven't seen a false positive, but that doesn't mean there are non. (Because you haven't seen a black swan it doesnt mean there are non.)

I thing if it pass and you suspect something you have to have a closer look. If it doesn't pass you have to have a closer look - even if you doesn't suspect something ...

Bye
Ingo

Uri Blass · Post by **Uri Blass** » Wed Oct 01, 2014 5:37 pm

Laskos wrote:
SzG wrote:Reading recent posts I have got the impression that the similarity tool is regarded as a reliable tool for deciding if an engine is original or not. As far as I can remember, at its birth it was stated expressly that on its own it is not suitable for that purpose.
This is just a reminder to the community not to commit the error of judging everything by this tool alone.
It could give false negatives, as Ed showed, but I have not seen any false positive. So:

1/ If it passes Sim test, that may mean nothing.
2/ If it doesn't pass the Sim test, that means it's a clone or a derivative.

Of course you do not see any false positive because if you see positive you assume that it is a true positive.

There is no way to refute the claim that there are no false positives if you assume that every positive is a clone or a derivative.

How do you prove that B is not a derivative of A?

Laskos · Post by **Laskos** » Wed Oct 01, 2014 5:53 pm

Uri Blass wrote:
Laskos wrote:
SzG wrote:Reading recent posts I have got the impression that the similarity tool is regarded as a reliable tool for deciding if an engine is original or not. As far as I can remember, at its birth it was stated expressly that on its own it is not suitable for that purpose.
This is just a reminder to the community not to commit the error of judging everything by this tool alone.
It could give false negatives, as Ed showed, but I have not seen any false positive. So:

1/ If it passes Sim test, that may mean nothing.
2/ If it doesn't pass the Sim test, that means it's a clone or a derivative.
Of course you do not see any false positive because if you see positive you assume that it is a true positive.

There is no way to refute the claim that there are no false positives if you assume that every positive is a clone or a derivative.

How do you prove that B is not a derivative of A?

By other circumstantial evidence. All the positives, were positives of open source Fruit, after Fruit, positives of open source Strelka, after Strelka, positives of open source Ippo, after Ippo, positives pf open source SF, after SF.

There is no benefit of doubt in these cases. Show me a single closed source engine, which, being prior to the open source one different engine, is a positive with that later open source engine.

Laskos · Post by **Laskos** » Wed Oct 01, 2014 6:42 pm

IWB wrote:
Laskos wrote:
SzG wrote:Reading recent posts I have got the impression that the similarity tool is regarded as a reliable tool for deciding if an engine is original or not. As far as I can remember, at its birth it was stated expressly that on its own it is not suitable for that purpose.
This is just a reminder to the community not to commit the error of judging everything by this tool alone.
It could give false negatives, as Ed showed, but I have not seen any false positive. So:

1/ If it passes Sim test, that may mean nothing.
2/ If it doesn't pass the Sim test, that means it's a clone or a derivative.
I believe that you haven't seen a false positive, but that doesn't mean there are non. (Because you haven't seen a black swan it doesnt mean there are non.)

I thing if it pass and you suspect something you have to have a closer look. If it doesn't pass you have to have a closer look - even if you doesn't suspect something ...

Bye
Ingo

With swans it's a bit different. All black swans I saw were dyed former white swans. No naturally black swans were observed. If I see again a black swan, the reasonable assumption is that it's a former white swan.

IWB · Post by **IWB** » Wed Oct 01, 2014 7:02 pm

Laskos wrote: With swans it's a bit different. All black swans I saw were dyed former white swans. No naturally black swans were observed. If I see again a black swan, the reasonable assumption is that it's a former white swan.

http://en.wikipedia.org/wiki/Black_swan

That is the problem with assumptions!

Bye
Ingo

Jouni · Post by **Jouni** » Wed Oct 01, 2014 7:05 pm

I have feeling, that all programs which score significantly better than Stockfish in tactical test are Ippolit based! I don't remember any exception so far. Also Equinox is probably based heavily on Ippo.

Frank Quisinsky · Post by **Frank Quisinsky** » Wed Oct 01, 2014 7:10 pm

Hi Gabor,

at first ... if I read your name I am thinking each time on the good and old Winboard times. You are a Winboard icon!

Yes, thinking the same ...
An logical statement gave the Critter programmer in TalkChess.

http://talkchess.com/forum/viewtopic.ph ... 71&t=39577

All isn't easy but each time interesting if the programmer of such an "critical engine" don't give us exactly information. If so ... in 95% a derivative or clone engine (experience). Most are to see in the styles of the engines.

The tool is good (nice to have) but all should be see in combination with other facts.

Best
Frank

Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests

Re: Similarity tests