copy-pastesimilaritysimian

Tips to show similarities in files


In a project, I found some css files that "smell" like there are copy-pasted rules in them.

I wonder what are your strategies for detecting copy-paste stuff in files.

Just of curiosity i'd like to hear your tips and tricks for showing file similarities!


Solution

  • Try Simian Similarity Analyzer.

    It is used for copy-paste-detection in source code (Java, C#, C, C++, COBOL, Ruby, JSP, ASP, HTML, XML, Visual Basic, Groovy), but you can run this on plain text files too.