A plagiarism detection engine based on Stanford's MOSS(Measure of Software Similarity)
Moss (for a Measure Of Software Similarity) is an automatic system for determining the similarity of programs. To date, the main application of Moss has been in detecting plagiarism in programming classes. Since its development in 1994, Moss has been very effective in this role. The algorithm behind moss is a significant improvement over other cheating detection algorithms (at least, over those known to us).
Moss can currently analyze code written in the following languages: C, Java
Adding new languages is fairly easy, check out the conf directory to learn how and add your language (don't forget to open a pull request so we can incorporate the new language)
A paper on the ideas behind Moss can be found here.