Duplicate Content Checker

I’m looking for a php script or a java app that looks for duplicate content on my large article directory http://tinyurl.com/39vpw6w(720000 articles). I also want to be able to change the threshold varying from 40%-100%. It has to give a list of the original+ duplicates with a delete button to manually delete them.
Who can make such a script or has one already made?

Leave a Reply

Your email address will not be published. Required fields are marked *