I would be intrigued to know whether you intend to study plagiarism as a practice - or do you rather need this software for practical purposes? If the former, I would be inclined first of all to consider ethical issues, since accessing cases of plagiarism raises quite a few of these. This is not to say that it is overtly ambitious to studying this theme, but rather that this is a fascinating area with noteworthy challenges.