Plagiarism meets Paraphrasing: Insights for the Next Generation in Automatic Plagiarism Detection

Autores UPV
Año
Revista COMPUTATIONAL LINGUISTICS

Abstract

The presented experiments show that (i) more complex paraphrase phenomena and a high density of paraphrase mechanisms make plagiarism detection more difficult, (ii) lexical substitutions are the paraphrase mechanisms used the most when plagiarizing, and (iii) paraphrase mechanisms tend to shorten the plagiarized text. For the first time, the paraphrase mechanisms behind plagiarism have been analyzed, providing critical insights for the improvement of automatic plagiarism detection systems.