On the Application of Spell Correction to Improve Plagiarism Detection
In this paper we present the accuracy gains that spell corrector systems can provide to the plagiarism detection task when the appropriations contain spelling mistakes. These may have been introduced on purpose to avoid detection systems from finding the aforementioned appropriations, which could happen specially if such systems are based on lexical similarities. This document will detail the components that we have developed for both plagiarism detection and spell correction, and the significant gains that their combination produces.
KeywordsQuery Term Source Document Computational Linguistics Spell Corrector Plagiarism Detection
Unable to display preview. Download preview PDF.
- [Clough and Stevenson, 2011]Clough, P., Stevenson, M.: Developing a corpus of plagiarised short answers. Language Resources and Evaluation 45(1) (2011)Google Scholar
- [Gao et al., 2010]Gao, J., Li, X., Micol, D., Quirk, C., Sun, X.: Learning Phrase-Based Spelling Error Models from Clickthrough Data. In: Proceedings of the 23rd International Conference on Computational Linguistics, Beijing, China (August 2010)Google Scholar
- [Micol et al., 2010]Micol, D., Ferrández, Ó., Llopis, F., Muñoz, R.: A Lexical Similarity Approach for Efficient and Scalable External Plagiarism Detection. In: Proceedings of the SEPLN 2010 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, Padua, Italy (2010)Google Scholar
- [Philips, 1990]Philips, L.: Hanging on the metaphone. Computer Language Magazine 7(12), 38–44 (1990)Google Scholar
- [Sun et al., 2010]Sun, X., Gao, J., Micol, D., Quirk, C.: A Large scale Ranker-Based System for Search Query Spelling Correction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, pp. 266–274 (2010)Google Scholar