Advertisement

On the Application of Spell Correction to Improve Plagiarism Detection

  • Daniel Micol
  • Óscar Ferrández
  • Rafael Muñoz
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7337)

Abstract

In this paper we present the accuracy gains that spell corrector systems can provide to the plagiarism detection task when the appropriations contain spelling mistakes. These may have been introduced on purpose to avoid detection systems from finding the aforementioned appropriations, which could happen specially if such systems are based on lexical similarities. This document will detail the components that we have developed for both plagiarism detection and spell correction, and the significant gains that their combination produces.

Keywords

Query Term Source Document Computational Linguistics Spell Corrector Plagiarism Detection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Clough and Stevenson, 2011]
    Clough, P., Stevenson, M.: Developing a corpus of plagiarised short answers. Language Resources and Evaluation 45(1) (2011)Google Scholar
  2. [Gao et al., 2010]
    Gao, J., Li, X., Micol, D., Quirk, C., Sun, X.: Learning Phrase-Based Spelling Error Models from Clickthrough Data. In: Proceedings of the 23rd International Conference on Computational Linguistics, Beijing, China (August 2010)Google Scholar
  3. [Micol et al., 2010]
    Micol, D., Ferrández, Ó., Llopis, F., Muñoz, R.: A Lexical Similarity Approach for Efficient and Scalable External Plagiarism Detection. In: Proceedings of the SEPLN 2010 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, Padua, Italy (2010)Google Scholar
  4. [Philips, 1990]
    Philips, L.: Hanging on the metaphone. Computer Language Magazine 7(12), 38–44 (1990)Google Scholar
  5. [Sun et al., 2010]
    Sun, X., Gao, J., Micol, D., Quirk, C.: A Large scale Ranker-Based System for Search Query Spelling Correction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, pp. 266–274 (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Daniel Micol
    • 1
  • Óscar Ferrández
    • 2
  • Rafael Muñoz
    • 1
  1. 1.Department of Software and Computing SystemsUniversity of AlicanteSan Vicente del RaspeigSpain
  2. 2.Department of Biomedical InformaticsUniversity of UtahSalt Lake CityUnited States of America

Personalised recommendations