Skip to main content

An Evolutionary Neural Network Approach to Intrinsic Plagiarism Detection

  • Conference paper
Book cover Artificial Intelligence and Cognitive Science (AICS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6206))

Included in the following conference series:

Abstract

Intrinsic Plagiarism Detection attempts to identify portions of a document which have been plagiarised without the use of reference collections. This is typically achieved by developing a classifier using support vector machines or hand-crafted neural networks. This paper presents an evolutionary neural network approach to the development of an intrinsic plagiarism detection classifier which is capable of evolving both the weights and structure of a neural network. The neural network is empirically tested on a corpus of documents and is shown to perform well.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Angeline, P.J., Saunders, G.M., Pollack, J.P.: An evolutionary algorithm that constructs recurrent neural networks. IEEE Transactions on Neural Networks 5(1), 54–65 (1994)

    Article  Google Scholar 

  2. Baker, B.S.: A program for identifying duplicated code. Computing Science and Statistics 24, 49–57 (1992)

    Google Scholar 

  3. Belew, R.K., McInerney, J., Schraudolph, N.N.: Evolving networks: Using the genetic algorithm with connectionist learning. In: Langton, C.G., Taylor, C., Doyne Farmer, J., Rasmussen, S. (eds.) Artificial Life II, pp. 511–547. Addison-Wesley, Redwood City (1992)

    Google Scholar 

  4. Branke, J.: Evolutionary algorithms for neural network design and training. Technical Report No. 322, University of Karlsruhe, Institute AIFB (1995)

    Google Scholar 

  5. Brin, S., Davis, J., Molina, H.G.: Copy detection mechanisms for digital documents. In: Proceedings of the ACM SIGMOD Annual Conference, pp. 398–409 (1995)

    Google Scholar 

  6. Chellapilla, K., Fogel, D.B.: Evolving neural networks to play checkers without relying on expert knowledge. IEEE Transactions on Neural Networks 10, 1382–1391 (1999)

    Article  Google Scholar 

  7. Dasarathy, B.V.: Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Computer Society Press, Los Alamitos (1990)

    Google Scholar 

  8. De Garis, H.: Genetic programming: building artificial nervous systems using genetically programmed neural network modules. In: Porter, B.W., Mooney, R.J. (eds.) Machine Learning: Proceedings of the Seventh International Conference, Austin, TX, June 21-23, pp. 132–139. Morgan Kaufmann, Palo Alto (1990)

    Google Scholar 

  9. Finkel, R.A., Zaslavsky, A., Monostori, K., Schmidt, H.: Signature extraction for overlap detection in documents. Aust. Comput. Sci. Commun. 24(1), 59–64 (2002)

    Google Scholar 

  10. Flesch, R.: A new readability yardstick. Journal of Applied Psychology 32, 221–233 (1948)

    Article  Google Scholar 

  11. Fradkin, D., Muchnik, I.: Support vector machines for classification. In: Discrete Methods in Epidemiology. DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 70, pp. 13–20 (2006)

    Google Scholar 

  12. Goldberg, D.E., Richardson, J.: Genetic algorithms with sharing for multimodal function optimization. In: Proceedings of the Second International Conference on Genetic Algorithms and their Application, pp. 41–49. Lawrence Erlbaum Associates, Inc., Mahwah (1987)

    Google Scholar 

  13. Gruau, F.: Neural Network Synthesis using Cellular Encoding and the Genetic Algorithm. PhD thesis, Centre d’etude nucleaire de Grenoble, Ecole Normale Superieure de Lyon, France (1994)

    Google Scholar 

  14. Gunning, R.: The technique of clear writing. McGraw-Hill, New York (1952)

    Google Scholar 

  15. Gusfield, D.: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge (January 1997)

    Book  MATH  Google Scholar 

  16. Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice-Hall, Englewood Cliffs (1999)

    MATH  Google Scholar 

  17. Hussain, T.S., Browse, R.A.: Genetic encoding of neural networks using attribute grammars. In: CITO Researcher Retreat, Hamilton, Ontario, Canada, May 12-14 (1998)

    Google Scholar 

  18. Jensen, F.V.: Bayesian artificial intelligence: Kevin b. korb, ann e. nicholson, Chapman & hall, 354 pages (2004); Pattern Anal. Appl. 7(2), 221–223 (2004)

    Google Scholar 

  19. Kolen, J.F., Pollack, J.B.: Back propagation is sensitive to initial conditions. Advances in Neural Information Processing Systems 3, 860–867 (1991)

    Google Scholar 

  20. Koza, J.R., Rice, J.P.: Genetic generation of both the weights and architecture for a neural network. In: International Joint Conference on Neural Networks, IJCNN 1991, Seattle, WA, July 8-12, vol. II, pp. 397–404. IEEE Computer Society Press, Los Alamitos (1991)

    Google Scholar 

  21. Maniezzo, V.: Searching among search spaces: Hastening the genetic evolution of feedforward neural networks. In: Albrecht, R.F., Reeves, C.R., Steele, N.C. (eds.) Artificial Neural Nets and Genetic Algorithms, pp. 635–643. Springer, Heidelberg (1993)

    Chapter  Google Scholar 

  22. Montana, D.J., Davis, L.: Training feedforward neural networks using genetic algorithms. In: Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, pp. 762–767. Morgan Kaufmann, San Mateo (1989)

    Google Scholar 

  23. Kincaid, J.P., Fishburne, R.P., Rogers, R.L., Chissom, B.S.: Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel. In: Research Branch Report 8-75, Naval Technical Training, Millington, TN, U. S. Naval Air Station, Memphis, TN (1975)

    Google Scholar 

  24. Richards, N., Moriarty, D., McQuesten, P., Miikkulainen, R.: Evolving neural networks to play Go. In: Proceedings of the 7th International Conference on Genetic Algorithms, East Lansing, MI (1997)

    Google Scholar 

  25. Sasaki, T., Tokoro, M.: Evolving learnable neural networks under changing environments with various rates of inheritance of acquired characters: Comparison between darwinian and lamarckian evolution. Artificial Life 5(3), 203–223 (1999)

    Article  Google Scholar 

  26. Si, A., Leong, H.V., Lau, R., Va, H., Rynson, L., Lau, W.H.: Check: A document plagiarism detection system. In: Proceedings of ACM Symposium for Applied Computing, pp. 70–77. ACM Press, New York (1997)

    Google Scholar 

  27. Stanley, K.O., Miikkulainen, R.: Evolving neural networks through augmenting topologies. Evolutionary Computation 10(2), 99–127 (2002)

    Article  Google Scholar 

  28. Stein, B.: Fuzzy-fingerprints for text-based information retrieval. In: I-KNOW 2005: Proceedings of the 5th International Conference on Knowledge Management, pp. 572–579 (2005)

    Google Scholar 

  29. Stein, B., Eissen, S.M.z.: Intrinsic plagiarism analysis with meta learning. In: Stein, B., Koppel, M., Stamatatos, E. (eds.) PAN. CEUR Workshop Proceedings, vol. 276, CEUR-WS.org (2007)

    Google Scholar 

  30. Sutton, R.S.: Two problems with backpropagation and other steepest-descent learning procedures for networks. In: Proc. of 8th Annual Conf. of the Cognitive Science Society, pp. 823–831 (1986)

    Google Scholar 

  31. White, D.W.: GANNet: A genetic algorithm for searching topology and weight spaces in neural network design. PhD thesis, University of Maryland College Park (1994)

    Google Scholar 

  32. Whitley, D., Starkweather, T., Bogart, C.: Genetic algorithms and neural networks - optimizing connections and connectivity. Parallel Computing 14(3), 347–361 (1990)

    Article  Google Scholar 

  33. Yao, X.: Evolving artificial neural networks. Proceedings of the IEEE, 1423–1447 (1999)

    Google Scholar 

  34. Zhang, B., Muhlenbein, H.: Evolving optimal neural networks using genetic algorithms with occam’s razor. Complex Systems 7(3), 199–220 (1993)

    Google Scholar 

  35. Eissen, S.M.z., Stein, B.: Intrinsic plagiarism detection. In: Lalmas, M., MacFarlane, A., Rüger, S.M., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds.) ECIR 2006. LNCS, vol. 3936, pp. 565–569. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Curran, D. (2010). An Evolutionary Neural Network Approach to Intrinsic Plagiarism Detection. In: Coyle, L., Freyne, J. (eds) Artificial Intelligence and Cognitive Science. AICS 2009. Lecture Notes in Computer Science(), vol 6206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17080-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17080-5_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17079-9

  • Online ISBN: 978-3-642-17080-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics