Abstract
Intrinsic Plagiarism Detection attempts to identify portions of a document which have been plagiarised without the use of reference collections. This is typically achieved by developing a classifier using support vector machines or hand-crafted neural networks. This paper presents an evolutionary neural network approach to the development of an intrinsic plagiarism detection classifier which is capable of evolving both the weights and structure of a neural network. The neural network is empirically tested on a corpus of documents and is shown to perform well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Angeline, P.J., Saunders, G.M., Pollack, J.P.: An evolutionary algorithm that constructs recurrent neural networks. IEEE Transactions on Neural Networks 5(1), 54–65 (1994)
Baker, B.S.: A program for identifying duplicated code. Computing Science and Statistics 24, 49–57 (1992)
Belew, R.K., McInerney, J., Schraudolph, N.N.: Evolving networks: Using the genetic algorithm with connectionist learning. In: Langton, C.G., Taylor, C., Doyne Farmer, J., Rasmussen, S. (eds.) Artificial Life II, pp. 511–547. Addison-Wesley, Redwood City (1992)
Branke, J.: Evolutionary algorithms for neural network design and training. Technical Report No. 322, University of Karlsruhe, Institute AIFB (1995)
Brin, S., Davis, J., Molina, H.G.: Copy detection mechanisms for digital documents. In: Proceedings of the ACM SIGMOD Annual Conference, pp. 398–409 (1995)
Chellapilla, K., Fogel, D.B.: Evolving neural networks to play checkers without relying on expert knowledge. IEEE Transactions on Neural Networks 10, 1382–1391 (1999)
Dasarathy, B.V.: Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Computer Society Press, Los Alamitos (1990)
De Garis, H.: Genetic programming: building artificial nervous systems using genetically programmed neural network modules. In: Porter, B.W., Mooney, R.J. (eds.) Machine Learning: Proceedings of the Seventh International Conference, Austin, TX, June 21-23, pp. 132–139. Morgan Kaufmann, Palo Alto (1990)
Finkel, R.A., Zaslavsky, A., Monostori, K., Schmidt, H.: Signature extraction for overlap detection in documents. Aust. Comput. Sci. Commun. 24(1), 59–64 (2002)
Flesch, R.: A new readability yardstick. Journal of Applied Psychology 32, 221–233 (1948)
Fradkin, D., Muchnik, I.: Support vector machines for classification. In: Discrete Methods in Epidemiology. DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 70, pp. 13–20 (2006)
Goldberg, D.E., Richardson, J.: Genetic algorithms with sharing for multimodal function optimization. In: Proceedings of the Second International Conference on Genetic Algorithms and their Application, pp. 41–49. Lawrence Erlbaum Associates, Inc., Mahwah (1987)
Gruau, F.: Neural Network Synthesis using Cellular Encoding and the Genetic Algorithm. PhD thesis, Centre d’etude nucleaire de Grenoble, Ecole Normale Superieure de Lyon, France (1994)
Gunning, R.: The technique of clear writing. McGraw-Hill, New York (1952)
Gusfield, D.: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge (January 1997)
Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice-Hall, Englewood Cliffs (1999)
Hussain, T.S., Browse, R.A.: Genetic encoding of neural networks using attribute grammars. In: CITO Researcher Retreat, Hamilton, Ontario, Canada, May 12-14 (1998)
Jensen, F.V.: Bayesian artificial intelligence: Kevin b. korb, ann e. nicholson, Chapman & hall, 354 pages (2004); Pattern Anal. Appl. 7(2), 221–223 (2004)
Kolen, J.F., Pollack, J.B.: Back propagation is sensitive to initial conditions. Advances in Neural Information Processing Systems 3, 860–867 (1991)
Koza, J.R., Rice, J.P.: Genetic generation of both the weights and architecture for a neural network. In: International Joint Conference on Neural Networks, IJCNN 1991, Seattle, WA, July 8-12, vol. II, pp. 397–404. IEEE Computer Society Press, Los Alamitos (1991)
Maniezzo, V.: Searching among search spaces: Hastening the genetic evolution of feedforward neural networks. In: Albrecht, R.F., Reeves, C.R., Steele, N.C. (eds.) Artificial Neural Nets and Genetic Algorithms, pp. 635–643. Springer, Heidelberg (1993)
Montana, D.J., Davis, L.: Training feedforward neural networks using genetic algorithms. In: Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, pp. 762–767. Morgan Kaufmann, San Mateo (1989)
Kincaid, J.P., Fishburne, R.P., Rogers, R.L., Chissom, B.S.: Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel. In: Research Branch Report 8-75, Naval Technical Training, Millington, TN, U. S. Naval Air Station, Memphis, TN (1975)
Richards, N., Moriarty, D., McQuesten, P., Miikkulainen, R.: Evolving neural networks to play Go. In: Proceedings of the 7th International Conference on Genetic Algorithms, East Lansing, MI (1997)
Sasaki, T., Tokoro, M.: Evolving learnable neural networks under changing environments with various rates of inheritance of acquired characters: Comparison between darwinian and lamarckian evolution. Artificial Life 5(3), 203–223 (1999)
Si, A., Leong, H.V., Lau, R., Va, H., Rynson, L., Lau, W.H.: Check: A document plagiarism detection system. In: Proceedings of ACM Symposium for Applied Computing, pp. 70–77. ACM Press, New York (1997)
Stanley, K.O., Miikkulainen, R.: Evolving neural networks through augmenting topologies. Evolutionary Computation 10(2), 99–127 (2002)
Stein, B.: Fuzzy-fingerprints for text-based information retrieval. In: I-KNOW 2005: Proceedings of the 5th International Conference on Knowledge Management, pp. 572–579 (2005)
Stein, B., Eissen, S.M.z.: Intrinsic plagiarism analysis with meta learning. In: Stein, B., Koppel, M., Stamatatos, E. (eds.) PAN. CEUR Workshop Proceedings, vol. 276, CEUR-WS.org (2007)
Sutton, R.S.: Two problems with backpropagation and other steepest-descent learning procedures for networks. In: Proc. of 8th Annual Conf. of the Cognitive Science Society, pp. 823–831 (1986)
White, D.W.: GANNet: A genetic algorithm for searching topology and weight spaces in neural network design. PhD thesis, University of Maryland College Park (1994)
Whitley, D., Starkweather, T., Bogart, C.: Genetic algorithms and neural networks - optimizing connections and connectivity. Parallel Computing 14(3), 347–361 (1990)
Yao, X.: Evolving artificial neural networks. Proceedings of the IEEE, 1423–1447 (1999)
Zhang, B., Muhlenbein, H.: Evolving optimal neural networks using genetic algorithms with occam’s razor. Complex Systems 7(3), 199–220 (1993)
Eissen, S.M.z., Stein, B.: Intrinsic plagiarism detection. In: Lalmas, M., MacFarlane, A., Rüger, S.M., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds.) ECIR 2006. LNCS, vol. 3936, pp. 565–569. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Curran, D. (2010). An Evolutionary Neural Network Approach to Intrinsic Plagiarism Detection. In: Coyle, L., Freyne, J. (eds) Artificial Intelligence and Cognitive Science. AICS 2009. Lecture Notes in Computer Science(), vol 6206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17080-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-17080-5_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17079-9
Online ISBN: 978-3-642-17080-5
eBook Packages: Computer ScienceComputer Science (R0)