Using Maximum Similarity Graphs to Edit Nearest Neighbor Classifiers

  • Milton García-Borroto
  • Yenny Villuendas-Rey
  • Jesús Ariel Carrasco-Ochoa
  • José Fco. Martínez-Trinidad
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5856)


The Nearest Neighbor classifier is a simple but powerful non-parametric technique for supervised classification. However, it is very sensitive to noise and outliers, which could decrease the classifier accuracy. To overcome this problem, we propose two new editing methods based on maximum similarity graphs. Numerical experiments in several databases show the high quality performance of our methods according to classifier accuracy.


nearest neighbor error-based editing prototype selection 


  1. 1.
    Cover, T., Hart, P.E.: Nearest Neighbor pattern classification. IEEE Trans. on Information Theory 13, 21–27 (1967)zbMATHCrossRefGoogle Scholar
  2. 2.
    Dasarathy, B.D.: Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)Google Scholar
  3. 3.
    Wilson, D.L.: Asymptotic properties of nearest neighbor rules using edited data. IEEE Transactions on Systems, Man and Cybernetics 2, 408–421 (1972)zbMATHCrossRefGoogle Scholar
  4. 4.
    Tomek, I.: An experiment with the Edited Nearest-Neighbor Rule. IEEE Transactions on Systems, Man and Cybernetics SMC-6, 448–452 (1976)zbMATHCrossRefMathSciNetGoogle Scholar
  5. 5.
    Devijver, P.A., Kittler, J.: On the edited neares neighbor rule. In: Press, I.C.S. (ed.) 5th International Conference on Pattern Recognition, Los Alamitos, California, pp. 72–80 (1980)Google Scholar
  6. 6.
    Kuncheva, L.I.: Combining pattern classifiers: methods and algorithms. Wiley-Interscience, Hoboken (2004)zbMATHCrossRefGoogle Scholar
  7. 7.
    Hattori, K., Takahashi, M.: A new edited k-nearest neighbor rule in the pattern classification problem. Pattern Recognition 33, 521–528 (2000)CrossRefGoogle Scholar
  8. 8.
    Toussaint, G.: Proximity Graphs for Nearest Neighbor Decision Rules: Recent Progress. In: 34 Symposium on Computing and Statistics INTERFACE-2002, Montreal, Canada, pp. 1–20 (2002)Google Scholar
  9. 9.
    Caballero, Y., Bello, R., Salgado, Y., García, M.M.: A method to edit training set based on rough sets. International Journal of Computational Intelligence Research 3, 219–229 (2007)CrossRefGoogle Scholar
  10. 10.
    Koplowitz, J.: On the relation of performance to editing in nearest neighbor rules. Pattern Recognit. 13, 251–255 (1981)CrossRefGoogle Scholar
  11. 11.
    Pons-Porrata, A., Berlanga-Llavori, R., Ruiz-Shulcloper, J.: Topic discovery based on text mining techniques. Information Processing & Management 43, 752–768 (2007)CrossRefGoogle Scholar
  12. 12.
    Merz, C.J., Murphy, P.M.: UCI Repository of Machine Learning Databases. University of California at Irvine, Department of Information and Computer Science, Irvine (1998)Google Scholar
  13. 13.
    Wilson, R.D., Martinez, T.R.: Improved Heterogeneous Distance Functions. Journal of Artificial Intelligence Research 6, 1–34 (1997)zbMATHMathSciNetGoogle Scholar
  14. 14.
    Dietterich, T.G.: Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms, vol. 10, pp. 1895–1923. MIT Press, Cambridge (1998)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Milton García-Borroto
    • 1
    • 3
  • Yenny Villuendas-Rey
    • 2
  • Jesús Ariel Carrasco-Ochoa
    • 3
  • José Fco. Martínez-Trinidad
    • 3
  1. 1.Bioplantas CenterUNICAC. de ÁvilaCuba
  2. 2.Ciego de Ávila University UNICAC. de ÁvilaCuba
  3. 3.National Institute of Astrophysics,Optics and ElectronicsPueblaMéxico

Personalised recommendations