SNN: A Supervised Clustering Algorithm

Aguilar, Jesús S.; Ruiz, Roberto; Riquelme, José C.; Giráldez, Raúl

doi:10.1007/3-540-45517-5_24

Jesús S. Aguilar³,
Roberto Ruiz³,
José C. Riquelme³ &
…
Raúl Giráldez³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2070))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

795 Accesses
3 Citations

Abstract

In this paper, we present a new algorithm based on the nearest neighbours method, for discovering groups and identifying interesting distributions in the underlying data in the labelled databases. We introduces the theory of nearest neighbours sets in order to base the algorithm S-NN (Similar Nearest Neighbours). Traditional clustering algorithms are very sensitive to the user-defined parameters and an expert knowledge is required to choose the values. Frequently, these algorithms are fragile in the presence of outliers and any adjust well to spherical shapes. Experiments have shown that S-NN is accurate discovering arbitrary shapes and density clusters, since it takes into account the internal features of each cluster, and it does not depend on a user- supplied static model. S-NN achieve this by collecting the nearest neighbours with the same label until the enemy is found (it has not the same label). The determinism and the results offered to the researcher turn it into a valuable tool for the representation of the inherent knowledge to the labelled databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aha, D. W. A (1990). Study of Instance-Based Algorithms for Supervised Learning Tasks: Mathematical, Empirical, and psychological Evaluations. Ph. D. Dissertation. UCI.
Google Scholar
Codrington, C. W. y Brodley, C. E. (1997). On the Qualitative Behavior of Impurity-Based Splitting Rules I: The Minima-Free Property. Technical Report, Purdue University.
Google Scholar
Cover, T. M. y Hart, P. E. (1967). Nearest Neighbor Pattern Classification. NN-Pattern Classification Techniques. IEEE.
Google Scholar
Chang, C. L. (1974). Finding Prototypes for Nearest Neighbor Classifiers. IEEE Transactions on Computers.
Google Scholar
Hart, P. E. (1968). The condensed nearest neighbor rule. IEEE Transactions on Information Theory, IT-14.
Google Scholar
Hellman, M. E. (1970). The Nearest Neighbor Classification Rule with a Reject Option. NN-Pattern Classification Techniques. IEEE.
Google Scholar
Jarvis, R. A. y Patrick, E. A. (1973). Clustering using a Similarity Measure Based on Shared Near Neighbors. IEEE Transactions on Computers.
Google Scholar
Joussellin, A. y Dubuisson, B. (1987). A Link Between k-Nearest Neighbor Rules and Knowledge Based Systems by Sequence Analysis. Pattern Recognition Letters.
Google Scholar
Patrick, E. A. y Fischer, F. P. (1970). A Generalized k-Nearest Neighbor Rule. NN-Pattern Classification Techniques. IEEE.
Google Scholar
Ritter, G. L., Woodruff, H. B., Lowry, S. R. y Isenhour, T. L. (1975). An algorithm for a Selective Nearest Neighbor Decision Rule. IEEE Transactions on Information Theory, 21.
Google Scholar
Tomek, I. (1976). An Experiment with the Edited Nearest-Neighbor Rule. IEEE Transactions on Systems, Man, an Cybernetics SMC-6.
Google Scholar
Wilson, D. (1972). Asymptotic Properties of Nearest Neighbor Rules using Edited Data. IEEE Transactions on Systems, Man and Cybernetics 2.
Google Scholar
D. Fisher. (1995). Optimization and Simplification of Hierarchical Clusters. Proceedings of the International Conference on Knowledge Discovery and Data Mining.
Google Scholar
S. Guha, (1998). CURE: An Efficient Clustering Algorithm for Large Databases. Proceedings of the 1998 ACM SIGMOD Conference.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Sevilla, Avda. Reina Mercedes s/n, 41011, Sevilla, Spain
Jesús S. Aguilar, Roberto Ruiz, José C. Riquelme & Raúl Giráldez

Authors

Jesús S. Aguilar
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Ruiz
View author publications
You can also search for this author in PubMed Google Scholar
José C. Riquelme
View author publications
You can also search for this author in PubMed Google Scholar
Raúl Giráldez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Hungarian Academy of Sciences, Intelligent Manufacturing and Business Processes Computer and Automation Research Institute, Kende utca 13-17, 1111, Budapest, Hungary
László Monostori & József Váncza &
Department of Computer Science 601 University Drive, Southwest Texas State University, San Marcos, TX, 78666-4616, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aguilar, J.S., Ruiz, R., Riquelme, J.C., Giráldez, R. (2001). SNN: A Supervised Clustering Algorithm. In: Monostori, L., Váncza, J., Ali, M. (eds) Engineering of Intelligent Systems. IEA/AIE 2001. Lecture Notes in Computer Science(), vol 2070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45517-5_24

Download citation

DOI: https://doi.org/10.1007/3-540-45517-5_24
Published: 18 June 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42219-8
Online ISBN: 978-3-540-45517-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics