Abstract
Typical clustering operations in data mining involve finding natural groupings of resources or users. Conventional clusters have crisp boundaries, i.e. each object belongs to only one cluster. The clusters and associations in data mining do not necessarily have crisp boundaries. An object may belong to more than one cluster. Researchers have studied the possibility of using fuzzy sets in data mining clustering applications. Recently, two different methodologies based on properties of rough sets were proposed for developing interval representations of clusters. One approach is based on Genetic Algorithms, and the other is an adaptation of K-means algorithm. Both the approaches have been successful in generating intervals of clusters. The efficiency of the clustering algorithm is an important issue when dealing with a large dataset. This paper provides comparison of the time complexity of the two rough clustering algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
J.H. Holland: Adaptation in Natural and Artificial Systems. University of Michigan Press Ann Arbor (1975)
A. Joshi and R. Krishnapuram: Robust Fuzzy Clustering Methods to Support Web Mining. Proceedings of the workshop on Data Mining and Knowledge Discovery, SIGMOD’ 98, Seattle (1998) 15/1–15/8.
P. Lingras: Unsupervised Rough Set Classification using GAs. Journal Of Intelligent Information Systems, Vol. 16:3 (2001) 215–228
P. Lingras: Rough set clustering for web mining. To appear in Proceedings of 2002 IEEE International Conference on Fuzzy Systems (2002)
P. Lingras and C. West: Interval Set Clustering of Web Users with Rough K-means. submitted to the Journal of Intelligent Information Systems
Z. Pawlak: Rough classification. International Journal of Man-Machine Studies, Vol. 20, (1984) 469–483
A. Skowron and J. Stepaniuk: Information granules in distributed environment. In Zhong, N. Skowron, A. Ohsuga, S. (eds.): New Directions in Rough Sets, Data Mining, and Granular-Soft Computing, Lecture notes in Artificial Intelligence, Vol. 1711. Springer-Verlag, Tokyo (1999) 357–365
Y.Y. Yao, X. Li, T.Y. Lin, and Q. Liu: Representation and Classification of Rough Set Models. Proceeding of Third International Workshop on Rough Sets and Soft Computing, San Jose, California (1994) 630–637
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lingras, P., Yao, Y.Y. (2002). Time Complexity of Rough Clustering: GAs versus K-Means. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds) Rough Sets and Current Trends in Computing. RSCTC 2002. Lecture Notes in Computer Science(), vol 2475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45813-1_34
Download citation
DOI: https://doi.org/10.1007/3-540-45813-1_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44274-5
Online ISBN: 978-3-540-45813-5
eBook Packages: Springer Book Archive