Abstract
Clustering means partitioning of data set into individual clusters where the similarity among the data exists and the procedure requires more substantial methodology when the dimension of the input data set is very high as well as we have to select more relevant dimensions or features which are necessary enough for clustering. Nature-inspired algorithm like firefly gives a promising result in function optimization and clustering. The proposed work will represent a new feature selection cum clustering algorithm called iterative firefly k-means features selection (FKM_FS) algorithm by minimizing the inter-cluster distance as well as maximizing the intra-cluster distance and maximizing the average relevance of the particular feature to the clustering. We define a methodology based on variance of observation in a cluster with respect to global variance to identify relevant feature subset. Finally, the algorithm will run both on real and data set.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31, 264–323 (1999)
Jacob, K., Charles, N., Marc, T.: Grouping Multidimensional Data Recent Advances in Clustering, pp. 25–72. Springer-Verlag, New York (2006)
Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. J. Mach. Learn. Res. 5, 845–889 (2004)
Geem, Z., Kim, J., Loganathan, G.V.: A new heuristic optimization algorithm. Harmon. Search Simul. 76, 60–68 (2001)
Yang, X.S.: Firefly algorithm, stochastic test functions and design optimization. Int. J. Bio-Inspired Comput. 2, 78–84 (2010)
Zeng, H., Cheung, Y.M.: A new feature selection method for Gaussian mixture clustering. Pattern Recogn. 42, 243–250 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mukherjee, S., Bhaumik, L. (2019). Simultaneous Clustering and Feature Selection Using Nature-Inspired Algorithm. In: Biswas, U., Banerjee, A., Pal, S., Biswas, A., Sarkar, D., Haldar, S. (eds) Advances in Computer, Communication and Control. Lecture Notes in Networks and Systems, vol 41. Springer, Singapore. https://doi.org/10.1007/978-981-13-3122-0_55
Download citation
DOI: https://doi.org/10.1007/978-981-13-3122-0_55
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-3121-3
Online ISBN: 978-981-13-3122-0
eBook Packages: EngineeringEngineering (R0)