Skip to main content

Simultaneous Clustering and Feature Selection Using Nature-Inspired Algorithm

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 41))

Abstract

Clustering means partitioning of data set into individual clusters where the similarity among the data exists and the procedure requires more substantial methodology when the dimension of the input data set is very high as well as we have to select more relevant dimensions or features which are necessary enough for clustering. Nature-inspired algorithm like firefly gives a promising result in function optimization and clustering. The proposed work will represent a new feature selection cum clustering algorithm called iterative firefly k-means features selection (FKM_FS) algorithm by minimizing the inter-cluster distance as well as maximizing the intra-cluster distance and maximizing the average relevance of the particular feature to the clustering. We define a methodology based on variance of observation in a cluster with respect to global variance to identify relevant feature subset. Finally, the algorithm will run both on real and data set.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31, 264–323 (1999)

    Article  Google Scholar 

  2. Jacob, K., Charles, N., Marc, T.: Grouping Multidimensional Data Recent Advances in Clustering, pp. 25–72. Springer-Verlag, New York (2006)

    MATH  Google Scholar 

  3. Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. J. Mach. Learn. Res. 5, 845–889 (2004)

    Google Scholar 

  4. Geem, Z., Kim, J., Loganathan, G.V.: A new heuristic optimization algorithm. Harmon. Search Simul. 76, 60–68 (2001)

    Google Scholar 

  5. Yang, X.S.: Firefly algorithm, stochastic test functions and design optimization. Int. J. Bio-Inspired Comput. 2, 78–84 (2010)

    Article  Google Scholar 

  6. Zeng, H., Cheung, Y.M.: A new feature selection method for Gaussian mixture clustering. Pattern Recogn. 42, 243–250 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sabyasachi Mukherjee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mukherjee, S., Bhaumik, L. (2019). Simultaneous Clustering and Feature Selection Using Nature-Inspired Algorithm. In: Biswas, U., Banerjee, A., Pal, S., Biswas, A., Sarkar, D., Haldar, S. (eds) Advances in Computer, Communication and Control. Lecture Notes in Networks and Systems, vol 41. Springer, Singapore. https://doi.org/10.1007/978-981-13-3122-0_55

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-3122-0_55

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-3121-3

  • Online ISBN: 978-981-13-3122-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics