Skip to main content

Cluster Validity Using Modified Fuzzy Silhouette Index on Large Dynamic Data Set

  • Conference paper
  • First Online:
Computational Intelligence in Data Mining

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 990))

  • 793 Accesses

Abstract

Cluster validity index is applied to evaluate clustering results. It can be performed based on different measures and it can accomplish at data point level or cluster center level. In distance-based clustering methods, silhouette is an efficient point to point index measure, which defines relation based on compactness and separation distances. To validate fuzzy partitions, fuzzy silhouette index is used by applying defuzzification process. One of the applications of cluster validity is finding an optimal number of clusters in distance-based methods. As data size increases, point-wise index measure calculation takes more execution time. Hence, we proposed approaches to reduce time complexity by modifying fuzzy silhouette index at center to center and center to mean levels. All these methods are applied to find the right number of cluster and they are giving correct value in minimum execution time. All work is implemented in Matlab and effective results are given by our proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hartigan, J.A.: Clustering Algorithms. John Wiley & Sons, Inc., New York (1975)

    Google Scholar 

  2. Hartigan, J.A., Wong, M.A.: Algorithm AS 136: a k-means clustering algorithm. J. Roy. Stat. Soc. Ser. C28(1), 100–108 (1979)

    MATH  Google Scholar 

  3. Kaufman, L. and Rousseau, P.J. (1987), Clustering by means of Medoids, in Statistical Data Analysis Based on the L1–Norm and Related Methods, edited by Y. Dodge, North-Holland, 405–416

    Google Scholar 

  4. Park, H.S., Jun, C.H.: A simple and fast algorithm for K-medoids clustering. Exp. Syst. Appl. 36(2), 3336–3341 (2009)

    Article  Google Scholar 

  5. Crespo, F., Weber, R.: A methodology for dynamic data mining based on fuzzy clustering. Fuzzy Sets Syst. 150(2), 1 (2005)

    Article  MathSciNet  Google Scholar 

  6. Nock, R., Nielsen, F.: On weighting clustering. IEEE Trans. Pattern Anal. Mach. Intell. 28(8), 1–13 (2006)

    Article  Google Scholar 

  7. Bezdek, J.C.: Cluster validity with fuzzy sets 58–73 (1973)

    Google Scholar 

  8. Bezdek, James C. (1981). Pattern Recognition with Fuzzy Objective Function Algorithms. ISBN 0-306-40671-3

    Google Scholar 

  9. Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13(8), 841–847 (1991)

    Article  Google Scholar 

  10. Pal, N.R., Bezdek, J.C.: On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Syst. 3(3), 370–379 (1995)

    Article  Google Scholar 

  11. Bezdek, J.C., Coray, C., Gunderson, R., Watson, J.: Detection and characteristics of cluster substructure and linear structure: Fuzzy c-lines. SIAM J. Appl. Math. 40(2), 339–357

    Google Scholar 

  12. Rousseau, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Comput. Appl. Math. 20, 53–65 (1987)

    Article  Google Scholar 

  13. Peters, G., Weber, R., Nowatzke, R.: Dynamic rough clustering and its applications. J. Appl. Soft Comput. 12(2012), 3193–3207 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chatti Subbalakshmi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Subbalakshmi, C., Sayal, R., Saini, H.S. (2020). Cluster Validity Using Modified Fuzzy Silhouette Index on Large Dynamic Data Set. In: Behera, H., Nayak, J., Naik, B., Pelusi, D. (eds) Computational Intelligence in Data Mining. Advances in Intelligent Systems and Computing, vol 990. Springer, Singapore. https://doi.org/10.1007/978-981-13-8676-3_1

Download citation

Publish with us

Policies and ethics