Skip to main content

A Soft Subspace Clustering Algorithm Based on Multi-Objective Optimization and Reliability Measure

  • Conference paper
  • First Online:
Proceedings of The Eighth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), 2013

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 212))

  • 1153 Accesses

Abstract

Subspace clustering finds clusters in subspaces of the data instead of the entire data space to deal with high-dimensional data. Most existing subspace clustering algorithms lean on just one single objective function. Single objective function is often biased. On the other hand, most existing subspace clustering algorithms are based on wrapper approach, which brings a negative effect on the quality of subspace clustering. This paper presents a soft subspace clustering algorithm based on multi-objective evolutionary algorithm and reliability measure, called R-MOSSC. Comparing with optimization of a scalar function combining multiple objectives, it does not need to determine weight hyperparameters, and offers a deep insight into the problem by obtaining a set of solutions. Further, reliability-based dimension weight matrix from filter approach is used to enhance the performance of subspace clustering. Simulation results show that R-MOSSC is better than existing algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kriegel HP, Kröger P, Zimek A (2009) Clustering high-dimensional data: a survey on subspace clustering, pattern-based clustering, and correlation clustering. TKDD 3:1–58

    Article  Google Scholar 

  2. Beyer KS, Goldstein J, Ramakrishnan R, Shaft U (1999) When is “nearest neighbor” meaningful? In: Beeri C, Buneman P (eds) ICDT ‘99. Springer, London, pp 217–235

    Google Scholar 

  3. Jing L, Ng MK, Huang JZ (2007) An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data. IEEE T Knowl Data Eng 19:1026–1041

    Article  Google Scholar 

  4. Deng Z, Choi KS, Chung FL, Wang S (2010) Enhanced soft subspace clustering integrating within-cluster and between-cluster information. Pattern Recogn 43:767–781

    Article  MATH  Google Scholar 

  5. MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Cam LML, Neyman J (eds) Proceedings of the fifth berkeley symposium on mathematical statistics and probability, vol 1. University of California Press, pp 281–297

    Google Scholar 

  6. Lu Y, Wang S, Li S, Zhou C (2011) Particle swarm optimizer for variable weighting in clustering high-dimensional data. Mach Learn 82:43–70

    Article  MathSciNet  Google Scholar 

  7. Boongoen T, Shang C, Iam-On N, Shen Q (2011) Extending data reliability measure to a filter approach for soft subspace clustering. IEEE Trans Syst Man Cybern B 41:1705–1714

    Article  Google Scholar 

  8. Tiwari S, Fadel G, Deb K (2011) AMGA2: improving the performance of the archive-based micro-genetic algorithm for multi-objective optimization. Eng Optim 43:377–401

    Article  Google Scholar 

  9. Zhou A, Qu BY, Li H, Zhao SZ, Suganthan PN, Zhang Q (2011) Multi-objective evolutionary algorithms: a survey of the state of the art. Swarm Evol Comput 1:32–49

    Article  Google Scholar 

  10. Strehl A, Ghosh J (2003) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3:583–617

    MathSciNet  MATH  Google Scholar 

  11. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor 11:10–18

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (60805026, 61070076, 61272065), Natural Science Foundation of Guangdong Province (S2011020001182), Research Foundation of Science and Technology Plan Project in Guangdong Province and Guangzhou City (2010A040303004, 2011B040200007, 2011A091000026, 11A12050914, 11A31090341, 2011Y5-00004), Research Foundation of NSFC-Guangdong Key project (U0935002), and the Zhujiang New Star of Science and Technology in Guangzhou City (2011J2200093).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiahai Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bi, Z., Wang, J., Yin, J. (2013). A Soft Subspace Clustering Algorithm Based on Multi-Objective Optimization and Reliability Measure. In: Yin, Z., Pan, L., Fang, X. (eds) Proceedings of The Eighth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), 2013. Advances in Intelligent Systems and Computing, vol 212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37502-6_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37502-6_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37501-9

  • Online ISBN: 978-3-642-37502-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics