Skip to main content

Degenerate Expectation-Maximization Algorithm for Local Dimension Reduction

  • Conference paper
Book cover Classification, Clustering, and Data Mining Applications

Abstract

Dimension reduction techniques based on principal component analysis (PCA) and factor analysis are commonly used in statistical data analysis. The effectiveness of these methods is limited by their global nature. Recent efforts have focused on relaxing global restrictions in order to identify subsets of data that are concentrated on lower dimensional subspaces. In this paper, we propose an adaptive local dimension reduction method, called the Degenerate Expectation-Maximization Algorithm (DEM). This method is based on the finite mixture model. We demonstrate that the DEM yields significantly better results than the local PCA (LPCA) and other related methods in a variety of synthetic and real datasets. The DEM algorithm can be used in various applications ranging from clustering to information retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Archer, C, and Leen, T. K. (1999). “Optimal Dimension Reduction and Transform Coding with Mixture Principal Components,” International Joint Conference on Neural Networks (IJCNN), IEEE.

    Google Scholar 

  2. Ciuperca, G., Ridolfi, A., and Idier, J. (2003). “Penalized Maximum Likelihood Estimator for Normal Mixtures,” Scandinavian Journal of Statistics, 30, 45–59.

    Article  MathSciNet  MATH  Google Scholar 

  3. Hastie, T., and Stuetzle, W. (1989). “Principal Curves,” Journal of the American Statistical Association, 84, 502–516.

    Article  MathSciNet  MATH  Google Scholar 

  4. Hathaway, R. J. (1985). “A Constrained Formulation of Maximum-Likelihood Estimation for Normal Mixture Distributions,” Annals of Statistics, 13, 795–800.

    Article  MathSciNet  MATH  Google Scholar 

  5. Hinton, G. E., and Ghahramani, Z. (1997). “Generative Models for Discovering Sparse Distributed Representations,” Philosophical Transactions Royal Society B, 352, 1177–1190.

    Article  Google Scholar 

  6. Kambhatla, N., and Leen, T. K. (1997). “Dimension Reduction by Local Principal Component Analysis,” Neural Computation, 9, 1793–1516.

    Article  Google Scholar 

  7. Kiefer, J., and Wolfowitz, J. (1956). “Consistency of the Maximum Likelihood Estimates in the Presence of Infinitely Many Incidental Parameters,” Annals of Mathematical Statistics, 27, 887–906.

    Article  MathSciNet  MATH  Google Scholar 

  8. Kohonen, T. (1989). Self-Organization and Associative Memory (3rd ed.), Springer-Verlag, Berlin.

    Book  Google Scholar 

  9. Kohonen, T. (1990). “The Self-Organizing Map,” in Proceedings of the IEEE, 78, pp. 1464–1479.

    Article  Google Scholar 

  10. Lin, X. (2003). Finite Mixture Models for Clustering, Dimension Reduction and Privacy Preserving Data Mining, Ph.D. Thesis, Purdue University.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lin, X., Zhu, Y. (2004). Degenerate Expectation-Maximization Algorithm for Local Dimension Reduction. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds) Classification, Clustering, and Data Mining Applications. Studies in Classification, Data Analysis, and Knowledge Organisation. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17103-1_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17103-1_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22014-5

  • Online ISBN: 978-3-642-17103-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics