Skip to main content

Adaptive Dimensionality Adjustment for Online “Principal Component Analysis”

  • Conference paper
  • First Online:
Intelligent Data Engineering and Automated Learning – IDEAL 2019 (IDEAL 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11871))

Abstract

Many applications in the Industrial Internet of Things and Industry 4.0 rely on large amounts of data which are continuously generated. The exponential growth in available data and the resulting storage requirements are often underestimated bottlenecks. Therefore, efficient dimensionality reduction gets more attention and becomes more relevant. One of the most widely used techniques for dimensionality reduction is “Principal Component Analysis” (PCA). A novel algorithm to determine the optimal number of meaningful principal components on a data stream is proposed. The basic idea of the proposed algorithm is to optimize the dimensionality adjustment process by taking advantage of several “natural” PCA features. In contrast to the commonly used approach to start with a maximal set of principal components and apply some sort of stopping rule, the proposed algorithm starts with a minimal set of principal components and uses a linear regression model in the natural logarithmic scale to approximate the remaining components. An experimental study is presented to demonstrate the successful application of the algorithm to noisy synthetic and real world data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Parameter set: \(m_0 = 2\), \(\varGamma = 100\), \(\gamma _0 = 10, N = 5000\).

References

  1. Careercon 2019 - help navigate robots. https://www.kaggle.com/c/career-con-2019/data. Accessed 20 May 2019

  2. Cardot, H., Degras, D.: Online principal component analysis in high dimension: which algorithm to choose? Int. Stat. Rev. 86, 29–50 (2015)

    Article  MathSciNet  Google Scholar 

  3. Gordon, A.D., Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Biometrics 40(3), 874 (1984)

    Article  Google Scholar 

  4. Guttman, L.: Some necessary conditions for common-factor analysis. Psychometrika 19(2), 149–161 (1954)

    Article  MathSciNet  Google Scholar 

  5. Hancock, P., Baddeley, R., Smith, L.: The principal components of natural images. Netw. Comput. Neural Syst. 3, 61–70 (1970)

    Article  Google Scholar 

  6. Jolliffe, I.T., Cadima, J.: Principal component analysis: a review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 374(2065), 20150202 (2016)

    Article  MathSciNet  Google Scholar 

  7. Katal, A., Wazid, M., Goudar, R.H.: Big data: issues, challenges, tools and good practices. In: 2013 Sixth International Conference on Contemporary Computing (IC3), pp. 404–409 (2013)

    Google Scholar 

  8. Moeller, R.: Interlocking of learning and orthonormalization in RRLSA. Neurocomputing 49(1–4), 429–433 (2002)

    Article  Google Scholar 

  9. Peres-Neto, P.R., Jackson, D.A., Somers, K.M.: How many principal components? Stopping rules for determining the number of non-trivial axes revisited. Comput. Stat. Data Anal. 49(4), 974–997 (2005)

    Article  MathSciNet  Google Scholar 

  10. Preacher, K.J., MacCallum, R.C.: Repairing tom swifts electric factor analysis machine. Underst. Stat. 2(1), 13–43 (2003)

    Article  Google Scholar 

  11. Schenck, W.: Adaptive Internal Models for Motor Control and Visual Prediction. MPI Series in Biological Cybernetics, 1st edn. Logos Verlag Berlin, Berlin (2008)

    Google Scholar 

  12. Tharwat, A.: Principal component analysis - a tutorial. Int. J. Appl. Pattern Recogn. 3, 197–238 (2016)

    Article  Google Scholar 

  13. Zhang, T., Yang, B.: Big data dimension reduction using PCA. In: 2016 IEEE International Conference on Smart Cloud (SmartCloud), pp. 152–157 (2016)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the EFRE-NRW funding programme “Forschungsinfrastrukturen” (grant no. 34.EFRE-0300119).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nico Migenda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Migenda, N., Möller, R., Schenck, W. (2019). Adaptive Dimensionality Adjustment for Online “Principal Component Analysis”. In: Yin, H., Camacho, D., Tino, P., Tallón-Ballesteros, A., Menezes, R., Allmendinger, R. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2019. IDEAL 2019. Lecture Notes in Computer Science(), vol 11871. Springer, Cham. https://doi.org/10.1007/978-3-030-33607-3_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-33607-3_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-33606-6

  • Online ISBN: 978-3-030-33607-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics