Correlation

Chapter

Abstract

Correlation quantifies the relationship between features. Linear correlation methods are robust and computationally efficient but detect only linear dependencies. Nonlinear correlation methods are able to detect nonlinear dependencies but need to be carefully parametrized. As a popular example for nonlinear correlation we present the chi-square test for independence that can be applied to continuous features using histogram counts. Nonlinear correlation can also be quantified by the regression validation error. Correlation does not imply causality, so correlation analysis may reveal spurious correlations. If the underlying features are known, then spurios correlations may be compensated by partial correlation methods.

Keywords

Obesity Covariance Sammon 

References

  1. 1.
    D. Freedman, R. Pisani, and R. Purves. Statistics. W. W. Norton & Company, New York, 2007.Google Scholar
  2. 2.
    K. Pearson. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine, 50(302):157–175, 1900.CrossRefMATHGoogle Scholar
  3. 3.
    T. A. Runkler. Fuzzy histograms and fuzzy chi–squared tests for independence. In IEEE International Conference on Fuzzy Systems, volume 3, pages 1361–1366, Budapest, July 2004.Google Scholar
  4. 4.
    H. A. Simon. Spurious correlation: A causal interpretation. Journal of the American Statistical Association, 49:467–479, 1954.MATHGoogle Scholar

Copyright information

© Springer Fachmedien Wiesbaden 2016

Authors and Affiliations

  1. 1.Siemens AGMünchenGermany

Personalised recommendations