Quantitative Geosciences: Data Analytics, Geostatistics, Reservoir Characterization and Modeling pp 77-102 | Cite as

# Correlation Analysis

## Abstract

Correlation is a fundamental tool for multivariate data analysis. Most multivariate statistical methods use correlation as a basis for data analytics. Machine learning methods are also impacted by correlations in data. With todays’ big data, the role of correlation becomes increasingly important. Although the basic concept of correlation is simple, it has many complexities in practice. Many may know the common saying “correlation is not causation”, but the statement “a causation does not necessarily lead to correlation” is much less known or even debatable. This chapter presents uses and pitfalls of correlation analysis for geoscience applications.

## References

- Aldrich, J. (1995). Correlation genuine and spurious in Pearson and Yule.
*Statistical Science, 10*(4), 364–376.MathSciNetCrossRefGoogle Scholar - Fletcher, S. (2017).
*Data assimilation for the geosciences: From theory to application*. Amsterdam: Elsevier.CrossRefGoogle Scholar - Galton, F. (1888). Co-relations and their measurement, chiefly from anthropometric data.
*Proceedings of the Royal Society of London, 45*, 135–145.Google Scholar - Langford, E., Schwertman, N., & Owens, M. (2001). Is the property of being positively correlated transitive?
*American Statistician, 55*, 322–325.MathSciNetCrossRefGoogle Scholar - Ma, Y. Z. (2009). Simpson’s paradox in natural resource evaluation.
*Mathematical Geosciences, 41*(2), 193–213. https://doi.org/10.1007/s11004-008-9187-z.CrossRefzbMATHGoogle Scholar - Ma, Y. Z. (2011). Pitfalls in predictions of rock properties using multivariate analysis and regression method.
*Journal of Applied Geophysics, 75*, 390–400.CrossRefGoogle Scholar - Ma, Y. Z. (2015). Simpson’s paradox in GDP and Per-capita GDP growth.
*Empirical Economics, 49*(4), 1301–1315.CrossRefGoogle Scholar - Ma, Y. Z., & Gomez, E. (2015). Uses and abuses in applying neural networks for predicting reservoir properties.
*Journal of Petroleum Science and Engineering, 133*, 66–75. https://doi.org/10.1016/j.petrol.2015.05.006.CrossRefGoogle Scholar - Ma, Y. Z., Wang, H., Sitchler, J., et al. (2014). Mixture decomposition and lithofacies clustering using wireline logs.
*Journal of Applied Geophysics, 102*, 10–20. https://doi.org/10.1016/j.jappgeo.2013.12.011.CrossRefGoogle Scholar - Martinelli, G., & Chugunov, N. (2014). Sensitivity analysis with correlated inputs for volumetric analysis of hydrocarbon prospects. In
*The proceeding of ECMOR XIV– 14th European conference on the mathematics of oil recovery*. https://doi.org/10.3997/2214-4609.20141870. - Mayer-Schonberger, V., & Cukier, K. (2013).
*Big data: A revolution that will transform how we live, work, and think*. Boston: Houghton Mifflin Harcourt.Google Scholar - Pearl, J. (2000).
*Causality: Models, reasoning and inference*. Cambridge: Cambridge University Press, 384p.zbMATHGoogle Scholar - Pearson, K., Lee, A., & Bramley-Moore, L. (1899). Mathematical contributions to the theory of evolution – VI. Genetic (reproductive) selection: Inheritance of fertility in man, and of fertility in thorough-bred racehorses.
*Philosophical Transactions of the Royal Society of London, Series A, 192*, 257–278.CrossRefGoogle Scholar - Rousseeuw, P. J., & Leroy, A. M. (1987).
*Robust regression and outlier detection*. Hoboken: Wiley.CrossRefGoogle Scholar - Yule, G. U., & Kendall, M. G. (1968).
*An introduction to the theory of statistics*(14th ed.). New York: Hafner Pub. Co. Revised and Enlarged, Fifth Impression.zbMATHGoogle Scholar - Zeisel, H. (1985).
*Say it with figures*(6th ed.). New York: Harper and Brothers.Google Scholar