Abstract
Regression analysis is used to model the relationship between a response variable and one or more explanatory variables (covariates). In the compositional case, the proper choice of logratio coordinates matters, both due to the interpretation of the regression parameters and because of the properties of the regression models. And again, orthonormal coordinates, particularly in their pivot version, are preferable. Moreover, in case of regression with compositional response and real covariates, ilr coordinates enable to decompose the multivariate regression model into single multiple regressions. The coordinate representation of compositions is essential also for statistical inference like hypotheses testing, which is frequently of interest in the regression context. In this chapter, all basic regression cases are contained: the mentioned regression with compositional response and real covariates, the case of real response and compositional explanatory variables, regression between two compositions, and finally also regression between the parts within one composition. A further important task is considered: variable selection of relevant covariates by forward and backward selection. Robustness issues are also of particular importance in the regression context—outliers in the response or in the covariates will have limited effect for robust regression estimates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
J. Aitchison, The Statistical Analysis of Compositional Data (Chapman & Hall, London, 1986); Reprinted in 2003 with additional material by The Blackburn Press)
H. Akaike, Information theory and an extension of the maximum likelihood principle, in Proceedings of the Second International Symposium on Information Theory, ed. by B. Petrov, F. Csaki (Akademiai Kiado, Budapest, 1973), pp. 267–281
F. Bruno, F. Greco, M. Ventrucci, Spatio-temporal regression on compositional covariates: modeling vegetation in a gypsum outcrop. Environ. Ecol. Stat. 22(3), 445–463 (2015)
P. Bühlmann, S. van de Geer, Statistics for High-Dimensional Data (Springer, Heidelberg, 2011)
J. Chen, X. Zhang, S. Li, Multiple linear regression with compositional response and covariates. J. Appl. Stat. 44(12), 2270–2285 (2017)
A. Davison, D. Hinkley, Bootstrap Methods and Their Application (Cambridge University Press, Cambridge, 1997)
J.J. Egozcue, J. Daunis-i-Estadella, V. Pawlowsky-Glahn, K. Hron, P. Filzmoser, Simplicial regression. The normal model. J. Appl. Probab. Stat. 6(1–2), 87–108 (2012)
M.A. Engle, M. Gallo, K.T. Schroeder, N.J. Geboy, J.W. Zupancic, Three-way compositional analysis of water quality monitoring data. Environ. Ecol. Stat. 21(3), 565–581 (2014)
E. Fišerová, K. Hron, Statistical inference in orthogonal regression for three-part compositional data using a linear model with type-II constraints. Commun. Stat. Theory Methods 41(13–14), 2367–2385 (2012)
E. Fišerová, L. Kubáček, P. Kunderová, Linear Regression Models: Reguarity and Singularities (Academia, Praha, 2007)
W.A. Fuller, Measurement Error Models (Wiley, New York, 1987)
M. Gallo, Tucker3 model for compositional data. Commun. Stat. Theory Methods 44(21), 4441–4453 (2015)
A. Gardlo, A. Smilde, K. Hron, M. Hrdá, R. Karlíková, T. Adam, Normalization techniques for PARAFAC modeling of urine metabolomics data. Metabolomics 12, 117 (2016)
S. Heritier, E. Cantoni, S. Copt, M.P. Victoria-Feser, Robust Methods in Biostatistics (Wiley, Chichester, 2009)
K. Hron, P. Filzmoser, K. Thompson, Linear regression with compositional explanatory variables. J. Appl. Stat. 39(5), 1115–1128 (2012)
K. Hrůzová, V. Todorov, K. Hron, P. Filzmoser, Classical and robust orthogonal regression between parts of compositional data. Stat. J. Theor. Appl. Stat. 50(6), 1261–1275 (2016)
P.J. Huber, Robust Statistics (Wiley, New York, 1981)
R.A. Johnson, D.W. Wichern, Applied Multivariate Statistical Analysis, 6th edn. (Prentice Hall, Upper Saddle River, 2007)
P.M. Kroonenberg, Applied Multiway Data Analysis (Wiley, Hoboken, 2008)
I. Markovsky, S. Van Huffel, Overview of total least-squares methods. Signal Processing 87(10), 2283–2302 (2007)
R. Maronna, D. Martin, V. Yohai, Robust Statistics: Theory and Methods (Wiley, Chichester, 2006)
C. Mert, P. Filzmoser, K. Hron, Error propagation in compositional data analysis: theoretical and practical considerations. Math. Geosci. 48(8), 941–961 (2016)
I. Müller, K. Hron, E. Fišerová, J. Šmahaj, P. Cakirpaloglu, J. Vančáková, Interpretation of compositional regression with application to time budget analysis. Austrian J. Stat. 47(2), 3–19 (2018)
C.R. Rao, S.K. Mitra, Generalized Inverse of Matrices and Its Applications (Wiley, New York, 1971)
P.J. Rousseeuw, Least median of squares regression. J. Am. Stat. Assoc. 79, 871–880 (1984)
P. Rousseeuw, M. Hubert, High-breakdown estimators of multivariate location and scatter, in Robustness and Complex Data Structures, ed. by C. Becker, R. Fried, S. Kuhnt (Springer, Heidelberg, 2013), pp. 49–66
M. Salibian-Barrera, S. Van Aelst, G. Willems, Principal component analysis based on multivariate MM-estimators with fast and robust bootstrap. J. Am. Stat. Assoc. 101(475), 1198–1211 (2006)
H. Scheffé, Experiments with mixtures. J. R. Stat. Soc. Ser. B Stat Methodol. 20(2), 344–360 (1958)
A. Smilde, R. Bro, P. Geladi, Multi-way Analysis: Applications in the Chemical Sciences (Wiley, Chichester, 2004)
R. Tolosana-Delgado, K.G. van den Boogaart, Linear models with compositions in R, in Compositional Data Analysis: Theory and Applications, ed. by V. Pawlowsky-Glahn, A. Buccianti (Wiley, Chichester, 2011), pp. 356–371
S. Van Aelst, G. Willems, Fast and robust bootstrap for multivariate inference: the R package FRB. J. Stat. Softw. 53(3), 1–32 (2013)
K.G. van den Boogaart, R. Tolosana-Delgado, Analyzing Compositional Data with R (Springer, Heidelberg, 2013)
K. Varmuza, P. Filzmoser, Introduction to Multivariate Statistical Analysis in Chemometrics (CRC Press, Boca Raton, 2009)
B. Walczak, P. Filzmoser, What can go wrong at the data normalization step for identification of biomarkers? J. Chromatogr. A 1362, 194–205 (2014)
H. Wold, M. Sjöström, L. Eriksson, PLS regression: a basic tool of chemometrics Chemom. Intell. Lab. Syst. 58, 109–130 (2001)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Filzmoser, P., Hron, K., Templ, M. (2018). Regression Analysis. In: Applied Compositional Data Analysis. Springer Series in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-96422-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-96422-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96420-1
Online ISBN: 978-3-319-96422-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)