Evolving Regression Models

Baragona, Roberto; Battaglia, Francesco; Poli, Irene

doi:10.1007/978-3-642-16218-3_3

Roberto Baragona⁴,
Francesco Battaglia⁵ &
Irene Poli⁶

Part of the book series: Statistics and Computing ((SCO))

1675 Accesses

Abstract

Regression models are well established tools in statistical analysis which date back early to the eighteenth century. Nonetheless, problems involved in their implementation and application in a wide number of fields are still the object of active research. Preliminary to the regression model estimation there is an identification step which has to be performed for selecting the variables of interest, detecting the relationships of interest among them, distinguishing dependent and independent variables. On the other hand, generalized regression models often have nonlinear and non convex log-likelihood, therefore maximum likelihood estimation requires optimization of complicated functions. In this chapter evolutionary computation methods are presented that have been developed to either support or surrogate analytic tools if the problem size and complexity limit their efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Balcombe K (2005) Model selection using information criteria and genetic algorithms. Comput Econ 25:207–228
Article MATH Google Scholar
Baragona R, Battaglia F (2007) Outliers detection in multivariate time series by independent component analysis. Neural Comput 19:1962–1984
Article MATH Google Scholar
Bell AJ, Sejnowski TJ (1995) An information – maximization approach to blind separation and blind deconvolution. Neural Comput 7:1129–1159
Article Google Scholar
Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit 30:1145–1159
Article Google Scholar
Bremer RH, Langevin GJ (1993) The genetic algorithm for identifying the structure of a mixed model. In: ASA proceedings of the statistical computing section. American Statistical Association, Alexandria, pp 80–85
Google Scholar
Cardoso JF, Souloumiac A (1993) Blind beamforming for non Gaussian signals. IEE Proc F 140:362–370
Google Scholar
Chatterjee S, Laudato M, Lynch LA (1996) Genetic algorithms and their statistical applications: an introduction. Comput Stat Data Anal 22:633–651
Article MATH Google Scholar
Chiodi M (1986) Procedures for generating pseudo-random numbers from a normal distribution of order p (p > 1). Stat Appl 1:7–26
Google Scholar
Fitzenberger B, Winker P (1998) Threshold accepting to improve the computation of censored quantile regression. In: Paynem R, Green P (eds) COMPSTAT, proceedings in computational statistics. Physica-Verlag, Heidelberg, pp 311–316
Google Scholar
Friedman J (1987) Exploratory projection pursuit. J Am Stat Assoc 82:249–266
Article MATH Google Scholar
Galeano P, Peña D, Tsay RS (2006) Outlier detection in multivariate time series by projection pursuit. J Am Stat Assoc 101:654–669
Article MATH Google Scholar
Gorriz JM, Puntonet CG, Gomez AM, Pernia O (2005) Guided GA-ICA algorithms. In: Wang J, Liao X, Yi Z (eds) ISNN 2005, LNCS 3496. Springer, Berlin Heidelberg, pp 943–948
Google Scholar
Guo Q, Wu W, Massart DL, Boucon C, de Jong S (2002) Feature selection in principal component analysis of analytical data. Chemom Intell Lab Syst 61:123–132
Article Google Scholar
Hosmer D, Lemeshow S (1989) Applied logistic regression. Wiley, New York, NY
Google Scholar
Huber PJ (1985) Projection pursuit. Ann Stat 13:435–475
Article MATH MathSciNet Google Scholar
Hyvarinen A, Oja E (2000) Independent component analysis: algorithms and applications. Neural Netw 13:411–430
Article Google Scholar
Kapetanios G (2007) Variable selection in regression models using nonstandard optimisation of information criteria. Comput Stat Data Anal 52:4–15
Article MATH MathSciNet Google Scholar
Kemsley EK (1998) A genetic algorithm approach to the calculation of canonical variates. Trends Anal Chem 17:24–34
Article Google Scholar
Kemsley EK (2001) A hybrid classification method: discrete canonical variate analysis using a genetic algorithm. Chemom Intell Lab Syst 55:39–55
Article Google Scholar
Lauritzen SL (1996) Graphical models. Oxford University Press, Oxford
Google Scholar
McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman and Hall, London
MATH Google Scholar
Miller AJ (1990) Subset selection in regression. Chapman and Hall, London
MATH Google Scholar
Minerva T, Paterlini S (2002) Evolutionary approaches for statistical modelling. In: Fogel DB, El-Sharkam MA, Yao G, Greenwood H, Iba P, Marrow P, Shakleton M (eds) Evolutionary computation 2002. Proceedings of the 2002 congress on evolutionary computation. IEEE Press, Piscataway, NJ, vol 2, pp 2023–2028
Google Scholar
Mitchell M (1996) An Introduction to genetic algorithms. The MIT Press, Cambridge, MA
Google Scholar
Pasia JM, Hermosilla AY, Ombao H (2005) A useful tool for statistical estimation: genetic algorithms. J Statistical Comput Simul 75:237–251
Article MATH MathSciNet Google Scholar
Robles V, Bielza C, Larrañaga P, González S, Ohno-Machado L (2008) Optimizing logistic regression coefficients for discrimination and calibration using estimation of distribution algorithms. TOP 16:345–366
Article MATH MathSciNet Google Scholar
Roverato A, Poli I (1998) A genetic algorithm for graphical model selection. J Ital Stat Soc 7:197–208
Article Google Scholar
Sabatier R, Reynés C (2008) Extensions of simple component analysis and simple linear discriminant analysis using genetic algorithms. Comput Stat Data Anal 52:4779–4789
Article MATH Google Scholar
Sessions D, Stevans L (2006) Investigating omitted variable bias in regression parameter estimation: a genetic algorithm approach. Comput Stat Data Anal 50:2835–2854
Article MATH MathSciNet Google Scholar
Spears WM, De Jong KA (1991) An analysis of multi-point crossover. In: Rawlins GJE (ed) Foundations of genetic algorithms. Morgan Kaufmann, San Mateo, CA, pp 301–315
Google Scholar
Sun ZL, Huang DS, Zheng CH, Shang L (2006) Optimal selection of time lags for TDSEP based on genetic algorithm. Neurocomputing 69:884–887
Article Google Scholar
Tan Y, Wang J (2001) Nonlinear blind source separation using higher order statistics and a genetic algorithm. IEEE Trans Evol Comput 5:600–612
Article Google Scholar
Tolvi J (2004) Genetic algorithms for outlier detection and variable selection in linear regression models. Soft Comput 8:527–533
Article MATH Google Scholar
Vitrano S, Baragona R (2004) The genetic algorithm estimates for the parameters of order p normal distributions. In: Bock HH, Chiodi M, Mineo A (eds) Advances in multivariate data analysis. Springer, Berlin Heidelberg, pp 133–143
Google Scholar
Zhou X, Wang J (2005) A genetic method of LAD estimation for models with censored data. Comput Stat Data Anal 48:451–466
Article MATH Google Scholar
Ziehe A, Müller KR (1998) Tdsep – and efficient algorithm for blind separation using time structure. In: Proceedings of the international conference on ICANN, perspectives in neural computing. Springer, Berlin, pp 675–680
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Communication and Social Research, Sapienza University of Rome, Via Salaria 113, 00198, Rome, Italy
Prof. Roberto Baragona
Department of Statistical Sciences, Sapienza University of Rome, Piazzale Aldo Moro 5, 00100, Roma, Italy
Prof. Francesco Battaglia
Department of Statistics, Ca’ Foscari University of Venice, Cannaregio 873, 30121, Venice, Italy
Prof. Irene Poli

Authors

Prof. Roberto Baragona
View author publications
You can also search for this author in PubMed Google Scholar
Prof. Francesco Battaglia
View author publications
You can also search for this author in PubMed Google Scholar
Prof. Irene Poli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roberto Baragona .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Baragona, R., Battaglia, F., Poli, I. (2011). Evolving Regression Models. In: Evolutionary Statistical Procedures. Statistics and Computing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16218-3_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-16218-3_3
Published: 08 November 2010
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16217-6
Online ISBN: 978-3-642-16218-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics