Abstract
Designing ensemble learners has been recognized as one of the significant trends in the field of data knowledge, especially, in data science competitions. Building models that are able to outperform all individual models in terms of bias, which is the error due to the difference in the average model predictions and actual values, and variance, which is the variability of model predictions, has been the main goal of the studies in this area. An optimization model has been proposed in this paper to design ensembles that try to minimize bias and variance of predictions. Focusing on service sciences, two well-known housing datasets have been selected as case studies: Boston housing and Ames housing. The results demonstrate that our designed ensembles can be very competitive in predicting the house prices in both Boston and Ames datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
World development indicators. World Bank (1978)
B. Hefley, W. Murphy, Service Science, Management and Engineering: Education for the 21st Century. Springer Science & Business Media (2008)
H. Katzan, Foundations of service science concepts and facilities. J. Serv. Sci. 1(1) (2008)
G. Xiong, Z. Liu, X. Liu, F. Zhu, D. Shen, Service Science, Management, and Engineering: Theory and Applications. Academic (2012)
L. Breiman, Statistical modeling: the two cultures (with comments and a rejoinder by the author). Stat. Sci. 16(3), 199–231 (2001)
B. Park, J.K. Bae, Using machine learning algorithms for housing price prediction: the case of Fairfax County, Virginia housing data. Expert Syst. Appl. 42(6), 2928–2934 (2015)
J. Gu, M. Zhu, L. Jiang, Housing price forecasting based on genetic algorithm and support vector machine. Expert Syst. Appl. 38(4), 3383–3386 (2011)
X. Wang, J. Wen, Y. Zhang, Y. Wang, Real estate price forecasting based on SVM optimized by PSO. Opt.-Int. J. Light Electron Opt. 125(3), 1439–1443 (2014)
H. Selim, Determinants of house prices in Turkey: hedonic regression versus artificial neural network. Expert Syst. Appl. 36(2), 2843–2852 (2009)
E.A. Antipov, E.B. Pokryshevskaya, Mass appraisal of residential apartments: an application of random forest for valuation and a CART-based approach for model diagnostics. Expert Syst. Appl. 39(2), 1772–1778 (2012)
P.-N. Tan, M. Steinbach, V. Kumar, Introduction to data mining, addison, ed. by M.A. Boston, USA: Wesley Longman, Publishing Co., Inc (2005)
D. Talia, P. Trunfio, F. Marozzo, Data analysis in the cloud: models, techniques and applications. Elsevier (2015)
M. Sugiyama, Introduction to statistical machine learning. Morgan Kaufmann (2015)
G. Wang, J. Hao, J. Ma, H. Jiang, A comparative assessment of ensemble learning for credit scoring. Expert Syst. Appl. 38(1), 223–230 (2011)
L. Nanni, A. Lumini, An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring. Expert Syst. Appl. 36(2), 3028–3033 (2009)
J. Friedman, T. Hastie, R. Tibshirani, The Elements of Statistical Learning (no. 10). Springer series in statistics New York (2001)
L. Breiman, Bias, variance, and arcing classifiers (1996)
H. Pham, S. Olafsson, Bagged ensembles with tunable parameters. Comput. Intell. 35(1), 184–203 (2019)
H. Pham, S. Olafsson, On Cesaro averages for weighted trees in the random forest. J. Classif. (2019)
S. Boyd, L. Vandenberghe, Convex Optimization. Cambridge university press, Cambridge (2004)
E. Jones, T. Oliphant, P. Peterson, others. SciPy: Open source scientific tools for Python (2001). http://www.scipy.org
D. Kraft, A software package for sequential quadratic programming, Forschungsbericht- Deutsche Forschungs- und Versuchsanstalt fur Luft- und Raumfahrt (1988)
A. Wendorff, E. Botero, J.J. Alonso, Comparing different off-the-shelf optimizers’ performance in conceptual aircraft design, in 17th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference (2016), p. 336
D. Harrison Jr., D.L. Rubinfeld, Hedonic housing prices and the demand for clean air. J. Environ. Econ. Manag. 5(1), 81–102 (1978)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Shahhosseini, M., Hu, G., Pham, H. (2020). Optimizing Ensemble Weights for Machine Learning Models: A Case Study for Housing Price Prediction. In: Yang, H., Qiu, R., Chen, W. (eds) Smart Service Systems, Operations Management, and Analytics. INFORMS-CSS 2019. Springer Proceedings in Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-030-30967-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-30967-1_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30966-4
Online ISBN: 978-3-030-30967-1
eBook Packages: Business and ManagementBusiness and Management (R0)