Abstract
With GLMs, mean responses are modeled as monotonic functions of linear scores. The assumed linearity of the score is not restrictive for categorical features coded by means of binary variables. However, this assumption becomes questionable for continuous features which may have a nonlinear effect on the score scale. This chapter is devoted to Generalized Additive Models (GAMs) which keep the additive decomposition of the score but allow the actuary to discover nonlinear effects of features like policyholder’s age or place of residence (geographic effect), for instance. Contrarily to the prior categorization of continuous features (banding into pre-defined classes), GAMs offer a flexible, data-driven procedure to identify the optimal transformation of continuous features for inclusion on the score scale. Precisely, continuous features enter the model in a semi-parametric additive predictor. Typical applications in insurance include the graduation of mortality and morbidity or the analysis of risk variation by age or geographic area, for instance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
It turns out that this phenomenon is not an isolated case and has led to a theory known as the Wisdom of Crowds.
References
Brouhns N, Denuit M, Vermunt JK (2002) A Poisson log-bilinear approach to the construction of projected life tables. Insur Math Econ 31:373–393
Chen Y, Jia Z, Mercola D, Xie X (2013) A gradient boosting algorithm for survival analysis via direct optimization of concordance index. Comput Math Methods Med 13, Article ID 873595
Denuit M, Lang S (2004) Nonlife ratemaking with Bayesian GAMs. Insur Math Econ 35:627–647
Denuit M, Legrand C (2018) Risk classification in life and health insurance: extension to continuous covariates. Eur Actuar J 8:245–255
Denuit M, Mesfioui M, Trufin J (2019a) Bounds on concordance-based validation statistics in regression models for binary responses. Methodol Comput Appl Probab 21:491–509
Denuit M, Mesfioui M, Trufin J (2019b) Concordance-based predictive measures in regression models for discrete responses. Scand Actuar J
Fenske N, Kneib T, Hothorn T (2011) Identifying risk factors for severe childhood malnutrition by boosting additive quantile regression. J Am Stat Assoc 106:494–510
Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19:1–67
Friedman JH, Stuetzle W (1981) Projection pursuit regression. J Am Stat Assoc 76:817–823
Groll A, Tutz G (2012) Regularization for generalized additive mixed models by likelihood-based boosting. Methods Inf Med 51:168–177
Gschlossl S, Schoenmaekers P, Denuit M (2011) Risk classification in life insurance: methodology and case study. Eur Actuar J 1:23–41
Hainaut D, Trufin J, Denuit M (2019) Effective statistical learning methods for actuaries–neural networks and unsupervised methods. Springer Actuarial Series
Hastie T, Tibshirani R (1990) Generalized additive models. Chapman and Hall, London
Hastie T, Tibshirani R, Friedman J (2016) The Elements of statistical learning: data mining, inference, and prediction. Springer Series in Statistics
Kraus A, Kuchenhoff H (2014) Credit scoring optimization using the area under the curve. J Risk Model Valid 8:31–67
Lingjarde OC, Liestol K (1998) Generalized projection pursuit regression. SIAM J Sci Comput 20:844–857
Loader C (1999) Local regression and likelihood. Springer, New York
Ma S, Huang J (2005) Regularized ROC method for disease classification and biomarker selection with microarray data. Bioinformatics 21:4356–4362
Mayr A, Binder H, Gefeller O, Schmid M (2014a) The evolution of boosting algorithms-from machine learning to statistical modelling. Methods Inf Med 53:419–427
Mayr A, Binder H, Gefeller O, Schmid M (2014b) Extending statistical boosting-an overview of recent methodological developments. Methods Inf Med 53:428–435
Mayr A, Schmid M (2014) Boosting the concordance index for survival data a unified framework to derive and evaluate biomarker combinations. PloS ONE 9:e84483
Renshaw AE, Haberman S (1996) Dual modelling and select mortality. Insur Math Econ 19:105–126
Renshaw AE, Haberman S, Hatzopoulos P (1997) On the duality of assumptions underpinning the construction of life tables. ASTIN Bull 27:5–22
Sobotka F, Kneib T (2012) Geoadditive expectile regression. Comput Stat Data Anal 56:755–767
Trufin J, Hainaut D, Denuit M (2019) Effective statistical learning methods for actuaries–tree-based methods. Springer Actuarial Series
Tutz G, Binder H (2006) Generalized additive modelling with implicit variable selection by likelihood based boosting. Biometrics 51:961–971
Tutz G, Binder H (2007) Boosting ridge regression. Comput Stat Data Anal 51:6044–6059
Wallis KF (2014) Revisiting Francis Galton’s forecasting competition. Stat Sci 29:420–424
Wang Z (2011) HingeBoost: ROC-based boost for classification and variable selection. Int J Biostat 7:1–30
Wood SN (2017) Generalized additive models: an introduction with R, 2nd edn. CRC Press
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Denuit, M., Hainaut, D., Trufin, J. (2019). Generalized Additive Models (GAMs). In: Effective Statistical Learning Methods for Actuaries I. Springer Actuarial(). Springer, Cham. https://doi.org/10.1007/978-3-030-25820-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-25820-7_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-25819-1
Online ISBN: 978-3-030-25820-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)