Skip to main content

Generalized Additive Models (GAMs)

  • Chapter
  • First Online:
Effective Statistical Learning Methods for Actuaries I

Part of the book series: Springer Actuarial ((SPACLN))

  • 1772 Accesses

Abstract

With GLMs, mean responses are modeled as monotonic functions of linear scores. The assumed linearity of the score is not restrictive for categorical features coded by means of binary variables. However, this assumption becomes questionable for continuous features which may have a nonlinear effect on the score scale. This chapter is devoted to Generalized Additive Models (GAMs) which keep the additive decomposition of the score but allow the actuary to discover nonlinear effects of features like policyholder’s age or place of residence (geographic effect), for instance. Contrarily to the prior categorization of continuous features (banding into pre-defined classes), GAMs offer a flexible, data-driven procedure to identify the optimal transformation of continuous features for inclusion on the score scale. Precisely, continuous features enter the model in a semi-parametric additive predictor. Typical applications in insurance include the graduation of mortality and morbidity or the analysis of risk variation by age or geographic area, for instance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    It turns out that this phenomenon is not an isolated case and has led to a theory known as the Wisdom of Crowds.

References

  • Brouhns N, Denuit M, Vermunt JK (2002) A Poisson log-bilinear approach to the construction of projected life tables. Insur Math Econ 31:373–393

    Article  Google Scholar 

  • Chen Y, Jia Z, Mercola D, Xie X (2013) A gradient boosting algorithm for survival analysis via direct optimization of concordance index. Comput Math Methods Med 13, Article ID 873595

    Google Scholar 

  • Denuit M, Lang S (2004) Nonlife ratemaking with Bayesian GAMs. Insur Math Econ 35:627–647

    Article  Google Scholar 

  • Denuit M, Legrand C (2018) Risk classification in life and health insurance: extension to continuous covariates. Eur Actuar J 8:245–255

    Article  MathSciNet  Google Scholar 

  • Denuit M, Mesfioui M, Trufin J (2019a) Bounds on concordance-based validation statistics in regression models for binary responses. Methodol Comput Appl Probab 21:491–509

    Article  MathSciNet  Google Scholar 

  • Denuit M, Mesfioui M, Trufin J (2019b) Concordance-based predictive measures in regression models for discrete responses. Scand Actuar J

    Google Scholar 

  • Fenske N, Kneib T, Hothorn T (2011) Identifying risk factors for severe childhood malnutrition by boosting additive quantile regression. J Am Stat Assoc 106:494–510

    Article  MathSciNet  Google Scholar 

  • Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19:1–67

    Article  MathSciNet  Google Scholar 

  • Friedman JH, Stuetzle W (1981) Projection pursuit regression. J Am Stat Assoc 76:817–823

    Article  MathSciNet  Google Scholar 

  • Groll A, Tutz G (2012) Regularization for generalized additive mixed models by likelihood-based boosting. Methods Inf Med 51:168–177

    Article  Google Scholar 

  • Gschlossl S, Schoenmaekers P, Denuit M (2011) Risk classification in life insurance: methodology and case study. Eur Actuar J 1:23–41

    Article  MathSciNet  Google Scholar 

  • Hainaut D, Trufin J, Denuit M (2019) Effective statistical learning methods for actuaries–neural networks and unsupervised methods. Springer Actuarial Series

    Google Scholar 

  • Hastie T, Tibshirani R (1990) Generalized additive models. Chapman and Hall, London

    MATH  Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2016) The Elements of statistical learning: data mining, inference, and prediction. Springer Series in Statistics

    Google Scholar 

  • Kraus A, Kuchenhoff H (2014) Credit scoring optimization using the area under the curve. J Risk Model Valid 8:31–67

    Article  Google Scholar 

  • Lingjarde OC, Liestol K (1998) Generalized projection pursuit regression. SIAM J Sci Comput 20:844–857

    Article  MathSciNet  Google Scholar 

  • Loader C (1999) Local regression and likelihood. Springer, New York

    MATH  Google Scholar 

  • Ma S, Huang J (2005) Regularized ROC method for disease classification and biomarker selection with microarray data. Bioinformatics 21:4356–4362

    Article  Google Scholar 

  • Mayr A, Binder H, Gefeller O, Schmid M (2014a) The evolution of boosting algorithms-from machine learning to statistical modelling. Methods Inf Med 53:419–427

    Article  Google Scholar 

  • Mayr A, Binder H, Gefeller O, Schmid M (2014b) Extending statistical boosting-an overview of recent methodological developments. Methods Inf Med 53:428–435

    Article  Google Scholar 

  • Mayr A, Schmid M (2014) Boosting the concordance index for survival data a unified framework to derive and evaluate biomarker combinations. PloS ONE 9:e84483

    Article  Google Scholar 

  • Renshaw AE, Haberman S (1996) Dual modelling and select mortality. Insur Math Econ 19:105–126

    Article  Google Scholar 

  • Renshaw AE, Haberman S, Hatzopoulos P (1997) On the duality of assumptions underpinning the construction of life tables. ASTIN Bull 27:5–22

    Article  Google Scholar 

  • Sobotka F, Kneib T (2012) Geoadditive expectile regression. Comput Stat Data Anal 56:755–767

    Article  MathSciNet  Google Scholar 

  • Trufin J, Hainaut D, Denuit M (2019) Effective statistical learning methods for actuaries–tree-based methods. Springer Actuarial Series

    Google Scholar 

  • Tutz G, Binder H (2006) Generalized additive modelling with implicit variable selection by likelihood based boosting. Biometrics 51:961–971

    Article  Google Scholar 

  • Tutz G, Binder H (2007) Boosting ridge regression. Comput Stat Data Anal 51:6044–6059

    Article  MathSciNet  Google Scholar 

  • Wallis KF (2014) Revisiting Francis Galton’s forecasting competition. Stat Sci 29:420–424

    Article  MathSciNet  Google Scholar 

  • Wang Z (2011) HingeBoost: ROC-based boost for classification and variable selection. Int J Biostat 7:1–30

    Article  MathSciNet  Google Scholar 

  • Wood SN (2017) Generalized additive models: an introduction with R, 2nd edn. CRC Press

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michel Denuit .

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Denuit, M., Hainaut, D., Trufin, J. (2019). Generalized Additive Models (GAMs). In: Effective Statistical Learning Methods for Actuaries I. Springer Actuarial(). Springer, Cham. https://doi.org/10.1007/978-3-030-25820-7_6

Download citation

Publish with us

Policies and ethics