Abstract
In net scoring as well as in gross scoring, the analyst has an impact on model development not only by choosing the modeling method. As part of data preparation or modeling itself, the analyst tries to adjust the available data in order to improve model results and/or stability and/or more discrimination power. In this chapter a closer look will be taken at two important methods for those adjustments. The first method deals with the possible transformation of raw data in order to improve performance of net scoring methods. The second method explains the so-called variable preselection, i.e., the process of reducing all available variables to a suitable subset which enter model building.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Strictly speaking, only one less variable is required because its value can be derived from the knowledge of all other dummy variables.
- 2.
Since for different numbers of levels between variables, the \(\chi ^2_{\mathrm {net},2}\) statistic has different approximate distributions and it would be incorrect to sort with respect to \(\chi ^2_{\mathrm {net},2}\).
References
W. Daniel. Biostatistics - A Foundation for Analysis in the Health Sciences, Eighth Edition. Wiley, 2005.
V. Devriendt, D. Moldovan, and W. Verbeke. A literature survey and experimental evaluation of the state-of-the-art in uplift modeling: A stepping stone toward the development of prescriptive analytics. Big Data, 6(1):13–41, 2018. https://doi.org/10.1089/big.2017.0104.
M. Falk, F. Marohn, and B. Tewes. Foundations of Statistical Analyses - Examples with SAS. Birkhäuser, Basel, 2003.
R. Johnson and G. Bhattacharyya. Statistics - Principles and Methods, 4th edition. Wiley, 2001.
K. Larsen. Net lift models. 2010. Presentation at the Analytics 2010 Conference, available at: http://www.sas.com/events/aconf/2010/pres/larsen.pdf.
R. Michel, I. Schnakenburg, and T. von Martens. Methods of variable pre-selection for netscore modeling. Journal of Research in Interactive Marketing, 7(4):257–268, 2013.
N.J. Radcliffe and P.D. Surry. Real-world uplift modeling with significance-based uplift trees. 2011. Technical Report, Stochastic Solutions.
N. Siddiqi. Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring. John Wiley & Sons, Hoboken, 2005.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Michel, R., Schnakenburg, I., von Martens, T. (2019). Supplementary Methods for Variable Transformation and Selection. In: Targeting Uplift. Springer, Cham. https://doi.org/10.1007/978-3-030-22625-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-22625-1_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22624-4
Online ISBN: 978-3-030-22625-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)