Sample Selection

Finlay, Steven

doi:10.1057/9780230298989_3

Steven Finlay

297 Accesses

Abstract

The databases available for model construction can be vast. Some customer databases contain tens of millions of records (observations) and thousands of predictor variables. Even with modern computing facilities it may not be practical to use all of the data available. There will also be cases that are not suitable for model construction, and these need to be identified and dealt with. The data used for model construction should also be as similar as possible to the data that will exist when the completed model is put into service — which usually means that the sample used to construct the model should be as recent as possible to mitigate against changes in the patterns of behaviour that accumulate over time. For these reasons it is common for models to be constructed using a sub-set (a sample) of the available data, rather than the full population.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Authors

Steven Finlay
View author publications
You can also search for this author in PubMed Google Scholar

Copyright information

About this chapter

Cite this chapter

Finlay, S. (2010). Sample Selection. In: Credit Scoring, Response Modelling and Insurance Rating. Palgrave Macmillan, London. https://doi.org/10.1057/9780230298989_3

Download citation

DOI: https://doi.org/10.1057/9780230298989_3
Publisher Name: Palgrave Macmillan, London
Print ISBN: 978-1-349-36689-7
Online ISBN: 978-0-230-29898-9
eBook Packages: Palgrave Economics & Finance CollectionEconomics and Finance (R0)

Publish with us

Policies and ethics