Abstract
A major challenge in prediction modeling is that we may have more candidate predictors available for the analysis than we would like to include for further analysis, in particular if our data set is relatively small. A small sample size leads to problems as discussed in Chap. 5, such as limited power to test effects of potential predictors, and too extreme predictions when predictions are based on the standard regression coefficients (overfitting). We discuss some procedures to increase the robustness and validity of a prediction model, including restriction of the number of candidate predictors based on subject knowledge, considering distributions of predictors, combining similar variables, and averaging the effects of similar variables. We provide a detailed description of a case study of modeling similar effects of aspects of family history for robust prediction of the presence of a genetic mutation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Steyerberg, E.W. (2019). Restrictions on Candidate Predictors. In: Clinical Prediction Models. Statistics for Biology and Health. Springer, Cham. https://doi.org/10.1007/978-3-030-16399-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-16399-0_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16398-3
Online ISBN: 978-3-030-16399-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)