Abstract
Missing data are a common problem in medical research. We may encounter missing values of predictor values (X) and for the outcome (y) that we want to predict. Traditional complete case analysis suffers from inefficiency, selection bias of subjects, and other limitations when developing a prediction model. We briefly review the theoretical background on mechanisms of missingness of predictor values and how these may affect prediction models. We further concentrate on imputation methods as a solution, where a completed data set is created by filling in missing values for the statistical analysis. Special attention is given to the specification of an imputation model, which is the essential step in imputation. Multiple imputation is a method is to generate completed data sets multiple times, while single imputation is more straightforward and may be sufficient for some prognostic research questions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Steyerberg, E.W. (2019). Missing Values. In: Clinical Prediction Models. Statistics for Biology and Health. Springer, Cham. https://doi.org/10.1007/978-3-030-16399-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-16399-0_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16398-3
Online ISBN: 978-3-030-16399-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)