Skip to main content

Introduction

  • Chapter
  • 392 Accesses

Part of the book series: Lecture Notes in Statistics ((LNS,volume 86))

Abstract

In many scientific areas a basic task is to assess the simultaneous influence of several factors on a quantity of interest. Regression models provide therefore a powerful framework, and estimation of the effects in such models is a well-established field of statistics. In general this estimation is based on measurements of the factors (covariates) and the quantity of interests (outcome variable) for a set of units. However, in practice often not all covariates can be measured for all units, i.e., some of the units show a missing value in one or several covariates. The reasons can be very different and depend mainly on the type of the measurement procedure for a single covariate and the type of the data collection procedure. Some examples should illustrate this:

  • If the covariate values are collected by a questionnaire or interview, non-response is a typical source for missing values. It may be due to a true lack of knowledge, if for example a person is asked for certain diseases during its childhood, or to an intentional refusal. The latter is especially to be expected for embarrassing questions like alcohol consumption, drug abuse, sexual activities, or income.

  • In retrospective studies covariate values are often collected on the basis of documents like hospital records. Incompleteness of the documents causes missing values.

  • In clinical trials biochemical parameters are often used as covariates. The measurement of these parameters often requires a certain amount of blood, urine or tissue, which may not be available.

  • In prospective clinical trials the recruitment of patients can last several years. Meanwhile scientific progress may discover new influential factors, which may cause the decision to add the measurement of the covariate to the data collection procedure. For patients recruited before this decision the value of this covariate is missing.

  • If the measurement of a covariate is very expensive, one may restrict the measurement to a subset of all units.

  • Even in a well planned and conducted study small accidents can happen. A test tube may break, a case report form may be lost on the mail, an examination may be forgotten, the inaccuracy of an instrument may be detected too late, etc. Each accident may cause a missing value.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag New York, Inc.

About this chapter

Cite this chapter

Vach, W. (1994). Introduction. In: Logistic Regression with Missing Values in the Covariates. Lecture Notes in Statistics, vol 86. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2650-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-2650-5_1

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-0-387-94263-6

  • Online ISBN: 978-1-4612-2650-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics