© 2001

Regression Modeling Strategies

With Applications to Linear Models, Logistic Regression, and Survival Analysis


Part of the Springer Series in Statistics book series (SSS)

Table of contents

  1. Front Matter
    Pages i-xxiii
  2. Frank E. Harrell Jr.
    Pages 1-9
  3. Frank E. Harrell Jr.
    Pages 11-40
  4. Frank E. Harrell Jr.
    Pages 41-52
  5. Frank E. Harrell Jr.
    Pages 53-85
  6. Frank E. Harrell Jr.
    Pages 105-120
  7. Frank E. Harrell Jr.
    Pages 147-177
  8. Frank E. Harrell Jr.
    Pages 179-213
  9. Frank E. Harrell Jr.
    Pages 215-267
  10. Frank E. Harrell Jr.
    Pages 269-298
  11. Frank E. Harrell Jr.
    Pages 331-343
  12. Frank E. Harrell Jr.
    Pages 375-388
  13. Frank E. Harrell Jr.
    Pages 389-412
  14. Frank E. Harrell Jr.
    Pages 413-442
  15. Frank E. Harrell Jr.
    Pages 465-507

About this book


Many texts are excellent sources of knowledge about individual statistical tools, but the art of data analysis is about choosing and using multiple tools. Instead of presenting isolated techniques, this text emphasizes problem solving strategies that address the many issues arising when developing multivariable models using real data and not standard textbook examples. It includes imputation methods for dealing with missing data effectively, methods for dealing with nonlinear relationships and for making the estimation of transformations a formal part of the modeling process, methods for dealing with "too many variables to analyze and not enough observations," and powerful model validation techniques based on the bootstrap. This text realistically deals with model uncertainty and its effects on inference to achieve "safe data mining".


Analysis Excel Fitting Resampling Survival analysis best fit data analysis data mining modeling

Authors and affiliations

  1. 1.Department of BiostatisticsVanderbilt University School of MedicineNashvilleUSA

Bibliographic information

Industry Sectors
IT & Software
Finance, Business & Banking
Energy, Utilities & Environment
Oil, Gas & Geosciences


From the reviews:

"The book is an ambitious, and mostly successful, attempt to disseminate effective strategies for the use of regression techniques. Many of the examples are from the medical area, in which the author has worked for many years and has accumulated a wealth of experience. It is written in a clear and direct style…definitely a valuable reference for modern applications of commonly used regression techniques. Data analysis, particularly users of S-PLUS, with experience in the application of these tools will benefit the most from this book."


"This is a book that leaves one breathless. It demands a lot, but gives plenty in return.  ... The book has many sets of programming instructions and printouts, all delivered in a stacato fashion. Sets of data are large. Many different types of models and methods are discussed. There are many printouts and diagrams. Computer oriented readers will like this book immediately. Others may grow to like it. It is an essential reference for the library."


"This is the latest volume in the generally excellent Springer Series in Statistics, and it has to be one of the best. Professor Harrell has produced a book that offers many new and imaginative insights into multiple regression, logistic regression and survival analysis, topics that form the core of much of the statistical analysis carried out in a variety of disciplines, particularly in medicine. ... Regression Modelling Stategies is a book that many statisticians will enjoy and learn from. The problems given at the end of each chapter may also make it suitable for some postgrdauate courses, particularly those for medical students in which S-PLUS is a major component. Working through the case studies in the book will demonstrate what can be achieved with a little imagination, when modelling complex and challenging data sets. So here we have a truly excellent, informative and attractive text that is highly recommended."


"Over the past 7 years, I have probably read this book, on its preversion, a half-dozen times, and I refer to it routinely. If my work bookshelf held only one book, it would be this one. The book covers, very completely, the nuances of regression modeling with particular emphasis on binary and ordinal logistic regression and parametric and nonparametric survival analysis...Harrell very nicely walks the reader through numerous analyses, explaining and defining his model-building choices at each step in the process. It is refreshing to have an author present choices and actuallly defend an approach, and in this manner."

"This book emphasizes problem solving strategies that address the many issues arising when developing multivariable models … . The author has a very motivating style and includes opinions, remarks and summary … . The logical path chosen on how to present the material is excellent. … considering the fun I had reading the book, I think that the author’s aims are met and I highly recommend everybody to have a look at the book. Moreover, I recommend purchasing the book to any library." (Diego Kuonen, Statistical Methods in Medical Research, Vol. 13 (5), 2004)

"It is a book that tries to show us how many different tools may be used in combination for regression analysis. … The author gives us plenty of references (466!) to textbooks and papers where we may read more about individual topics; most chapters end with suggestions for further reading and problems. … Many tools are illustrated in five chapter-long case studies. … the author has written a very inspiring book which should be able to teach most of us something … ." (Søren Feodor Nielsen, Journal of Applied Statistics, Vol. 30 (1), 2003)

"This book could serve as a wonderful textbook for a graduate-level or upper undergraduate-level data-analysis class. There are plenty of hands-on exercises … . From a researcher’s perspective, there are enough interesting ideas to easily stimulate research on other fruitful avenues. From an applied statistician’s perspective, the book fills an important gap in the field and would serve as an ideal resource. … a well laid-out, enjoyable book. I wholeheartedly recommend it … to anyone interested in the strategies of intelligent data analysis." (Sunil J. Rao, Journal of the American Statistical Association, March, 2003)

"Regression Modeling Strategies is largely about prediction. … The book is incredibly well referenced, with a 466-item bibliography. … Harrell very nicely walks the reader through numerous analyses, explaining and defining his model-building choices at each step in the process. It is refreshing to have an author present choices and actually defend an approach … . I found his arguments very convincing. Certainly, if you are interested in developing or validating prediction models, you will likely find this book to be very valuable." (Mike Kattan, Medical Decision Making, March/April, 2003)

"Professor Harrell provides descriptions of statistical strategies intended for the analysis of data using linear, logistic and proportional hazard regression models. … Harrell combines statistical theory with a modest amount of mathematics, data in the form of case studies, implementation of regression models, graphics and interpretation making it attractive to Masters or PhD level graduate students as well as biomedical researchers. … this is an excellent book for serious researchers." (Max K. Bulsara, Lab News, August/September, 2002)