Skip to main content

Part of the book series: Synthese Library ((SYLI,volume 234))

  • 153 Accesses

Abstract

Assuming an adaptation of Suppes’s analysis of causality, we show that multiple regression methods are fundamentally incorrect procedures for identifying causes. This is because when regressors are correlated the existence of an unmeasured common cause of regressor X i and outcome variable Y may bias estimates of the influence of other regressors X k;, variables having no influence on Y whatsoever may thereby be given significant regression coefficients. The bias may be quite large. Simulation studies show that standard regression model specification procedures make the same error. The strategy of regressing on a larger set of variables and checking stability may compound rather than remedy the problem. A similar difficulty in the estimation of the influence of other regressors arises if some X i is an effect rather than a cause of Y. The problem appears endemic in uses of multiple regression on uncontrolled variables, and unless somehow corrected appears to invalidate many scientific uses of regression methods. We describe an implementation in the TETRAD II program of a model specification algorithm that avoids these and certain other errors in large samples. We illustrate the TETRAD II algorithm by applying it to a number of real and simulated data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Cooper, G. and Herskovits, E.: 1992, ‘A Bayesian Method for the Induction of Probabilistic Networks from Data’, Machine Learning (to appear).

    Google Scholar 

  • Fox, J.: 1984, Linear Statistical Models and Related Methods, Wiley, New York.

    Google Scholar 

  • Glymour, C., Spirtes P., and Scheines, R.: 1991b, ‘From Probability to Causality’, Philosophical Studies, 64(1), 1–36.

    Article  Google Scholar 

  • Linthurst, R. A.: 1979, ‘Aeration, Nitrogen, pH and Salinity as Factors Affecting Spartina Alterniflora Growth and Dieback’, Ph.D. thesis, North Carolina State University.

    Google Scholar 

  • Mosteller, F. and Tukey, J.: 1977, Data Analysis and Regression, A Second Course in Regression, Addison-Wesley, Massachusetts.

    Google Scholar 

  • Pratt, J. and Schlaifer, R.: 1988, ‘On the Interpretation and Observation of Laws’, Journal of Econometrics, 39, 23–52.

    Article  Google Scholar 

  • Rawlings, J.: 1988, Applied Regression Analysis, Wadsworth, Belmont, CA.

    Google Scholar 

  • Spirtes, P.: 1992, ‘Building Causal Graphs from Statistical Data in the Presence of Latent Variables’, forthcoming in: B. Skyrms (Ed.), Proceedings of the IX International Congress on Logic, Methodology, and the Philosophy of Science, Uppsala, Sweden, 1991.

    Google Scholar 

  • Spirtes, P., Glymour, C., Scheines, R., and Sorensen, S.: 1990, ‘TETRAD Studies of Data for Naval Air Traffic Controller Trainees’, Report to the Navy Personnel Research Development Center, San Diego, CA.

    Google Scholar 

  • Spirtes, P., Glymour, C., and Scheines, R.: 1990, ‘Causality from Probability’, in: J. Tiles et al. (Eds.), Evolving Knowledge in Natural Science and Artificial Intelligence, Pitman, London, pp. 181–199.

    Google Scholar 

  • Spirtes, P., Glymour, C., and Scheines, R.: 1991, ‘An Algorithm for Fast Recovery of Sparse Causal Graphs’, Social Science Computer Review, 9, 62–72.

    Article  Google Scholar 

  • Spirtes, P., Glymour, C., and Scheines, R.: 1993, Causation, Prediction and Search, Springer, New York.

    Book  Google Scholar 

  • Suppes, P.: 1970, A Probabilistic Theory of Causality, North-Holland, Amsterdam.

    Google Scholar 

  • Timberlake, M. and Williams, K.: 1984, ‘Dependence, Political Exclusion, and Government Repression: Some Cross-National Evidence’, American Sociological Review, 49, 141–146.

    Article  Google Scholar 

  • Verma, T. and Pearl, J.: 1990b, ‘Equivalence and Synthesis of Causal Models’, Proc. Sixth Conference on Uncertainty in AI, Association for Uncertainty in AI, Inc., Mountain View, CA, pp. 220–227.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Glymour, C., Spirtes, P., Scheines, R. (1994). In Place of Regression. In: Humphreys, P. (eds) Patrick Suppes: Scientific Philosopher. Synthese Library, vol 234. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-0774-7_13

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-0774-7_13

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-4331-1

  • Online ISBN: 978-94-011-0774-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics