Bayesian Model Comparison of Structural Equation Models

  • Sik-Yum Lee
  • Xin-Yuan Song
Part of the Lecture Notes in Statistics book series (LNS, volume 192)


Structural equation modeling is a multivariate method for establishing meaningful models to investigate the relationships of some latent (causal) and manifest (control) variables with other variables. In the past quarter of a century, it has drawn a great deal of attention in psychometrics and sociometrics, both in terms of theoretical developments and practical applications (see Bentler and Wu, 2002; Bollen, 1989; Jöreskog and Sörbom, 1996; Lee, 2007). Although not to the extent that they have been used in behavioral, educational, and social sciences, structural equation models (SEMs) have been widely used in public health, biological, and medical research (see Bentler and Stein, 1992; Liu et al. 2005; Pugesek et al., 2003 and references therein). A review of the basic SEM with applicants to environmental epidemiology has been given by Sanchez et al. (2005).


Bayesian Information Criterion Path Sampling Deviance Information Criterion Manifest Variable Full Conditional Distribution 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



This research is fully supported by two grants (CUHK 404507 and 450607) from the Research Grant Council of the Hong Kong Special Administrative Region, and a direct grant from the Chinese University of Hong Kong (Project ID 2060278). The authors are indebted to Dr. John C. K. Lee, Faculty of Education, The Chinese University of Hong Kong, for providing the data in the application.


  1. Ansari, A. and Jedidi, K. (2000). Bayesian factor analysis for multilevel binary observations. Psychometrika, 65, 475–498MathSciNetCrossRefGoogle Scholar
  2. Austin, P. C. and Escobar, M. D. (2005). Bayesian modeling of missing data in clinical research. Computational Statistics and Data Analysis, 48, 821–836MathSciNetCrossRefGoogle Scholar
  3. Bentler, P. M. and Bonett, D. G. (1980). Significance tests and goodness of fit in the analysis of covariance structures. Psychological Bulletin, 88, 588–606CrossRefGoogle Scholar
  4. Bentler, P. M. and Stein, J. A. (1992). Structural equation models in medical research. Statistical Methods in Medical Research, 1, 159–181CrossRefGoogle Scholar
  5. Bentler, P. M. and Wu, E. J. C. (2002). EQS6 for Windows User Guide. Encino, CA: Multivariate Software, Inc.Google Scholar
  6. Bollen, K. A. (1989). Structural Equations with Latent Variables. New York: WileyMATHGoogle Scholar
  7. Chib, S. (1995). Marginal likelihood from the Gibbs output. Journal of the American Statistical Association, 90, 1313–1321MathSciNetMATHCrossRefGoogle Scholar
  8. Chib, S. and Jeliazkov, I. (2001). Marginal likelihood from the Metropolis-Hastings outputs. Journal of the American Statistical Association, 96, 270–281MathSciNetMATHCrossRefGoogle Scholar
  9. Congdon, P. (2005). Bayesian predictive model comparison via parallel sampling. Computational Statistics and Data Analysis, 48, 735–753MathSciNetMATHCrossRefGoogle Scholar
  10. Diciccio, T. J., Kass, R. E., Raftery, A. and Wasserman, L. (1997). Computing Bayes factors by combining simulation and asymptotic approximations. Journal of the American Statistical Association, 92, 903–915MathSciNetMATHCrossRefGoogle Scholar
  11. Dolan, C, van der Sluis, S. and Grasman, R. (2005). A note on normal theory power calculation in SEM with data missing completely at random. Structural Equation Modeling, 12, 245–62MathSciNetCrossRefGoogle Scholar
  12. Dunson, D. B. (2000). Bayesian latent variable models for clustered mixed outcomes. Journal of the Royal Statistical Society Series B, 62, 355–366MathSciNetCrossRefGoogle Scholar
  13. Dunson, D. B. (2005). Bayesian semiparametric isotonic regression for count data. Journal of the American Statistical Association, 100, 618–627MathSciNetMATHCrossRefGoogle Scholar
  14. Dunson, D. B. and Herring, A. H. (2005). Bayesian latent variable models for mixed discrete outcomes. Biostatistics, 6, 11–25MATHCrossRefGoogle Scholar
  15. Garcia-Donato, G. and Chan, M. H. (2005). Calibrating Bayes factor under prior predictive distributions. Statistica Sinica, 15, 359–380MathSciNetMATHGoogle Scholar
  16. Gelfand, A. E. and Dey D. K. (1994). Bayesian model choice: asymptotic and exact calculations. Journal of the Royal Statistical Society, Series B, 56, 501–514MathSciNetMATHGoogle Scholar
  17. Gelman, A. and Meng, X. L. (1998). Simulating normalizing constants: from importance sampling to bridge sampling to path sampling. Statistical Science, 13, 163–185MathSciNetMATHCrossRefGoogle Scholar
  18. Gelman, A., Meng, X. L. and Stern, H. (1996). Posterior predictive assessment of model fitness via realized discrepancies. Statistica Sinica, 6, 733–807MathSciNetMATHGoogle Scholar
  19. Geman, S. and Geman, D. (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 721–741MATHCrossRefGoogle Scholar
  20. Hastings, W. K. (1970). Monte Carlo sampling methods using Markov chains and their application.Biometrika, 57, 97–109MATHCrossRefGoogle Scholar
  21. Joreskog, K. G. and Sörbom, D. (1996). LISREL 8: Structural Equation Modeling with the SIM- PLIS Command Language. Scientific Software International: Hove and LondonGoogle Scholar
  22. Kass, R. E. and Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90, 773–795MATHCrossRefGoogle Scholar
  23. Kim, K. H. (2005). The relation among fit indices, power, and sample size in structural equation modeling. Structural Equation Modeling, 12, 368–390MathSciNetCrossRefGoogle Scholar
  24. Lee, S. Y. (2007). Structural Equations Modelling:ABayesian Approach. New York: WileyCrossRefGoogle Scholar
  25. Lee, S. Y. and Song, X. Y. (2003). Model comparison of a nonlinear structural equation model with fixed covariates. Psychometrika, 68, 27–47MathSciNetCrossRefGoogle Scholar
  26. Lee, S. Y. and Song, X. Y. (2004a). Evaluation of the Bayesian and maximum likelihood approaches in analyzing structural equation models with small sample sizes. Multivariate Behavioral Research, 39, 653–686CrossRefGoogle Scholar
  27. Lee, S. Y. and Song, X. Y. (2004b). Maximum likelihood analysis of a general latent variable model with hierarchically mixed data. Biometrics, 60, 624–636MathSciNetMATHCrossRefGoogle Scholar
  28. Lee, S. Y. and Xia, Y. M. (2006). Maximum likelihood methods in treating outliers and symmetrically heavy-tailed distributions for nonlinear structural equation models with missing data. Psychometrika, 71, 565–585MathSciNetCrossRefGoogle Scholar
  29. Levin, H. M. (1998). Accelerated schools: a decade of evolution. In A. Hargreavers et al. (Eds) International Handbook of Educational Change, Part Two(pp 809–810). New York: KluwerGoogle Scholar
  30. Liu, X., Wall, M. M. and Hodges, J. S. (2005). Generalized spatial structural equation models. Biostatistics, 6, 539–551MATHCrossRefGoogle Scholar
  31. Meng, X. L. and Wong, H. W. (1996). Simulating ratios of normalizing constants via a simple identity: a theoretical exploration. Statistica Sinica, 6, 831–860MathSciNetMATHGoogle Scholar
  32. Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H. and Teller, E. (1953). Equations of state calculations by fast computing machine. Journal of Chemical Physics, 21, 1087–1091CrossRefGoogle Scholar
  33. Ogata, Y. (1989). A Monte Carlo method for high dimensional integration. Numerische Mathe-matik, 55, 137–157MathSciNetMATHCrossRefGoogle Scholar
  34. Palomo, J., Dunson, D. B. and Bollen, K. (2007). Bayesian structural equation modeling. In S. Y. Lee (Ed) Handbook of Latent Variable and Related Models. Amsterdam: ElsevierGoogle Scholar
  35. Pugesek, B. H., Tomer, A. and von Eye, A. (2003). Structural Equation Modeling Applications in Ecological and Evolutionary Biology. New York: Cambridge University PressMATHCrossRefGoogle Scholar
  36. Raftery, A. E. (1993). Bayesian model selection in structural equation models. In K. A. Bollen and J. S. Long (Eds) Testing Structural Equation Models(pp 163–180). Thousand Oaks, CA: Sage PublicationsGoogle Scholar
  37. Raftery, A. E. (1996). Hypothesis testing and model selection. In W. R. Wilks, S. Richardson and D. J. Spieglhalter (Eds) Practical Markov Chain Monte Carlo(pp 163–188). London: Chapman and HallGoogle Scholar
  38. Raykov, T. and Marcoulides, G. A. (2006). On multilevel model reliability estimation from the perspective of structural equation modeling. Structural Equation Modeling, 13, 130–141MathSciNetCrossRefGoogle Scholar
  39. Richardson, S. and Green, P. J. (1997). On Bayesian analysis of mixture with an unknown number of components (with discussion). Journal of the Royal Statistical Society, Series B, 59, 731–792MathSciNetMATHCrossRefGoogle Scholar
  40. Sanchez, B. N., Budtz-Jorgenger, E., Ryan, L. M. and Hu, H. (2005). Structural equation models: a review with applications to environmental epidemiology. Journal of the American Statistical Association, 100, 1443–1455MathSciNetMATHCrossRefGoogle Scholar
  41. Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464MathSciNetMATHCrossRefGoogle Scholar
  42. Shi, J. Q. and Lee, S. Y. (2000). Latent variable models with mixed continuous and polytomous data. Journal of the Royal Statistical Society, Series B, 62, 77–87MathSciNetMATHCrossRefGoogle Scholar
  43. Song, X. Y. and Lee, S. Y. (2004). Bayesian analysis of two-level nonlinear structural equation models with continuous and polytomous data. British Journal of Mathematical and Statistical Psychology, 57, 29–52MathSciNetCrossRefGoogle Scholar
  44. Song, X. Y. and Lee, S. Y. (2005). A multivariate probit latent variable model for analyzing di-chotomous responses. Statistica Sinica, 15, 645–664MathSciNetMATHGoogle Scholar
  45. Song, X. Y. and Lee, S. Y. (2006a). Model comparison of generalized linear mixed models. Statistics in Medicine, 25, 1685–1698MathSciNetCrossRefGoogle Scholar
  46. Song, X. Y. and Lee, S. Y. (2006b). Bayesian analysis of latent variable models with non-ignorable missing outcomes from exponential family. Statistics in Medicine, 26, 681–693MathSciNetCrossRefGoogle Scholar
  47. Spiegelhalter, D. J., Best, N. G., Carlin, B. P. and van der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society, Series B, 64, 583–639MATHCrossRefGoogle Scholar
  48. Spiegelhalter, D. J., Thomas, A., Best, N. G. and Lunn, D. (2003). WinBugs User Manual. Version 1.4. Cambridge, England: MRC Biostatistics UnitGoogle Scholar
  49. Tanner, M. A. and Wong, W. H. (1987). The calculation of posterior distributions by data augmentation (with discussion). Journal of the American statistical Association, 82, 528–550MathSciNetMATHCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2008

Authors and Affiliations

  1. 1.Department of StatisticsChinese University of Hong KongShatinHong Kong

Personalised recommendations