Abstract
Many social scientists are interested in inferring causal relations between “latent” variables that they cannot directly measure. One strategy commonly used to make such inferences is to use the values of variables that can be measured directly that are thought to be “indicators” of the latent variables of interest, together with a hypothesized causal graph relating the latent variables to their indicators. To use the data on the indicators to draw inferences about the causal relations between the latent variables (known as the structural model), it is necessary to hypothesize causal relations between the indicators and the latents that they are intended to indirectly measure, (known as the measurement model). The problem addressed in this paper is how to reliably infer the measurement model given measurements of the indicators, without knowing anything about the structural model, which is ultimately the question of interest. In this paper, we develop the Find- TwoFactorClusters (FTFC) algorithm, a search algorithm that, when compared to existing algorithms based on vanishing tetrad constraints, also works for a more complex class of measurement models, and does not assume that the model describing the causal relations between the latent variables is linear or acyclic.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bartholomew, D.J., Steele, F., Moustaki, I., Galbraith, J.I.: The Analysis and Interpretation of Multivariate Data for Social Scientists. Texts in Statistical Science Series. Chapman & Hall/CRC (2002)
Silva, R., Glymour, C., Scheines, R., Spirtes, P.: Learning the structure of latent linear structure models. Journal of Machine Learning Research 7, 191–246 (2006)
Bollen, K.A.: Structural Equations with Latent Variables. Wiley-Interscience (1989)
Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search, 2nd edn. Adaptive Computation and Machine Learning. The MIT Press (2001)
Sullivant, S., Talaska, K., Draisma, J.: Trek Separation for Gaussian Graphical Models. Ann. Stat. 38(3), 1665–1685 (2010)
Spirtes, P.: Calculation of Entailed Rank Constraints in Partially Non-Linear and Cyclic Models. In: Proceedings of the Twenty-Ninth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI 2013), pp. 606–615 (2013)
Bollen, K., Ting, K.: Confirmatory tetrad analysis. Sociological Methodology 23, 147–175 (1993)
Reise, S., Morizot, J., Hays, R.: The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research 16, 19–31 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kummerfeld, E., Ramsey, J., Yang, R., Spirtes, P., Scheines, R. (2014). Causal Clustering for 2-Factor Measurement Models. In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2014. Lecture Notes in Computer Science(), vol 8725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44851-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-662-44851-9_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44850-2
Online ISBN: 978-3-662-44851-9
eBook Packages: Computer ScienceComputer Science (R0)