Abstract
A focused information criterion is developed to estimate undirected graphical models where for each node in the graph a generalized linear model is put forward conditioned upon the other nodes in the graph. The proposed method selects a graph with a small estimated mean squared error for a user-specified focus, which is a function of the parameters in the generalized linear models, by selecting an appropriate model at each node. For situations where the number of nodes is large in comparison with the number of cases, the procedure performs penalized estimation with quadratic approximations to several popular penalties. To show the procedure’s applicability and usefulness we have applied it to two datasets involving voting behavior of U.S. senators and to a clinical dataset on psychopathology.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Banerjee, O., El Ghaoui, L., & d’Aspremont, A. (2008). Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. Journal of Machine Learning Research, 9, 485–516.
Borsboom, D., Cramer, A. O. J., Schmittmann, V. D., Epskamp, S., & Waldorp, L. J. (2011). The small world of psychopathology. PLoS ONE, 6(11), e27407.
Claeskens, G., & Hjort, N. (2003). The focused information criterion. Journal of the American Statistical Association, 98, 900–916. With discussion and a rejoinder by the authors.
Claeskens, G., & Hjort, N. (2008). Minimising average risk in regression models. Econometric Theory, 24, 493–527.
Claeskens, G., & Hjort, N. (2008). Model selection and model averaging. Cambridge: Cambridge University Press.
Cox, D. R., & Wermuth, N. (1996). Multivariate dependencies: Models, analysis and interpretation. London: Chapman & Hall.
Dempster, A. (1972). Covariance selection. Biometrics, 28(1), 157–175.
Fan, J., & Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association, 96(456), 1348–1360.
Fiocco, M., & van Zwet, W. (2004). Maximum likelihood estimation for the contact process. Institute of Mathematical Statistics Lecture Notes - Monograph Series, 45, 309–318.
Frank, I. E., & Friedman, J. H. (1993). A statistical view of some chemometrics regression tools. Technometrics, 35(2), 109–135.
Jalali, A., Ravikumar, P., Vasuki, V., & Sanghavi, S. (2010). On learning discrete graphical models using group-sparse regularization. In Proceedings of the 13th international conference on artificial intelligence and statistics, Chia Laguna Resort, Sardinia, Italy.
Lauritzen, S. (1996). Graphical models. New York: Oxford University Press.
Lee, J., & Hastie, T. (2012). Learning mixed graphical models. Preprint arXiv:1205.5012v3.
Loh, P. L., & Wainwright, M. J. (2013). Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses. The Annals of Statistics, 41(6), 3022–3049.
McCullagh, P., & Nelder, J. (1989). Generalized linear models. London: Chapman & Hall.
Meinshausen, N., & Bühlmann, P. (2006). High-dimensional graphs and variable selection with the lasso. The Annals of Statistics, 34(3), 1436–1462.
Pircalabelu, E., Claeskens, G., Jahfari, S., & Waldorp, L. (2013). Focused information criterion for graphical models. Large p, small n considerations (Technical report). KBI, Faculty of Economics and Business, KU Leuven.
Pircalabelu, E., Claeskens, G., & Waldorp, L. (2012). Structure learning using a focused information criterion in graphical models (Technical report). KBI, Faculty of Economics and Business, KU Leuven.
Schmidt, M., Niculescu-Mizil, A., & Murphy, K. (2007). Learning graphical model structure using ℓ 1-regularization paths. In Proceedings of the 22nd national conference on artificial intelligence, Vancouver, Canada (Vol. 2, pp. 1278–1283).
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society (SERIES B), 58, 267–288.
van Borkulo, C. D., Kamphuis, J. H., & Waldorp, L. J. (2013). Predicting behaviour of networks of mental disorders: The contact process as a model for dynamics of psychopathology (Technical report). University of Amsterdam.
van der Aart, P. J. M., & Smeenk-Enserink, N. (1975). Correlations between distributions of hunting spiders (Lycosidae, Ctenidae) and environmental characteristics in a dune area. Netherlands Journal of Zoology, 25, 1–45.
Wainwright, M. J., & Jordan, M. I. (2008). Graphical models, exponential families, and variational inference. Foundations and Trends in Machine Learning, 1(1–2), 1–305.
Wainwright, M. J., Ravikumar, P., & Lafferty, J. D. (2007). High-dimensional graphical model selection using ℓ 1-regularized logistic regression. In B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Advances in neural information processing systems, 20 (NIPS 2007), (Vol. 19, pp. 1465–1472). Cambridge: MIT Press.
Watson, D., Clark, L. A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: The panas scales. Journal of Personality and Social Psychology, 54(6), 1063–1070.
Yang, E., Ravikumar, P. K., Allen, G. I., & Liu, Z. (2012). Graphical models via generalized linear models. In P. Bartlett, F. Pereira, C. Burges, L. Bottou, & K. Weinberger (Eds.), Advances in neural information processing systems, Curran Associates, Inc. (Vol. 25, pp. 1367–1375).
Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101(476), 1418–1429.
Acknowledgements
The authors wish to thank Prof. J.-H. Kamphuis for the PANAS data. The authors acknowledge the support of the Fund for Scientific Research Flanders, KU Leuven grant GOA/12/14 and of the IAP Research Network P7/06 of the Belgian Science Policy.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Claeskens, G., Pircalabelu, E., Waldorp, L. (2015). Constructing Graphical Models via the Focused Information Criterion. In: Antoniadis, A., Poggi, JM., Brossat, X. (eds) Modeling and Stochastic Learning for Forecasting in High Dimensions. Lecture Notes in Statistics(), vol 217. Springer, Cham. https://doi.org/10.1007/978-3-319-18732-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-18732-7_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18731-0
Online ISBN: 978-3-319-18732-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)