Abstract
The second generation of a parallel algorithm for generalized latent variable models, including MIRT models and extensions, on the basis of the general diagnostic model (GDM) is presented. This new development further improves the performance of the parallel-E parallel-M algorithm presented in an earlier report by means of additional computational improvements that produce even larger gains in performance. The additional gain achieved by this second-generation parallel algorithm reaches factor 20 for several of the examples reported with a sixfold gain based on the first generation. The estimation of a multidimensional IRT model for large-scale data may show a larger reduction in runtime compared to a multiple-group model which has a structure that is more conducive to parallel processing of the E-step. Multiple population models can be arranged such that the parallelism directly exploits the ability to estimate multiple latent variable distributions separately in independent threads of the algorithm.
The original version of this chapter was revised. An erratum to this chapter can be found at https://doi.org/10.1007/978-3-319-56294-0_37
This work was partially completed while the author was at the Educational Testing Service.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
G.M. Amdahl, Validity of the single processor approach to achieving large-scale computing capabilities. AFIPS Conf. Proc. 30, 483–485. doi:10.1145/1465482.1465560
L. Cai, Metropolis–Hastings Robbins–Monro algorithm for confirmatory item factor analysis. J. Educ. Behav. Stat. 35, 307–335 (2010)
L. Cai, flexMIRT: A Numerical Engine for Flexible Multilevel Multidimensional Item Analysis and Test Scoring (Version 2.0) [Computer software] (Vector Psychometric Group, Chapel Hill, NC, 2013)
H. Cui, A. Tumanov, J. Wei, L. Xu, W. Dai, J. Haber-Kucharsky, E. Xing, in Proceedings of the ACM Symposium on Cloud Computing. Exploiting iterativeness for parallel ML computations (ACM, New York, NY, 2014), pp. 1–14
H. Cui, X. Wei, M. Dai, Parallel implementation of expectation-maximization for fast convergence (2010). Retrieved from http://users.ece.cmu.edu/~hengganc/archive/report/final.pdf
A.S. Das, M. Datar, A. Garg, S. Rajaram, in Proceedings of the 16th International Conference on World Wide Web, WWW 07. Google news personalization: scalable online collaborative filtering (ACM, New York, NY, 2007), pp. 271–280. doi:10.1145/1242572.1242610
A.P. Dempster, N.M. Laird, D.B. Rubin, Maximum likelihood estimation from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B 39, 1–38 (1977)
G. Gan, X. Wang, J. Manzano, G.R. Gao, in Evolving OpenMP in an Age of Extreme Parallelism: 5th International Workshop on OpenMP, IWOMP 2009 Dresden, Germany, June 3–5, 2009, ed. by M. S. Muller, B. R. de Supinski, B. M. Chapman. Tile reduction: the first step towards tile aware parallelization in OpenMP (Springer, Berlin, Heidelberg, 2009). doi:10.1007/978-3-642-02303-3_12
R.D. Gibbons, D. Hedeker, Full-information item bi-factor analysis. Psychometrika, 57, 423–436 (1992)
R. Henson, J. Templin, J. Willse, Defining a family of cognitive diagnosis models using log linear models with latent variables. Psychometrika, 74, 191–210 (2009)
P.E. Lopez de Teruel, J.M. Garcia, M. Acacio, O. Canovas, P-EDR: an algorithm for parallel implementation of Parzen density estimation from uncertain observations, in Proceedings of 13th International Parallel Processing symposium and 10th Symposium on Parallel and Distributed Processing (IPPS/SPDP) (1999). http://ditec.um.es/~jmgarcia/papers/P-EDR.pdf
G. McLachlan, T. Krishnan, The EM Algorithm and Its Extensions (Wiley, New York, 1997)
F. Rijmen, M. Jeon, S. Rabe-Hesketh, M. von Davier, A third order item response theory model for modeling the effects of domains and subdomains in large-scale educational assessment surveys. J. Educ. Behav. Stat. 38, 32–60 (2014)
M. von Davier, A General Diagnostic Model Applied to Language Testing Data (Research Report No. RR-05-16) (Educational Testing Service, Princeton, NJ, 2005). doi:10.1002/j.2333-8504.2005.tb01993.x
M. von Davier, A general diagnostic model applied to language testing data. Br. J. Math. Stat. Psychol. 61, 287–307 (2008)
M. von Davier, The Log-Linear Cognitive Diagnostic Model (LCDM) as a Special Case of the General Diagnostic Model (GDM) (Research Report No. RR-14-40) (Educational Testing Service, Princeton, NJ, 2014). doi:10.1002/ets2.12043
M. von Davier, High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models. ETS Research Report ETS-RR-16-34 (2016)
M. von Davier, J. Rost, Polytomous mixed rasch models, in Rasch Models: Foundations, Recent Developments and Applications, ed. by G.H. Fischer, I.W. Molenaar (Springer, New York, 1995), pp. 371–379
M. von Davier, J. Rost, in Handbook of Item Response Theory, 2nd edn., ed. by W. van der Linden. Logistic mixture-distribution response models, vol 1 (CRC Press, Boca Raton, FL, 2016), pp. 393–406
M. von Davier, K. Yamamoto, Partially observed mixtures of IRT models: an extension of the generalized partial credit model. Appl. Psychol. Meas. 28(6), 389–406 (2004)
M. von Davier, X. Xu, C.H. Carstensen, Measuring growth in a longitudinal large scale assessment with a general latent variable model. Psychometrika 76, 318 (2011). doi:10.1007/s11336-011-9202-z
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
von Davier, M. (2017). New Results on an Improved Parallel EM Algorithm for Estimating Generalized Latent Variable Models. In: van der Ark, L.A., Wiberg, M., Culpepper, S.A., Douglas, J.A., Wang, WC. (eds) Quantitative Psychology. IMPS 2016. Springer Proceedings in Mathematics & Statistics, vol 196. Springer, Cham. https://doi.org/10.1007/978-3-319-56294-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-56294-0_1
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-56293-3
Online ISBN: 978-3-319-56294-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)