Estimation of Item and Person Parameters

  • Mark D. Reckase
Part of the Statistics for Social and Behavioral Sciences book series (SSBS)


The MIRT models presented in this book are useful from a theoretical perspective because they provide a model for the interaction between persons and test items. The different kinds of models represent different theoretical perspectives. For example, the compensatory and partially compensatory models provide two different conceptions of how levels on hypothetical constructs combine when applied to items that require some level on the constructs to determine the correct response. Although the theoretical models are interesting in their own right, the practical applications of the models require a means of estimating the item and person parameters for the models. Without practical procedures for parameter estimation, the usefulness of the models is very limited.

This chapter describes the procedures that are currently used for estimating the item and person parameters for the models. These procedures are necessarily embedded within computer programs for carrying out the steps in the estimation methodology. It is difficult to separate the estimation procedures from the programs used to implement them. An excellent estimation methodology may perform poorly because it is improperly programmed or programmed in an inefficient way. A method with poorer theoretical properties may perform better overall because the programming of the method was done in a more creative way. Because of this close tie between estimation methodology and the computer programs used to implement them, they will be described together in the later sections of this chapter. Before getting into the details of estimation procedures, a general overview is provided. This overview is designed to give a conceptual framework for understanding the estimation procedures without getting into the technical details.

This chapter is reasonably up to date about programs and methods as of 2008. However, computer software becomes obsolete and computer systems change fairly rapidly. For that reason, this chapter does not go into the details of running specific programs. Those may change by the time this book is published. Rather, the basic estimation model is described for commonly used programs and differences in constraints used in estimation procedures are presented. These are the features of the estimation methods and programs that explain why different programs applied to the same item response data give different parameter estimates. Chapter 7 describes some methods for determining if the parameter estimates from different programs are simple transformations of each other.


Markov Chain Monte Carlo Test Item Item Score Item Parameter Quadrature Point 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Adams RJ, Wilson M, Wang W (1997) The multidimensional random coefficients multinomial logit model. Applied Psychological Measurement 21:1–24CrossRefGoogle Scholar
  2. Baker FB, Kim S-H (2004) Item response theory: Parameter estimation techniques (2nd edition, revised and expanded). Marcel Dekker, New YorkGoogle Scholar
  3. Bock RD, Gibbons R, Muraki E (1988) Full information item factor analysis. Applied Psychological Measurement 12:261–280CrossRefGoogle Scholar
  4. Bolt DM, Lall VF (2003) Estimation of compensatory and noncompensatory multidimensional item response models using Markov chain Monte Carlo. Applied Psychological Measurement 27:395–414CrossRefMathSciNetGoogle Scholar
  5. Chib S, Greenberg E (1995) Understanding the Metropolis-Hastings algorithm. The American Statistician 49:327–335CrossRefGoogle Scholar
  6. Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society Series B 39:1–38MATHMathSciNetGoogle Scholar
  7. Fraser C (1998) NOHARM: A Fortran program for fitting unidimensional and multidimensional normal ogive models in latent trait theory. The University of New England, Center for Behavioral Studies, Armidale, AustraliaGoogle Scholar
  8. Gamerman D, Lopes HF (2006) Markov chain Monte Carlo: Stochastic simulation for Bayesian inference (2nd edition). Chapman & Hall, Boca Raton, FLMATHGoogle Scholar
  9. Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence 6:721–741CrossRefMATHGoogle Scholar
  10. Gosz JK, Walker CM (2002) An empirical comparison of multidimensional item response theory data using TESTFACT and NOHARM. Paper presented at the annual meeting of the National Council for Measurement in Education, New OrleansGoogle Scholar
  11. Hastings WK (1970) Monte Carlo simulation methods using Markov chains and their applications. Biometrika 57:97–109CrossRefMATHGoogle Scholar
  12. Maydeu-Olivares A (2001) Multidimensional item response theory modeling of binary data: Large sample properties of NOHARM estimates. Journal of Educational and Behavioral Statistics 26:51–71CrossRefGoogle Scholar
  13. McDonald RP (1997) Normal-ogive multidimensional model. In WJ van der Linden and RK Hambleton (eds) Handbook of modern item response theory. Springer, New YorkGoogle Scholar
  14. Miller TR (1991) Empirical estimation of standard errors of compensatory MIRT model parameters obtained from the NOHARM estimation program (Research Report ONR 91-2). American College Testing Program, Iowa City, IAGoogle Scholar
  15. Moulton MH (2004) One use of a non-unidimensional scaling (NOUS) model: Transferring information across dimensions and subscales. Educational Data Systems, Morgan Hill, CAGoogle Scholar
  16. Muraki E, Englelhard G Jr (1985) Full-information item factor analysis: applications of EAP scores. Applied Psychological Measurement 9:417–430CrossRefGoogle Scholar
  17. Patz RJ, Junker BW (1999) A straightforward approach to Markov chain Monte Carlo methods for item response models. Journal of Educational and Behavioral Statistics 24:146–178Google Scholar
  18. Schilling S, Bock RD (2005) High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature. Psychometrika 70:533–555MathSciNetGoogle Scholar
  19. Stigler SM (1986) The history of statistics: The measurement of uncertainty before 1900. Belknap Press, Cambridge, MAMATHGoogle Scholar
  20. Stone CA, Yeh C-C (2006) Assessing the dimensionality and factor structure of multiple-choice exams: An empirical comparison of methods using the Multistate Bar Examination. Educational and Psychological Measurement 66:193–214CrossRefMathSciNetGoogle Scholar
  21. Volodin N, Adams RJ (2002) The estimation of polytomous item response models with many dimensions. Unpublished report, University of Melbourne, Victoria, AustraliaGoogle Scholar
  22. Wu ML, Adams RJ, Wilson MR (1997) ConQuest: Generalized item response modeling software. ACER, Victoria, AustraliaGoogle Scholar
  23. Yao L (2003) BMIRT: Bayesian multivariate item response theory. CTB/McGraw-Hill, Monterey, CAGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  1. 1.Counseling, Educational, Psychology, and Special Education DepartmentMichigan State UniversityEast LansingUSA

Personalised recommendations