Skip to main content

Historical Background for Multidimensional Item Response Theory (MIRT)

  • Chapter
  • First Online:
Multidimensional Item Response Theory

Part of the book series: Statistics for Social and Behavioral Sciences ((SSBS))

Abstract

Multidimensional item response theory (MIRT) is the result of the convergence of ideas from a number of areas in psychology, education, test development, psychometrics, and statistics. Two general themes underlie the influence of these ideas on the development of MIRT. The first theme is that as our understanding of these areas increases, it becomes clear that things are more complicated than originally thought. The second theme is that the complexity can be represented by models or theories, but these theories and models are idealizations of reality. Because they are idealizations, they can likely be proven false if tested using a large number of observations. Nevertheless, the models can give useful approximations with many practical applications.

It is common in the development of scientific theories to collect data about a phenomenon and then to develop an idealized model of the phenomenon that is consistent with the data. The idealized model is usually presented as a mathematical equation. An example of this approach to theory development is reported in Asimov (1972, p. 158). He describes Galileo in a church observing the swing of lamps hanging from the ceiling by long chains. These lamps were swinging like pendulums, and Galileo is reported to have recorded the length of time it took to make one full swing using his own pulse rate. From these observations, he developed a mathematical formula that related the length of the chain to the length of time for each swing (period) of a pendulum.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The approximation formula for the period of an ideal simple pendulum is \(p = 2\pi \sqrt{\frac{\mathrm{length } } {g}}\), where g is the acceleration due to gravity. The approximation holds when the angle of swing is small.

  2. 2.

    The original presentations of factor analytic and IRT models used a wide variety of symbols to represent the parameters of the models. To make comparisons of the models easier, the symbols have been changed to a common set. This means that the symbols used here may not match those in the original article.

  3. 3.

    Whitely now publishes under the name Embretson.

References

  • Ackerman TA (1992) A didactic explanation of item bias, item impact, and item validity from a multidimensional perspective. Journal of Educational Measurement 29:67–91

    Article  Google Scholar 

  • Asimov I (1972) Asimov’s new guide to science. Basic Books, New York

    Google Scholar 

  • Bejar II (1977) An application of the continuous response level model to personality measurement. Applied Psychological Measurement 1:509–521

    Article  Google Scholar 

  • Binet A, Simon T (1913) A method of measuring the development of intelligence in children (Translated from the French by CH Town). Chicago Medical Book Company, Chicago

    Google Scholar 

  • Bock RD, Gibbons R, Muraki E (1988) Full information item factor analysis. Applied Psychological Measurement 12:261–280

    Article  Google Scholar 

  • Burgess MA (1921) The measurement of silent reading. Russell Sage Foundation, New York

    Google Scholar 

  • Camilli G, Wang M, Fesq J (1995) The effects of dimensionality on equating the Law School Admissions Test. Journal of Educational Measurement 32:79–96

    Article  Google Scholar 

  • Carroll JB (1945) The effect of difficulty and chance success on correlations between items or between tests. Psychometrika 10:1–19

    Article  Google Scholar 

  • Carroll JB (1993) Human cognitive abilities: A survey of factor analytic studies. Cambridge University Press, New York

    Book  Google Scholar 

  • Christoffersson A (1975) Factor analysis of dichotomized variables. Psychometrika 40:5–32

    Article  MATH  MathSciNet  Google Scholar 

  • Davey TC, Oshima TC (1994) Linking multidimensional calibrations. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans

    Google Scholar 

  • Deese J (1958) The psychology of learning (2nd edition). McGraw-Hill, New York

    Google Scholar 

  • Ebbinghaus H (1885) Uber das Gedachtnis: Untersuchungen zur experimentalen Psychologie. Duncker and Humbolt, Leipzig

    Google Scholar 

  • Fischer GH, Molenaar IW (eds) (1995) Rasch models: Foundations, recent developments, and applications. Springer-Verlag, New York

    MATH  Google Scholar 

  • Galton F (1870) Hereditary genius: An inquiry into its laws and consequences. D. Appleton, London

    Google Scholar 

  • Glas CAW (1992) A Rasch model with a multivariate distribution of ability. In Wilson M (ed) Objective measurement: Theory into practice volume 1. Ablex, Norwood, NJ

    Google Scholar 

  • Glas CAW, Vos HJ (2000) Adaptive mastery testing using a multidimensional IRT model and Bayesian sequential decision theory (Research Report 00-06). University of Twente, Enschede, The Netherlands

    Google Scholar 

  • Gulliksen H (1950) Theory of mental tests. Wiley, New York

    Google Scholar 

  • Kelderman H (1994) Objective measurement with multidimensional polytomous latent trait models. In Wilson M (ed) Objective measurement: Theory into practice, Vol. 2. Ablex, Norwood NJ

    Google Scholar 

  • Kirisci L, Hsu T, Yu L (2001) Robustness of item parameter estimation programs to assumptions of unidimensionality and normality. Applied Psychological Measurement 25:146–162

    Article  MathSciNet  Google Scholar 

  • Lord FM (1980) Applications of item response theory to practical testing problems. Lawrence Erlbaum Associates, Hillsdale, NJ

    Google Scholar 

  • Lord FM, Novick MR (1968) Statistical theories of mental test scores. Addison-Wesley, Reading, MA

    MATH  Google Scholar 

  • McCall WA (1922) How to measure in education. The Macmillan Company, New York

    Google Scholar 

  • McDonald RP (1985) Factor analysis and related methods. Lawrence Erlbaum Associates, Hillsdale, NJ

    Google Scholar 

  • McKinley RL, Reckase MD (1982) The use of the general Rasch model with multidimensional item response data (Research Report ONR 82-1). American College Testing, Iowa City, IA

    Google Scholar 

  • Miller TR, Hirsch TM (1992) Cluster analysis of angular data in applications of multidimensional item response theory. Applied Measurement in Education 5:193–211

    Article  Google Scholar 

  • Millman J, Greene J (1989) The specification and development of tests of achievement and ability. In Linn RL (ed) Educational measurement (3rd edition). American Council on Education and Macmillan, New York

    Google Scholar 

  • Mulaik SA (1972) A mathematical investigation of some multidimensional Rasch models for psychological tests. Paper presented at the annual meeting of the Psychometric Society, Princeton, NJ

    Google Scholar 

  • Rasch G (1960) Probabilistic models for some intelligence and attainment tests. Danmarks Paedagogiske Institut, Copenhagen

    Google Scholar 

  • Rasch G (1962) On general laws and the meaning of measurement in psychology. Proceedings of the fourth Berkeley symposium on mathematical statistics and probability 4:321–334

    Google Scholar 

  • Reckase MD (1972) Development and application of a multivariate logistic latent trait model. Unpublished doctoral dissertation, Syracuse University, Syracuse, NY

    Google Scholar 

  • Reckase MD (1985) The difficulty of test items that measure more than one ability. Applied Psychological Measurement 9:401–412

    Article  Google Scholar 

  • Reckase MD, Ackerman TA, Carlson JE (1988) Building a unidimensional test using multidimensional items. Journal of Educational Measurement 25:193–204

    Article  Google Scholar 

  • Reckase MD, Hirsch TM (1991) Interpretation of number-correct scores when the true numbers of dimensions assessed by a test is greater than two. Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago

    Google Scholar 

  • Reckase MD, McKinley RL (1991) The discriminating power of items that measure more than one dimension. Applied Psychological Measurement 15:361–373

    Article  Google Scholar 

  • Samejima F (1974) Normal ogive model on the continuous response level in the multidimensional space. Psychometrika 39:111–121

    Article  MATH  MathSciNet  Google Scholar 

  • Stern W (1914) The psychological methods of testing intelligence. Warwick & York, Baltimore

    Book  Google Scholar 

  • Sympson JB (1978) A model for testing with multidimensional items. In Weiss DJ (ed) Proceedings of the 1977 Computerized Adaptive Testing Conference, University of Minnesota, Minneapolis

    Google Scholar 

  • Thissen D, Wainer H (2001) Test scoring. Lawrence Erlbaum Associates, Mahwah, NJ

    Google Scholar 

  • Thorndike EL (1904) An introduction to the theory of mental and social measurements. The Science Press, New York

    Google Scholar 

  • van der Linden WJ (2005) Linear models of optimal test design. Springer, New York

    Google Scholar 

  • van der Linden WJ, Hambleton RK (eds.) (1997) Handbook of modern item response theory. Springer, New York

    MATH  Google Scholar 

  • Vernon P (1950) The structure of human abilities. Methuen, London

    Google Scholar 

  • Wang W, Wilson M (2005) Exploring local item dependence using a random-effects facet model. Applied Psychological Measurement 29:296–318

    Article  MathSciNet  Google Scholar 

  • Whipple GM (1910) Manual of mental and physical tests. Warwick & York, Baltimore

    Book  Google Scholar 

  • Whitely SE (1980b) Multicomponent latent trait models for ability tests. Psychometrika 45: 479–494

    Article  MATH  Google Scholar 

  • Wilson M, Adams R (1995) Rasch models for item bundles. Psychometrika 60:181–198

    Article  MATH  Google Scholar 

  • Yoakum CS, Yerkes RM (1920) Army mental tests. Henry Holt, New York

    Book  Google Scholar 

  • Harman HH (1976) Modern factor analysis (3rd edition revised). The University of Chicago Press, Chicago

    Google Scholar 

  • Muthén B (1978) Contributions to factor analysis of dichotomous variables. Psychometrika 43:551–560

    Article  MATH  MathSciNet  Google Scholar 

  • Horst P (1965) Factor analysis of data matrices. Holt, Rinehart & Winston, New York

    Google Scholar 

  • McDonald RP (1967) Nonlinear factor analysis. Psychometric Monograph 15

    Google Scholar 

  • Rasch G (1960) Probabilistic models for some intelligence and attainment tests. Danmarks Paedagogiske Institut, Copenhagen

    Google Scholar 

  • Rijmen F, De Boeck P (2005) A relationship between a between-item multidimensional IRT model and the mixture Rasch model. Psychometrika 70:481–496

    Article  MathSciNet  Google Scholar 

  • Healy AF, McNamara DS (1996) Verbal learning and memory: Does the modal model still work? In Spence JT, Darley JM, Foss DJ (eds) Annual Review of Psychology 47:143–172

    Google Scholar 

  • Kallon AA (1916) Standards in silent reading. Boston Department of Educational Investigation and Measurement Bulletin No. 12, School Document 18. Boston

    Google Scholar 

  • Perie M, Grigg W, Donahue P (2005) The Nation’s Report Card: Reading 2005 (NCES 2006-451). U.S. Department of Education, National Center for Education Statistics, U.S. Government Printing Office, Washington, DC

    Google Scholar 

  • McDonald RP (1999) Test theory: A unified treatment. Lawrence Erlbaum Associates, Mahwah, NJ

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mark D. Reckase .

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Reckase, M.D. (2009). Historical Background for Multidimensional Item Response Theory (MIRT). In: Multidimensional Item Response Theory. Statistics for Social and Behavioral Sciences. Springer, New York, NY. https://doi.org/10.1007/978-0-387-89976-3_3

Download citation

Publish with us

Policies and ethics