Historical Background for Multidimensional Item Response Theory (MIRT)

Reckase, Mark D.

doi:10.1007/978-0-387-89976-3_3

Mark D. Reckase²

Part of the book series: Statistics for Social and Behavioral Sciences ((SSBS))

5374 Accesses
1 Citations

Abstract

Multidimensional item response theory (MIRT) is the result of the convergence of ideas from a number of areas in psychology, education, test development, psychometrics, and statistics. Two general themes underlie the influence of these ideas on the development of MIRT. The first theme is that as our understanding of these areas increases, it becomes clear that things are more complicated than originally thought. The second theme is that the complexity can be represented by models or theories, but these theories and models are idealizations of reality. Because they are idealizations, they can likely be proven false if tested using a large number of observations. Nevertheless, the models can give useful approximations with many practical applications.

It is common in the development of scientific theories to collect data about a phenomenon and then to develop an idealized model of the phenomenon that is consistent with the data. The idealized model is usually presented as a mathematical equation. An example of this approach to theory development is reported in Asimov (1972, p. 158). He describes Galileo in a church observing the swing of lamps hanging from the ceiling by long chains. These lamps were swinging like pendulums, and Galileo is reported to have recorded the length of time it took to make one full swing using his own pulse rate. From these observations, he developed a mathematical formula that related the length of the chain to the length of time for each swing (period) of a pendulum.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The approximation formula for the period of an ideal simple pendulum is \(p = 2\pi \sqrt{\frac{\mathrm{length } } {g}}\), where g is the acceleration due to gravity. The approximation holds when the angle of swing is small.
2.
The original presentations of factor analytic and IRT models used a wide variety of symbols to represent the parameters of the models. To make comparisons of the models easier, the symbols have been changed to a common set. This means that the symbols used here may not match those in the original article.
3.
Whitely now publishes under the name Embretson.

References

Ackerman TA (1992) A didactic explanation of item bias, item impact, and item validity from a multidimensional perspective. Journal of Educational Measurement 29:67–91
Article Google Scholar
Asimov I (1972) Asimov’s new guide to science. Basic Books, New York
Google Scholar
Bejar II (1977) An application of the continuous response level model to personality measurement. Applied Psychological Measurement 1:509–521
Article Google Scholar
Binet A, Simon T (1913) A method of measuring the development of intelligence in children (Translated from the French by CH Town). Chicago Medical Book Company, Chicago
Google Scholar
Bock RD, Gibbons R, Muraki E (1988) Full information item factor analysis. Applied Psychological Measurement 12:261–280
Article Google Scholar
Burgess MA (1921) The measurement of silent reading. Russell Sage Foundation, New York
Google Scholar
Camilli G, Wang M, Fesq J (1995) The effects of dimensionality on equating the Law School Admissions Test. Journal of Educational Measurement 32:79–96
Article Google Scholar
Carroll JB (1945) The effect of difficulty and chance success on correlations between items or between tests. Psychometrika 10:1–19
Article Google Scholar
Carroll JB (1993) Human cognitive abilities: A survey of factor analytic studies. Cambridge University Press, New York
Book Google Scholar
Christoffersson A (1975) Factor analysis of dichotomized variables. Psychometrika 40:5–32
Article MATH MathSciNet Google Scholar
Davey TC, Oshima TC (1994) Linking multidimensional calibrations. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans
Google Scholar
Deese J (1958) The psychology of learning (2nd edition). McGraw-Hill, New York
Google Scholar
Ebbinghaus H (1885) Uber das Gedachtnis: Untersuchungen zur experimentalen Psychologie. Duncker and Humbolt, Leipzig
Google Scholar
Fischer GH, Molenaar IW (eds) (1995) Rasch models: Foundations, recent developments, and applications. Springer-Verlag, New York
MATH Google Scholar
Galton F (1870) Hereditary genius: An inquiry into its laws and consequences. D. Appleton, London
Google Scholar
Glas CAW (1992) A Rasch model with a multivariate distribution of ability. In Wilson M (ed) Objective measurement: Theory into practice volume 1. Ablex, Norwood, NJ
Google Scholar
Glas CAW, Vos HJ (2000) Adaptive mastery testing using a multidimensional IRT model and Bayesian sequential decision theory (Research Report 00-06). University of Twente, Enschede, The Netherlands
Google Scholar
Gulliksen H (1950) Theory of mental tests. Wiley, New York
Google Scholar
Kelderman H (1994) Objective measurement with multidimensional polytomous latent trait models. In Wilson M (ed) Objective measurement: Theory into practice, Vol. 2. Ablex, Norwood NJ
Google Scholar
Kirisci L, Hsu T, Yu L (2001) Robustness of item parameter estimation programs to assumptions of unidimensionality and normality. Applied Psychological Measurement 25:146–162
Article MathSciNet Google Scholar
Lord FM (1980) Applications of item response theory to practical testing problems. Lawrence Erlbaum Associates, Hillsdale, NJ
Google Scholar
Lord FM, Novick MR (1968) Statistical theories of mental test scores. Addison-Wesley, Reading, MA
MATH Google Scholar
McCall WA (1922) How to measure in education. The Macmillan Company, New York
Google Scholar
McDonald RP (1985) Factor analysis and related methods. Lawrence Erlbaum Associates, Hillsdale, NJ
Google Scholar
McKinley RL, Reckase MD (1982) The use of the general Rasch model with multidimensional item response data (Research Report ONR 82-1). American College Testing, Iowa City, IA
Google Scholar
Miller TR, Hirsch TM (1992) Cluster analysis of angular data in applications of multidimensional item response theory. Applied Measurement in Education 5:193–211
Article Google Scholar
Millman J, Greene J (1989) The specification and development of tests of achievement and ability. In Linn RL (ed) Educational measurement (3rd edition). American Council on Education and Macmillan, New York
Google Scholar
Mulaik SA (1972) A mathematical investigation of some multidimensional Rasch models for psychological tests. Paper presented at the annual meeting of the Psychometric Society, Princeton, NJ
Google Scholar
Rasch G (1960) Probabilistic models for some intelligence and attainment tests. Danmarks Paedagogiske Institut, Copenhagen
Google Scholar
Rasch G (1962) On general laws and the meaning of measurement in psychology. Proceedings of the fourth Berkeley symposium on mathematical statistics and probability 4:321–334
Google Scholar
Reckase MD (1972) Development and application of a multivariate logistic latent trait model. Unpublished doctoral dissertation, Syracuse University, Syracuse, NY
Google Scholar
Reckase MD (1985) The difficulty of test items that measure more than one ability. Applied Psychological Measurement 9:401–412
Article Google Scholar
Reckase MD, Ackerman TA, Carlson JE (1988) Building a unidimensional test using multidimensional items. Journal of Educational Measurement 25:193–204
Article Google Scholar
Reckase MD, Hirsch TM (1991) Interpretation of number-correct scores when the true numbers of dimensions assessed by a test is greater than two. Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago
Google Scholar
Reckase MD, McKinley RL (1991) The discriminating power of items that measure more than one dimension. Applied Psychological Measurement 15:361–373
Article Google Scholar
Samejima F (1974) Normal ogive model on the continuous response level in the multidimensional space. Psychometrika 39:111–121
Article MATH MathSciNet Google Scholar
Stern W (1914) The psychological methods of testing intelligence. Warwick & York, Baltimore
Book Google Scholar
Sympson JB (1978) A model for testing with multidimensional items. In Weiss DJ (ed) Proceedings of the 1977 Computerized Adaptive Testing Conference, University of Minnesota, Minneapolis
Google Scholar
Thissen D, Wainer H (2001) Test scoring. Lawrence Erlbaum Associates, Mahwah, NJ
Google Scholar
Thorndike EL (1904) An introduction to the theory of mental and social measurements. The Science Press, New York
Google Scholar
van der Linden WJ (2005) Linear models of optimal test design. Springer, New York
Google Scholar
van der Linden WJ, Hambleton RK (eds.) (1997) Handbook of modern item response theory. Springer, New York
MATH Google Scholar
Vernon P (1950) The structure of human abilities. Methuen, London
Google Scholar
Wang W, Wilson M (2005) Exploring local item dependence using a random-effects facet model. Applied Psychological Measurement 29:296–318
Article MathSciNet Google Scholar
Whipple GM (1910) Manual of mental and physical tests. Warwick & York, Baltimore
Book Google Scholar
Whitely SE (1980b) Multicomponent latent trait models for ability tests. Psychometrika 45: 479–494
Article MATH Google Scholar
Wilson M, Adams R (1995) Rasch models for item bundles. Psychometrika 60:181–198
Article MATH Google Scholar
Yoakum CS, Yerkes RM (1920) Army mental tests. Henry Holt, New York
Book Google Scholar
Harman HH (1976) Modern factor analysis (3rd edition revised). The University of Chicago Press, Chicago
Google Scholar
Muthén B (1978) Contributions to factor analysis of dichotomous variables. Psychometrika 43:551–560
Article MATH MathSciNet Google Scholar
Horst P (1965) Factor analysis of data matrices. Holt, Rinehart & Winston, New York
Google Scholar
McDonald RP (1967) Nonlinear factor analysis. Psychometric Monograph 15
Google Scholar
Rasch G (1960) Probabilistic models for some intelligence and attainment tests. Danmarks Paedagogiske Institut, Copenhagen
Google Scholar
Rijmen F, De Boeck P (2005) A relationship between a between-item multidimensional IRT model and the mixture Rasch model. Psychometrika 70:481–496
Article MathSciNet Google Scholar
Healy AF, McNamara DS (1996) Verbal learning and memory: Does the modal model still work? In Spence JT, Darley JM, Foss DJ (eds) Annual Review of Psychology 47:143–172
Google Scholar
Kallon AA (1916) Standards in silent reading. Boston Department of Educational Investigation and Measurement Bulletin No. 12, School Document 18. Boston
Google Scholar
Perie M, Grigg W, Donahue P (2005) The Nation’s Report Card: Reading 2005 (NCES 2006-451). U.S. Department of Education, National Center for Education Statistics, U.S. Government Printing Office, Washington, DC
Google Scholar
McDonald RP (1999) Test theory: A unified treatment. Lawrence Erlbaum Associates, Mahwah, NJ
Google Scholar

Download references

Author information

Authors and Affiliations

Counseling, Educational, Psychology, and Special Education Department, Michigan State University, 461 Erickson Hall, East Lansing, MI, 48824-1034, USA
Mark D. Reckase

Authors

Mark D. Reckase
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark D. Reckase .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Reckase, M.D. (2009). Historical Background for Multidimensional Item Response Theory (MIRT). In: Multidimensional Item Response Theory. Statistics for Social and Behavioral Sciences. Springer, New York, NY. https://doi.org/10.1007/978-0-387-89976-3_3

Download citation

DOI: https://doi.org/10.1007/978-0-387-89976-3_3
Published: 22 May 2009
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-89975-6
Online ISBN: 978-0-387-89976-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics