Measuring Achievement with Latent Structure Models

McArthur, David L.

doi:10.1007/978-94-009-3257-9_6

David L. McArthur PhD⁴

Part of the book series: Evaluation in Education and Human Services ((EEHS,volume 16))

128 Accesses

Abstract

The basic assumption in latent class models designed to measure achievement is that a student can be described as knowing or not knowing the answer to a test item, and that inferences about the student’s ability level should take this notion into account. The goals of a test might be to determine how many of the items an examinee knows, which items are known or which are not known, or what proportion of items among a domain of items are known. The problem is that examinees might give the correct response when they do not know, or they might carelessly give the wrong response when they actually do know. Latent class models can be used in an attempt to measure and correct the effects of these errors when addressing a particular measurement problem. Even if some other model is ultimately preferred, such as a latent trait model, latent class models are potentially useful.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baker, F.B., & Hubert, E.J. (1977). Inference procedures for ordering theory. Journal of Educational Statistics, 2, 217–233.
Article Google Scholar
Barlow, R., Bartholomew, D., Bremner, J., & Brunk, H. (1972). Statistical inference under order restrictions. New York: Wiley.
Google Scholar
Berk, R. (1980). Criterion-referenced measurement. (1980) Baltimore: The Johns Hopkins University Press.
Google Scholar
Bliss, L.B. (1980). A test of Lord’s assumption regarding examinee guessing behavior on multiple-choice tests using elementary school students. Journal of Educational Measurement, 17, 147–153.
Article Google Scholar
Bowman, K., Hutcheson, K., Odum, E., & Shenton, L. (1971). Comments on the distribution of indices of diversity. In G. Patil, E. Pielou, and W. Waters (Eds.) International Symposium on Statistical Ecology, Vol. 3. University Park: Pennsylvania State Press.
Google Scholar
Chacko, V.J. (1966). Modified chi-square test for ordered alternatives. Sankhya, Ser. B, 28, 185–190.
Google Scholar
Cliff, N. (1977). A theory of consistency of ordering generalizable to tailored testing. Psychometrika, 42, 375–399.
Article Google Scholar
Coombs, C.H., Milholland, J.E., & Womer, F.B. (1956). The assessment of partial information. Educational and Psychological Measurement, 16, 13–27.
Google Scholar
Cross, L.H., & Frary, R.B. (1977). An empirical test of Lord’s theoretical results regarding formula-scoring of multiple-choice tests. Journal of Educational Measurement, 14, 313–321.
Article Google Scholar
Dahiya, R.C. (1971). On the Pearson chi-squared goodness-of-fit test statistic Biometrika, 58, 685–686.
Google Scholar
Dayton, C M., & Macready, G.B. (1976). A probabilistic model for validation of behavioral hierarchies. Psychometrika, 41, 189–204.
Article Google Scholar
Dayton, CM., & Macready, G.B. (1980). A scaling model with response errors and intrinsically unscalable respondents. Psychometrika, 45, 343–356.
Article Google Scholar
Emrick J.A. (1971). An evaluation model for mastery testing. Journal of Educational Measurement, 8, 321–326.
Article Google Scholar
Frary, R.B. (1969). Reliability of multiple-choice test scores is not the proportion of variance which is true variance. Educational and Psychological Measurement, 29, 359–365.
Article Google Scholar
Goodman, L.A. (1979). On the estimation of parameters in latent structure analysis. Psychometrika, 44, 123–128.
Article Google Scholar
Hambleton, R.K., Swaminathan, H., Algina, J., & Coulson, D.B. (1978a). Criterion-referenced testing and measurement: A review of technical issues and developments. Review of Educational Research, 48, 1–48.
Google Scholar
Hambleton, R.K., Swaminathan, H., Cook, L.L., Eignor, D.R., & Gifford, J.A. (1978b). Developments in latent trait theory: Models, technical issues, and application. Review of Educational Research, 48, 467–510.
Google Scholar
Harnisch, D.L., & Linn, R.L. (1981). Analysis of item response patterns: Questionable test data and dissimilar curriculum practices. Journal of Educational Measurement, 18, 133–146.
Article Google Scholar
Harris, C.W., Houang, R.T., Pearlman, A.P., & Barnett, B. (1980). Final Report submitted to the National Institute of Education. Grant No. NIE-G-78-0085, Project No. 8-0244.
Google Scholar
Harris, C.W., & Pearlman, A. (1978). An index for a domain of completion or short answer items. Journal of Educational Statistics, 3, 285–304.
Article Google Scholar
Hartke, A.R. (1978). The use of latent partition analysis to identify homogeneity of an item population. Journal of Educational Measurement, 15, 43–47.
Article Google Scholar
Huynh, H. (1976a). On the reliability of decisions in domain-referenced testing. Journal of Educational Measurement, 13, 253–264.
Article Google Scholar
Huynh, H. (1976b). Statistical consideration of mastery scores. Psychometrika, 41, 65–78.
Article Google Scholar
Kale, B.K. (1962). On the solution of likelihood equations by iteration processes. The multiparametric case. Biometrika, 49 479–486.
Google Scholar
Keats, J.A. (1951). A statistical theory of objective test scores. Melbourne: Australian Council for Educational Research.
Google Scholar
Keats, J.A. (1964). Some generalizations of a theoretical distribution of mental test scores. Psychometrika, 29, 215–231.
Article Google Scholar
Knapp, T.R. (1977). The reliability of a dichotomous test item: A correlationless approach. Journal of Educational Measurement, 14, 237–252.
Article Google Scholar
Lord, F.M. (1965). A strong true-score theory, with applications. Psychometrika, 30, 239–270.
Article Google Scholar
Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale, New Jersey: Erlbaum.
Google Scholar
Macready, G.B., & Dayton, C.M. (1977). The use of probabilistic models in the assessment of mastery. Journal of Educational statistics, 2, 99–120.
Article Google Scholar
McDonald, R.P. (1981). The dimensionality of tests. British Journal of Mathematical and Statistical Psychology, 34, 100–117.
Article Google Scholar
Messick, S. (1975). The standard problem: Meaning and values in measurement and evaluation. American Psychologist, 30, 955–966.
Article Google Scholar
Mislevy, R.J., & Bock R.D. (1982). Biweight estimates of latent ability. Educational and Psychological Measurement, 42, 725–737.
Article Google Scholar
Molenaar, I.W. (1981). On Wilcox’s latent structure model for guessing. British Journal of Mathematical and Statistical Psychology, 34, 79–89.
Article Google Scholar
Robertson, T. (1978). Testing for and against an order restriction on multinomial parameters. Journal of the American Statistical Association, 73, 197–202.
Article Google Scholar
Robertson, T., & Wright, F.T. (1981). Likelihood ratio tests for and against a stochastic ordering between multinomial populations. Annals of Statistics, 9, 1248–1257.
Article Google Scholar
Sathe, Y.S., Pradhan, M., & Shah, S.P. (1980). Inequalities for the probability of the occurrence of at least m out of n events. Journal of Applied Probability, 17, 1127–1132.
Article Google Scholar
Simpson, E. (1949). Measurement of diversity. Nature, 163, 688.
Article Google Scholar
Smith P.J., Rae, D.S., Manderscheid, R., & Silberg, S. (1979). Exact and approximate distributions of the chi-square statistic for equiprobability. Communications in Statistics Simulation and Computation, 88, 131–149.
Google Scholar
van den Brink, W.P., & Koele, P. (1980). Item sampling, guessing and decision-making in achievement testing. British Journal of Mathematical and Statistical Psychology, 33, 104–108.
Article Google Scholar
van der Linden, W. (1981). Estimating the parameters of Emrick’s mastery testing model. Applied Psychological Measurement. 5, 517–530.
Article Google Scholar
Wainer, H., & Wright, B.D. (1980). Robust estimation of ability in the Rasch model. Psychometrika, 45, 373–391.
Article Google Scholar
Wilcox, R.R. (1980a). An approach to measuring the achievement or proficiency of an examinee. Applied Psychological Measurement, 4, 241–251.
Article Google Scholar
Wilcox, R.R. (1980b). Determining the length of a criterion-referenced test. Applied Psychological Measurement, 4, 425–446.
Article Google Scholar
Wilcox, R.R. (1980c). Estimating the likelihood of false-positive and false-negative decisions in mastery testing: An empirical Bayes approach. Journal of Educational Statistics, 2, 289–307.
Article Google Scholar
Wilcox, R.R. (1980d). Some results and comments on using latent structure models to measure achievement. Educational and Psychological Measurement, 40, 645–658.
Article Google Scholar
Wilcox, R.R. (1981a). A review of the beta-binomial model and its extensions. Journal of Educational Statistics, 6, 3–32.
Article Google Scholar
Wilcox, R.R. (1981b). Solving measurement problems with an answer-until-correct scoring procedure. Applied Psychological Measurement, 5, 399–414.
Article Google Scholar
Wilcox, R.R. (1982a). Approaches to measuring achievement with an emphasis on latent structure models. Technical Report. Center for the Study of Evaluation, University of California, Los Angeles.
Google Scholar
Wilcox, R.R. (1982b). Bounds of the k out of n reliability of a test, and an exact test for hierarchically related items. Applied Psychological Measurement, 6, 327–336.
Article Google Scholar
Wilcox, R.R. (1982c). How do examinees behave when taking multiple-choice tests. Applied Psychological Measurement, 7, 239–240.
Article Google Scholar
Wilcox, R.R. (1982d). On a closed sequential procedure for categorical data, and tests for equiprobable cells. British Journal of Mathematical and Statistical Psychology, 35, 193–207.
Article Google Scholar
Wilcox, R.R. (1982e). Some empirical and theoretical results on an answer-until-correct scoring procedure. British Journal of Mathematical and Statistical Psychology, 35, 57–70.
Article Google Scholar
Wilcox, R.R. (1982f). Some new results on an answer-until-correct scoring procedure. Journal of Educational Measurement, 19, 67–74.
Article Google Scholar
Wilcox, R.R. (1982g). Using results on k out of n system reliability to study and characterize tests. Educational and Psychological Measurement, 42, 153–165.
Article Google Scholar
Zehna, P.W. (1966). Invariance of maximum likelihood estimation. Annals of Mathematical Statistics, 37, 744.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Student Testing, Evaluation and Standards, Graduate School of Education, University of California Los Angeles, Los Angeles, CA, 90024, USA
David L. McArthur PhD

Authors

David L. McArthur PhD
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Student Testing, Evaluation and Standards, Graduate School of Education, University of California Los Angeles, Los Angeles, CA, 90024, USA
David L. McArthur PhD

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

McArthur, D.L. (1987). Measuring Achievement with Latent Structure Models. In: McArthur, D.L. (eds) Alternative Approaches to the Assessment of Achievement. Evaluation in Education and Human Services, vol 16. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3257-9_6

Download citation

DOI: https://doi.org/10.1007/978-94-009-3257-9_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7961-7
Online ISBN: 978-94-009-3257-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics