Abstract
The basic assumption in latent class models designed to measure achievement is that a student can be described as knowing or not knowing the answer to a test item, and that inferences about the student’s ability level should take this notion into account. The goals of a test might be to determine how many of the items an examinee knows, which items are known or which are not known, or what proportion of items among a domain of items are known. The problem is that examinees might give the correct response when they do not know, or they might carelessly give the wrong response when they actually do know. Latent class models can be used in an attempt to measure and correct the effects of these errors when addressing a particular measurement problem. Even if some other model is ultimately preferred, such as a latent trait model, latent class models are potentially useful.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baker, F.B., & Hubert, E.J. (1977). Inference procedures for ordering theory. Journal of Educational Statistics, 2, 217–233.
Barlow, R., Bartholomew, D., Bremner, J., & Brunk, H. (1972). Statistical inference under order restrictions. New York: Wiley.
Berk, R. (1980). Criterion-referenced measurement. (1980) Baltimore: The Johns Hopkins University Press.
Bliss, L.B. (1980). A test of Lord’s assumption regarding examinee guessing behavior on multiple-choice tests using elementary school students. Journal of Educational Measurement, 17, 147–153.
Bowman, K., Hutcheson, K., Odum, E., & Shenton, L. (1971). Comments on the distribution of indices of diversity. In G. Patil, E. Pielou, and W. Waters (Eds.) International Symposium on Statistical Ecology, Vol. 3. University Park: Pennsylvania State Press.
Chacko, V.J. (1966). Modified chi-square test for ordered alternatives. Sankhya, Ser. B, 28, 185–190.
Cliff, N. (1977). A theory of consistency of ordering generalizable to tailored testing. Psychometrika, 42, 375–399.
Coombs, C.H., Milholland, J.E., & Womer, F.B. (1956). The assessment of partial information. Educational and Psychological Measurement, 16, 13–27.
Cross, L.H., & Frary, R.B. (1977). An empirical test of Lord’s theoretical results regarding formula-scoring of multiple-choice tests. Journal of Educational Measurement, 14, 313–321.
Dahiya, R.C. (1971). On the Pearson chi-squared goodness-of-fit test statistic Biometrika, 58, 685–686.
Dayton, C M., & Macready, G.B. (1976). A probabilistic model for validation of behavioral hierarchies. Psychometrika, 41, 189–204.
Dayton, CM., & Macready, G.B. (1980). A scaling model with response errors and intrinsically unscalable respondents. Psychometrika, 45, 343–356.
Emrick J.A. (1971). An evaluation model for mastery testing. Journal of Educational Measurement, 8, 321–326.
Frary, R.B. (1969). Reliability of multiple-choice test scores is not the proportion of variance which is true variance. Educational and Psychological Measurement, 29, 359–365.
Goodman, L.A. (1979). On the estimation of parameters in latent structure analysis. Psychometrika, 44, 123–128.
Hambleton, R.K., Swaminathan, H., Algina, J., & Coulson, D.B. (1978a). Criterion-referenced testing and measurement: A review of technical issues and developments. Review of Educational Research, 48, 1–48.
Hambleton, R.K., Swaminathan, H., Cook, L.L., Eignor, D.R., & Gifford, J.A. (1978b). Developments in latent trait theory: Models, technical issues, and application. Review of Educational Research, 48, 467–510.
Harnisch, D.L., & Linn, R.L. (1981). Analysis of item response patterns: Questionable test data and dissimilar curriculum practices. Journal of Educational Measurement, 18, 133–146.
Harris, C.W., Houang, R.T., Pearlman, A.P., & Barnett, B. (1980). Final Report submitted to the National Institute of Education. Grant No. NIE-G-78-0085, Project No. 8-0244.
Harris, C.W., & Pearlman, A. (1978). An index for a domain of completion or short answer items. Journal of Educational Statistics, 3, 285–304.
Hartke, A.R. (1978). The use of latent partition analysis to identify homogeneity of an item population. Journal of Educational Measurement, 15, 43–47.
Huynh, H. (1976a). On the reliability of decisions in domain-referenced testing. Journal of Educational Measurement, 13, 253–264.
Huynh, H. (1976b). Statistical consideration of mastery scores. Psychometrika, 41, 65–78.
Kale, B.K. (1962). On the solution of likelihood equations by iteration processes. The multiparametric case. Biometrika, 49 479–486.
Keats, J.A. (1951). A statistical theory of objective test scores. Melbourne: Australian Council for Educational Research.
Keats, J.A. (1964). Some generalizations of a theoretical distribution of mental test scores. Psychometrika, 29, 215–231.
Knapp, T.R. (1977). The reliability of a dichotomous test item: A correlationless approach. Journal of Educational Measurement, 14, 237–252.
Lord, F.M. (1965). A strong true-score theory, with applications. Psychometrika, 30, 239–270.
Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale, New Jersey: Erlbaum.
Macready, G.B., & Dayton, C.M. (1977). The use of probabilistic models in the assessment of mastery. Journal of Educational statistics, 2, 99–120.
McDonald, R.P. (1981). The dimensionality of tests. British Journal of Mathematical and Statistical Psychology, 34, 100–117.
Messick, S. (1975). The standard problem: Meaning and values in measurement and evaluation. American Psychologist, 30, 955–966.
Mislevy, R.J., & Bock R.D. (1982). Biweight estimates of latent ability. Educational and Psychological Measurement, 42, 725–737.
Molenaar, I.W. (1981). On Wilcox’s latent structure model for guessing. British Journal of Mathematical and Statistical Psychology, 34, 79–89.
Robertson, T. (1978). Testing for and against an order restriction on multinomial parameters. Journal of the American Statistical Association, 73, 197–202.
Robertson, T., & Wright, F.T. (1981). Likelihood ratio tests for and against a stochastic ordering between multinomial populations. Annals of Statistics, 9, 1248–1257.
Sathe, Y.S., Pradhan, M., & Shah, S.P. (1980). Inequalities for the probability of the occurrence of at least m out of n events. Journal of Applied Probability, 17, 1127–1132.
Simpson, E. (1949). Measurement of diversity. Nature, 163, 688.
Smith P.J., Rae, D.S., Manderscheid, R., & Silberg, S. (1979). Exact and approximate distributions of the chi-square statistic for equiprobability. Communications in Statistics Simulation and Computation, 88, 131–149.
van den Brink, W.P., & Koele, P. (1980). Item sampling, guessing and decision-making in achievement testing. British Journal of Mathematical and Statistical Psychology, 33, 104–108.
van der Linden, W. (1981). Estimating the parameters of Emrick’s mastery testing model. Applied Psychological Measurement. 5, 517–530.
Wainer, H., & Wright, B.D. (1980). Robust estimation of ability in the Rasch model. Psychometrika, 45, 373–391.
Wilcox, R.R. (1980a). An approach to measuring the achievement or proficiency of an examinee. Applied Psychological Measurement, 4, 241–251.
Wilcox, R.R. (1980b). Determining the length of a criterion-referenced test. Applied Psychological Measurement, 4, 425–446.
Wilcox, R.R. (1980c). Estimating the likelihood of false-positive and false-negative decisions in mastery testing: An empirical Bayes approach. Journal of Educational Statistics, 2, 289–307.
Wilcox, R.R. (1980d). Some results and comments on using latent structure models to measure achievement. Educational and Psychological Measurement, 40, 645–658.
Wilcox, R.R. (1981a). A review of the beta-binomial model and its extensions. Journal of Educational Statistics, 6, 3–32.
Wilcox, R.R. (1981b). Solving measurement problems with an answer-until-correct scoring procedure. Applied Psychological Measurement, 5, 399–414.
Wilcox, R.R. (1982a). Approaches to measuring achievement with an emphasis on latent structure models. Technical Report. Center for the Study of Evaluation, University of California, Los Angeles.
Wilcox, R.R. (1982b). Bounds of the k out of n reliability of a test, and an exact test for hierarchically related items. Applied Psychological Measurement, 6, 327–336.
Wilcox, R.R. (1982c). How do examinees behave when taking multiple-choice tests. Applied Psychological Measurement, 7, 239–240.
Wilcox, R.R. (1982d). On a closed sequential procedure for categorical data, and tests for equiprobable cells. British Journal of Mathematical and Statistical Psychology, 35, 193–207.
Wilcox, R.R. (1982e). Some empirical and theoretical results on an answer-until-correct scoring procedure. British Journal of Mathematical and Statistical Psychology, 35, 57–70.
Wilcox, R.R. (1982f). Some new results on an answer-until-correct scoring procedure. Journal of Educational Measurement, 19, 67–74.
Wilcox, R.R. (1982g). Using results on k out of n system reliability to study and characterize tests. Educational and Psychological Measurement, 42, 153–165.
Zehna, P.W. (1966). Invariance of maximum likelihood estimation. Annals of Mathematical Statistics, 37, 744.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1987 Kluwer Academic Publishers
About this chapter
Cite this chapter
McArthur, D.L. (1987). Measuring Achievement with Latent Structure Models. In: McArthur, D.L. (eds) Alternative Approaches to the Assessment of Achievement. Evaluation in Education and Human Services, vol 16. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3257-9_6
Download citation
DOI: https://doi.org/10.1007/978-94-009-3257-9_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7961-7
Online ISBN: 978-94-009-3257-9
eBook Packages: Springer Book Archive