Abstract
An ongoing problem in the field of educational and psychological testing is that all measurements are contaminated by error. Consequently, measurements in this field are inaccurate, so it is of prime importance to assess the extent to which the measurements are contaminated. Classical test theory was developed in order to deal with this problem of measurement error. A central role in this theory is played by the concept of reliability, i.e., the extent to which observed variation reflects true variation in measurements or, more technically, the ratio of true to observed score variance. For the estimation of reliability, parallel measurements are needed-measurements with identical true scores and error variances. The correlation between two parallel measurement instruments in a sample from the population of interest, r xx, estimates the population reliability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Box, G.E.P., & Tiao, G.C. (1973). Bayesian inference in statistical analysis. Reading, MA: Addison-Wesley.
Brennan, R.L. (1983). Elements of generalizability theory. Iowa City, IA: American College Testing.
Cardinet, J., & Allal, L. (1983). Estimation of generalizability parameters. In L. J. Fyans, Jr. (ed.), New directions for testing and measurement: Generalizability theory. San Francisco: Jossey-Bass, Inc., pp. 17–48.
Cardinet, J., Tourneur, Y., & Allal, L. (1981 and 1982). Extension of generalizability theory and its applications in educational measurement. Journal of Educational Measurement 18:183–204; and 19:331–332.
Coates, T. J., & Thoresen, C. E. (1978). Using generalizability theory in behavior observation. Behavior Therapy 9:605–613.
Cone, J. D., & Foster, S. L. (1982). Direct observation in clinical psychology. In L. Kendall, & J.N. Butcher (eds.), Handbook of research methods in clinical psychology. New York: Wiley.
Cronbach, L.J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika 16:297–334.
Cronbach, L.J., Gleser, G. C, Nanda, H., & Rajaratnam, N. (1972). The dependability of behavioral measurements. New York: Wiley.
Cronbach, L. J., Rajaratnam, N., & Gleser, G.C. (1963). Theory of generalizability: A liberalization of reliability theory. British Journal of Statistical Psychology 16:137–163.
De Gruijter, D.N.M., & Van der Kamp, L.J.Th. (1984). Statistical models in psychological and educational testing. Lisse: Swets & Zeitlinger.
Efron, B. (1979). Bootstrap methods: Another look at the jackknife. Annals of Statistics 7:1–26.
Endler, N. S., & Magnusson, D. (eds.) (1976). Interactional psychology and personality. Washington, DC: Hemisphere.
Fyans, L. J., Jr. (1983). Multilevel analysis and cross-level inference of validity using generalizability theory. In L.J. Fyans, Jr. (ed.), New directions for testing and measurement: Generalizability theory. San Francisco: Jossey-Bass, Inc., pp. 49–66.
Gray, H.L., & Schucany, W. R. (1972). The generalized jackknife statistic. New York: Marcel Dekker, Inc.
Guttman, L. (1953). A special review of Harold Gulliksen’s “Theory of Mental Tests.” Psychometrika 18:123–130.
Jackson, P. H. (1973). The estimation of true score variance and error variance in the classical test theory model. Psychometrika 38:183–201.
Jarjoura, D. (1983). Best linear prediction of composite universe scores. Psychometrika 48:525–539.
Jarjoura, D., & Brennan, R. L. (1983). Multivariate generalizability models for tests developed from tables of specifications. In L.J. Fyans, Jr. (ed.), New directions for testing and measurement: Generalizability theory. San Francisco: Jossey-Bass, Inc., pp. 83–102.
Joe, G.W., & Woodward, J.A (1976). Some developments in multivariate generalizability. Psychometrika 41:205–217.
Kane, M.T., & Brennan, R. L. (1980). Agreement coefficients as indices of dependability for domain-referenced tests. Applied Psychological Measurement 4:105–126.
Kirk, R. E. (1982). Experimental design: Procedures for the behavioral sciences. Belmont, CA: Wadsworth.
Lindley, D.V. (1969). Bayesian least squares. Bulletin of the International Statistical Institute 43:152–153
Lindley, D.V. (1971). The estimation of many parameters. In V. P. Godambe & D. A. Sprott (eds.), Foundation of statistical inference. Toronto: Holt, Rinehart and Winston.
Magnusson, D., & Endler, N.S. (eds.) (1977). Personality at the crossroads: Current issues in interactional psychology. Hillsdale, NJ: Erlbaum.
Maxwell, A. E., & Pilliner, A. E. G. (1968), Deriving coefficients of reliability and agreement for ratings. British Journal of Mathematical and Statistical Psychology 21:105–116.
Rajaratnam, N. (1960). Reliability formulas for independent decision data when reliability data are matched. Psychometrika 25:261–271.
Rajaratnam, N., Cronbach, L. J., & Gleser, G.C. (1965). Generalizability of stratified parallel tests. Psychometrika 30:39–56.
Shavelson, R., & Webb, N. (1981). Generalizability theory: 1973–1980. British Journal of Mathematical and Statistical Psychology 34:133–166.
Shavelson, R., Webb, N.M., & Rowley, G.L. (1989). Generalizability theory. American Psychologist 44:922–932.
Sirotnik, K., & Wellington, R. (1977). Incidence sampling: An integrated theory for “matrix sampling.” Journal of Educational Measurement 14:343–399.
Srivastava, A. B. L., & Webster, H. (1967). An estimation of true scores in the case of items scored on a continuous scale. Psychometrika 32:327–338.
Van Heck, G.L. M. (1988). Modes and models in anxiety. Anxiety Research 1:199–214.
Webb, N.M., Shavelson, R. J., & Maddahian, E. (1983). Multivariate generalizability theory. In L.J. Fyans, Jr. (ed.), New directions for testing and measurement: Generalizability theory. San Francisco: Jossey-Bass, Inc., pp. 67–82.
Wiggins, J.S. (1973). Personality and prediction: Principles of personality assessment. Reading, MS: Addison-Wesley.
Winer, B.J. (1971). Statistical principles in experimental design. New York: McGraw-Hill.
Editor information
Rights and permissions
Copyright information
© 1991 Springer Science+Business Media New York
About this chapter
Cite this chapter
de Gruijter, D.N.M., van der Kamp, L.J.T. (1991). Generalizability Theory. In: Hambleton, R.K., Zaal, J.N. (eds) Advances in Educational and Psychological Testing: Theory and Applications. Evaluation in Education and Human Services Series, vol 28. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-2195-5_2
Download citation
DOI: https://doi.org/10.1007/978-94-009-2195-5_2
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7484-1
Online ISBN: 978-94-009-2195-5
eBook Packages: Springer Book Archive