Skip to main content

Reliability Estimation in Mokken’s Nonparametric Item Response Model

  • Chapter
Book cover Sociometric Research

Abstract

In this chapter we will investigate and improve two methods of estimating reliability that were originally proposed by Mokken (1971). Based on these methods, a third way of estimating reliability is proposed. The three methods are estimates of the classical reliability concept, which is the ratio of true score variance to observed score variance (see, for example, Lord and Novick, 1968), in the context of nonparametric item response theory (Mokken, 1971; Henning, 1976; Stokman and van Schuur, 1980; Mokken and Lewis, 1982). Apart from new theoretical developments, results from a Monte Carlo study will also be reported and discussed. In this study the three methods already mentioned are compared with four ‘classical’ methods of estimating the reliability of psychological measures. These four methods are the well-known lower bounds alpha (Cronbach, 1951) and lambda-2 (Guttman, 1945), and the lower bounds mu–2 and mu–3 (ten Berge and Zegers, 1978). Finally, some remarks will be made concerning the use of the estimation methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 29.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BERGE, J. M. F. ten and ZEGERS, F. E., ‘A Series of Lower Bounds to the Reliability of a Test’, Psychometrika, 43 (1978) 575–9.

    Article  Google Scholar 

  • BIRNBAUM, A., Part v in LORD, F. M. and NOVICK, M. R., Statistical Theories of Mental Test Scores (Reading, Mass.: Addison-Wesley, 1968).

    Google Scholar 

  • CRONBACH, L. J., ‘Coefficient Alpha and the Internal Structure of Tests’, Psychometrika, 16 (1951) 297–334.

    Article  Google Scholar 

  • FELDT, L. S., ‘The Approximate Sampling Distribution of Kuder-Richard-son Reliability Coefficient Twenty’, Psychometrika, 30 (1965) 357–70.

    Article  Google Scholar 

  • FISCHER, G. H., Einführung in die Theorie psychologischer Tests (Bern: Huber, 1974).

    Google Scholar 

  • GUTTMAN, L., ‘A Basis for Analyzing Test-Retest Reliability’, Psychometrika, 10 (1945) 255–82.

    Article  Google Scholar 

  • —— ‘The Basis for Scalogram Analysis’, in STOUFFER, S. A., GUTTMAN, L., SUCHMAN, E. A., LAZARSFELD, P. F., STAR, S. A. and CLAUSEN, J. A. (eds), Measurement and Prediction (Princeton, N.J.: Princeton University Press, 1950).

    Google Scholar 

  • HAMBLETON, R. K. and COOK, L. L., ‘Latent Trait Models and Their Use in the Analysis of Educational Test Data’, Journal of Educational Measurement, 14 (1977) 75–96.

    Article  Google Scholar 

  • HENNING, H. J., ‘Die Technik der Mokken-Skalenanalyse’, Psychologische Beiträge, 18 (1976) 410–30.

    Google Scholar 

  • JACKSON, P. H. and AGUNWAMBA, C. C., ‘Lower Bounds for the Reliability of the Total Score on a Test Composed of Non-homogeneous Items: I. Algebraic Lower Bounds’, Psychometrika, 42 (1977) 567–78.

    Article  Google Scholar 

  • JANSEN, P. G. W., ‘Homogenitätsmessung mit Hilfe des Koeffizienten H von Loevinger: Eine Kritische Diskussion’, Psychologische Beiträge, 24 (1982a) 96–105.

    Google Scholar 

  • —— ‘De Onbruikbaarheid van Mokkenschaal-analyse’, Tijdschrift voor Onderwijsresearch, 7 (1982b) 11–24.

    Google Scholar 

  • —— Rasch Analysis of Attitudinal Data (Den Haag: Rijks Psychologische Dienst, 1983) (dissertation).

    Google Scholar 

  • —— ROSKAM, E. E. Ch. I. and WOLLENBERG, A. L. van den, ‘De Mokkenschaal gewogen’, Tijdschrift voor Onderwijsresearch, 7 (1982) 31–42.

    Google Scholar 

  • KRISTOF, W., ‘The Statistical Theory of Stepped-up Reliability Coefficients When a Test has been Divided into Several Equivalent Parts’, Psychometrika, 28 (1963) 221–38.

    Article  Google Scholar 

  • LOEVINGER, J., ‘The Technique of Homogeneous Tests Compared with Some Aspects of “Scale Analysis” and Factor Analysis’, Psychological Bulletin, 45 (1948) 507–30.

    Article  Google Scholar 

  • LORD, F. M., ‘Practical Applications of Item Characteristic Curve Theory’, Journal of Educational Measurement, 14 (1977) 117–38.

    Article  Google Scholar 

  • —— Applications of Item Response Theory to Practical Testing Problems (Hillsdale, N.J.: Lawrence Erlbaum, 1980).

    Google Scholar 

  • —— and NOVICK, M. R., Statistical Theories of Mental Test Scores (Reading, Mass.: Addison-Wesley, 1968).

    Google Scholar 

  • LUMSDEN, J., ‘Test Theory’, Annual Review of Psychology, 27 (1976) 251–80.

    Article  Google Scholar 

  • MOKKEN, R. J., A Theory and Procedure of Scale Analysis (The Hague: Mouton, 1971).

    Book  Google Scholar 

  • —— and LEWIS, C., ‘A Nonparametric Approach to the Analysis of Dichotomous Item Responses’, Applied Psychological Measurement, 6 (1982) 417–30.

    Article  Google Scholar 

  • MOLENAAR, I. W., ‘Mokken Scaling Revisited’, Kwantitatieve Methoden, vol. 3, 8 (1982a) 145–64.

    Google Scholar 

  • —— ‘Een Tweede Weging van de Mokken-Schaal’, Tijdschrift voor Onder-wijsresearch, 7 (1982b) 172–81.

    Google Scholar 

  • —— ‘De Beperkte Bruikbaarheid van Jansen’s Kritiek’, Tijdschrift voor Onderwijsresearch, 7 (1982c) 25–30.

    Google Scholar 

  • —— and SIJTSMA, K., ‘Internal Consistency and Reliability in Mokken’s Nonparametric Item Response Model’, Tijdschrift voor Onderwijsresearch, 9 (1984) 257–68.

    Google Scholar 

  • SEDERE, M. U. and FELDT, L. S., ‘The Sampling Distributions of the Kristof Reliability Coefficient, the Feldt Coefficient, and Guttman’s lambda-2’, Journal of Educational Measurement, 14 (1977) 53–62.

    Article  Google Scholar 

  • SIJTSMA, K., ‘Useful Nonparametric Scaling: A Reply to Jansen’, Psychologische Beiträge, 26 (1984) 423–37.

    Google Scholar 

  • —— and MOLENAAR, I. W., ‘Reliability of Test Scores in Nonparametric Item Response Theory’, Psychometrika, (in press).

    Google Scholar 

  • STOKMAN, F. N. and SCHUUR, W. H. van, ‘Basic Scaling’, Quality and Quantity, 14 (1980) 5–30.

    Article  Google Scholar 

  • WEISS, D. J. and DAVISON, M. L., ‘Test Theory and Methods’, Annual Review of Psychology, 32 (1981) 629–58.

    Article  Google Scholar 

  • WOOD, R., ‘Fitting the Rasch Model — A Heady Tale’, British Journal of Mathematical and Statistical Psychology, 31 (1978) 27–32.

    Article  Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Copyright information

© 1988 Willem E. Saris and Irmtraud N. Gallhofer

About this chapter

Cite this chapter

Sijtsma, K. (1988). Reliability Estimation in Mokken’s Nonparametric Item Response Model. In: Saris, W.E., Gallhofer, I.N. (eds) Sociometric Research. Palgrave Macmillan, London. https://doi.org/10.1007/978-1-349-19051-5_10

Download citation

Publish with us

Policies and ethics