Skip to main content

Validity, Reliability, and Acuity of Self-Assessment in Educational Testing

  • Conference paper
Item Banking: Interactive Testing and Self-Assessment

Part of the book series: NATO ASI Series ((NATO ASI F,volume 112))

Abstract

When teachers use confidence marking, they should be aware that confidence estimation and confidence expression are influenced by a series of factors. Some of them have been studied in detail, such as the general human capacity to estimate one’s knowledge (how far can people be sensitive, reliable and valid in appreciating their uncertainty).

This paper indicates how some of these factors have been studied, the results and the implications for designing test Instructions, proper scoring rules and indices of the quality of self assessment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Adams, J.K. & Adams P.A. (1961). Realism of confidence judgments, Psychological Review 68, 33–45

    Article  Google Scholar 

  • Attneave, F. (1959). Application of Information theory to psychology. New York: Holt, Rinehart and Winston.

    Google Scholar 

  • Brown, T.A. & Shuford, E.H. (1973). Quantifying uncertainty into numerical probabilities for the reporting of intelligence (Report R-1185-ARPA), Santa Monica, Cal.: Rand Corporation.

    Google Scholar 

  • Bruno, J. (1993), Using testing to provide feedback to support instruction: a reexamination of the role of assessment in educational organizations. In: D. Leclercq, J. Bruno (eds.): Item banking: interactive testing and self-assessment. NATO ASI Series F, Vol. 112. Berlin: Springer-Verlag (this volume).

    Google Scholar 

  • Coombs, C.H. (1950). Psychological scaling without a unit of measurement. Psychological Review 57, 145–158.

    Article  Google Scholar 

  • Coombs, C.H., Dawes, R.M., Tversky, A. (1970). Mathematical psychology. Englewood Cliffs NJ: Prentice Hall.

    MATH  Google Scholar 

  • De Finetti, B., (1965a). La décision et les probabilités, Revue des Mathématiques pures et appliquées, Bucarest, 405–413.

    Google Scholar 

  • De Finetti, B. (1965b), Methods for discriminating levels of partial knowledge concerning a test item, British Journal of Mathematical and Statistical Psychology 18, 87–123.

    Article  Google Scholar 

  • Edwards, A.L. (1967). Statistical methods. New York: Hold, Rinehart and Winston, 2nd edition.

    Google Scholar 

  • Edwards, W., (1967), Probabilistic Information processing by men and man-machine Systems, in La Simulation du comportement humain, Paris, Dunod, p. 187.

    Google Scholar 

  • Hunt, Darwin P. (1993), Human self assessment -theory and application to learning and testing. In: D. Leclercq, J. Bruno (eds.): Item banking: interactive testing and self-assessment. NATO ASI Series F, Vol. 112. Berlin: Springer-Verlag (this volume).

    Google Scholar 

  • Leclercq, D. (1975). L’évaluation subjectivé de la probabilité d’exactitude des réponses en Situation pédagogique. Thèse de doctorat en Sciences de l’Education, Université de Liège, Institut de Psychologie et des Sciences de l’Education.

    Google Scholar 

  • Leclercq, D. (1983), Confidence marking, its use in testing. In: Postlethwaite, Choppin (eds.) Evaluation in Education, Oxford : Pergamon, 1982, vol. 6, 2, 161–287.

    Google Scholar 

  • Leclercq, D., Boxus, E., de Brogniez, P., Lambert, F., Wuidar H. (1993), The Taste approach: General implicit Solutions in MCQs, open books exams and interactive testing. In: D. Leclercq, J. Bruno (eds.) Item banking, interactive testing and self-assessment. NATO ASI Series, Vol. 112. Berlin: Springer Verlag (this volume).

    Chapter  Google Scholar 

  • Leclercq, D. & de Brogniez Ph. (1990), A fresh look on confidence marking. In: Estes, Heene, Leclercq (eds.) New pathways to learning through educational technology. Proceedings of the Seventh International Conference on Technology and Education, Brussels, vol. 1, pp. 646–649.

    Google Scholar 

  • Lichtenstein, S., Fischhoff, B., Phillips, L.D. (1975). Calibration of probabilities: the State of the art, decision making and change in human affairs. Proceedings of the Fifth Research Conference on Subjective Probability, Utility and Decision Making, Darmstadt, 1–4 September, D. Reidel.

    Google Scholar 

  • Lindley, D.V. (1971). Making decisions. London: Wiley.

    Google Scholar 

  • Luce, R.D., Raiffa, H. (1966). Games and decision. New York: Wiley.

    Google Scholar 

  • Michael J.J. (1968). The reliability of a multiple choice examination under various test-making Instructions. Journal of Educational Measurement 5, 307–314.

    Article  Google Scholar 

  • Miller, G.A. (1956). The magical number seven, plus or minus two. Psychological Review 63, 81–97.

    Article  Google Scholar 

  • Murphy, A.H., & Winkler, R.L. (1974). Subjective probability forecasting experiments in meterorology: some preliminary results. Bulletin of the American Meteorological Society 55, 1206–1216.

    Article  Google Scholar 

  • Pitz, G.F. (1974), Subjective probability distributions for imperfectly known quantities. In: Gregg, L.W. (ed.) Knowledge and Cognition. New York: Wiley, pp. 29–41.

    Google Scholar 

  • Raiffa, H. (1970). Decision analysis, introductory lectures on choice under uncertainty. New York: Addison-Wesley.

    Google Scholar 

  • Savage, L.J. (1951). The foundations of statistics. New York: Wiley.

    Google Scholar 

  • Shannon, C.E. (1951). Prediction and entropy of printed English. Bell Syst Techn. J. 30, 50–64.

    MATH  Google Scholar 

  • Shuford, E., Albert, A. & Massengill, N.E. (1966), Admissible probability measurement procedures. Psychometrika 31, 125–145.

    Article  MATH  Google Scholar 

  • Shuford, E. (1993), In pursuit of the fallacy: resurrecting the penalty. In: D. Leclercq, J. Bruno (eds.) Item banking: interactive testing and self-assessment. NATO ASI Series F, Vol. 112. Berlin: Springer-Verlag (this volume).

    Google Scholar 

  • Van Naerssen R.F. & Van Beaumont, R. (1965). Ervaringen met een Zekerheidsaanduiding bij objektieve Tentamens. Nederlands Tijdschrift Psychologie 20, 208–315.

    Google Scholar 

  • Van Naerssen, R.F., Sandbergen, S. & Bruynis, E. (1966), Is de Utiliteitscurve van Examenscores een Ogief? Nederlands Tijdschrift Psychologie 21(6), 358–363.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1993 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leclercq, D. (1993). Validity, Reliability, and Acuity of Self-Assessment in Educational Testing. In: Leclercq, D.A., Bruno, J.E. (eds) Item Banking: Interactive Testing and Self-Assessment. NATO ASI Series, vol 112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-58033-8_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-58033-8_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-63444-4

  • Online ISBN: 978-3-642-58033-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics