Skip to main content

An Option-Based Partial Credit Item Response Model

  • Conference paper
Book cover Quantitative Psychology Research

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 89))

Abstract

Multiple-choice (MC) tests have been criticized for allowing guessing and the failure to credit partial knowledge, and alternative scoring methods and response formats (Ben-Simon et al., Appl Psychol Meas 21:65–88, 1997) have been proposed to address this problem. Modern test theory addresses these issues by using binary item response models (e.g., 3PL) with guessing parameters, or with polytomous IRT models. We propose an option-based partial credit IRT model and a new scoring rule based on a weighted Hamming distance between the option key and the option response vector. The test taker (TT)’s estimated ability is based on information from both correct options and distracters. These modifications reduce the TT’s ability to guess and credit the TT’s partial knowledge. The new model can be tailored to different formats, and some popular IRT models, such as the 2PL and Bock’s nominal model, are special cases of the proposed model. Markov Chain Monte Carlo (MCMC) analysis was used to estimate the model parameters and it provides satisfactory estimates of the model parameters. Simulation studies show that the weighted Hamming distance scores have the highest correlation with TTs’ true abilities, and their distribution is also less skewed than those of the other scores considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Based on the items from “Practice Book for the Paper-based GRE revised General Test,” 26 % of the verbal items and 10 % of the quantitative items are of this type.

  2. 2.

    In the text completion items, it is probably more justified to use grouped number correct scoring than it is for MC items with multiple correct options, since the choice for each blank depends on the other choices.

  3. 3.

    The GRE revised General Test has such items for which TTs are asked to choose all the options that apply.

  4. 4.

    Please note that the model is by no means restricted only to the scoring rules listed in the table.

  5. 5.

    Interested readers may email Yuanchao Emily Bo (ybo@fordham.edu) for the R and WinBugs code.

References

  • Andersen EB (1977) Sufficient statistics and latent trait models. Psychometrika 42:69–81

    Article  MATH  MathSciNet  Google Scholar 

  • Andrich D (1988) Rasch models for measurement. Sage Publications, Beverly Hills

    Google Scholar 

  • Bechger TM, Maris G, Verstralen HHFM, Verhelst ND (2005) The Nedelsky model for multiple choice items. In: van der Ark LA, Croon MA, Sijtsma K (eds) New developments in categorical data analysis for the social and behavioral sciences. Erlbaum, Mahwah, pp 187–206

    Google Scholar 

  • Ben-Simon A, Budescu DV, Nevo B (1997) A comparative study of measures of partial knowledge in multiple-choice tests. Appl Psychol Meas 21:65–88

    Article  Google Scholar 

  • Bereby-Meyer Y, Meyer J, Budescu DV (2003) Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules. Acta Psychol 112:207–220

    Article  Google Scholar 

  • Birnbaum A (1968) Some latent trait models and their use in inferring an examinee’s ability. In: Lord FM, Novick MR (eds) Statistical theories of mental test scores. Addison-Wesley, Reading

    Google Scholar 

  • Bock RD (1972) Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika 37:29–51

    Article  MATH  MathSciNet  Google Scholar 

  • Budescu DV, Bar-Hillel M (1993) To guess or not to guess: a decision theoretic view of formula scoring. J Educ Meas 30:227–291

    Article  Google Scholar 

  • Budescu DV, Bo Y (in press) Analyzing test-taking behavior: decision theory meets psychometric theory. Psychometrika

    Google Scholar 

  • Coombs CH, Milholland JE, Womer FB (1956) The assessment of partial knowledge. Educ Psychol Meas 16:13–37

    Google Scholar 

  • R Development Core Team (2013) R: a language and environment for statistical computing [computer software]. R Foundation for Statistical Computing, Vienna. Retrieved from http://www.R-project.org/

  • Dressel PL, Schmidt J (1953) Some modifications of the multiple choice item. Educ Psychol Meas 13:574–595

    Article  Google Scholar 

  • Echternacht GJ (1976) Reliability and validity of option weighting schemes. Educ Psychol Meas 36:301–309

    Article  Google Scholar 

  • Frary RB (1989) Partial-credit scoring methods for multiple-choice tests. Appl Meas Educ 2:79–96

    Article  Google Scholar 

  • Gibbons JD, Olkin I, Sobel M (1977) Selecting and ordering populations: a new statistical methodology. Wiley, New York

    MATH  Google Scholar 

  • Gulliksen H (1950) Theory of mental tests. Wiley, New York

    Book  Google Scholar 

  • Haladyna TM, (1988) Empirically based polychromous scoring of multiple choice test items: A review. Paper presented at the annual meeting of the American Educational Research Association, New Orleans

    Google Scholar 

  • Hambleton RK, Roberts DM, Traub RE (1970) A comparison of the reliability and validity of two methods for assessing partial knowledge on a multiple-choice test. J Educ Meas 7:75–82

    Article  Google Scholar 

  • Hamming RW (1950) Error detecting and error correcting codes. Bell Syst Tech J 29:147–160

    Article  MathSciNet  Google Scholar 

  • Hansen R (1971) The influence of variables other than knowledge on probabilistic tests. J Educ Meas 8:9–14

    Article  Google Scholar 

  • Holzinger KJ (1924) On scoring multiple response tests. J Educ Psychol 15:445–447

    Article  Google Scholar 

  • Hutchinson TP (1982) Some theories of performance in multiple-choice tests, and their implications for variants of the task. Br J Math Stat Psychol 35:71–89

    Article  MathSciNet  Google Scholar 

  • Jacobs SS (1971) Correlates of unwarranted confidence in responses to objective test items. J Educ Meas 8:15–19

    Article  Google Scholar 

  • Jaradat D, Tollefson N (1988) The impact of alternative scoring procedures for multiple-choice items on test reliability, validity and grading. Educ Psychol Meas 48:627–635

    Article  Google Scholar 

  • Kahneman D, Tversky A (1979) Prospect theory: an analysis of decisions under risk. Econometrica 47:313–327

    Article  Google Scholar 

  • Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) WinBUGS – a Bayesian modeling framework: concepts, structure, and extensibility. Stat Comput 10:325–337

    Article  Google Scholar 

  • Masters GN (1982) A Rasch model for partial credit scoring. Psychometrika 47:149–174

    Article  MATH  Google Scholar 

  • Michael JC (1968) The reliability of a multiple choice examination under various test-taking instructions. J Educ Meas 5:307–314

    Article  Google Scholar 

  • Muraki E (1992) A generalized partial credit model: application of an EM algorithm. Appl Psychol Meas 16:159–176

    Article  Google Scholar 

  • Pugh RC, Brunza JJ (1975) Effects of a confidence weighted scoring system on measures of test reliability and validity. Educ Psychol Meas 35:73–78

    Article  Google Scholar 

  • Rippey RM (1970) A comparison of five different scoring functions for confidence tests. J Educ Meas 7:165–170

    Article  Google Scholar 

  • Ruch GM, Stoddard GD (1925) Comparative reliabilities of objective examinations. J Educ Psychol 16:89–103

    Article  Google Scholar 

  • Samejima F (1969) Estimation of ability using a response pattern of graded scores. Psychometrika Monograph, No. 18

    Google Scholar 

  • Samejima F (1972) A general model for free-response data. Psychometrika Monograph, No, 18.

    Google Scholar 

  • Samejima F (1979) A new family of models for the multiple choice item (Research Report No. 79-4). University of Tennessee, Department of Psychology, Knoxville

    Google Scholar 

  • San Martin E, del Pino G, de Boeck P (2006) IRT models for ability-based guessing. Appl Psychol Meas 30:183–203

    Article  MathSciNet  Google Scholar 

  • Smith RM (1987) Assessing partial knowledge in vocabulary. J Educ Meas 24:217–231

    Article  Google Scholar 

  • Stanley JC, Wang MD (1970) Weighting test items and test item options, an overview of the analytical and empirical literature. Educ Psychol Meas 30:21–35

    Article  Google Scholar 

  • Swineford F (1938) Measurement of a personality trait. J Educ Psychol 29:295–300

    Article  Google Scholar 

  • Swineford F (1941) Analysis of a personality trait. J Educ Psychol 32:348–444

    Article  Google Scholar 

  • Sykes RC, Hou L (2003) Weighting constructed-response items in IRT-based exams. Appl Meas Educ 16:257–275

    Article  Google Scholar 

  • Thissen D, Steinberg L (1984) A response model for multiple choice items. Psychometrika 49:501–519

    Article  Google Scholar 

  • Thurstone LL (1919) A method for scoring tests. Psychol Bull 16:235–240

    Article  Google Scholar 

  • Tversky A, Kahneman D (1992) Advances in prospect theory: cumulative representation of uncertainty. J Risk Uncertainty 5:297–323

    Article  MATH  Google Scholar 

  • Wang MW, Stanley JC (1970) Differential weighting: a review of methods and empirical studies. Rev Educ Res 40:663–705

    Article  Google Scholar 

  • Yaniv I, Schul Y (1997) Elimination and inclusion procedures in judgment. J Behav Decis Mak 10:211–220

    Article  Google Scholar 

  • Yaniv I, Schul Y (2000) Acceptance and elimination procedure in choice: noncomplementarity and the role of implied status quo. Organ Behav Hum Decis Process 82:293–313

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuanchao (Emily) Bo .

Editor information

Editors and Affiliations

Appendices

Appendix 1

We can use the following form to derive the Fisher information for an item

$$ I\left(\theta \right)=E\left[\left.{\left(\frac{\partial }{\partial \theta } \log p\left(X;\theta \right)\right)}^2\right|\theta \right]. $$

The logarithm of the likelihood for the model given in Eq. (22) is

$$ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]={\displaystyle \sum_{k=1}^K}{r}_k log({x}_k)- log\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]. $$

Note that

$$ \frac{\partial }{\partial \theta } \log \left({x}_k\right)={a}_k,\ \frac{\partial {x}_k^{r_k}}{\partial \theta }={r}_k{a}_k{x}_k^{r_k}. $$

So

$$ \frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}={\displaystyle \sum_{k=1}^K}{r}_k{a}_k-{\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]}^{-1}{\displaystyle \sum_{\boldsymbol{r}\in R}}\left[\frac{\partial }{\partial \theta}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]. $$

The derivative in the last term may be simplified as follows:

$$ \frac{\partial }{\partial \theta}\left({\displaystyle \prod_{k{=}1}^K}{x}_k^{r_k}\right)\!{=}\!{\displaystyle \sum_{h{=}1}^K}\left[\left(\frac{\partial {x}_h^{r_h}}{\partial \theta}\right){\displaystyle \prod_{k\ne h}^K}{x}_k^{r_k}\right]{=}{\displaystyle \sum_{h=1}^K}\left[\left({r}_h{a}_h{x}_h^{r_h}\right){\displaystyle \prod_{k\ne h}^K}{x}_k^{r_k}\right]{=}{\displaystyle \prod_{k{=}1}^K}{x}_k^{r_k}\left[{\displaystyle \sum_{h{=}1}^K}\left({r}_h{a}_h\right)\right]\!{.} $$

Thus, we may write the derivative of the log likelihood for an item as

$$ \begin{array}{lll}{}&\frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}\\ &\quad ={\displaystyle \sum_{k=1}^K}{r}_k{a}_k-{\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]}^{-1}{\displaystyle \sum_{\boldsymbol{r}\in R}}\left\{{\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\left[{\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right)\right]\right\}.\end{array} $$

Appendix 2

Start with the expression for the derivative of the log likelihood,

$$ \begin{array}{lll}{}&\frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}\\ &\quad ={\displaystyle \sum_{k=1}^K}{r}_k{a}_k-{\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]}^{-1}{\displaystyle \sum_{\boldsymbol{r}\in R}}\left\{{\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\left[{\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right)\right]\right\}\end{array} $$

and notice that the second term is actually the expected value of the quantity \( {\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right) \). Specially, if we define

$$ s\left(\boldsymbol{r}\right)={\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right), $$

we may write

$$ \begin{array}{lll} {} & E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right]\\ & ={\displaystyle \sum_{r\in R}}\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)s\left(\boldsymbol{r}\right)\right]={\displaystyle \sum_{r\in R}}\left\{\left[\frac{{\displaystyle {\prod}_{k=1}^K}{x}_k^{r_k}}{{\displaystyle {\sum}_{\boldsymbol{r}\in R}}\left({\displaystyle {\prod}_{k=1}^K}{x}_k^{r_k}\right)}\right]\left[{\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right)\right]\right\}.\end{array} $$

This allows us to rewrite the derivative of the log likelihood as

$$ \begin{array}{lll}\frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}&{=}\,{\displaystyle \sum_{k=1}^K}{r}_k{a}_k{-}E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right]\\ &{=}\,s\left(\boldsymbol{r}\right){-}E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right].\end{array} $$

The item information function then becomes

$$ \begin{array}{lll} I\left(\theta \right)&={\displaystyle \sum_{r\in R}}\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right){\left(\frac{\partial }{\partial \theta}\left\{ \log \left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}\right)}^2\right]\\ &={\displaystyle \sum_{r\in R}}\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right){\left(s\left(\boldsymbol{r}\right)-E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right]\right)}^2\right].\end{array} $$

Since the right-hand side of this expression is the conditional variance of s(r), we may write

$$ I\left(\theta \right)= var\left[s\left(\boldsymbol{r}\right)\Big|\theta, \boldsymbol{r}\in R\right]. $$

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Bo, Y.(., Lewis, C., Budescu, D.V. (2015). An Option-Based Partial Credit Item Response Model. In: Millsap, R., Bolt, D., van der Ark, L., Wang, WC. (eds) Quantitative Psychology Research. Springer Proceedings in Mathematics & Statistics, vol 89. Springer, Cham. https://doi.org/10.1007/978-3-319-07503-7_4

Download citation

Publish with us

Policies and ethics