An Option-Based Partial Credit Item Response Model

Bo, Yuanchao (Emily); Lewis, Charles; Budescu, David V.

doi:10.1007/978-3-319-07503-7_4

Yuanchao (Emily) Bo⁵,
Charles Lewis⁵ &
David V. Budescu⁵

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 89))

2207 Accesses
4 Citations

Abstract

Multiple-choice (MC) tests have been criticized for allowing guessing and the failure to credit partial knowledge, and alternative scoring methods and response formats (Ben-Simon et al., Appl Psychol Meas 21:65–88, 1997) have been proposed to address this problem. Modern test theory addresses these issues by using binary item response models (e.g., 3PL) with guessing parameters, or with polytomous IRT models. We propose an option-based partial credit IRT model and a new scoring rule based on a weighted Hamming distance between the option key and the option response vector. The test taker (TT)’s estimated ability is based on information from both correct options and distracters. These modifications reduce the TT’s ability to guess and credit the TT’s partial knowledge. The new model can be tailored to different formats, and some popular IRT models, such as the 2PL and Bock’s nominal model, are special cases of the proposed model. Markov Chain Monte Carlo (MCMC) analysis was used to estimate the model parameters and it provides satisfactory estimates of the model parameters. Simulation studies show that the weighted Hamming distance scores have the highest correlation with TTs’ true abilities, and their distribution is also less skewed than those of the other scores considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Based on the items from “Practice Book for the Paper-based GRE revised General Test,” 26 % of the verbal items and 10 % of the quantitative items are of this type.
2.
In the text completion items, it is probably more justified to use grouped number correct scoring than it is for MC items with multiple correct options, since the choice for each blank depends on the other choices.
3.
The GRE revised General Test has such items for which TTs are asked to choose all the options that apply.
4.
Please note that the model is by no means restricted only to the scoring rules listed in the table.
5.
Interested readers may email Yuanchao Emily Bo (ybo@fordham.edu) for the R and WinBugs code.

References

Andersen EB (1977) Sufficient statistics and latent trait models. Psychometrika 42:69–81
Article MATH MathSciNet Google Scholar
Andrich D (1988) Rasch models for measurement. Sage Publications, Beverly Hills
Google Scholar
Bechger TM, Maris G, Verstralen HHFM, Verhelst ND (2005) The Nedelsky model for multiple choice items. In: van der Ark LA, Croon MA, Sijtsma K (eds) New developments in categorical data analysis for the social and behavioral sciences. Erlbaum, Mahwah, pp 187–206
Google Scholar
Ben-Simon A, Budescu DV, Nevo B (1997) A comparative study of measures of partial knowledge in multiple-choice tests. Appl Psychol Meas 21:65–88
Article Google Scholar
Bereby-Meyer Y, Meyer J, Budescu DV (2003) Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules. Acta Psychol 112:207–220
Article Google Scholar
Birnbaum A (1968) Some latent trait models and their use in inferring an examinee’s ability. In: Lord FM, Novick MR (eds) Statistical theories of mental test scores. Addison-Wesley, Reading
Google Scholar
Bock RD (1972) Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika 37:29–51
Article MATH MathSciNet Google Scholar
Budescu DV, Bar-Hillel M (1993) To guess or not to guess: a decision theoretic view of formula scoring. J Educ Meas 30:227–291
Article Google Scholar
Budescu DV, Bo Y (in press) Analyzing test-taking behavior: decision theory meets psychometric theory. Psychometrika
Google Scholar
Coombs CH, Milholland JE, Womer FB (1956) The assessment of partial knowledge. Educ Psychol Meas 16:13–37
Google Scholar
R Development Core Team (2013) R: a language and environment for statistical computing [computer software]. R Foundation for Statistical Computing, Vienna. Retrieved from http://www.R-project.org/
Dressel PL, Schmidt J (1953) Some modifications of the multiple choice item. Educ Psychol Meas 13:574–595
Article Google Scholar
Echternacht GJ (1976) Reliability and validity of option weighting schemes. Educ Psychol Meas 36:301–309
Article Google Scholar
Frary RB (1989) Partial-credit scoring methods for multiple-choice tests. Appl Meas Educ 2:79–96
Article Google Scholar
Gibbons JD, Olkin I, Sobel M (1977) Selecting and ordering populations: a new statistical methodology. Wiley, New York
MATH Google Scholar
Gulliksen H (1950) Theory of mental tests. Wiley, New York
Book Google Scholar
Haladyna TM, (1988) Empirically based polychromous scoring of multiple choice test items: A review. Paper presented at the annual meeting of the American Educational Research Association, New Orleans
Google Scholar
Hambleton RK, Roberts DM, Traub RE (1970) A comparison of the reliability and validity of two methods for assessing partial knowledge on a multiple-choice test. J Educ Meas 7:75–82
Article Google Scholar
Hamming RW (1950) Error detecting and error correcting codes. Bell Syst Tech J 29:147–160
Article MathSciNet Google Scholar
Hansen R (1971) The influence of variables other than knowledge on probabilistic tests. J Educ Meas 8:9–14
Article Google Scholar
Holzinger KJ (1924) On scoring multiple response tests. J Educ Psychol 15:445–447
Article Google Scholar
Hutchinson TP (1982) Some theories of performance in multiple-choice tests, and their implications for variants of the task. Br J Math Stat Psychol 35:71–89
Article MathSciNet Google Scholar
Jacobs SS (1971) Correlates of unwarranted confidence in responses to objective test items. J Educ Meas 8:15–19
Article Google Scholar
Jaradat D, Tollefson N (1988) The impact of alternative scoring procedures for multiple-choice items on test reliability, validity and grading. Educ Psychol Meas 48:627–635
Article Google Scholar
Kahneman D, Tversky A (1979) Prospect theory: an analysis of decisions under risk. Econometrica 47:313–327
Article Google Scholar
Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) WinBUGS – a Bayesian modeling framework: concepts, structure, and extensibility. Stat Comput 10:325–337
Article Google Scholar
Masters GN (1982) A Rasch model for partial credit scoring. Psychometrika 47:149–174
Article MATH Google Scholar
Michael JC (1968) The reliability of a multiple choice examination under various test-taking instructions. J Educ Meas 5:307–314
Article Google Scholar
Muraki E (1992) A generalized partial credit model: application of an EM algorithm. Appl Psychol Meas 16:159–176
Article Google Scholar
Pugh RC, Brunza JJ (1975) Effects of a confidence weighted scoring system on measures of test reliability and validity. Educ Psychol Meas 35:73–78
Article Google Scholar
Rippey RM (1970) A comparison of five different scoring functions for confidence tests. J Educ Meas 7:165–170
Article Google Scholar
Ruch GM, Stoddard GD (1925) Comparative reliabilities of objective examinations. J Educ Psychol 16:89–103
Article Google Scholar
Samejima F (1969) Estimation of ability using a response pattern of graded scores. Psychometrika Monograph, No. 18
Google Scholar
Samejima F (1972) A general model for free-response data. Psychometrika Monograph, No, 18.
Google Scholar
Samejima F (1979) A new family of models for the multiple choice item (Research Report No. 79-4). University of Tennessee, Department of Psychology, Knoxville
Google Scholar
San Martin E, del Pino G, de Boeck P (2006) IRT models for ability-based guessing. Appl Psychol Meas 30:183–203
Article MathSciNet Google Scholar
Smith RM (1987) Assessing partial knowledge in vocabulary. J Educ Meas 24:217–231
Article Google Scholar
Stanley JC, Wang MD (1970) Weighting test items and test item options, an overview of the analytical and empirical literature. Educ Psychol Meas 30:21–35
Article Google Scholar
Swineford F (1938) Measurement of a personality trait. J Educ Psychol 29:295–300
Article Google Scholar
Swineford F (1941) Analysis of a personality trait. J Educ Psychol 32:348–444
Article Google Scholar
Sykes RC, Hou L (2003) Weighting constructed-response items in IRT-based exams. Appl Meas Educ 16:257–275
Article Google Scholar
Thissen D, Steinberg L (1984) A response model for multiple choice items. Psychometrika 49:501–519
Article Google Scholar
Thurstone LL (1919) A method for scoring tests. Psychol Bull 16:235–240
Article Google Scholar
Tversky A, Kahneman D (1992) Advances in prospect theory: cumulative representation of uncertainty. J Risk Uncertainty 5:297–323
Article MATH Google Scholar
Wang MW, Stanley JC (1970) Differential weighting: a review of methods and empirical studies. Rev Educ Res 40:663–705
Article Google Scholar
Yaniv I, Schul Y (1997) Elimination and inclusion procedures in judgment. J Behav Decis Mak 10:211–220
Article Google Scholar
Yaniv I, Schul Y (2000) Acceptance and elimination procedure in choice: noncomplementarity and the role of implied status quo. Organ Behav Hum Decis Process 82:293–313
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Psychology, Fordham University, Rose Hill Campus, 441 East Fordham Road, Bronx, NY, USA
Yuanchao (Emily) Bo, Charles Lewis & David V. Budescu

Authors

Yuanchao (Emily) Bo
View author publications
You can also search for this author in PubMed Google Scholar
Charles Lewis
View author publications
You can also search for this author in PubMed Google Scholar
David V. Budescu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuanchao (Emily) Bo .

Editor information

Editors and Affiliations

Department of Psychology, Arizona State University, Tempe, Arizona, USA
Roger E. Millsap
Dept. of Educational Psychology, University of Wisconsin, Madison, USA
Daniel M. Bolt
University of Amsterdam, Amsterdam, The Netherlands
L. Andries van der Ark
Department of Psychological Studies, The Hong Kong Institute of Education, Hong Kong, Hong Kong SAR
Wen-Chung Wang

Appendices

Appendix 1

We can use the following form to derive the Fisher information for an item

$$ I\left(\theta \right)=E\left[\left.{\left(\frac{\partial }{\partial \theta } \log p\left(X;\theta \right)\right)}^2\right|\theta \right]. $$

The logarithm of the likelihood for the model given in Eq. (22) is

$$ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]={\displaystyle \sum_{k=1}^K}{r}_k log({x}_k)- log\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]. $$

Note that

$$ \frac{\partial }{\partial \theta } \log \left({x}_k\right)={a}_k,\ \frac{\partial {x}_k^{r_k}}{\partial \theta }={r}_k{a}_k{x}_k^{r_k}. $$

So

$$ \frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}={\displaystyle \sum_{k=1}^K}{r}_k{a}_k-{\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]}^{-1}{\displaystyle \sum_{\boldsymbol{r}\in R}}\left[\frac{\partial }{\partial \theta}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]. $$

The derivative in the last term may be simplified as follows:

$$ \frac{\partial }{\partial \theta}\left({\displaystyle \prod_{k{=}1}^K}{x}_k^{r_k}\right)\!{=}\!{\displaystyle \sum_{h{=}1}^K}\left[\left(\frac{\partial {x}_h^{r_h}}{\partial \theta}\right){\displaystyle \prod_{k\ne h}^K}{x}_k^{r_k}\right]{=}{\displaystyle \sum_{h=1}^K}\left[\left({r}_h{a}_h{x}_h^{r_h}\right){\displaystyle \prod_{k\ne h}^K}{x}_k^{r_k}\right]{=}{\displaystyle \prod_{k{=}1}^K}{x}_k^{r_k}\left[{\displaystyle \sum_{h{=}1}^K}\left({r}_h{a}_h\right)\right]\!{.} $$

Thus, we may write the derivative of the log likelihood for an item as

$$ \begin{array}{lll}{}&\frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}\\ &\quad ={\displaystyle \sum_{k=1}^K}{r}_k{a}_k-{\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]}^{-1}{\displaystyle \sum_{\boldsymbol{r}\in R}}\left\{{\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\left[{\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right)\right]\right\}.\end{array} $$

Appendix 2

Start with the expression for the derivative of the log likelihood,

$$ \begin{array}{lll}{}&\frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}\\ &\quad ={\displaystyle \sum_{k=1}^K}{r}_k{a}_k-{\left[{\displaystyle \sum_{\boldsymbol{r}\in R}}\left({\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\right)\right]}^{-1}{\displaystyle \sum_{\boldsymbol{r}\in R}}\left\{{\displaystyle \prod_{k=1}^K}{x}_k^{r_k}\left[{\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right)\right]\right\}\end{array} $$

and notice that the second term is actually the expected value of the quantity $ {\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right) $. Specially, if we define

$$ s\left(\boldsymbol{r}\right)={\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right), $$

we may write

$$ \begin{array}{lll} {} & E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right]\\ & ={\displaystyle \sum_{r\in R}}\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)s\left(\boldsymbol{r}\right)\right]={\displaystyle \sum_{r\in R}}\left\{\left[\frac{{\displaystyle {\prod}_{k=1}^K}{x}_k^{r_k}}{{\displaystyle {\sum}_{\boldsymbol{r}\in R}}\left({\displaystyle {\prod}_{k=1}^K}{x}_k^{r_k}\right)}\right]\left[{\displaystyle \sum_{h=1}^K}\left({r}_h{a}_h\right)\right]\right\}.\end{array} $$

This allows us to rewrite the derivative of the log likelihood as

$$ \begin{array}{lll}\frac{\partial }{\partial \theta}\left\{ log\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}&{=}\,{\displaystyle \sum_{k=1}^K}{r}_k{a}_k{-}E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right]\\ &{=}\,s\left(\boldsymbol{r}\right){-}E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right].\end{array} $$

The item information function then becomes

$$ \begin{array}{lll} I\left(\theta \right)&={\displaystyle \sum_{r\in R}}\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right){\left(\frac{\partial }{\partial \theta}\left\{ \log \left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right)\right]\right\}\right)}^2\right]\\ &={\displaystyle \sum_{r\in R}}\left[p\left(\boldsymbol{r}\Big|\theta, \boldsymbol{r}\in R\right){\left(s\left(\boldsymbol{r}\right)-E\left[s\left(\boldsymbol{r}\right)\Big|\theta,\ \boldsymbol{r}\in R\right]\right)}^2\right].\end{array} $$

Since the right-hand side of this expression is the conditional variance of s(r), we may write

$$ I\left(\theta \right)= var\left[s\left(\boldsymbol{r}\right)\Big|\theta, \boldsymbol{r}\in R\right]. $$

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bo, Y.(., Lewis, C., Budescu, D.V. (2015). An Option-Based Partial Credit Item Response Model. In: Millsap, R., Bolt, D., van der Ark, L., Wang, WC. (eds) Quantitative Psychology Research. Springer Proceedings in Mathematics & Statistics, vol 89. Springer, Cham. https://doi.org/10.1007/978-3-319-07503-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-07503-7_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07502-0
Online ISBN: 978-3-319-07503-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics