Abstract
This chapter briefly describes some recent developments in test equating and provides examples of how they are performed using the R packages kequate (Andersson et al., J Stat Softw 55(6):1–25, 2013) and SNSequate (González, J Stat Softw 59(7):1–30, 2014). The chapter begins with recent developments within the kernel equating framework, including different bandwidth selection methods, the use of different kernels in the continuization step, and IRT kernel equating. A Bayesian approach to test equating and the assessment of equating transformations are also discussed. The chapter ends with some concluding reflections on the future of equating research connected to the use of R.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
More details are given in Sect. B.8.
- 2.
Results for test scores (2, …, 78) have been omitted.
- 3.
In the examples that follow, the dependence is accounted in the form of a regression on the \(\boldsymbol{z}\) covariates.
- 4.
The BNP.eq() function makes use of other functions from the R package DPpackage (Jara et al. 2011), which is also downloaded and installed when installing SNSequate.
- 5.
Running the BNP.eq() function can take a considerable amount of computation time.
- 6.
Note that the covariates that are not directly involved in the equating of interest are automatically integrated out by the function.
References
Andersson, B. (2016). Asymptotic standard errors of observed-score equating with polytomous IRT models. Journal of Educational Measurement, 53(4), 459–477.
Andersson, B., Bränberg, K., & Wiberg, M. (2013). Performing the kernel method of test equating with the package kequate. Journal of Statistical Software, 55(6), 1–25.
Andersson, B., & von Davier, A. A. (2014). Improving the bandwidth selection in kernel equating. Journal of Educational Measurement, 51(3), 223–238.
Andersson, B., & Wiberg, M. (2014). IRT observed-score kernel equating with the R package kequate. R: Vignette. Retrieved from https://cran.r-project.org/web/packages/kequate/vignettes/irtguide.pdf
Andersson, B., & Wiberg, M. (2016). Item response theory observed-score kernel equating. Psychometrika. doi: 10.1007/s11336--016--9528--7.
Barrientos, A. F., Jara, A., & Quintana, F. A. (2012). On the support of MacEachern’s dependent Dirichlet processes and extensions. Bayesian Analysis, 7, 277–310.
Chalmers, R. P. (2012). Mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29.
Cid, J. A., & von Davier, A. A. (2015). Examining potential boundary bias effects in kernel smoothing on equating: An introduction for the adaptive and Epanechnikov kernels. Applied Psychological Measurement, 39(3), 208–222.
Dorans, N., & Feigenbaum, M. (1994). Equating issues engendered by changes to the SAT and PSAT/NMSQT. Technical issues related to the introduction of the new SAT and PSAT/NMSQT (pp. 91–122).
Epanechnikov, V. (1969). Non-parametric estimation of a multivariate probability density. Theory of Probability and Its Applications, 14, 153–158.
Gelman, A., Carlin, J., Stern, H., & Rubin, D. (2003). Bayesian data analysis (2nd ed.). Boca Raton: Chapman and Hall.
Ghosh, J., & Ramamoorthi, R. (2003). Bayesian nonparametrics. New York: Springer.
González, J. (2014). SNSequate: Standard and nonstandard statistical models and methods for test equating. Journal of Statistical Software, 59(7), 1–30.
González, J., Barrientos, A. F., & Quintana, F. A. (2015). A dependent Bayesian nonparametric model for test equating. In R. Millsap, D. Bolt, L. van der Ark, & W.-C. Wang (Eds.), Quantitative psychology research (pp. 213–226). Cham: Springer International Publishing.
González, J., Barrientos, A. F., & Quintana, F. A. (2015). Bayesian nonparametric estimation of test equating functions with covariates. Computational Statistics & Data Analysis, 89, 222–244.
González, J., & von Davier, A. A. (2017). An illustration of the Epanechnikov and adaptive continuization methods in kernel equating. In L. A. van der Ark, M. Wiberg, S. A. Culpeppe, J. A. Douglas, & W.-C. Wang (Eds.), Quantitative psychology – 81st annual meeting of the psychometric society, Asheville, North Carolina, 2016. New York: Springer.
Häggström, J., & Wiberg, M. (2014). Optimal bandwidth selection in observed-score kernel equating. Journal of Educational Measurement, 51(2), 201–211.
Hall, P., Marron, J., & Park, B. U. (1992). Smoothed cross-validation. Probability Theory and Related Fields, 92(1), 1–20.
Harris, D. J., & Crouse, J. D. (1993). A study of criteria used in equating. Applied Measurement in Education, 6(3), 195–240.
Hjort, N. L., Holmes, C., Müller, P., & Walker, S. (2010). Bayesian nonparametrics. Cambridge: Cambridge University Press.
Jara, A., Hanson, T., Quintana, F., Müller, P., & Rosner, G. (2011). Dppackage: Bayesian non-and semi-parametric modelling in R. Journal of Statistical Software, 40, 1–30.
Karabatsos, G., & Walker, S. (2009). A Bayesian nonparametric approach to test equating. Psychometrika, 74(2), 211–232.
Liang, T., & von Davier, A. A. (2014). Cross-validation: An alternative bandwidth-selection method in kernel equating. Applied Psychological Measurement, 38(4), 281–295.
Lord, F., & Wingersky, M. (1984). Comparison of IRT true-score and equipercentile observed-score “equatings”. Applied Psychological Measurement, 8(4), 453–461.
Müller, P., & Quintana, F. (2004). Nonparametric Bayesian data analysis. Statistical Science, 19, 95–110.
Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16, 159–176.
Rizopoulos, D. (2006). ltm: An R package for latent variable modeling and item response theory analyses. Journal of Statistical Software, 17(5), 1–25.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. (Psychometrika Monograph No. 17). Richmond, VA: Psychometric Society.
Silverman, B. (1986). Density estimation for statistics and data analysis (Vol. 3). London: Chapman and Hall.
Thissen, D., Pommerich, M., Billeaud, K., & Williams, V. S. L. (1995). Item response theory for scores on tests including polytomous items with ordered responses. Applied Psychological Measurement, 19(1), 39–49.
Wiberg, M., & González, J. (2016). Statistical assessment of estimated transformations in observed-score equating. Journal of Educational Measurement, 53(1), 106–125.
Yuan, K.-H., Cheng, Y., & Patton, J. (2014). Information matrices and standard errors for MLEs of item parameters in IRT. Psychometrika, 79(2), 232–254.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
González, J., Wiberg, M. (2017). Recent Developments in Equating. In: Applying Test Equating Methods. Methodology of Educational Measurement and Assessment. Springer, Cham. https://doi.org/10.1007/978-3-319-51824-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-51824-4_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51822-0
Online ISBN: 978-3-319-51824-4
eBook Packages: EducationEducation (R0)