Hypothesis Testing of Equating Differences in the Kernel Equating Framework
Test equating methods are used to produce scores that are interchangeable across different test forms (Kolen & Brennan, 2004). In practice, often more than one equating method is applied to the data stemming from a particular test administration. If differences in estimated equating functions are observed, the question arises as to whether these differences reflect real differences in the underlying “true” equating functions or merely reflect sampling error. That is, are observed differences in equating functions statistically significant?
KeywordsWald Test Kernel Method Score Distribution Score Point Equate Function
The authors would like to thank Tim Moses for sharing SAS macros to carry out kernel equating and to compute the standard error of the equating difference at a given score point. The results reported in this paper were obtained through an adaptation of Tim’s SAS macros. Any opinions expressed in this chapter are those of the authors and not necessarily of Educational Testing Service.