Modeling Parental Involvement

Punter, R. Annemiek; Glas, Cees A. W.; Meelissen, Martina R. M.

doi:10.1007/978-3-319-28064-6_4

R. Annemiek Punter⁶,
Cees A. W. Glas⁶ &
Martina R. M. Meelissen⁶

Part of the book series: IEA Research for Education ((IEAR,volume 1))

4882 Accesses
1 Citations

Abstract

This chapter presents a psychometric framework aimed at identifying and modeling cultural differential item functioning (CDIF) in multiple ways. One line of modelling uses the residual approach to identify CDIF, and country-specific and random item parameters for the affected items. A second approach uses a non-standard application of the bi-factor model. The results of all approaches for each of the five parental involvement components provide insights into the extent to which they are affected by CDIF.

You have full access to this open access chapter, Download chapter PDF

Development and Validation of a Questionnaire on Chinese Parents’ Beliefs in Parental Roles and Responsibilities

Article 15 December 2018

Language, Immigration, and Socioeconomic Status: A Latent Class Analytic Approach to Parental Predictors of Child Behavior Outcomes

Article 28 July 2023

Psychometric Validation of the Revised Family Affluence Scale: a Latent Variable Approach

Article Open access 18 October 2015

Keywords

In this chapter, we first outline the models used and the estimation and testing procedures employed, and then summarize the results revealed by these models.

4.1 Estimation and Testing Procedures

The procedures we used for parameter estimation and evaluation of model fit are based on marginal maximum likelihood (MML). Most of the procedures we discuss are documented in more detail elsewhere (see Bock and Aitkin 1981; Bock et al. 1988; Gibbons and Hedeker 1992; Glas 1999; Adams and Wu 2006; De Jong et al. 2007; Jennrich and Bentler 2011; Glas and Jehangir 2014). We used the public domain software package MIRT (Glas 2010) in the calculations. Additional estimation and testing procedures were used for the bi-factor model, with unidimensional models as special cases, and random item parameters as a generalization.

4.1.1 MML Estimation

The bi-factor model used in this study was in two parts: a measurement model (i.e., an IRT model) and a structural model. The measurement model pertains to a polytomously-scored response of a student n to an item i. The possible item scores range from 0 to m _i and the score of student n on item i is denoted by the variables x _nij (j = 1, …, m _i) where x _nij = 1 if the response is in category 1 and zero otherwise. Note that m _i has an index i, which indicates that the maximum score of items can differ.

We describe the procedure for the bi-factor model, combined with the partial credit model (PCM; Masters 1982) and generalized partial credit model (GPCM; Muraki 1992) as IRT models, since these two models were the ones we selected for the present study. However, the theory also applies to other IRT models, such as the unidimensional PCM and GPCM, the graded response model (Samejima 1969), the sequential model (Tutz 1990), and other versions of these models with random item parameters instead of fixed item parameters.

In the bi-factor GPCM, the probability of scoring in category j (j = 0, …, m _i) is given by

$$ p_{ij} (\theta_{n} ) \, = \, p(x_{nij} = 1|\theta_{n} ,a,b) \, = \, \frac{{\exp \left( {\sum\limits_{h = 1}^{j} {a_{i0} \theta_{n0} + a_{ig(n)} \theta_{ng(n)} - b_{ih} } } \right)}}{{1 + \sum\limits_{k = 1}^{{m_{i} }} {\exp \left( {\sum\limits_{h = 1}^{k} {a_{i0} \theta_{n0} + a_{ig(n)} \theta_{ng(n)} - b_{ik} } } \right)} }} $$

(4.1)

where, θ _n0 is the score of a student n on the latent scale pertaining to all countries, θ _ng(n) is the score on a country specific latent dimension, and the index g(n) indicates the country to which student n belongs. Further, a _i0 and a _ig(n) are the factor loadings of item i on these two dimensions, and b _ih (h = 1, …, m _i) is the item location parameter. The location parameter b _ih is the position on the latent scale, where it is assumed that summations such as h = 1 to 0 result in zero. The unidimensional GPCM lacks the country-specific dimensions θ _ng(n) and the associated factor loadings a _ig(n). Further, the PCM is obtained by fixing all item parameters a _i0 to one.

The formula for the response probability and subsequent derivations can be simplified by introducing the re-parametrization d _ij = Σ ^j_h = 1 b _ih and by defining a ^t_ig θ _n as the inner product of the vectors (a _i0, a _ig(n)) and (θ _n0, θ _ng(n)), respectively. Thus, Eq. (4.1) becomes

$$ p_{ij} (\theta_{n} ) \, = \, \frac{{\exp \left( {ja_{ig}^{t} \theta_{n} - d_{ij} } \right)}}{{1 + \sum\limits_{k = 1}^{{m_{i} }} {\exp \left( {ka_{ig}^{t} \theta_{n} - d_{ik} } \right)} }} $$

(4.2)

The θ ₀-dimension is the general dimension that pertains to all countries and is the basis for the comparison of the countries. The θ _g-dimensions are the country-specific dimensions, and the factor loadings on these dimensions give an indication of country-by-item interaction. It is assumed that within each country, the dimensions θ ₀ and θ _g have a bi-variate normal distribution $ N(\theta_{n0} ,\theta_{ng} ;\mu_{g} ,\varSigma_{g} ) $. For the two-dimensional country mean μ _g = (μ _g0, μ _g), it holds that the mean on the second dimension is fixed at zero, that is μ _g = 0. The covariance matrix is given by

$$ \varSigma_{g} = \left[ {\begin{array}{*{20}c} {\sigma_{g}^{2} } & 0 \\ 0 & 1 \\ \end{array} } \right] $$

In the unidimensional GPCM and PCM, the latent student parameters θ ₀ have a univariate normal distribution with a mean μ _g and a variance σ ²_g . Finally, random item parameters are obtained by introducing independent multivariate normal distributions on the parameters for each item (for further details, please consult De Jong et al. 2007).

The present application of the bi-factor model is not standard, but an extension of the basic model. Thus, the technical details on the estimation equations, expressions for the covariance matrix of the estimates, and tests of model fit, are also provided (see Appendix A).

4.1.2 Detection and Modeling of Differential Item Functioning

Part of the process of establishing the construct validity of a scale may consist of showing that the scale fits an IRT model. In the present study, the focus is on country-specific CDIF. CDIF can be detected using Lagrange multiplier (LM) test statistics (Rao 1947; see also, Aitchison and Silvey 1958) and CDIF can be modeled using country-specific item parameters. Glas and Jehangir (2014) already showed the feasibility of the method using PISA data, although in the slightly simpler framework of one-dimensional IRT models. The method is implemented in the public domain software package MIRT (Glas 2010). LM tests have been previously applied to IRT frameworks (Glas 1999; Glas and Falcón 2003; Glas and Dagohoy 2007). Our primary interest is not in the actual outcome of the LM test, because due to the very large sample sizes in educational surveys even the smallest model violation, that is, the smallest amount of differential item functioning (DIF), will be significant. The reason for adopting the framework of the LM test is that it clarifies the connection between the model violations, and observations and expectations used to detect DIF. Further, because it produces comprehensible and well-founded expressions for model expectations, the value of the LM test statistic can be used as measure of the effect size of DIF, and the procedure can be easily generalized to a broad class of IRT models.

To define the test and the associated residuals, we define a background variable

$$ y_{nc} = \left\{ {\begin{array}{*{20}l} 1 \hfill & {{\text{if person }}n{\text{ belongs to country }}c ,} \hfill \\ 0 \hfill & {{\text{if person }}n{\text{ does not belong to country }}c .} \hfill \\ \end{array} } \right. $$

The LM test targets the null-hypothesis of no DIF, namely the null-hypothesis where $ \delta_{i} = 0 $. The LM test statistic is computed using the MML estimates of the null-model, where $ \delta_{i} $ is not estimated. The test is based on evaluation of the first-order derivatives of the marginal likelihood with respect to $ \delta_{i} $ evaluated at $ \delta_{i} = 0 $ (see Glas 1999). If the first-order derivative in this point is large, the MML estimate of $ \delta_{i} $ is far removed from zero, and the test is significant. If the first-order derivative in this point is small, the MML estimate of $ \delta_{i} $ is probably close to zero and the test is not significant. The actual LM statistic is the squared first-order derivative divided by its estimated variance, and it has an asymptotic chi-squared distribution with one degree of freedom. However, as already discussed, the primary interest is not so much in the test itself, but in the information it provides regarding the fit between the data and the model.

For a general definition of the approach, which also pertains to polytomously-scored items, the covariates y _nc (c = 1, …, C) should be defined. Special cases leading to specific DIF statistics are given later. The covariates may be separately observed person characteristics, but they may also depend on the observed response pattern, but without the response to the item i targeted.

The LM approach can be outlined using the bi-factor GPCM; the special cases for the unidimensional PCM and GPCM are obtained if the restrictions denoted above are invoked. The probability of a response is given by a generalization of the bi-factor GPCM, namely,

$$ p_{ij} (\theta_{n} ) \, = \, \frac{{\exp \left( {ja_{ig}^{t} \theta_{n} - d_{ij} + j\sum\limits_{c}^{{}} {y_{nc} \delta_{ic} } } \right)}}{{1 + \sum\limits_{k = 1}^{{m_{i} }} {\exp \left( {ka_{ig}^{t} \theta_{n} - d_{ik} + k\sum\limits_{c}^{{}} {y_{nc} \delta_{ic} } } \right)} }} $$

For one so-called reference country, the covariate y _nv is equal to zero. This country serves as a baseline where the bi-factor GPCM with item parameters a and b holds. In the other C-1 countries, the covariates y _nv are equal to one. It can be shown (see Glas 1999) that the test statistic is based on the residuals

$$ \frac{{\sum\limits_{n = 1}^{N} {\sum\limits_{j = 1}^{{m_{i} }} {y_{nc} jX_{ij} } } }}{{\sum\limits_{n = 1}^{N} {y_{nc} } }}{ - }\frac{{\sum\limits_{n = 1}^{N} {\sum\limits_{j = 1}^{{m_{i} }} {y_{nc} j} } E\left( {P_{ij} (\theta_{n} )|{{x}}_{{n}}{;} \lambda } \right) \, }}{{\sum\limits_{n = 1}^{N} {y_{nc} } }} $$

(4.3)

for c = 1, …, C-1. Dividing this residual by the number of respondents Σ _n y _nc produces residuals that are the differences between the observed and expected average item-total score in country c = 1, …, C-1. The residual gauges so-called uniform DIF, in other words, the residual indicates whether the item total function (ITF) Σ _j jP _ij(θ) is shifted for the item, namely whether there is item-by-country interaction.

The LM statistic for the null-hypothesis $ \delta_{i} = 0 $ (c = 1, …, C-1) is a quadratic form in the (C-1)-dimensional vector of residuals and the inverse of their covariance matrix (for details, see Glas 1999). It has an asymptotic chi-squared distribution with C-1 degrees of freedom.

A special case of this procedure is obtained if one country serves as the focal country and all other countries serve as reference. Then the model under the alternative hypothesis has only one additional parameter, $ \delta_{i} $, and the associated LM statistic has an asymptotic chi-squared distribution with one degree of freedom.

Items that show the worst misfit, based on their value of the LM statistic and residuals, are given country-specific item parameters. From a practical point of view, defining country-specific item parameters is equivalent to defining an incomplete design where the DIF item is split into a number of virtual items, and where each virtual item is considered as administered in a specific country. The resulting design can be analyzed using IRT software that supports the analysis of data collected in an incomplete design. We here refer to items with country-specific parameters as split items.

The method is motivated by the assumption that a substantial part of the items function the same in all countries and a limited number of items have CDIF. In the IRT model, it is assumed that all items pertain to the same latent variable θ. Items without CDIF have the same item parameters in every country. However, items with CDIF have item parameters that differ across countries. These items refer to the same latent variable θ as all the other items, but their location on the scale differs across countries. For instance, the number of cars in the family may be a good indicator of wealth, but the actual number of cars at a certain level of wealth may vary across countries, or even within countries. Having a car in the inner city of Amsterdam is clearly a sign of wealth, but, in the rural eastern part of the Netherlands, an equivalent level of wealth would probably result in the ownership of three cars.

The number of items given country-specific item parameters is a matter of choice where two considerations are relevant. First, there should remain a sufficient number of anchor items in the scale. Second, the model including the split items should fit the data. DIF statistics no longer apply to the split items. However, the fit of the item response curve of an individual item, say item i, can be evaluated using the test for non-uniform DIF described earlier, but using a model including country-specific items parameters. So, in this application too, test-score ranges are used as proxies for locations on the θ scale, and the test evaluates whether the model with the country-specific item parameters can properly predict the ITF.

4.2 Results of Modeling Country-Specific Differential Item Functioning

We here provide descriptive statistics at country level for each of the five parental involvement components under the PCM and GPCM, including sample size and estimated global reliability (Tables 4.1, 4.2, 4.3, 4.4 and 4.5). Sample sizes for the first four components (early literacy activities, help with homework, school practices on parental involvement from a parental perspective, and parental involvement from a student perspective) were taken from the PIRLS home and student data, providing a significantly larger sample than that available for the last component (school practices on parental involvement, school perspective), where data were derived from the PIRLS school questionnaire. The GPCM rarely improved global reliability. Components 1 (early literacy activities), 2 (help with homework), and 5 (school practices on parental involvement, school perspective) were evaluated using nine, eight, and 15 items, respectively (see also Table 3.2). Their global reliability is generally >0.70, which is an acceptable level for country inferences. A value of 0.80 is generally considered an acceptable reliability level for individual inferences, and for many combinations of components and countries, this level was attained. Components 3 (school practices on parental involvement, parental perspective) and 4 (parental involvement from a student perspective), were evaluated using three items and five items, respectively; the global reliability of these estimates was thus correspondingly lower.

Table 4.1 Country characteristics component 1: early literacy activities before beginning primary school

Modeling Parental Involvement

Abstract

Similar content being viewed by others

Development and Validation of a Questionnaire on Chinese Parents’ Beliefs in Parental Roles and Responsibilities

Language, Immigration, and Socioeconomic Status: A Latent Class Analytic Approach to Parental Predictors of Child Behavior Outcomes

Psychometric Validation of the Revised Family Affluence Scale: a Latent Variable Approach

Keywords

4.1 Estimation and Testing Procedures

4.1.1 MML Estimation

4.1.2 Detection and Modeling of Differential Item Functioning

4.2 Results of Modeling Country-Specific Differential Item Functioning

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation