Background

Research has demonstrated that contextual factors such as gender, food security (e.g., limited or uncertain availability of nutritionally appropriate foods), and ethnicity play a role in the development of pediatric obesity [1,2,3]. For example, research has found that when compared to boys, girls have more fat mass with a different fat distribution pattern, are less sensitive to insulin across childhood, and are more susceptible to family and environmental risk factors that contribute to pediatric obesity [4]. Boys, in turn, are more physically active throughout childhood and adolescence, receive more benefits from physical activity, and tend to have lower leptin levels when compared to girls [4]. This suggests there may be differential risk factors and susceptibility across groups such as sex. Research on ethnicity has demonstrated that Latino children and adolescents have higher rates of overweight and pediatric obesity than their non-Latino White counterparts [3]. Similarly, there is an association with food insecurity and higher rates of overweight and obesity in children [2, 5,6,7]. Despite studies demonstrating the influence of these contextual factors on pediatric obesity, limited research has been conducted to ensure that questionnaires on key risk factors for pediatric obesity are invariant across gender, ethnicity, and food security. Measurement invariance is important, because construct validity is threatened when items of a scale function inconsistently across groups.

Existing cross-sectional and longitudinal research suggests that parental beliefs and feeding practices contribute to pediatric obesity [8,9,10,11,12]. Two commonly used measures to assess parental beliefs and feeding practices are the Child Feeding Questionnaire [9] and Child Eating Behaviour Questionnaire [13]. Both questionnaires have been used to make group comparisons despite limited psychometric research examining the appropriateness of these measures across key contextual factors related to pediatric obesity.

The Child Feeding Questionnaire (CFQ), one of the most widely used scales in the child feeding literatures, assesses parent’s concern about a child’s weight, responsibility for feeding a child, and the extent a parent pressures a child to eat or restricts a child’s food intake [9]. The CFQ was initially developed with a 7-factor model and validated among an ethnically diverse sample of mothers and fathers with children ranging from 2- to 11-years of age [9]. However, a replication study of low-income Latino and African American families with boys and girls failed to replicate the original factor structure and proposed an alternate model [14]. This same study also found cross-cultural conceptual problems resulting in the authors dropping the perceived weight subscales, as well as a number of items in each of the remaining subscales in order to achieve cross-cultural equivalence [14]. Despite these issues, the CFQ has been used to make group comparisons across a number of different groups, such as parents with boys versus girls [15], Latino versus European Americans [11, 16], food secure and food insecure households [5, 17] and among low-income families without an assessment of food security [18, 19].

The Children’s Eating Behavior Questionnaire (CEBQ) is a parent-report questionnaire that assesses individual eating styles of children that have been found to relate to pediatric obesity [12, 13]. The CEBQ was initially developed with an 8 factor model and validated among mothers and fathers with children between the ages of 3- to 8-years old in the United Kingdom [13]. Additional studies have validated the original factor structure among children, with only a slight variation where food responsiveness and emotional overeating at times load onto the same factor [20, 21]. Within the United States, one study replicated the original factor structure among low income families with pre-school aged children and found measurement equivalence across White and Black participants [22]. Yet, another study of low-income Hispanic and African American families failed to replicate the original factor structure [23]. An additional study of minority low-income families has suggested there may be conceptual issues with some of the scales in the CEBQ [24]. Despite these concerns, the CEBQ has been used to make group comparisons across gender [21, 25], and ethnicity [22, 23, 25]. In addition, the CEBQ has been used across socioeconomic status [25, 26]. If there are problems with measurement invariance, validity of the inferences and interpretations of the results associated with the measure may be threatened.

Research has suggested there may be cross-cultural conceptual problems with the CFQ and CEBQ, and highlighted the need to examine measurement equivalence of these widely used scales, particularly among low-income minority groups [24]. The goal of the current study was to examine measurement equivalence of the CFQ and CEBQ across key contextual factors that influence pediatric obesity (gender, ethnicity, food security). To facilitate across study comparisons of both measures, the current study targeted caregivers with children between the ages of 4- to 6-years old. Both measures have psychometric studies that recruited families from preschool centers, thereby increasing our ability to compare our findings to the extant literature [14, 22,23,24, 27].

Method

Participants and procedures

This study includes 243 caregivers (169 maternal caregivers) with children between the ages of 4- to 6-years old who resided in the home with the children the majority of the time. Table 1 displays the sample characteristics. Approximately 51% of the children were male, and 33.6% were Latino. The majority of caregivers (51.5%) reported a monthly household income of $3000 or below. There were 72 children whose caregiver reported household food insecurity. The number of persons per household ranged from 2 to 10 (M = 4.27, SD = 1.14). Child body mass index (BMI) was calculated as BMI-for-age (age- and sex-specific) using experimenter-measured child weight and height with Centers for Disease Control and Prevention (CDC) growth charts [28]. Among the children, 66.7% had a BMI percentile score below the 85th percentile and considered of healthy weight, while 23.8% of the sample had a BMI percentile score between 85th and 95th and considered overweight, and 9.5% of the children were considered obese with a 95th or greater BMI percentile score.

Table 1 Means, standard deviation, and sample characteristics

Using flyers, participants were recruited from waiting rooms of pediatricians’ offices, daycare centers, preschools, and local stores or businesses that were frequented by families. Families called if they were interested in participating in the study and were screened by phone. Caregivers were excluded if (1) they were unable to use English fluently, (2) had a significant disability that would prevent them from completing the tasks in this proposal, such as blindness, or (3) did not have a child between the ages of 4- to 6-years old. Parents completed online questionnaires on their behavior patterns and those of their children, and were paid for their participation.

Measures

Child Feeding Questionnaire (CFQ)

The CFQ is a 28 item measure given to parents that assesses parent’s perceived responsibility for feeding, perceived parent weight across development, concern about child weight and risk for being overweight, food restriction, and pressure to eat [12, 13]. Items are rated on a scale from 1 to 5. Scores on the subscales range from 4 to 20 for Perceived Parent Weight, 3 to 15 for Concern about Child Weight, 3 to 15 for Parental Responsibility, 8 to 40 for Restriction, and 4 to 20 for Pressure to Eat. Confirmatory factor analyses have tested the factor structure of this measure across Caucasian and Latino samples [14]. Among Caucasian and Latino samples, internal reliability coefficients range from .70 to .92 [14]. In the current study, internal reliability coefficients are: .64 Parent Perceived Weight, .85 Concern about Child weight, .88 Parental Responsibility, .81 Restriction, and .75 Pressure to Eat.

Children’s Eating Behaviour Questionnaire (CEBQ)

The Children’s Eating Behaviour Questionnaire is a 35 item measure that assesses parents’ perceptions on child’s eating behaviors with items rated on a scale from 1 to 5. The subscales include child’s responsiveness to food (scores range 5–25), enjoyment of food (scores range 4–20), satiety responsiveness (scores range 5–25), slowness in eating (scores range 4–20), food fussiness (scores range 6–30), emotional overeating (scores range 4–20), emotional undereating (scores range 4–20), and desire for drinks (scores range 3–15) [13]. Past research reports internal reliability coefficients ranging from .74 to .91 [13]. In the current study, internal reliability coefficients are: .79 Food Responsiveness, .86 Emotional Overeating, .82 Enjoyment of Food, .87 Desire to Drink, .71 Satiety Responsiveness, .74 Slowness in Eating, .73 Emotional Undereating, and .88 Food Fussiness.

Food security

The United States Department of Agriculture Household Food Security questionnaire was used to assess food security [29, 30]. This is an 18 item questionnaire that categorizes families into high food security (score of 0), marginal food (scores of 1–2) security, low food security (scores 3–7) and very low food security (scores of 8 or more). This measure has been used with different ethnic groups [31]. For this study, individuals with scores of 2 or less were considered food secure, and individuals with a score of 3 or more were considered food insecure. However, among those with a score of 1 or 2, if caregivers reported skipping meals or not eating so that their children may eat, they were classified as food insecure.

Data analytic approach

Descriptive analyses were conducted using SPSS software. Descriptive statistics were calculated by gender, ethnicity, and household food security via independent sample t tests. Internal consistency was calculated and reported across all of the groups.

Power analyses

Research has determined that no general rule of thumb will suffice when determining the needed sample size for CFA [32, 33]. Research has found that communality of indicators (i.e., reliability of the indicators), and factor overdetermination (i.e., number of factors/number of indicators) are important when determining sample size requirements for CFAs [33, 34]. MacCallum et al. suggested communalities of .6 or greater, and a minimum of 3 indicators per factor [34]. Using Monte Carlo data simulation techniques, Wolf and colleagues [32] found that for CFAs with one factor loading of .50 and with 3 or 4 indicators required a sample size of 190 or a sample size of 90 if the indicators increased to 6 or 8. Given the findings from Wolf and colleagues [32], our sample size should be sufficient. To further ensure sufficient power, retrospectively the RMSEA analyses were all entered into the Preacher and Coffman online software and yielded power of .80 through .98 [35].

Measurement invariance

To conduct the measurement equivalence analyses, confirmatory factor analyses (CFAs) in Mplus [36] were conducted following the procedures recommended by Mulaik and Millsap [37]. For all analyses, we used full information maximum likelihood to handle missing data as this method produces more unbiased results [38]. Little MCAR tests with expectation-maximization methods were performed to evaluate if data was missing at random. For the CFQ, missing data for all the items ranged from 1.2 to 3.2%, and analyses indicate the data is indeed missing at random, χ2 (247) = 225.451, p = .83. For the CEBQ, missing data for all items ranges from 1.6 to 2.8% and analyses indicate the data is missing at random, χ2 (101) = 116.648, p = .14. We evaluated model fit using various fit statistics, including the chi-square significance test [39], the root-mean-square error of approximation (RMSEA) [40], and the comparative fit index (CFI) [41]. The Akaike Information Criterion (AIC) was used to compare different models with the lowest AIC value relative to another model is the optimal model [41]. Adequate fit was considered to be a lack of significance on chi-square difference test, a RMSEA <.08, and CFI > .90 [41, 42]. To examine measurement invariance across groups (males vs. females, Latino vs. non-Latino Whites, food secure vs. food insecure) we used a step-wise approach (instead of constraining all the parameters) to identify at which point invariance is no longer achieved between the two groups [43]. The first step entails examining single group solutions of each subscale of the CFQ and CEBQ for each subgroup. For example, we examined Pressure to Eat subscale of CFQ among males and females, separately. If model fit was adequate for each of these samples, then we proceeded to the next step; otherwise, we stopped. The second step involved examining configural invariance, which assesses if the number of factors and pattern of indicator-factor loadings fit both groups equally well. Both the factor loadings and item thresholds were allowed to be freely estimated in each group. If model fit was adequate, then we proceeded to the next step; otherwise, we stopped. The third step examined loading invariance, which constrained loadings to be equal across both groups. Differences in factor loadings would suggest that items were not assessing the same construct across groups. For example, this test would examine if the items in the Pressure to Eat subscale of the CFQ were associated with comparable relationships to the latent construct (parents pressuring their children to eat) across the gender groups in this sample. If model fit was adequate, then we proceeded to the next step; otherwise, we stopped. The fourth step examined item intercept invariance, which constrained loadings and intercepts to be equal across both groups. Lack of invariance would suggest that the groups had different thresholds for endorsing a particular item, such that one group endorsed the item at higher severity despite having similar levels of the latent construct. For example, Latino parents may produce different raw scores on the items that comprise the Pressure to Eat subscale than non-Latino parents despite having similar global Pressure to Eat subscale scores.

Results

Means and standard deviations on all subscales across all groups are reported in Table 1.

Invariance across male and female samples

CFQ

Independent CFAs indicated poor fit of the single latent factor for either females or males on Perceived Parent Weight, Restriction, and Pressure to Eat subscales. Throughout gender invariance examination, males were used as the reference group. Results support configural, loading, and intercept invariance for the Concern about Child Weight and Parental Responsibility subscales. This indicates these subscales appear to be assessing the same underlying constructs across males and females, and the groups are endorsing items at similar thresholds. Comparatively, the AIC values indicate that the intercept invariance model appears to be the optimal model for both subscales (Table 2).

Table 2 Independent and multi-group CFAs for parent feeding in males and females

CEBQ

Independent CFAs indicated poor fit of the single latent factor for either females or males on Emotional Overeating, Slowness in Eating, Emotional Undereating and Food Fussiness. Results support configural, loading, and intercept invariance for the Food Responsiveness, Enjoyment of Food, and Satiety Responsiveness subscales indicating these subscales appear to be assessing the same underlying constructs across males and females. In addition, males and females appear to be endorsing items at similar thresholds. The AIC values indicated the intercept invariance model to be the optimal model for all of these subscales. However, Desire to Drink only achieved configural and loading, with the model poorly fitting for intercept invariance. The AIC value further confirms the configural invariance model to be the optimal model for Desire to Drink Table 3.

Table 3 Independent and multi-group CFAs for child eating in males and females

Invariance across Latino and non-Latino samples

CFQ

Independent CFAs indicated poor fit of the single latent factor for either Latinos or Non-Latinos on Perceived Parent Weight, Restriction, and Pressure to Eat subscales. Results support configural, loading, and intercept invariance for Concern about Child Weight subscale. This subscale appears to be assessing the same underlying construct across ethnic groups, and the groups appear to be endorsing items at similar thresholds. Comparatively, the loading invariance model appears to be the optimal model for this subscale. Parental Responsibility achieved configural invariance but not loading invariance (Table 4).

Table 4 Independent and multi-group CFAs for parent feeding in non-Latino and Latino samples

CEBQ

Independent CFAs indicated poor fit of the single latent factor for either Latinos or Non-Latinos on Slowness in Eating, Emotional Undereating, and Food Fussiness. Results support configural, loading, and intercept invariance for all other scales. For Food Responsiveness, the best fitting model was the loading invariance model when compared to the other models. For Emotional Overeating, the configural invariance model was the best fitting of the three invariance models. For Enjoyment of food, Desire to Drink, and Satiety Responsiveness the intercept model was the optimal model with the lowest AIC values (Table 5).

Table 5 Independent and multi-group CFAs for child eating in Non-Latino and Latinos

Invariance across food secure and insecure households

CFQ

Independent CFAs indicated poor fit of the single latent factor for either food secure or insecure households on Perceived Parent Weight, Restriction, and Pressure to Eat. Results support configural, loading and intercept invariance for Concern about Child Weight, with the intercept invariance model being the best fitting based on AIC values. For the Parental Responsibility subscale, only configural invariance was achieved, with loading invariance model fitting poorly (Table 6).

Table 6 Independent and multi-group CFAs for parent feeding in food secure and insecure households

CEBQ

Independent CFAs indicated poor fit of the single latent factor for either food secure or insecure households on Food Responsiveness, Emotional Overeating, Slowness in Eating, Emotional Undereating, and Food Fussiness. The models for Enjoyment of Food, Desire to Drink, and Satiety Responsiveness all supported the configural, loading and intercept invariance. Thus, these subscales appear to be assessing the same underlying constructs and samples are endorsing items at similar threshold levels. Based on the AIC index, the intercept invariance model for Enjoyment of Food, the loading invariance model for Desire to Drink, and configural invariance model for Satiety Responsiveness are the optimal models (Table 7).

Table 7 Independent and multi-group CFAs for child eating in food secure and insecure households

Discussion

Although there have been extensive studies that make group comparisons on child eating and feeding practices, few studies have examined measures to ensure group comparisons are equivalent across important group characteristics related to childhood obesity. Of note, there have been association studies relating minority groups’ responses and scores on child eating and feeding practices measures to childhood obesity with limited research examining the appropriateness of these measures among minority groups. To further strengthen the research base for assessing child feeding practices and eating behaviors, we sought to evaluate the factor structure and measurement invariance of the CFQ and CEBQ across gender and ethnicity. A unique contribution of our study was the examination of household food security. It is important to ensure that child eating and feeding practices measures perform consistently across diverse environments.

Overall, results regarding the factor structure yielded mixed results for each measure and highlight some important issues to consider in assessing child eating and feeding practices. For the CFQ, the factor structures did not differ across any of the groups for the subscales Concern about Child Weight and Parental Responsibility. Our study is consistent with and adds to the existing psychometric literature. Cumulatively, Concern about Child Weight and Parent Responsibility, are invariant across Latinos (current study, [14]), African Americans [14], preschool-aged boys and girls (current study, [14, 27]) and diverse food secure environments [current study]. Most notably, the factor structures for Restriction, and Pressure to Eat from the CFQ varied across the ethnic and food security groups in the current study. It is important to note our findings add to the existing literature. There have been cross-cultural conceptual issues for the Restriction subscale among samples of Latinos (current study, [14]), African American [14, 40] and an Australian sample [27], and diverse food secure groups [current study]. It is important to note, that all of these studies, including the current study, assessed parents of preschool age children. For Pressure to Eat subscale, cross-cultural issues were found in the current study and Boles and colleagues [44], but Anderson and colleagues [14] found this scale to be invariant. In our study, the Perceived Parent Weight factor fit poorly across all the groups. Consistent with our findings, Anderson et al. [14] found issues with this factor and subsequently dropped it altogether. Given all of these findings, caution should be given to conclusions derived from studies that used these subscales across food secure samples [17, 18]. Similarly, the lack of measurement invariance on Pressure to Eat and Restriction suggests findings from published studies demonstrating higher rates among Latinos compared to other ethnic and racial groups (i.e., [11, 16]) should be interpreted with caution. Overall, results of the current study suggest that research should continue to validate the CFQ.

Results on the CEBQ revealed that three of the eight factors (Enjoyment of Food, Desire to Drink, and Satiety Responsiveness) performed well and did not vary across any of the groups. However, the intercepts did vary for Food Responsiveness where Latinos report lower thresholds for endorsing these items. In contrast, the intercepts varied for Emotional Overeating as well, but Latinos reported higher thresholds for endorsing these items when compared to non-Latino Whites. Importantly, the Food Fussiness factor showed poor fit across all of the groups. Other factors (i.e., Desire to Drink, Slowness in Eating, and Emotional Undereating) varied across the groups, suggesting the items in these subscales do not assess the same underlying construct across groups.

Cumulatively, the findings from the psychometric research on the CEBQ are mixed. Domoff et al. [22] conducted a validation study of the CEBQ among an ethnically diverse sample of low-income parents of preschool age children within the United States and replicated the original factor structure. A second study conducted on predominantly Hispanic and Black parents of preschool age children within the United States failed to replicate the original factor structure of the CEBQ and proposed three new factors [23]. Consistent with Domoff et al. [22], our study found that three of the food approach subscales (Food Responsiveness, Emotional Overeating, Enjoyment of Food) and one food avoidance subscale (Satiety Responsiveness) performed well across gender and ethnicity. But, consistent with Sparks & Radnitz [23], Food Fussiness, Slowness in Eating, and Emotional Undereating had cross-cultural conceptual problems. It is important to note that all studies used parents of preschool age children, but differed in mode of administration. The current study along with Sparks & Radnitz [23] administered self-report questionnaires while Domoff and colleagues [22] administered the questionnaire orally. At best, the collective research within the United States is inconclusive with regards to the construct validity of the CEBQ and highlights the need for further research.

A unique contribution of this study was an examination of measurement invariance across food secure and insecure households. For the CEBQ, Food responsiveness, which assesses external eating (e.g., responsiveness to sight, smell, and taste of palatable foods), and Food Fussiness, which assesses picky eating, failed to fit the data adequately. A closer examination of participants from food insecure households and their responses to food responsiveness and food fussiness items revealed higher rates of “untrue” endorsements. In addition, some parents wrote comments on these items stating those behaviors or situations did not occur. This might suggest that within food insecure environments, items from these subscales do not apply or fail to capture the living context of these families. In other words, the behaviors assessed in these items might be specific to households with consistent and stable food availability. Similarly, parental food restriction also may be only applicable in food secure environments given that within food insecure households the family economics and resources are placing food restrictions on the family. It is important to highlight that household food security is associated with income, however, food insecurity in children can still be quite high at incomes that are two or three times the poverty level [2]. Similarly, caregiver disability can influence the risk of food insecurity in children, but high rates can still be evident among households with employed caregivers [2]. Recent research has found that numerous factors aside from income and employment can influence household food insecurity including caregiver incarceration, immigration status, and caregiver’s mental and physical health ([2] for a review). Since these analyses are unique to our study, replication is needed. Further research is need to explore how the presence of poor caregiver mental health or disability influences parent-report of child eating behaviors, if at all.

The results of this study should be interpreted in light of several limitations. First, the findings are limited to White and Latino samples fluent in English who are parents of preschool aged children. Second, the data are parent-report and may be influenced by context-specific eating behaviors and/or the desire to respond with socially expected answers. The very low food insecure and low food insecure were combined into one food insecure group due to sample size. Future research should consider separating this group as the very low food insecure group may differ significantly in feeding practices than the other groups. A larger sample size would have allowed for examining the entire factor structure of the scales within one CFA analysis. Relatedly, the number of analyses conducted increase the chance of family-wise error rate and the probability of making a type I error. This signifies that some of the p-values were significant simply by chance. Furthermore, this study was cross-sectional in nature and cannot address longitudinal measurement invariance or distinguish between important differences among individuals that exist within specific racial groups, genders, or food security groups (i.e., genetics or individual experiences). Future research should consider conducting longitudinal measurement invariance with these variables, as this can address if changes on measurement over time reflect individual change or change in the properties of the measurement instrument.

Conclusions

In summary, continued psychometric research and scale refinement is needed on the CFQ and CEBQ. For the CFQ, Concern about Child Weight and Parental Responsibility subscales perform consistently across gender, ethnicity, and food secure environments. For the CEBQ, Enjoyment of Food, Desire to Drink, and Satiety Responsiveness subscales were invariant across all the contextual factors. Given that pediatric obesity is influenced by contextual factors such as gender, food security, and ethnicity, it is imperative that assessments perform consistently across these factors. The ability to assess risk factors for pediatric obesity, or to detect change across time in treatment studies or longitudinal studies, is compromised if the measures used are influenced by contextual factors.