Introduction

Child maltreatment is a serious problem that affects many children around the world. A recent series of meta-analyses showed that worldwide prevalence rates of child maltreatment ranged from 0.3% based on studies using maltreatment reports of professionals to 36.6% based on self-report studies (Stoltenborgh et al. 2014). Child maltreatment is associated with serious short-term and long-term negative consequences, such as physical, behavioral, and psychological problems, leading to high costs for individuals and society (Alink et al. 2012; Gilbert et al. 2008; Jonson-Reid et al. 2012) . Given the high prevalence rates and serious consequences of child maltreatment, effective prevention of child maltreatment is essential. The number of interventions aimed at preventing or reducing child maltreatment has grown over the years. However, several meta-analyses on the effectiveness of these interventions showed only a limited effect (e.g., Euser et al. 2015; Geeraerts et al. 2004; MacLeod and Nelson 2000; Sweet and Appelbaum 2004). Therefore, it is very important to unravel intervention components that contribute to intervention effectiveness from components not contributing to (or even negatively affecting) intervention effectiveness. Consequently, new promising interventions may be developed and existing interventions may be improved by integrating effective components in interventions and/or eliminating ineffective components from interventions. To enhance knowledge on how specific components of child maltreatment interventions affect the effectiveness of these interventions, a comprehensive meta-analysis was performed in the present study.

The number of interventions aimed at preventing or reducing child maltreatment has increased exponentially over the last decades (Daro 2011). Most interventions target a clearly defined population that is identified on the basis of risk factors for child maltreatment, such as teenage mothers, drug or alcohol addicted parents, or multiproblem families. Interventions may be aimed at reducing the incidence of child maltreatment in maltreating families or at preventing the occurrence of child maltreatment in at-risk, but non-maltreating families. The former are often system interventions, whereas the latter often comprise home visitation interventions in which parents are visited at home and provided with information, support, and/or training regarding child health, development, and care. These interventions often begin prenatally and continue during the child’s first 2 years of life, but may also begin after birth and last only a few months. In addition, some interventions are aimed at preventing the first occurrence of child maltreatment in the general population, for example by providing a short parental skills training to parents who visit a well-baby clinic.

A number of meta-analyses have synthesized results on the effectiveness of interventions aimed at preventing or reducing child maltreatment (e.g., Euser et al. 2015; Filene et al. 2013; Geeraerts et al. 2004; Guterman 1999; Layzer et al. 2001; Pinquart and Teubert 2010; Sweet and Appelbaum 2004). These studies generally found minor effects of interventions for reducing or preventing child maltreatment. In specific, Geeraerts et al. (2004) found a small effect (d = .29) of early prevention interventions for families with young children at risk for physical child abuse and neglect. Filene et al. (2013) found a small effect (d = .20) of home visitation interventions, and Pinquart and Teubert (2010) found a very small but significant effect (d = .13) of parent education interventions for expectant and new parents. Euser et al. (2015) also found a very small but significant effect (d = 0.13) of interventions aimed at preventing or reducing child maltreatment. However, this last effect was no longer significant after publication bias was taken into account by applying the trim-and-fill approach.

A few meta-analyses have attempted to identify characteristics of child maltreatment interventions associated with intervention effectiveness by examining potential moderators of the mean effect of interventions. However, none of these meta-analyses extensively examined whether and how specific intervention components, such as different content or delivery methods, influence intervention effectiveness. In studies on child maltreatment interventions, different terms are used for intervention components, such as practice elements (Chorpita and Daleiden 2009), kernels (Embry and Biglan 2008), behavior change techniques (Michie et al. 2013), and core components (Blase and Fixsen 2013). Blase and Fixsen (2013) classify core components into: (1) contextual factors, such as types of families served (e.g., low-income teenage parents, parents with alcohol or drug problems) and delivery settings (e.g., clinic-based therapy, home visits), (2) structural elements, such as the required number and specific sequence of sessions, and (3) specific intervention practices, such as teaching problem-solving behavior, practicing communication skills, and reinforcing appropriate behavior. Specific intervention practices can be further classified based on intervention content (such as increasing knowledge of typical child development, increasing parenting self-efficacy, and improving discipline and/or behavior management strategies) and delivery techniques used to engage parents and teach relevant content (such as group discussions, homework assignments, role-playing, and modeling).

The main objective of this meta-analysis was to determine whether and how intervention components (i.e., intervention content and delivery techniques) influence the effectiveness of child maltreatment interventions. In doing so, important insight can be gained into what components contribute to intervention effectiveness and, consequently, how child maltreatment interventions can best be designed to reach optimal effectiveness. For this purpose, we conducted a three-level meta-analysis in which we tested intervention components as potential moderators of the (mean) effect of child maltreatment interventions. As we aimed for a comprehensive meta-analysis, we included (a) two types of interventions: preventive interventions targeting the general population or families at risk for child maltreatment and curative interventions targeting maltreating families that are aimed at reducing maltreatment, (b) randomized controlled trials (RCTs) as well as high quality quasi-experimental studies, and (c) recently conducted studies, as previous meta-analyses included studies that were published until 2013.

Determining which components appear to be essential (or nonessential) across a variety of interventions aimed at reducing or preventing child maltreatment has important implications for clinical practice. First, when choosing among interventions to implement, such information could be used to select interventions containing components associated with greater intervention effectiveness. Second, the effectiveness of existing interventions could be improved by integrating specific components associated with greater effectiveness into interventions. Third, it may be possible to eliminate components associated with less effective interventions, thereby minimizing the burden on practitioners and families.

In the present study, we aimed to examine the effect of contextual factors (i.e., general aim of the intervention, types of families served, delivery setting, type of intervention, specific intervention, age of the children), structural elements (i.e., type of worker, duration of intervention, number and interval of intervention sessions), content (parenting skills, personal skills parents, self-confidence of parents, attitudes/expectations toward parenting, knowledge of typical child development, social network of family, relationship between parents, relationship between parent and child, parental mental health problems, parental empowerment, social/emotional support, well-being child, child skills, practical support, motivation to change), and delivery methods (i.e., modeling, role-playing, monitoring, (psycho) education, homework assignments, cognitive skills training, and family group conferencing). We also examined the effect of study design characteristics such as sample characteristics (i.e., sample size, age of parents, percentage of cultural minorities), design of the study (randomized controlled trial versus quasi-experimental), and outcome characteristics (follow-up duration, type of outcome measure).

Method

Inclusion Criteria

Studies were selected if they met the following three criteria. First, studies had to report on the effect of at least one intervention for preventing or reducing child maltreatment. In specific, we included two types of interventions: preventive interventions targeting the general population or targeting families at risk for child maltreatment and curative interventions targeting maltreating families aimed at reducing maltreatment or recurrence of maltreatment. In this review, we followed the definition of child maltreatment as formulated by the Centers for Disease Control and Prevention: “any act or series of acts of commission or omission by a parent or other caregiver that results in harm, potential for harm, or threat of harm to a child.” Given this broad definition, we included studies that reported on interventions for physical abuse, sexual abuse, and neglect. In addition, studies examining the effect of child maltreatment interventions on harsh parenting (such as corporal/physical punishment or parental aggression toward children) and out-of-home placement were also included. Second, experimental studies as well as quasi-experimental studies (in which a treatment condition is compared to a control condition) were included. Although randomized controlled trials can be regarded as the “golden standard” for studies examining the effectiveness of interventions (Farrington 2003), we decided to also include quasi-experimental studies, since RCTs are rather scarce in the field of child protection due to practical and ethical concerns for true experimental designs. Third, studies had to report at least one effect size or sufficient information to calculate at least one effect size.

Selection of Studies

The electronic databases PsychINFO, ERIC, PubMed, Web of Science, ScienceDirect, and Google Scholar were searched for articles, books, chapters, dissertations, and reports. Until March 2017, studies were collected using keywords regarding study design, intervention features, study outcomes, and participants in different combinations: ‘(quasi-)experiment’, ‘randomized control*’, ‘trial’, ‘RCT’ ‘child*’, ‘abus*’, ‘maltreat*’, ‘neglect*’, ‘interven*’, ‘preven*’, ‘home visit*’, ‘recur*’, ‘recidiv*’, ‘relaps*’, ‘family group conferencing’ ‘randomized’, ‘evaluat*’, and ‘experiment*’.

Next, manual searches of reference sections of the retrieved articles, reviews, and book chapters were conducted. Finally, we contacted authors by email to request for studies and unpublished manuscripts on the effect of child maltreatment interventions that may be relevant for inclusion in the present review. The search procedure is depicted in the flow diagram presented in Fig. 1. The search yielded 546 relevant studies of which 130 studies met the inclusion criteria.

Fig. 1
figure 1

Flowchart of selection of studies for inclusion in meta-analysis

Study Coding

Following the guidelines of Lipsey and Wilson (2001), a coding scheme was developed to code all study design and intervention characteristics that could moderate the effect of child maltreatment interventions.

First, with regard to study design characteristics, we collected information on publication year, sample size, parental age, percentage of cultural minorities in samples, research design (RCT, quasi-experimental design), pilot/feasibility study (yes/no), follow-up length (in months), and type of outcome measure (self-report of parents, official reports, observations, hospital data, self-report of children). Second, we retrieved information on intervention characteristics, which we divided into contextual factors, structural elements, and specific intervention practices (content and delivery techniques). The contextual factors included the general aim of the intervention (reducing risk for child maltreatment versus reducing actual child maltreatment), type of families served (maltreating parents, general population, risk group identified by means of a questionnaire, risk group identified based on the presence of one or two risk factors), delivery setting (home/ambulant, treatment center, combination of locations), post-/prenatal intervention (postnatal, prenatal, postnatal and prenatal), type of intervention (cognitive behavioral therapy, home visitation interventions [in which parents are visited at home and provided with information, support and/or training regarding child health, development and care], parent training interventions [aimed at learning specific parenting skills], family-based/multisystemic interventions [aimed at the whole family system/multiple social systems], substance abuse intervention [primarily targeting substance abuse of parents], before-school interventions, general prevention interventions [aimed at preventing the occurrence of child maltreatment in the general population], crisis interventions [aimed at solving acute problems], and combined interventions), specific individual interventions (ACT Parenting Raising Safe Kids, (Early) Head Start, PCIT, FGC/FCDM, Healthy Families, Healthy Start, Healthy Steps, Intensive Family Preservation Services, Incredible Years, MST (CAN/BSF), Nurse Family Partnership (NFP), Triple P, Parents as Teachers, Project 12-Ways, Safe Environment for Every Kid, Child Parent Enrichment Program, SafeCare, other), and age of the child (unborn child/baby (< 2), infant/toddler (2–5), primary school (6–12), high school (> 12). The structural elements included type of worker (professionals only versus professionals and others), the duration of the intervention (0–6 months, 7–12 months, 13–24 months, > 24 months), the minimum duration of the intervention, the maximum duration of the intervention, the average number of sessions, and the interval of the sessions (weekly, multiple sessions a week, every other week/monthly, ascending/descending intensity, every 3 months). The specific intervention practices (content) included parenting skills (yes/no), personal skills of parents (yes/no), parents’ self-confidence (yes/no), attitudes or expectations about parenting (yes/no), knowledge of typical child development (yes/no), social network of the family (yes/no), relationship between parents (yes/no), relationship between parent and child (yes/no), mental health problems parents (yes/no), parental empowerment (yes/no), social or emotional support (yes/no), well-being of the child (yes/no), child skills (yes/no), practical support (yes/no), and motivation (yes/no). Finally, we examined the following delivery techniques: modeling (yes/no), role-playing (yes/no), monitoring (yes/no), (psycho)education (yes/no), homework assignments (yes/no), cognitive skills training (yes/no), and family group conferencing (yes/no). Inter-rater agreement was based on a double-coding of 14 studies by two independent coders. An inter-rater agreement of 97% was found between the two coders on all coded variables.

Calculation of Effect Sizes

All outcomes of the primary studies were transformed into the standardized difference between two means, also referred to as Cohen’s d. The effect sizes were calculated using formulas of Ferguson (1966), Lipsey and Wilson (2001), and Rosenthal (1994). In most instances, proportions, means and standard deviations, and odd ratios were transformed into Cohen’s d. If insufficient statistical information was provided for calculating an effect size, we contacted the study authors and asked for the required information. In calculating each effect size, it was important that the direction of the effect (positive or negative) corresponded with the statistical data reported in the primary study. A positive effect indicated less child maltreatment (or levels of other factors, such as harsh parenting) was found in the intervention group than in the control group, whereas a negative effect indicated more child maltreatment in the intervention group than in the control group. If results were reported to be nonsignificant and additional statistical information required for calculating an effect size was not reported, the value of zero was assigned to an effect and added to the data set (Durlak and Lipsey 1991).

All coded data and calculated effect sizes were entered in SPSS version 22. Before the analyses were performed, continuous variables were centered around their mean, and categorical variables were recoded into dummy variables for each category of a variable. Further, extreme values of effect sizes and sample sizes (Z > 3.29 or Z < − 3.29; Tabachnik and Fidell 2013) were identified.

Statistical Analyses

For estimating the overall effect for all included child maltreatment interventions, only preventive interventions, and only curative interventions, as well as for examining potential moderating variables, a three-level meta-analysis technique was used. By applying a multilevel approach to meta-analysis, there is no need for aggregating or selecting data, implying that all relevant effect sizes can be extracted from primary studies (see also Assink et al. 2015; Assink and Wibbelink 2016). As a result, all information from primary studies is preserved and maximum statistical power can be achieved. In our meta-analytic model, three forms of variance were taken into account: random sampling variation of observed effect sizes (level 1), variance within studies (level 2), and variance between studies (level 3) (Cheung 2014; Hox 2002; Van den Noortgate et al. 2013, 2014). The sampling variance (level 1) was not estimated, but considered to be known and calculated using the formula of Cheung (2014, p. 2015). Because we considered the primary studies to be a random sample from a larger population of studies, we built random-effects models.

Estimating the overall effects was done in separate three-level intercept-only models. Effect sizes were weighted by the inverse of their variance (i.e., sampling error), so that effect sizes derived from studies using samples of larger size contributed more to the overall effect size estimate than effect sizes derived from studies using samples of smaller size. Next, the significance of the variance distributed on levels 2 and 3 were tested by conducting two separate one-tailed log-likelihood ratio tests. In these tests, the deviance of a model in which the variance on either level 2 or level 3 was set to zero, was compared to the deviance of the full model in which level 2 and level 3 variances were freely estimated. In case the level 2 and/or level 3 variance was significant, the distribution of effect sizes was considered to be heterogeneous. This indicates that the effect sizes could not be treated as estimates of one common effect size, and thus, moderator analyses were performed to search for variables that can explain the variance. Potential moderating variables (i.e., study design characteristics, contextual factors, structural elements, and specific intervention practices) were examined by testing them in a three-level meta-analytic model as covariates. In all meta-analytic models, the Knapp and Hartung correction (Knapp and Hartung 2003) was applied, implying that the significance of coefficients was tested using the t- and F- distributions rather than the z-distribution.

The statistical software package R (version 3.2.0) and the metafor package (Viechtbauer 2010) were used to build the 3-level meta-analytic models. We used the syntax as described by Assink and Wibbelink (2016). In all analyses, a 5% significant level was used.

Publication Bias and Sensitivity Analyses

A common problem in conducting a meta-analysis is that studies with nonsignificant or negative results are less likely to be published than studies with positive and significant results. This phenomenon is called publication bias and is often referred to as the ‘file drawer problem’ (Rosenthal 1995). So, the effects extracted from primary studies included in this meta-analysis may not be an adequate representation of the actual effect of child maltreatment interventions. Besides publication bias, the results may be affected by other forms of bias, such as coding or selection bias. Therefore, we examined the degree to which our results were affected by (different forms of) bias by conducting the nonparametric and funnel plot-based trim-and-fill analysis as described by Duval and Tweedie (2000a, b). A funnel plot is a scatter plot of the effect sizes against the effect size’s precision (1 divided by the standard error). In this analysis, the symmetry of the funnel is tested, as the plot would be asymmetric when bias is present. In case of an asymmetric plot, the asymmetry is restored by imputing “missing” effect sizes that are estimated on the basis of existing effect sizes in the data set. Subsequently, a “corrected” overall effect can be estimated in a sensitivity analysis using the data set to which the imputed effect sizes produced by the trim-and-fill algorithm have been added. In this way, the degree to which the results were affected by bias can be made visible.

As we identified seven outlying effect sizes (see above), we performed a sensitivity analysis in which we estimated an overall effect of child maltreatment interventions using the data in which the outlying effect sizes were excluded. In this way, we could determine the degree to which the initially estimated overall effect was robust against outliers in effect sizes. In addition, we estimated the overall effect of child maltreatment interventions excluding the results of pilot/feasibility studies, because these studies are more likely to show an effect that is larger than effects produced by well-powered trials.

Results

Descriptive Characteristics

The present meta-analysis included 130 studies with k = 121 non-overlapping samples, comprising N = 39.044 participants. A total of 352 effect sizes were extracted from all primary studies, which examined the effect of home visitation interventions (k = 50), parent training interventions (k = 29), family-based/multisystemic interventions (k = 17), substance abuse interventions (k = 4), before-school interventions (k = 4), general prevention interventions (k = 4), crisis interventions (k = 3), cognitive behavioral therapy (k = 2), and combined interventions (k = 15). These studies were published between 1977 and 2017 (with 2007 being the median publication year) and were conducted in the USA (k = 93), Europe (k = 11), Canada (k = 6), Australia or New Zealand (k = 6), and in various other countries (k = 5). The characteristics of the studies are presented in “Appendix 1,” including the name of the child maltreatment intervention, publication year, sample size, age of the child at the start of the intervention, study design (RCT or quasi-experimental), and type of families served (risk group, general population, or maltreating parents). The distribution of the moderators for preventive and curative interventions is presented in Table 1.

Table 1 Distribution of moderators separately for preventive and curative interventions

Overall Effect, Heterogeneity in Effect Sizes, and Sensitivity Analyses

A significant overall effect was found of interventions aimed at preventing or reducing child maltreatment with a Cohen’s d of 0.287; 95% CI [0.226; 0.348], t(351) = 9.209, p < .001 (see Table 2). According to the criteria formulated by Cohen (1988), this effect is small in magnitude. The two log-likelihood ratio tests showed that significant variance was present both at level 2 (χ 2(1) = 178.015, p < .001; one-sided) and level 3 (χ 2(1) = 48.182, p < .001; one-sided) of the meta-analytic model. Of the total variance, 32.8 and 56.1% were distributed at levels 2 and 3, respectively, and 11.1% was the percentage of sampling variance that was calculated using the formula of Cheung (2014, p. 2015).

Table 2 Overall Effect for all interventions, preventive interventions, curative interventions, and all interventions excluding pilot/feasibility studies

The results of the trim-and-fill analysis showed that bias may be present in the data, as the distribution of all effect sizes (produced by both preventive and curative interventions) was asymmetrical. Figure 2 shows that a small number of effect sizes had to be imputed in the right side of the funnel to restore its symmetry. In specific, 3 effect sizes from 2 independent studies were added to the data and after re-estimating the overall effect, a slightly higher (Δd = 0.014) significant effect was found (d = 0.301 [95% CI 0.238; 0.365], t(354) = 9.343, p < .001). Besides asymmetry in the distribution of effect sizes, there were a number of effect sizes that seemed to be “outliers” compared with other effect sizes (i.e., the effects in the most right part of Fig. 2). To determine whether and how these “outlying” effect sizes influenced our estimated overall effect, we performed a sensitivity analysis in which we re-estimated an overall effect after excluding 7 effect sizes with a Z score exceeding 3.29 (Tabachnik and Fidell 2013) from the data. The results produced an overall effect of d = 0.257; 95% CI [0.204, 0.311], t(344) = 9.499, p < .001, which is slightly below our initial estimated overall effect (Δd = 0.030). In addition, we estimated the overall effect of all interventions using data without effects derived from the pilot studies (see Table 2). The influence of pilot/feasibility studies on the overall summary effect was only small (d = 0.287 with pilot/feasibility studies and d = 0.278 without pilot/feasibility studies; Δd = 0.009). All three sensitivity analyses showed that the estimated overall effect of d = 0.287 was rather robust.

Fig. 2
figure 2

Funnelplot

Next, we estimated an overall effect of only preventive interventions, and this yielded a Cohen’s d of 0.263; 95% CI [0.197; 0.329], which is a small effect according to the criteria of Cohen (1988). The two log-likelihood ratio tests showed once again that significant variance was present at level 2 (χ 2(1) = 141.121, p < .001; one-sided) and level 3 (χ 2(1) = 45.038, p < .001; one-sided) of the meta-analytic model. Of the total variance, 26.4 and 64.1% were distributed at levels 2 and 3, respectively, and 9.4% was distributed at level 1. As these results indicated substantial heterogeneity in effect sizes, we could test study design and intervention characteristics as potential moderators of the effect of preventive interventions. The estimated overall effect of curative interventions was also significant with a Cohen’s d of 0.364; 95% CI [0.227; 0.502] and was small in magnitude according to Cohen’s criteria. Both the level 2 variance (χ 2(1) = 55.467, p < .001; one-sided) and the level 3 variance (χ 2(1) = 3.975, p = .023; one-sided) were significant, and of the total variance, 21.3, 55.7, and 23.0% were distributed at levels 1, 2, and 3, respectively. So, also for curative interventions, moderator analyses could be performed to examine whether and how the intervention effect was influenced by study design and intervention characteristics.

Moderator Analyses Bivariate Models

Table 3 shows the results of the moderator analyses performed for the preventive interventions, and Table 4 shows the results of the moderator analyses performed for the curative interventions.

Table 3 Results of the moderator analyses for preventive interventions (bivariate models)
Table 4 Results of the moderator analyses for curative interventions (bivariate models)

Study design characteristics For preventive interventions, a trend significant moderating effect was found for research design (larger effect sizes were found in quasi-experimental designs (d = .374) versus RCTs (d = .226)), and a significant positive moderating effect was found for follow-up period (intervention effects increased as follow-up length increased). For curative interventions, a trend significant negative moderating effect was found for sample size (larger sample sizes yielded smaller effect sizes), and a trend significant negative moderating effect was found for percentage of cultural minorities in samples (higher percentages of minorities yielded smaller effect sizes). A significant effect was found for type of outcome measure (larger effect sizes were found for official reports compared to self-reports of parents).

Contextual factors For preventive interventions, a significant moderating effect was found for the type of intervention: The effect of parent training interventions (d = .428) and substance abuse interventions (d = 1.852) were significantly higher than home visitation interventions (d = .210). The types of preventive interventions that were effective in preventing child maltreatment were: home visitation interventions (d = .210), parent training interventions (d = .428), family-based/multisystemic interventions (d = .343), substance abuse interventions (d = 1.852) and combined interventions (d = .174). Before-school interventions (d = .148), general prevention interventions (d = .024), and crisis interventions (d = .407) did not have a significant effect on preventing child maltreatment (the latter probably due to lack of power). Types of curative interventions that were effective in reducing child maltreatment were home visitation interventions (d = .344; trend significant), parent training interventions (d = .415), family-based/multisystemic interventions (d = 0.346), substance abuse interventions (d = .385; trend significant), and cognitive behavioral therapy (d = .445). Crisis interventions (d = .335) did not have a significant effect on reducing child maltreatment, probably due to lack of power.

Further, the moderator analysis showed that the effect of any of the specific individual interventions did not significantly deviate from the effect of the reference category (i.e., other interventions). Specific individual interventions were tested as a separate category only if the effect of the intervention has been examined in at least two studies. Specific individual interventions with a (trend) significant effect on preventing or reducing child maltreatment were: MST-CAN/BSF (d = .545), Triple P (d = .417), ACT-Parent’s Raising Safe Kids Program (d = .377), and Healthy Start (d = .339).

Structural elements For preventive interventions, higher effect sizes were found for interventions delivered by professionals only (d = .334) compared to interventions delivered by professionals and others (such as paraprofessionals or non-professionals; d = .184). Higher effect sizes were found for interventions with a shorter duration (d = .361 for 0–6 months) compared to interventions with a longer duration (d = −.051 for 7–12 months; d = .236 for 13–24 months (trend significant difference); and d = .190 for longer than 24 months). For curative interventions, none of the structural elements were significantly related to effect size.

Specific interventions practices and delivery techniques For preventive interventions, larger effect sizes were found for interventions focusing on increasing the self-confidence of parents (d = .397 versus d = .229). For curative interventions, larger effect sizes were found for improving parenting skills (d = .430 versus d = .190), improving personal skills of parents (d = .440 versus d = .177; trend significant difference), addressing mental health problems parents (d = .521 versus d = .258; trend significant difference), providing social and/or emotional support (d = .649 versus d = .296), and improving child well-being (d = .539 versus d = .272; trend significant difference). Smaller effect sizes were found for interventions focusing on empowerment (d = .203 versus d = .440; trend significant difference).

Moderator Analyses Multiple Moderator Models

Finally, the moderators that were significant in the bivariate models were tested jointly in a multiple moderator model for preventive and curative interventions separately. In this way, the unique contribution of moderators to the prediction of effect size could be examined (see Tables 5 and 6). In the multiple moderator model for preventive interventions, heterogeneity in intervention effects was particularly explained by follow-up duration, parent training interventions (versus other interventions; trend significant), substance abuse interventions (versus other interventions), and a focus on increasing self-confidence of parents (versus interventions without this focus). In the multiple moderator model for curative interventions, heterogeneity in intervention effects was particularly explained by outcome measure, a focus on improving parenting skills (versus interventions without this focus), and providing social/emotional support (versus interventions without this practice; trend significant). In both models, multicollinearity did not seem to be a problem, since all variance inflation factors were below 1.302 and all tolerance statistics were above 0.768.

Table 5 Results of the multiple moderator model for preventive interventions
Table 6 Results of the multiple moderator model for curative interventions

Discussion

The aim of the present meta-analysis was to gain insight into components contributing to the effectiveness of child maltreatment interventions. Overall, a small but significant effect was found of interventions aimed at preventing or reducing child maltreatment (d = .287), which is in line with findings of previously conducted meta-analyses on the effect of these interventions (e.g., Geeraerts et al. 2004; Filene et al. 2013; MacLeod and Nelson 2000). The results of the trim-and-fill analysis suggested that bias was present in the data, and therefore, a “corrected” overall effect was estimated, resulting in an effect size of d = 0.301. Because there are several methodological shortcomings regarding the trim-and-fill method (see the Limitations section), this corrected effect size should not be interpreted as a true effect size, but only as an indicator of (possible) bias in the data.

Larger effect sizes were found for curative interventions targeting maltreating parents (d = .364) than for preventive interventions targeting at-risk families/the general population (d = .263), but this difference did not reach significance. Euser et al. (2015) did find a significantly higher effect for interventions aimed at reducing child maltreatment in maltreating families than for interventions aimed at preventing child maltreatment in at-risk families/the general population. The finding that curative interventions are more effective than preventive interventions may be explained by a lower prevalence of child maltreatment in at-risk families/the general population than in maltreating families, making it “more difficult” to find significant differences between intervention and control groups (because of lower statistical power) and consequently, to prove the effectiveness of an intervention. Furthermore, the detection of child maltreatment in the general population/at-risk families is particularly difficult when official records are used, as many maltreatment occurrences are not officially reported to child protection services. So, in primary studies examining the effect of a preventive intervention and in which official records are used produce rather small effect sizes. Therefore, relatively large research groups and long follow-up periods are needed to increase the power. This may also be a possible explanation for the significant effect of the variable follow-up period (higher effect sizes were produced in longer follow-up periods): the longer the follow-up period, the higher the prevalence of child abuse in the research groups, the more likely a possible effect is detected.

Study Design Characteristics

For preventive interventions, higher effect sizes were found for studies with quasi-experimental designs (d = .347) than for RCTs (d = .226; trend significant difference). Previous meta-analyses that included only RCT designs generally showed smaller effect sizes than meta-analysis including both RCTs and quasi-experimental designs, which is in line with the findings of our meta-analysis. For example, Pinquart and Teubert (2010) found a very small effect (d = .13) of RCTs on parenting interventions for families with newborns. Euser et al. (2015) also found a very small effect (d = 0.13) of RCTs on interventions for preventing or reducing child maltreatment. Ultimately, exclusively reviewing RCT studies is desirable, because effects of child maltreatment interventions can best be determined in RCT’s, as random assignment of participants to an experimental and a control group (theoretically) equalizes both groups on all other variables. Therefore, RCT’s are considered to be the most powerful study design in intervention research. However, at present, it is too early for such a review, because only a few RCTs have been performed on the effectiveness of child maltreatment interventions and their components. Consequently, a lot of essential information would be missing in a review of only RCTs and as a result, little knowledge could be obtained about effective components. Additionally, when matching procedures are properly applied in quasi-experimental studies, reasonable control groups can be formed so that generalization of results is quite feasible and realistic.

Further, for preventive interventions, larger positive intervention effects were found at later follow-up than at immediate post-intervention, which may be attributed to sleeper effects of interventions (Maurer et al. 2007). Sleeper effects in the context of child maltreatment interventions imply that positive intervention effects—at least to some extent—need time to emerge after interventions have ended. For example, parents may be unsure about applying newly learned parenting practices in their own home environment without being helped or supervised and, consequently, time passes until parents are able to apply these practices effectively. As soon as parents have practiced and acquired more confidence, the newly learned practices may be reinforced by positive responses of the child, other family members, and/or members of the social network of the family. It takes a considerable amount of time until positive parenting practices sink in with parents, simply because replacing adverse parenting practices and/or beliefs with positive parenting strategies cannot be expected to occur within a short time period. In general, child maltreatment interventions aim for a sustained change in parent–child interactions over time by improving parenting practices and/or beliefs, so that (the recurrence of) child maltreatment is prevented. Therefore, it is important in primary research to thoroughly conduct follow-up evaluations of considerable length, as the true effects of child maltreatment interventions may be particularly expressed in follow-up rather than in post-treatment evaluations.

For curative interventions, a moderating effect was found for type of outcome (smaller effect sizes were found for studies using self-report data obtained from parents compared to studies using official reports). This may be explained by two difficulties that arise when using self-report methods, such as interviews or questionnaires. First, it may be difficult for parents to be honest about the way they raise their children and to admit any neglectful or abusive parenting practices. As a result, parents may give socially desirable answers. Second, parents may be biased toward presenting a more favorable image of their own parenting behavior (see, for instance, Schwarz et al. 1985). Both issues may result in an underestimation of the true number of maltreatment occurrences, which in turn influences the magnitude of effect sizes. On the other hand, it can be assumed that official reports also lead to an underrepresentation of child abuse because researchers found that many occurrences of child maltreatment do not appear in official records (e.g., Fergusson et al. 2000; Finkelhor 2008; MacMillan et al. 2003).

Effective Components

As for intervention duration, we found that preventive interventions with a short duration (up to 6 months) were more effective than preventive interventions of longer duration, whereas for curative interventions, no significant effect of intervention duration was found. This finding is in line with findings of a meta-analysis on effective ingredients of prevention programs for youth at risk for juvenile delinquency, which also showed that preventive interventions of shorter duration were more effective than preventive interventions of longer duration (De Vries et al. 2015). In contrast, Euser et al. (2015) concluded in their meta-analysis that interventions with an average length (6–12 months) were most effective, but this conclusion seems too strong. There was hardly any absolute difference in mean effect between their 0–6 month category (d = .22) and their 6–12 month category (d = .23). The “less is more” effect in attachment-based interventions found by Bakermans-Kranenburg et al. (2003) seems also applicable to interventions aimed at reducing or preventing child maltreatment. However, intervention duration was no longer a significant predictor of effect size in the multiple moderator model, which implies that intervention duration does not make a unique contribution to the prediction of effect size above follow-up duration (in months), and whether or not interventions were a parent training intervention, a substance abuse intervention, and focusing on improving parents’ self-confidence.

Moreover, it seems to be important to adjust the intensity of an intervention to the level of risk present in a family. In case of high risk, it may be important to intervene intensively, whereas in case of low risk, less intensive interventions are appropriate. This principle is known as the risk principle of the risk–need–responsivity (RNR) model (e.g., Andrews and Bonta 2010a, b; Andrews et al. 1990). This model is widely used for preventing recidivism in the (juvenile) criminal justice system, as various meta-analyses have shown that judicial interventions aimed at behavioral change are most effective when delivered according to this model (Andrews et al. 1990; Andrews and Dowden 1999). It can be assumed that the RNR principles also apply to interventions aimed at preventing or reducing child maltreatment, because child maltreatment, like delinquent behavior, is caused by the accumulation of risk factors in multiple systems (Belsky 1980, 1993; Brown et al. 1998). Furthermore, many risk factors of delinquent behavior resemble risk factors for child maltreatment, like poverty, stress in the family, and psychiatric problems of parents, including alcohol or drug problems (Stith et al. 2009).

For preventive interventions, lager effect sizes were found for interventions focusing on increasing self-confidence of parents. For curative interventions, larger effect sizes were found for improving parenting skills, improving personal skills of parents (trend significant), addressing mental health problems of parents (trend significant), providing social and/or emotional support, and improving a child’s well-being (trend significant). These findings offer possibilities to improve interventions. For curative interventions, smaller effect sizes were found for interventions focusing on empowerment (trend significant). This finding is in line with studies reporting that children from high-risk families benefit less from the presence of protective factors (e.g., Luthar and Goldstein 2004; Miller et al. 1999; Vanderbilt-Adriance and Shaw 2008), indicating that interventions aimed at increasing or strengthening protective factors in high-risk families do not necessarily lead to a decrease in child maltreatement. Vanderbilt-Adriance and Shaw (2008) concluded in their review of studies on resilience that in high-risk families, there should be a focus on both strengthening protective factors and reducing risks, because there are limits to the amount of risk that can be mitigated.

Implications for Policy and Clinical Practice

First, when implementing best practices, clinical professionals and policy makers should opt for interventions that produce the largest effects on preventing or reducing child maltreatment. Cognitive behavioral therapy, home visitation, parent training, family-based/multisystemic, substance abuse, and combined interventions were effective in preventing and/or reducing child maltreatment. Specific individual interventions with a (trend) significant effect on preventing or reducing child maltreatment that were examined in at least two independent studies were: MST-CAN/BSF (intensive family therapy), Triple P (a parent training), ACT-Parent’s Raising Safe Kids Program (a short-term parent training), and Healthy Start (a home visitation intervention). Implementing these (types of) interventions in clinical practice may therefore be fruitful.

Second, the effectiveness of existing interventions could be improved by integrating specific components associated with greater effectiveness into interventions, such as focusing on improving parenting skills and providing social and/or emotional support in curative interventions, and increasing self-confidence of parents in preventive interventions. For curative interventions, it seems important to screen for mental health problems in parents and if present, to address these problems in interventions. Interventions targeting substance-abusing parents appeared to be very effective, as well as interventions addressing other mental health problems. Furthermore, improving child well-being and providing social and/or emotional support seems to be effective components of interventions aimed at maltreating families.

Limitations

Several limitations need to be discussed. First, a large amount of literature is available on the effectiveness of interventions aimed at preventing or reducing child maltreatment, and therefore, it is possible that we may have missed primary studies. However, we included a rather large number of studies in our meta-analysis (121 independent studies) and this led us to assume that our sample of studies is representative of all studies on the effect of child maltreatment interventions that have been conducted to date. Second, the reported information of the studies included in the meta-analysis was sometimes limited. A relatively large number of studies failed to report important information on intervention characteristics, such as specific intervention practices and delivery techniques.

A third limitation is related to the outcome measure that was assessed in primary studies. In some studies, it is assumed that a report, investigation, or a recurrence in child protection is indicative of abuse or neglect. However, the relationship between contact with the child protection system and maltreatment is not straightforward (Jenkins et al. 2017). Several studies showed that a large proportion of child maltreatment is not reported to child protection authorities (Cyr et al. 2013; Finkelhor et al. 2005, 2009). Population-based surveys showed that rates of maltreatment are more than ten times the rates of substantiated maltreatment in those same jurisdictions (e.g., Fergusson et al. 2000; Finkelhor 2008; MacMillan et al. 2003). Further, in many cases reported to child protection authorities, maltreatment is not substantiated.

Fourth, there are several methodological difficulties regarding the trim-and-fill method. The performance of the trim-and-fill method is limited when effect sizes prove to be heterogeneous (Peters et al. 2007; Terrin et al. 2003). In addition, the application of the trim-and-fill method could mean adding and adjusting for nonexistent effect sizes in response to funnel plots that are asymmetrical, simply because of random variation (Egger et al. 2001). Finally, Nakagawa and Santos (2012) mentioned that this method has originally been designed for meta-analytic reviews in which independence of effect sizes can be assumed. Despite these shortcomings, there is no best method for detecting and handling missing data in meta-analysis, and therefore, the results from the trim and fill method should be interpreted with caution. In the present study, we only used the trim-and-fill method to calculate a “corrected” overall effect. A fifth limitation is that several moderator analyses were based on a rather small number of effect sizes, implying a low statistical power in the moderator analyses. Finally, it should be kept in mind that the present review was mainly based on primary studies that were conducted in Western countries, and in particular the USA. Since countries differ in social and political climate, organization of child welfare health services, and in ethnic and cultural background of clients served by child welfare, it is questionable whether the present results are representative for nonwestern countries.

Despite these limitations, our study provides important knowledge for science and practice. Our findings show that interventions can be effective in preventing or reducing child maltreatment. The results of this meta-analysis can be used to improve existing interventions, for example by integrating effective components in interventions, or by developing new promising interventions comprising of the most effective components.