Caring for a child with autism spectrum disorder (ASD) can be challenging. Compared to caregivers of typically developing children and of children with other developmental disabilities (DD), primary caregivers of individuals with ASD report greater stress (Abbeduto et al. 2004; Hayes and Watson 2013), depression, anxiety (Pakenham et al. 2005), and caregiver strain (McGrew and Keyes 2014), and lower levels of life satisfaction, psychological and physical well-being (Benson and Kersh 2011), quality of life, and marital satisfaction (Siman-Tov and Kaniel 2011).

Most interventions for ASD focus exclusively on the needs of the child with ASD and rarely target the needs of the caregiver. “Direct” interventions work directly with the child in targeting the core social and communication deficits or problem behaviors (e.g., Early Start Denver Model; Dawson et al. 2010). Parent-mediated (parent-implemented) interventions target the child “indirectly,” teaching parents to deliver the intervention strategies that manage problem behaviors and promote social communication skills (Green et al. 2010; Wong and Kasari 2012).

However, across both direct and indirect interventions, the experiences and psychosocial adjustment of caregivers are generally neglected (Ergüner-Tekinalp and Akkök 2004), even when parents are employed as the intervention agents. For example, when McConachie and Diggle (2007) systematically reviewed parent-implemented interventions for young children with ASD, 8 of the 12 studies reviewed did not even report parent-focused outcomes. This failure to address parent needs is concerning because of the missed potential to ameliorate parent stress and to increase parental well-being–variables that may also increase the effectiveness of interventions for the child with ASD (Wainer et al. 2017). Further, attention to parent stress may have spillover effects that positively impact the psychological well-being of other children within the family (Rossiter and Sharpe 2001).

A few intervention studies have been conducted in ASD or related non-ASD populations to address negative caregiver psychosocial outcomes (Barlow et al. 2002). For example, in an early review of group interventions for parents of children with intellectual disabilities (ID), Hastings and Beck (2004) found that cognitive behavioral group interventions helped reduce stress. In addition, four more recent reviews have summarized the literature on caregiver interventions for the DD and ASD population, using both qualitative (Cachia et al. 2016; Da Paz and Wallander 2017) and quantitative methods (i.e., meta-analytic; Lindo et al. 2016; Singer et al. 2007). Da Paz and Wallander (2017) reviewed 13 studies examining the effectiveness of interventions primarily targeting caregivers’ mental health, 9 of which focused exclusively on ASD caregivers. Studies included in the review examined a variety of treatment strategies (e.g., cognitive behavioral therapy [CBT], expressive writing, acceptance and commitment therapy [ACT]) and caregiver outcomes (e.g., stress, anxiety, quality of life). Overall, the treatment elicited better results than the control conditions across intervention strategies and produced small to large effects in improving caregivers’ general health and reducing stress, depression, and anxiety.

The second narrative review (Cachia et al. 2016) focused exclusively on parents of children with ASD and examined ten studies of interventions using either mindfulness-based training, ACT, or both. The studies using mindfulness-based training and ACT demonstrated generally positive results (e.g., reduced stress; Cachia et al. 2016).

The two meta-analyses reviewed interventions targeting ASD and related disorders more broadly. Singer et al. (2007) used meta-analysis (n = 17 studies) to examine parenting and stress management interventions for caregivers of individuals with DD and found that cognitive behavioral training and stress management interventions were effective. Similarly, Lindo et al. (2016) examined six studies of the impact of various caregiver-focused interventions on reducing the stress in parents of children with cognitive/developmental disabilities and found a moderate effect size.

Taken together, these reviews provide preliminary evidence for the effectiveness of interventions targeting parents of children with ASD and DD generally. However, each of the reviews had limitations. First, with the exception of Cachia et al. (2016), the reviews focused on parents of those with DD generally and often did not separately examine results for caregivers of those with ASD (e.g., Benn et al. 2012). For example, only 6 of the 22 studies in the two meta-analysis reviews were restricted to ASD, and these studies were not analyzed separately, limiting their value in understanding outcomes for parents of those with ASD. Further, although Cachia et al. (2016) did focus exclusively on ASD caregivers, their use of a qualitative approach was limited to providing descriptive information only, and could not estimate effect sizes, quantitatively compare the effectiveness of different types of interventions, or examine potential moderators of effective interventions. Second, none of the reviews were comprehensive. For example, Lindo et al. (2016) examined only six studies, Da Paz and Wallander (2017) missed six studies reviewed by Cachia et al. (2016) and Cachia et al. (2016) missed 9 studies reviewed by Da Paz and Wallander (2017).

Current Study

The current study used meta-analytic techniques to comprehensively review interventions that directly target psychosocial outcomes in caregivers of those with ASD, also examining potential moderators of effectiveness, including child, parent, intervention, and methodological variables.

Two child moderators were examined: age and diagnosis of Asperger syndrome. Although neither moderator had been directly examined in an intervention study of caregiver adjustment, both have been related to caregiver adjustment cross-sectionally. Fitzgerald et al. (2002) reported that better maternal mental health was associated with having an older child with ASD. In addition, caregivers of children with ASD with cognitive and language delays, which are indicators of increased severity (Karst and van Hecke 2012) but are absent in children with Asperger syndrome (American Psychiatric Association 2000), report greater parenting stress. Because most studies did not measure or report symptom severity or presence of cognitive or language delay, we used the diagnosis of Asperger syndrome as a proxy of severity. Accordingly, we hypothesized that studies including a greater percentage of older offspring with ASD or of offspring with Asperger syndrome would yield higher effect sizes.

One parent moderator was examined. In a meta-analysis of parent training programs, disadvantaged caregivers (low SES) of children with disruptive behaviors were found to demonstrate less positive change (Lundahl et al. 2006). Accordingly, we expected that caregivers with lower SES would demonstrate less treatment change.

Four treatment moderators were examined: treatment type, number of components, format, and dose. With respect to treatment type, as noted earlier, both cognitive and mindfulness based interventions have been effective in improving outcomes for caregivers of those with DD and ASD (Cachia et al. 2016; Singer et al. 2007). Cognitive treatments (e.g., mindfulness) also have tended to produce greater effect sizes in psychological adjustment than non-cognitive approaches in interventions for dementia caregivers (Pinquart and Sörensen 2006). With respect to number of components, multicomponent interventions (e.g., psychoeducation and social support) have been shown to produce greater effectiveness in caregivers of those with DD (Singer et al. 2007). With respect to format, group interventions have produced smaller reductions in distress than individual interventions for elderly caregivers (Knight et al. 1993) and for parents of those with cancer (Northouse et al. 2010). With respect to treatment dose, smaller effects have been found for interventions with shorter durations in a meta-analysis of DD caregivers (Lindo et al. 2016), and with lower treatment intensity (fewer sessions) in caregivers of older adults (Sorensen et al. 2002) and cancer patients (Northouse et al. 2010). Accordingly, we expected greater effectiveness for cognitive and mindfulness treatment approaches and for interventions using multiple components, individual formats, and of higher intensity.

Finally, we examined three methodological moderators: type of dependent measure, publication status, and study quality. Specifically, in a meta-analytic review of interventions for caregivers (e.g., children or spouse of elderly), Knight et al. (1993) found that measures of parent emotional outcome (e.g., depression and anxiety inventories) were more sensitive to change than measures of burden, as assessed by the Zarit Burden Interview (Zarit et al. 1980). Also, for intervention studies generally, effect sizes tend to be smaller for unpublished studies (Polanin et al. 2016) and for high quality studies (Cuijpers et al. 2010). Accordingly, we expected that published or low quality studies would yield larger effect sizes. The moderating effect of dependent measures was exploratory.

In summary, the current meta-analysis examined the effectiveness of caregiver-focused interventions in improving psychosocial outcomes in caregivers of individuals with ASD. The major aims were to: (1) determine the overall effectiveness of caregiver-focused interventions, (2) compare effectiveness across intervention approaches, and (3) examine possible moderators of intervention effectiveness.

Methods

Literature Search

Multiple databases were used to search for relevant studies including: ERIC, MEDLINE, PubMed, Web of Science, PsycARTICLES, PsycINFO, Dissertations and Theses (Proquest), CINAHL, and EMBASE. Backward and forward searches were conducted for additional relevant studies (Lipsey and Wilson 2001). E-alerts to receive weekly notifications of relevant studies were set up in both PsycARTICLES and PsycINFO at the time of the initial literature search (August 2017). Articles generated from these additional searches were screened for inclusion criteria until July 2018. The search included published and unpublished empirical studies, systematic reviews, and conference proceedings. A variety of search term combinations were used, including types of participants (e.g., “parent”), types of diagnoses (e.g., “autism”), types of interventions (e.g., “psychoeducation”), and types of psychosocial outcomes (e.g., “depression”, “anxiety”).

Inclusion and Exclusion Criteria

Studies were included when: (1) participants were primary caregivers of offspring of any age diagnosed with ASD. Because there was no restriction on year of publication, studies focusing on a(ny of the DSM-IV or DSM-5 diagnoses related to ASD were deemed acceptable (e.g., ASD, autism, Asperger’s syndrome, PDD-NOS); (2) group comparisons were used, either within-subjects (i.e., pre- vs. post-) or between-subjects designs (treatment vs. control); (3) the group sample size receiving treatment was larger than five (see Blodgett et al. 2014; Liu et al. 2014; Virués-Ortega 2010); (4) caregivers were the main focus of the intervention, not the child with ASD; (5) outcome data on caregiver psychosocial adjustment was provided and was the target of the intervention; (6) effect sizes or information that allowed calculation of effect sizes was provided; and (7) the study was reported in English.

No date or publication format limits were set, which allowed access to the full body of research addressing the issue. Studies that included a mixed sample of caregivers (e.g., ASD, ADHD, ID) and did not provide separate results for the ASD caregivers were excluded. Interventions targeting the offspring’s adaptive and/or challenging behaviors directly (child-focused) or teaching caregivers child-focused intervention strategies (caregiver-mediated/implemented) were excluded from the review.

Coding

Based on recommendations from Lipsey and Wilson (2001) two codebook modules were created, one coded information applicable to the whole study (e.g., basic study information, sample descriptors, study design) and the other coded effect size information for individual study findings. The codebooks were created, tested and then revised as needed by the first author, a masters degree doctoral student in clinical psychology. To confirm reliability, 20% of eligible studies were independently coded by a second rater, a masters degree student in clinical psychology. Interrater reliability ranged from 80% to 100% across codes. Discrepancies between coders were discussed until consensus was reached. Codebooks were updated to reflect the consensus coding, as needed. Authors were contacted for additional information when necessary.

Study Quality

The Cochrane Collaboration Guideline and Quality Assessment Tool for Quantitative Studies (QATQS, The Cochrane Collaboration 2011; Thomas 2009) was used to measure study quality. The 14-item QATQS examines five components: selection bias, study design, confounding variables, blinding of the outcome assessors and participants, data collection methods, and dropout rates in the study. Components were rated using a three point scale (i.e., strong, moderate, weak). A global quality rating was assigned to each study based on the ratings of the five components (see details in Thomas 2009).

Statistical Analyses

Descriptive Statistics

Participant (e.g., age, gender, SES), intervention (e.g., sample size, number of intervention sessions), and methodological characteristics (e.g., study design, study quality measures) were examined using descriptive statistics using SPSS Version 23.0 (IBM Corp. Released 2015).

Main Meta-analytic Analyses

To examine effectiveness, two main meta-analyses were conducted using the within- and between-subjects data, separately. First, for within-subjects data, the mean effect size (ES) was calculated with pre- and post-test data representing the standardized mean gain in outcomes from both within-subjects design studies and the experimental groups from the between-subjects design studies. Pre- and post-test data were used because they were the most commonly available measurement time points across studies conforming to the pretest and follow-up scores (discussed in more details in Results). To control for statistical dependence between pre- and post-test scores, pre-post correlations were used to calculate the standardized mean (Lipsey and Wilson 2001). When available, the pre-post correlations were extracted directly from the studies. However, because most studies do not report data on the correlations between the pre- and post-treatment measures (Lipsey and Wilson 2001), when unavailable, previous published studies were used to estimate the pre-post correlation (i.e., test–retest reliability; Lipsey and Wilson 2001; Rosenthal 1991). When correlation information was not available from the published literature, a conservative estimate (r = .70) was used according to recommendations by Rosenthal (1991), Hofmann et al. (2010), and Khoury et al. (2013).

For between-subjects data, the post-intervention data from the treatment and control groups were used to calculate a between-subjects ES, representing the standardized mean difference in outcomes between the experimental and control groups (Hedges and Olkin 1985; Singer et al. 2007; Sorensen et al. 2002). If groups had substantially different average scores on the dependent measures at pretest, change scores were used with the appropriately adjusted standard deviation (Singer et al. 2007; Sorensen et al. 2002).

The second major goal of this meta-analysis was to examine whether effectiveness differed across intervention approaches. First, moderator analyses were conducted to examine if the mean ES depended on type of interventions. If significant, separate individual meta-analyses were conducted, using both the within- and between-subjects data, to examine mean ES restricted to a specific intervention approach.

Data analyses were conducted using the Comprehensive Meta-analysis 3 program (CMA 3; Borenstein et al. 2014). ESs were calculated using a random-effects model, which increases the ability to generalize results by accounting for within- and between-subjects variability of the individual study data (Borenstein et al. 2009; Lipsey and Wilson 2001). Each study was constrained to contribute one ES per construct per study to ensure statistical independence of the ESs (Lipsey and Wilson 2001), i.e., multiple outcome measures of a single construct of interest were averaged (Card 2015).

Hedges’ g was used for the calculation of mean ESs instead of Cohen’s d because Cohen’s d tends to overestimate ESs in studies with small samples, which was true for the majority of the studies in this meta-analysis. ESs were weighted by the inverse of its variance to account for study differences in sample size (Lipsey and Wilson 2001; Borenstein et al. 2009). Ninety-five percent confidence intervals (CI) were calculated to indicate the degree of precision of the estimate and the significance of the mean ESs. The interpretation of Hedges’ g is equivalent to Cohen’s d (i.e., ESs of .20. 50, and .80 are considered small, medium, and large, respectively; Cohen 1992; Lipsey and Wilson 2001). Because it is inadvisable to combine ESs from within-subjects and between-subjects design studies (Borenstein et al. 2009; Lipsey and Wilson 2001), within- and between-subjects data were analyzed separately.

Outliers

Both within-subject and between-subject study effect sizes were examined for potential outliers. This is a common procedure in meta-analysis to detect potentially unrepresentative studies whose inclusion may bias mean effect size estimates. Study effect sizes were examined initially using visual inspection through forest plots using the CMA 3 program. Sensitivity analyses also were conducted to determine whether removal of studies with highly discrepant effect sizes (i.e., outliers) significantly altered the results (Card 2015). Outliers were retained if their removal did not significantly alter the results.

Publication Bias

Publication bias is concerned with whether the sample of studies included in a meta-analysis is representative of the population of studies theoretically available for inclusion. The key concern is that published studies are both easier to access and more likely to report significant positive results. Tests for publication bias examine the distribution of effect sizes across the sample of studies included in the meta-analysis. We employed two tests of publication bias. First, bias was examined visually using a funnel-plot of study ESs. An asymmetric plot indicates probable missing studies due to publication bias (Lipsey and Wilson 2001). Second, Egger’s tests were used, by regressing the standardized ES onto an estimate of ES precision (the inversion of the standard error; Card 2015). A significant Egger’s test would indicate publication bias (Card 2015). Egger’s test required at least 17 studies to have sufficient power to detect publication bias (Card 2015).

Heterogeneity

Hedges’ Q statistic and the I2 index were used to examine homogeneity of the ES distribution. A statistically significant Q indicates a heterogeneous distribution. However, the sensitivity of the Q statistic is strongly impacted by the number of studies. That is, it has low power when the number of studies is small and may detect unimportant heterogeneity when the number of studies is large (Fu et al. 2011). Accordingly, the I2 index, which quantifies the extent of the heterogeneity, was also computed and used as a heterogeneity indicator. Specifically, I2 indices larger than 25% are usually interpreted to indicate heterogeneity (Huedo-Medina et al. 2006). For the current study, when the Q statistic was significant (p < .10; Fu et al. 2011; Higgins and Thompson 2002) or the I2 index was greater than 25%, the ES distributions were viewed as heterogeneous, and moderator analyses were conducted to examine sources of the heterogeneity (Lipsey and Wilson 2001).

Moderator Analyses

The third goal of this meta-analysis was to examine potential moderators that contributed to any detected heterogeneity in the data. Moderator analyses were conducted using subgroup analyses for categorical moderators and meta-regressions for continuous moderators. For subgroup analyses, Hedges’ g and Q statistics were computed across subgroups (Borenstein et al. 2009). For meta-regressions (Huedo-Medina et al. 2006), a random-effects model was used, which is generally preferred in meta-regressions (Higgins and Green 2011) because it allows true effects to vary between studies (Borenstein et al. 2009). A minimum of four studies per subgroup was required for categorical moderator analyses and 10 for meta-regression analyses (Fu et al. 2011; Higgins and Green 2011).

Results

Characteristics of Included Studies

The literature search identified 5995 articles, producing 41 unique studies that were included in the meta-analyses (see Fig. 1). Studies were conducted from 1998 to 2018, and over half were published (63.4%). The review included a diverse group of participants from the United States (51.2%) and other countries (e.g., Australia, China, Turkey, Iran). The number of participants per study ranged from 5 to 212 (M = 43.20, SD = 40.93).

Fig. 1
figure 1

Search results and screening process

Of the 41 studies, 18 studies used a within-subjects design and 23 studies used a between-subjects design, of which 10 were randomized controlled trials (RCT). The majority of the comparison groups were treatment as usual or waitlist controls (91.3%). About a third of the studies reported follow-up outcome data (31.7%) after the first posttest.

A total of 1771 participants were included in the meta-analysis (Ntreatment= 1156, Ncontrol= 708). The average caregiver age was 39.92 (SD = 4.40; range 33–52). Over half of the participants had a bachelor’s degree or above (60.9%), and the majority were females (85.8%). One study focused exclusively on fathers of children with ASD (Elfert and Mirenda 2015), and 16 studies included only mothers.

The mean age of the individuals with ASD across studies ranged from 4.5 months to 16.13 years old, with the majority being male (82.58%). Fewer than 20% of the studies reported sample information about the level of problem behaviors (k = 7), ASD symptom severity (k = 4), or comorbid ID of children with ASD (k = 3).

Number of intervention sessions ranged from 1 to 60 (M = 7.88, SD = 9.12) with a mode of 8 (k = 11). The duration of interventions ranged from one day to three months (M = 56.56 days, SD = 40.26). Intervention approaches included social support (k = 5), psychoeducational programs (k = 10), ACT (k = 6), mindfulness-based therapy (k = 3), CBT (k = 7), positive psychotherapy (k = 1), written emotional disclosure (k = 4), and multicomponent interventions (k = 6). A variety of psychosocial outcomes were reported, including parenting stress, caregiver burden, depression, anxiety, general well-being, quality of life, and family and marital adjustment. Based on the study quality assessment, over half of the studies (63.4%) were categorized as weak, with 29% rated as moderate quality (k = 12) and 7.3% rated as strong (k = 3; see Tables 1 and 2).

Table 1 Overview of key study features
Table 2 Overview of study features across studies

Outliers and Publication Bias

One-study removal sensitivity analysis showed that the findings were robust to outliers. Both funnel plots and Egger’s test did not support publication bias for either the within- or between-subjects ESs (Within-subjects: Intercept= − 3.04, SE = 1.76, p = .09, 95% CI − 6.60, .53; Between-subjects: Intercept= − .53, SE = .99, p = .59, 95% CI − 2.59, .54).

The Effectiveness of Caregiver-Focused Interventions

Using a composite psychosocial outcome, which adjusted for total score direction and outcome valence of the measure, the mean within-subject ES was .44 (k = 40, SE = .10, 95% CI .24, .64), see Fig. 2. There was significant heterogeneity across studies (Q = 844.38, p < .001, I2= 95.38) supporting further moderator analyses. The pretest to follow-up ES was nonsignificant (Hedges’ g = .26, SE = .15, 95% CI − .03, .55), indicating no significant change between pretest and follow-up; however, only one-third of the studies reported follow-up data. The remaining within-subjects analyses were conducted using only pre- and post-test data.

Fig. 2
figure 2

Forest plot of studies included in within-subjects overall psychosocial outcome meta-analysis

The mean between-subjects ES was small (k = 22, Hedges’ g = .28, SE = .10, 95% CI .10, .45), indicating that caregivers in the intervention groups had slightly better overall psychosocial outcomes, see Fig. 3. There was significant heterogeneity across studies, suggesting the need for moderator analyses (Q = 40.80, p = .006, I2= 48.53), see Table 3.

Fig. 3
figure 3

Forest plot of studies included in between-subjects overall psychosocial outcome meta-analysis

Table 3 Summary of mean effect sizes

Comparison of Intervention Approaches

Interventions were categorized into six approaches: ACT/Mindfulness-based, CBT, positive psychotherapy (e.g., defined as “cultivating strengths, developing optimism, and expressing gratitude as a means of improving well-being”; LaPlante 2013, p. 46)/written emotional disclosure, psychoeducation, social support, and multicomponent interventions.

The six ACT and three mindfulness-based intervention studies were combined into one category because of the small number of studies and the overlap in the two intervention approaches (e.g., ACT includes a combination of acceptance and mindfulness methods; Hayes et al. 1999); also, several studies using ACT included mindfulness-based strategies (Blackledge and Hayes 2006; Hahs 2013; Kowalkowski 2013; Montgomery 2016). Multicomponent interventions included psychoeducation and either social support (Chiang 2014; Farmer and Reupert 2013; Samadi et al. 2013; Zimmerman 2013) or behavior management (e.g., Tonge et al. 2006). The mean overall psychosocial ES was examined within each intervention category separately for both within- and between-subjects data.

Intervention type was a significant moderator of within-subjects ES differences across studies (Qbetween= 23.97, df = 5, p < .001). When examined separately, four intervention approaches had significant small to medium within-subjects ESs in improving caregiver psychosocial outcomes. CBT had the highest mean ES (k = 6, Hedges’ g = .62, SE = .17, 95% CI .30, .95, p < .001), followed by multicomponent interventions (k = 5, Hedges’ g = .55, SE = .15, 95% CI .26, .84, p < .001), ACT/Mindfulness-based interventions (k = 9, Hedges’ g = .49, SE = .07, 95% CI .36, .62, p < .001), and positive psychotherapy/written emotional disclosure (k = 5, Hedges’ g = .19, SE = .05, 95% CI .10, .28, p < .001). Psychoeducational programs (k = 10, Hedges’ g = .52, SE = .34, 95% CI − .14, 1.18, p = .12) and social support when provided alone (k = 5, Hedges’ g = .06, SE = .13, 95% CI − .19, .32, p = .63) did not significantly improve caregiver psychosocial outcomes. Although cross-study heterogeneity was found for two intervention approaches (CBT: Q = 33.02, p < .001, I2= 84.86; multicomponent interventions: Q = 26.65, p < .001, I2= 84.99), the small number of studies available within intervention type precluded conducting additional moderator analyses, see Table 4.

Table 4 Summary of mean effect sizes across intervention types, within-subjects

Intervention type was not a significant moderator of the between-subjects ES (Qbetween= 7.14, df = 6, p = .31). Therefore, no separate individual meta-analyses were conducted for intervention type using the between-subjects data.

Moderators of Effectiveness across Interventions

Categorical Moderators

As reported above, the significant cross-study heterogeneity in effectiveness for both the within- and between-subjects ESs supported the need for further moderator analyses. The following categorical moderators were examined: intervention delivery format, publication status, number of treatment components, and study quality. Although three intervention delivery formats were reported (Individual, Group, mixed), only group and individual delivery formats were utilized often enough to be analyzed as moderators (k > 4; Fu et al. 2011). Number of treatment components (multicomponent approach vs. single-component) was examined as a potential moderator limited to the within-subjects data because only three between-subjects studies used a multicomponent intervention approach (k < 4; Fu et al. 2011). Several exploratory variables also were examined as potential moderators: country of data collection, the use of telehealth, and the use of multiple vs. single psychosocial outcome measures. As mentioned earlier, in calculating the mean intervention ES, a composite score was used in studies that assessed more than one psychosocial outcome. A moderator analysis was conducted to test whether using one ES from a combination of multiple psychosocial outcomes vs. a single outcome impacted the mean ES.

Both within- and between-subjects results indicated that intervention delivery format, publication status, study quality, and number of intervention components (p >.05) were not significant moderators. Moreover, none of the exploratory variables were significant moderators, including country of data collection, use of telehealth techniques, and using a composite vs. a single outcome indicator (p >.05). However, the moderating effect of telehealth in the within-subjects analysis was at a trending level (p = .09). Studies that did not use telehealth were more effective (Hedges’ g = .51, SE = .14, 95% CI .24, .78) than studies that used some form of telehealth (Hedges’ g = .25, SE = .05, 95% CI .15, .36).

Continuous Moderators

Meta-regressions using simple regression and a random-effects model were used to examine the relationship between the continuous variables and the study ESs. Variables examined included age of the offspring, a diagnosis of Asperger syndrome, education level, intervention duration, and number of sessions. Given the small number of studies, only single covariate analyses were conducted because there was insufficient power to conduct analyses with multiple covariates (Borenstein et al. 2009).

None of the aforementioned variables were significant continuous moderators in either the within- or between-subjects analyses. That is, the average age of the child with ASD, the percentage of the sample with Asperger syndrome, caregiver education level, intervention duration, and number of sessions failed to explain heterogeneity in the mean ES. Although dose as measured by treatment length was not significant, another possible indicator of intervention intensity suggested in previous studies is total intervention hours (Northouse et al. 2010). Accordingly, we also examined the potential moderation effect of total intervention hours on outcomes and found that increased intervention hours were related to larger mean ES (B = .0003, SE = .00, 95% CI .0001, .001) in the within-subjects meta-analysis.

Type of Dependent Measures

Due to statistical dependence, neither meta-regression nor categorical analysis was suitable to examine the statistical significance of dependent measure type as a moderator. As an alternative, ESs using different types of psychosocial outcomes were calculated separately and compared.

For the within-subjects data, the largest ESs were found for decreased perceived stress/distress (k = 13, Hedges’ g = .56, SE = .09, 95% CI .39, .74) and parenting stress (k = 18, Hedges’ g = .55, SE = .17, 95% CI .22, .87) and improved general well-being (k = 8, Hedges’ g = .54, SE = .11, 95% CI .32, .76). Smaller ESs were found for decreasing caregiver depressive symptoms (k = 14, Hedges’ g = .34, SE = .08, 95% CI .19, .49), improving positive and negative affect (Hedges’ g = .29, SE = .07, 95% CI .15, .42), and decreasing anxiety (k = 7, Hedges’ g = .25, SE = .10, 95% CI .05, .45). Effect sizes for quality of life and life satisfaction were not significant (k = 10, Hedges’ g = .22, SE = .14, 95% CI − .04, .49). Additionally, few studies measured caregiver burden (k = 5), family adaptation (k = 3), and marital adjustment (k = 3), and all elicited small nonsignificant ESs.

For the between-subjects data, the largest ESs were for improved general well-being (k = 6, Hedges’ g = .65, SE = .26, 95% CI .14, 1.15) and decreased parenting stress (k = 6, Hedges’ g = .49, SE = .22, 95% CI .05, .93), anxiety level (k = 5, Hedges’ g = .42, SE = .12, 95% CI .18, .66), and depressive symptoms (k = 9, Hedges’ g = .32, SE = .12, 95% CI .09, .55). The ESs for general subjective stress/distress (k = 7, Hedges’ g = .24, SE = .17, 95% CI − .08, .57) and quality of life were not significant (k = 7, Hedges’ g = .01, SE = .24, 95% CI − .46, .47). Few studies measured caregiver burden (k = 3), positive and negative affect (k = 3), and family (k = 2) and marital adjustment (k = 1), and none of the results reached significance.

Discussion

This study was the first meta-analysis examining the effectiveness of interventions exclusively focused on caregivers of individuals with ASD. A total of 41 studies using both within- and between-subjects group designs, and including 1771 participants were identified. The comprehensiveness of our sampling is supported by the fact that the number of studies included is more than double the number reported in any prior meta-analysis or narrative review. Below we discuss our findings with respect to the three main research questions: (1) were interventions effective in improving overall psychosocial outcomes in caregivers of those with ASD?; (2) did effectiveness differ across intervention approaches?; (3) what were the potential moderators of effectiveness?

The Overall Caregiver-Focused Interventions Effectiveness

Overall, interventions were effective in improving caregivers’ psychosocial outcomes with a small, significant ES. This finding is similar to results from three prior meta-analyses in non-ASD caregiver samples. Singer et al. (2007) conducted a meta-analysis on group interventions for parents of children with DD and found a small ES, Maughan et al. (2005) examined the impact of parenting programs on parental psychosocial outcomes in the general population and found improvement in a variety of psychosocial outcomes (e.g., anxiety, depression, stress) with small ESs, and Brodaty, Green, and Koschera (2003) examined intervention effectiveness for dementia caregivers and found significant benefits for caregivers (e.g., mood, burden, psychological morbidity), with a small ES.

The ESs varied with study design. Consistent with previous studies of meta-analyses (Lipsey and Wilson 2001), the within-subjects ESs (Hedges’ g = .44) were larger than the between-subjects ESs (Hedges’ g = .28). Lipsey and Wilson (2001) examined 45 meta-analyses and noted that on average within-subjects designs yielded ESs 61% larger than those using between-subjects designs and suggested that within-subjects design studies could potentially inflate mean ESs. Because within-subject studies lack a control group, they are subject to more internal validity threats (e.g., maturation effects, history effects) so that factors unrelated to the intervention could inflate the apparent effectiveness (Higgins and Green 2011) potentially biasing the results.

Effectiveness Across Intervention Approaches

A variety of intervention approaches were identified and used across the studies: CBT, ACT, mindfulness-based intervention, psychoeducational intervention, social support, positive psychotherapy, written emotional disclosure, and multicomponent interventions. Psychosocial outcome ES depended on intervention approaches in the within-subjects analyses. Although there were no differences across intervention type in the between-subjects analyses, this may have been due to the limited number of studies available. Moreover, because the within-subjects analyses included all of the identified studies, it represents a more comprehensive and externally valid sample.

ACT/mindfulness-based interventions were effective and exhibited a medium positive impact on caregiver overall psychosocial outcome in the within-subjects meta-analysis. This finding validates the results reported earlier by Cachia et al. (2016) that mindfulness was effective for caregivers of those with ASD and also is consistent with the findings of Pinquart and Sörensen (2006) that cognitive treatments, including mindfulness, have tended to produce greater effects sizes in psychological adjustment than non-cognitive approaches in interventions for dementia caregivers. The results also are consistent with findings from the general psychotherapy literature that mindfulness-based interventions are moderately effective in improving anxiety and mood for people diagnosed with psychological or physical disorders (e.g., cancer, anxiety, depression; Hofmann et al. 2010).

CBT was moderately effective in improving caregiver overall psychosocial outcome in the within-subjects meta-analysis. In fact, compared to the other intervention approaches, CBT produced the largest effect size. This is consistent with prior studies showing that CBT is effective in improving psychosocial outcomes in parents of children with DD (Hastings and Beck 2004; Singer et al. 2007). The findings also are consistent with the general psychotherapy literature that CBT is effective in reducing depression, anxiety, and stress in the general population (e.g., Richardson and Rothstein 2008).

Multicomponent interventions also were moderately effective in improving caregiver psychosocial outcomes in the within-subjects meta-analysis. This result is consistent with the findings of Hastings and Beck (2004)that multicomponent interventions are effective in improving psychological well-being in parents of children with ID and in reducing stress in parents of children with DD (Singer et al. 2007).

Finally, positive psychotherapy/written emotional disclosure also had a small effect in improving caregiver outcomes in the within-subject analyses. However, the ES was the smallest of the significant interventions. The results indicate that focusing on positive aspects in life or sharing thoughts and feelings about a stressful event in a safe environment may have a small positive impact on psychological well-being although the limited number of studies and inconsistency in the results temper our confidence in these findings.

The results for psychoeducation and social support were nonsignificant for the within-subjects analyses. However, the number of studies was limited and should be interpreted with caution. Previous meta-analyses have reported small effects for psychoeducation in reducing stress, depression, and anxiety in the general population (Donker et al. 2009; Van Daele et al. 2012) and in caregivers of people with dementia (Pinquart and Sörensen 2006).

In summary, four of the six intervention approaches demonstrated some evidence of effectiveness. ACT, mindfulness-based interventions, and CBT were moderately effective in improving caregiver psychosocial outcomes in pre-post comparisons. Although multicomponent, psychosocial and positive psychotherapy/written emotional disclosure interventions had modest support, these findings are tempered by small samples and small effects in the within-subjects results. Social support and psychoeducational interventions, however, were not effective in improving caregiver psychosocial outcomes. However, as already mentioned, due to the small number of studies in some of the intervention categories, these results should be taken with caution. Moreover, it should again be emphasized that these results refer to the within-subjects analyses. These modest effect sizes are likely to be even smaller when examined using a between-subjects design.

Moderators of Intervention Effectiveness

Group level sample characteristics related to child and caregiver had little impact on the outcomes of interventions. The lack of moderation effects for study sample composition is in part explicable because studies exhibited a range restriction in mean values on many of these variables. Also, because only about a third of the studies reported the detailed ASD diagnostic categories based on DSM-IV criteria (American Psychiatric Association 2000), and even fewer studies reported the presence of comorbid ID and problem behaviors in the offspring, there was limited ability to examine the potential moderating effects of these child characteristics on effectiveness.

Apart from the moderating impact of intervention type noted earlier, other intervention characteristics (i.e., group vs. individual delivery format, multi- vs. single-component, intervention duration, number of sessions) had little impact on the outcomes. Specifically, although group interventions were more effective than individual interventions in the within-subjects meta-analysis, the difference was not significant. Prior findings concerning delivery format have been inconsistent. For example, group interventions were less effective in improving burden and well-being than individual interventions in caregivers of older adults and dementia caregivers (Sorensen et al. 2002), whereas interventions delivered in group format were more effective in cancer caregivers (Northouse et al. 2010). Moreover, the moderation analysis was complicated by the fact that format was confounded with type of interventions (i.e., all the ACT/mindfulness-based studies and the majority of the CBT and multicomponent studies were conducted in groups, whereas all the positive psychotherapy/written emotional disclosure studies were in individual format).

In contrast to predictions that higher intervention intensity should have a positive impact on caregiver outcomes (Sorensen et al. 2002), neither total intervention duration nor the number of intervention sessions moderated the effectiveness of interventions. However, in a supplemental analysis, effectiveness was found to be related to total number of intervention hours. That is, similar to findings from Northouse et al. (2010), total intervention hours (i.e., in person, online, through phone) predicted higher mean ES in the within-subjects meta-analysis. However, the moderating effect was small and explained only 7% of the variance in the model. Because total hours combines the impact of intervention duration and frequency, it may be a more sensitive measure of intervention intensity. Other factors like attendance, homework practice, and compliance could also impact intervention outcomes (Khoury et al. 2013).

ESs differed based on the outcome measure used, suggesting that some measures were more sensitive to the effect of caregiver-focused interventions. Specifically, interventions were most effective in reducing parenting stress, depression, and anxiety and improving general well-being in both within- and between-subjects meta-analyses. In contrast, both caregiver burden and quality of life produced nonsignificant small ESs, suggesting that caregiver burden and quality of life are less sensitive to the effect of interventions, which is similar to the findings in Knight et al. (1993). However, these conclusions should be tempered with caution because of the small number of studies utilizing some of the outcome measures. Ideally, future studies should compare the sensitivities of the outcome measures within the same studies.

Limitations of the Reviewed Studies

The reviewed studies suffered from several limitations. First, the majority of the studies included small sample sizes. Although small samples have less impact on the results of meta-analyses, small sample sizes threaten internal and external validity of the findings in individual studies. Second, very few studies used randomized controlled trials, with over half of the between-subjects design studies using quasi-experimental designs, which limited the quality and reliability of the results. However, when explicitly examined, the use of an RCT design was not a significant moderator of effectiveness. Third, the majority of the studies used convenience sampling allowing participants to self-refer to the study, which may introduce selection bias to the results. It is possible that caregivers who volunteer differ from the general population of caregivers in important ways that could limit generalizability, e.g., experiencing either greater or fewer negative psychosocial outcomes, have children with either more or less severe problem behaviors. For example, the proportion of participants with a college degree was roughly twice the proportion in the general population (Ryan and Bauman 2016), marking the sample as particularly well educated. Moreover, because many studies provided insufficient information on offspring’s characteristics (e.g., comorbidities, problem behaviors), there was limited ability to evaluate external validity of the individuals with ASD. Fourth, as mentioned earlier, over half of the studies were categorized as of weak quality. Many studies did not provide important information needed to evaluate study quality, such as allocation concealment and dropout rates, contributing to the high percentage of studies rated as having weak quality. Lastly, few studies reported long term follow-up, reducing the ability to examine the long term or maintenance effect of the interventions.

Limitations of the Current Meta-analysis

Although this is the first study to systematically review interventions that specifically target psychosocial outcomes of caregivers of individuals with ASD using meta-analytic methods, there were several limitations. First, there are inherent problems when using meta-analysis to compare intervention effectiveness, as proposed by the second research question, because caregivers in each study may nonrandomly select to participate in a particular type of intervention (Dura and Kiecolt-Glaser 1990; Robinson et al. 2005), confounding preference with intervention type.

Second, because some of the moderators (e.g., outcome variables) were used in very few studies (e.g., caregiver burden, family adjustment), it was not possible to reliably examine intervention impact on these outcomes. In addition, the power to detect significance in any of the moderator analyses was likely to be low (Borenstein et al. 2009) due to the small number of studies, which may have contributed to some of the nonsignificant findings (e.g., treatment intensity, delivery format).

A couple of strengths of the current meta-analysis are worth noting. The first strength is its generalizability across countries. The review included a diverse group of participants from the US and other countries around the world, which supports the effectiveness of caregiver-focused interventions regardless of country of origin. Second, efforts to include unpublished studies to minimize the “file drawer” problems were largely successful. There was no evidence of publication bias and 37% of the studies reviewed were dissertations.

Implications and Future Research Directions

This is a developing research area compared to caregiver interventions in other populations (e.g., dementia, cancer). More studies examining interventions that target psychosocial outcomes in ASD caregivers are needed, especially studies with a rigorous study design (i.e., RCT), larger sample sizes, and long-term follow-ups. Moreover, the effectiveness of these studies was quite modest, suggesting that critical features impacting caregiver outcomes are not being addressed. For example, it is possible that interventions that focus on improving outcomes both for the child and the caregiver may have greater impact for the caregiver than interventions that focus solely on the caregiver (such as those investigated here) or focus solely on the child (assuming they also impact caregiver outcomes).

Future research should be more comprehensive in reporting details about sample characteristics (both the caregivers’ and the offspring’s) and intervention descriptions. For example, offspring characteristics, such as comorbid intellectual disability or problem behaviors, could affect severity of caregiver stress or ability to participate in the intervention and thus would be interesting and meaningful to explore.

Also, there was a restricted range of offspring’s age in the current meta-analysis, thus the effectiveness of interventions in different age groups is not clear. Studies of interventions that target ASD caregivers across the lifespan are needed. Caregiver stress tends to wax and wane, and parental psychosocial outcomes may change coinciding with the offspring’s key transition periods, e.g., receiving an ASD diagnosis, transitioning into adulthood (Stuart and McGrew 2009; Yu et al. 2018). Research is needed to examine whether caregiver-focused interventions are effective in improving psychosocial outcomes during or after the key transition periods, and whether incorporating strategies specific to the transition periods will have positive impacts on caregivers.

A key future challenge for researchers is to identify the moderators of intervention effectiveness. In addition, although not the focus of this meta-analysis, caregiver characteristics such as coping strategies, appraisal styles, and parental self-efficacy may act as moderators. For example, avoidance coping and negative appraisal styles have been identified as being particularly problematic in increasing caregiver stress and burden (Stuart and McGrew 2009; McGrew and Keyes 2014; Yu et al. 2018). Interventions that focus more closely on modifying these known risk factors may lead to greater improvement in caregivers’ psychosocial outcomes.

Methodological quality was low across a variety of quality measures. Future studies should include fidelity assessment even when the intervention is well developed, manualized, and implemented by experienced facilitators. Ensuring model adherence in intervention studies is important because it will allow better communication among researchers (Bond et al. 2000), and monitoring intervention delivery for factors like consistency, content, and quality can limit validity threats.

Conclusion

This meta-analysis provides evidence for the effectiveness of interventions specifically targeting psychosocial outcomes in caregivers of individuals with ASD. The caregiver-focused interventions on average exhibited small but positive effects in improving overall psychosocial outcomes in caregivers. Specifically, they were successful in alleviating parenting stress, depression, and anxiety and increasing general well-being. CBT and Mindfulness/ACT approaches appear to be more effective than others. As a developing field, more research is needed to examine interventions to improve caregiver outcomes in this population and potential variables that contribute to the effectiveness.