Introduction

It is well established through both cross-sectional and longitudinal studies that youth who have experienced maltreatment are overrepresented among those who self-report delinquency and have juvenile justice involvement (Esposti et al., 2020; Malvaso et al., 2018; Widom et al., 2018). Children in the child welfare system may be at particular risk. A recent study found that, of all youth younger than age 18 arrested and booked in California between 2014 and 2015, 60.2% had prior child protective services involvement (Eastman et al., 2019). Foster care placement is also strongly associated with delinquency (Goodkind et al., 2020); one longitudinal study found that 57% of youth who emancipated from foster care had been incarcerated by age 25 or 26 (Courtney et al., 2011). Of the young people arrested in the California study, over half had a child protective services’ report of maltreatment before the age of 10 (average 5.4 years; Eastman et al., 2019). Thus, delinquency prevention efforts with this population should focus on working with youth before adolescence, when delinquency typically peaks (Moffitt, 1993).

There have been many efforts to prevent delinquency among youth; common strategies include after-school programming, skills training, and youth mentoring (Welsh & Farrington, 2007). Systematic reviews and meta-analyses have found mixed results for these programs, with some indicating no benefit on delinquency (Herrera et al., 2013; Taheri & Welsh, 2016), whereas others have found such programs to have small effects (with effect sizes clustered around .20) on the prevention or reduction of delinquency (Beelmann & Lösel, 2020; Farrington et al., 2017; Lösel & Beelmann, 2003; Raposa et al., 2019; Tolan et al., 2013; van der Stouwe et al., 2020). Very few studies demonstrate long-term effects on delinquency (Hawkins et al., 2020), although there are some notable exceptions (Conduct Problems Prevention Research Group, 2010; Karcher, 2020; van der Stouwe et al., 2020). Each of the reviews calls for more rigorous studies to be conducted given the extreme heterogeneity in findings, which is likely due to the diverse populations served, different program components, variable fidelity to the models, and striking differences in rates of participant engagement and retention (Taheri & Welsh, 2016).

The reviews are consistent, however, in identifying some important delinquency prevention strategies, including social skill and mentoring interventions. Skills training programs for indicated populations that were more intensive, utilized a cognitive behavioral approach, and focused on social competencies were most effective (Beelmann & Lösel, 2020; Lösel & Beelmann, 2003; Welsh & Farrington, 2007). Across the meta-analyses of youth mentoring, effects sizes were strongest when (1) programs targeted youth with problems and/or those who had been exposed to substantial environmental risk; (2) mentors were motivated by gaining experience for their professional development, and there was a fit between their experience and the goals of the program; and (3) programs emphasized emotional support and provided structure for mentors to engage in instrumental and advocacy activities with mentees (Christensen et al., 2020; Dubois et al., 2011; Lyons et al., 2019; Raposa et al., 2019; Tolan et al., 2013).

Only one known prior study has conducted a preventive intervention with children in foster care in order to reduce delinquency. Middle School Success was designed for girls in foster care who were entering middle school. The intervention consisted of a 6-session summer group program for foster parents (caregiver management training) and a parallel girls’ skills group. During the subsequent academic year, parents and children were provided coaching to reinforce what was learned over the summer. Although only about 50% of the coaching sessions were attended, the program evidenced a marginally significant effect on a composite measure of self-reported delinquency and association with delinquent peers (Kim & Leve, 2011).

No prevention studies with children in foster care have used court records to assess delinquency outcomes, yet a recent systematic review of the association between maltreatment and offending concluded: “it has been shown that experiences of maltreatment can exert different influences on officially measured and self-reported offending outcomes” (Malvaso et al., 2018, p. 30). The authors suggest the use of multiple methods to examine delinquency in order to reduce the risk of measurement biases. Self-reports of delinquency can provide a more accurate picture of the scope of delinquency for several reasons: (1) only a fraction of delinquent behavior is detected by officials, (2) there is bias in who has contact with law enforcement, and (3) youth might have moved out of the area from which records are being collected. On the other hand, youth may deny or exaggerate their involvement with delinquent behaviors, and this may differ as a function of intervention status (e.g., those in the intervention may feel more comfortable reporting problem behaviors or they may underreport behaviors to please the investigator). Issues of study attrition are also more problematic in self-reports, as youth at the highest risk for delinquency are at the highest risk for attrition in longitudinal studies (Bosick, 2009; Brame & Piquero, 2003). Therefore, the use of both delinquency measurement approaches is warranted, especially given the low concordance between methods used in prior studies (Conduct Problems Prevention Research Group, 2010; Farrington et al., 2007; Murray & Farrington, 2010).

Fostering Healthy Futures

Fostering Healthy Futures (FHF) is a 9-month preventive intervention designed for preadolescent children placed in court-ordered foster care as a result of maltreatment. The program consists of one-on-one mentoring and skills groups, and it incorporates many of the key elements (described above) that have been shown to prevent delinquent behavior and other adverse outcomes (Taussig et al., 2007). Weekly groups employ a cognitive behavioral approach, with an explicit focus on teaching children social skills. Mentors support children in practicing and generalizing these skills in their homes, schools, and communities. There is a focus on both establishing a strong mentor-mentee relationship and on engaging in instrumental, goal-focused activities during mentoring visits, which has been shown to produce maximal benefit (Christensen et al., 2020; Lyons et al., 2019; Rhodes, 2020). FHF has demonstrated positive effects on mental health symptoms and service utilization and has also led to a reduction in placement changes, foster care costs, and residential treatment up to 1-year post intervention (Taussig & Culhane, 2010; Taussig et al., 2012, 2019; Winokur & Crawford, 2014). Longer-term effects have yet to be examined, and this is the first study to evaluate FHF’s impact on delinquency.

Current Study

The current study sought to contribute to the literature by employing a rigorous randomized controlled trial of the FHF program to examine changes in both self-reported and court records of delinquency charges over 7–12 years. The study used multilevel modeling analyses with an intent-to-treat approach to evaluate trajectories across age for total, non-violent and violent delinquency, given the well-established age-crime curve for delinquency (Moffitt, 1993) and the fact that other intervention studies have found program effects only at certain developmental periods and for specific types of delinquency (Bosick, 2009; Conduct Problems Prevention Research Group, 2010; Duncan et al., 2019; Farrington et al., 2007).

Method

Participants

Study recruitment began in August 2002 in Denver, Colorado and expanded to three additional metro-area counties in 2007 at the start of the second clinical trial. Recruitment letters were sent to families of all children between the ages of 9 and 11 years who were living in out-of-home care within the participating counties. Children were eligible for the study if they (a) had been placed in any type of out-of-home care (i.e., foster care, congregate care, kinship care) by court order due to maltreatment within the preceding year, (b) had lived in their current placement setting for at least 3 weeks, (c) resided within a 35-min drive to the intervention group sites at the time of recruitment, (d) did not have a developmental disability that would preclude them from participating in group, and (e) demonstrated adequate proficiency in English (caregivers, however, could be monolingual Spanish speaking). Participants were recruited in 10 cohorts over the course of 10 consecutive summers; the first five cohorts comprised the “pilot trial” and the second five cohorts comprised the “efficacy trial.” When multiple siblings were eligible in the pilot trial, one sibling was randomly selected to participate in the RCT. In the efficacy trial, eligible siblings were paired for randomization, and both were included in the trial. Overall, 22 sibling pairs were included in the RCT. Participation in this study was voluntary and could not be court-ordered.

For participants’ self-reports of delinquency, the follow-up time points differed between the pilot and efficacy trials, but all youth had the opportunity to participate in both a time 2 follow-up interview (6 months post intervention) and a time 3 follow-up interview (1.5 years post intervention for the pilot trial and 2.5 years post intervention for the efficacy trial). Finally, for a subgroup of participants, there was a long-term follow-up interview (called time 4) that took place when participants were between the ages of 18 and 22 (with variable amounts of time post intervention). Court records of charges (including charges through age 18) were queried for 7 consecutive years beginning 3 months after the intervention started for each cohort.

As shown in the CONSORT diagram (see Fig. 1), 90.1% of recruited children and families agreed to participate. Seventeen percent of the participants did not meet eligibility criteria based on the baseline interview (for additional details, see Taussig et al., 2019). Of the 426 children who were randomized to intervention and control groups, 89.0% were retained at time 2, 84.3% were retained at time 3, and 91.8% of those eligible for the time 4 interview were retained. Baseline characteristics of the 426 children who participated in the RCT are presented in Table 1. As shown, the sample was balanced on gender and was racially/ethnically diverse. Approximately half of the children self-identified as Hispanic, half self-identified as White, and over a quarter self-identified as Black (categories were not mutually exclusive).

Fig. 1
figure 1

FHF Consolidated Standards of Reporting Trials (CONSORT) diagram

Table 1 Baseline characteristics

The analytic sample included 391 participants (174 control and 217 intervention) for the self-reported delinquency outcomes and 425 participants (193 control and 232 intervention) for the court records of delinquency charges. Figure 1 provides reasons for missing data at any given follow-up assessment and details the exclusion reasons for the analytic sample. Not shown are the exclusion of data from 18 participants (9 control and 9 intervention) subsequent to their participation in a booster mentoring program when they were between the ages of 14 and 17.

Study Protocol

The current study was approved by the researchers’ institutional review board. Written informed consent from legal guardians and youth assent were obtained prior to each interview. Children and their legal guardians also provided assent/consent to obtain court records. Following the baseline interview, children were randomized to condition after stratifying on gender and county. Within each cohort, all children were manually randomized in a single block. Participants were interviewed at each time point by trained research assistants in a private place, typically at their residence, and research assistants were masked to condition. Participants received between $40 and $100 as compensation for each interview.

FHF is a 9-month program consisting of weekly skills groups and one-on-one, community-based mentoring. Skills groups meet for 1.5 h/week and utilize a cognitive behavioral curriculum with units addressing emotion recognition, perspective taking, problem solving, anger management, cultural identity, change and loss, healthy relationships, peer pressure, abuse prevention, and future orientation. Mentors are graduate students in social work or psychology who meet weekly with their mentees to build healthy relationships and support their mentees in practicing social skills, engaging in extracurricular activities, and thinking positively about their futures. FHF continues programming regardless of whether children change placements, reunify with their birth families, or are adopted. Mentors do not make recommendations during juvenile or family court proceedings. High rates of engagement and fidelity were achieved in the two trials: 95% of those offered FHF started the program, 92% completed it, and there was over 85% attendance at mentoring and skills groups (for additional details on the program design and fidelity, see Taussig et al., 2007, 2019).

Measures

Demographic, Maternal, and Child Welfare Characteristics

Demographic data (i.e., sex, age, race/ethnicity, county, and placement type) were obtained from child welfare records and child and caregiver reports. Trained research assistants coded Child Protection Services’ intake reports and court records of dependency and neglect petitions using a modified version of the Maltreatment Classification System (Barnett et al., 1993) to obtain maternal characteristics (i.e., substance use, criminal history, mental illness, and maltreatment history) and children’s maltreatment history (i.e., physical, sexual, and emotional abuse, physical neglect, supervisory neglect, educational neglect, and moral-legal maltreatment).

Adverse Childhood Experiences

Because adverse childhood experiences (ACEs) are associated with delinquency in both general and foster care samples (Duke et al., 2010; Garrido et al., 2017), we controlled for ACEs. ACEs were assessed with a 6-item measure developed for children in foster care comprised of physical abuse, sexual abuse, removal from a single parent household, violence exposure, caregiver, and school transitions (Raviv et al., 2010). Children’s scores of 0 or 1 for each of the six ACE items were summed to form a composite ACE index. In this sample, 87% of children were exposed to at least one ACE, and scores ranged from 0 to 5 (M = 1.6, SD = 1.1).

Self-Reported Delinquency

Delinquency was measured with The Adolescent Risk Behavior Survey (ARBS; Taussig, 1998), a youth-report measure that assesses past-year and lifetime engagement in risk behaviors including delinquency. The ARBS is a compilation of scales from two risk behavior surveys that have been used with racially and ethnically diverse samples and have shown adequate reliability and validity: the National and Denver Youth Surveys (Huizinga, 2017) and the National Adolescent Student Health Survey (American School Health Association, 1989). Table S1 in Online Resource 1 lists the delinquent behaviors that were assessed at each timepoint, whether they were classified as non-violent or violent, and the percent of youth who endorsed each behavior at each time point. Separate total, non-violent, and violent delinquency composite variables were constructed at each time point by computing the percentage of acts endorsed (yes/no) within each category. As shown in the table, between 23.0 and 37.3% of the sample endorsed at least one delinquent behavior at each time point.

Court Records of Delinquency Charges

Court records of delinquency or criminal charges filed by a prosecutor’s office in county or district court were queried for participants in the years prior to their study enrollment and then for 7 consecutive years beginning 3 months after the intervention started. Each cohort had the same number of years in which delinquency charges were coded post intervention. Court records for all participants were extracted from a state-wide electronic court case management system allowing for all filings (both juvenile and adult, as some individuals were 18 years old) to be matched on name and date of birth. Court records were coded for each year, and each offending date was counted only once. That is, if a participant was involved in a single incident that resulted in charges of assault, illegal possession of a weapon, and robbery on the same date, this incident was coded as one charge in that year (one count recorded for total, one count for violent, and one count for non-violent charges). Non-violent charges comprised a range of behaviors, including possession of a controlled substance, trespassing, theft, speeding, and criminal mischief. Violent charges involved behaviors such as menacing, assault, robbery, and murder. Variables representing the number of total, non-violent, and violent charge dates each year were created. As shown in Table S2 in Online Resource 2, between 0.2 and 8.7% of the sample had at least one delinquent charge in any study year.

Analytic Strategy

Baseline Differences and Attrition

Equivalence between intervention and control groups on baseline characteristics was assessed using chi-square tests for categorical variables and independent samples t-tests for continuous variables. Next, baseline predictors of attrition (defined as having no self-report data modeled) across condition were examined using a series of logistic regressions. A chi-square test was then used to assess whether the rate of attrition varied by intervention condition. Finally, logistic regression models examined whether interactions between each baseline variable and intervention status predicted attrition status.

Multilevel Models

Analyses proceeded in two parts, examining intervention effects on the trajectories of (1) self-reported delinquency and (2) court records of delinquency charges over age. In each instance, a separate model investigated the trajectory of total, non-violent, and violent delinquency. Multilevel models examined a quadratic trajectory of within-individual changes in delinquency over age. At the between-individual level, the models tested for intervention effects on the intercept, linear slope, and quadratic coefficients, while controlling for covariates (including those that differed at baseline), as described below. All analyses used an intent-to-treat approach and utilized maximum likelihood estimation; thus, trajectories were analyzed for all participants, including those with some missing outcome data.

Trajectories of Self-Reported Delinquency

Delinquency variety variables were operationalized as the percent of different delinquent behaviors endorsed by a participant at each wave. Given the non-normal shape of the distribution for these variables, we employed a multilevel model with a Poisson distribution. We tested the model for Poisson outcomes vs continuous outcomes and found a better fit (based on AIC and BIC criteria) for the Poisson models. The four waves of data were modeled in the following way:

Level 1: \(\mathrm{E}({\mathrm{Delinquency}}_{\mathrm{ij}}| {\upbeta }_{\mathrm{j}}) = {\uplambda }_{\mathrm{ij}}\)

$$\mathrm{log}[{\uplambda }_{\mathrm{ij}}] = {\upbeta }_{0\mathrm{j}} + {\upbeta }_{1\mathrm{j}} {\mathrm{Age}}_{\mathrm{ij}} + {\upbeta }_{2\mathrm{j}} {{\mathrm{Age}}^{2}}_{\mathrm{ij}}$$

Level 2: \({\upbeta }_{0\mathrm{j}} = {\upgamma }_{00} + {\upgamma }_{01} {\mathrm{Int}}_{\mathrm{j}} + {\upgamma }_{02} {\mathrm{Male}}_{\mathrm{j}} + {\upgamma }_{03} {\mathrm{MatCrime}}_{\mathrm{j}}\)\(+ {\upgamma }_{04}{\mathrm{ACEs}}_{\mathrm{j}} + {\upgamma }_{05} {\mathrm{BaselineDel}}_{\mathrm{j}} + {\upgamma }_{06} {\mathrm{Black}}_{\mathrm{j}} + {\upgamma }_{07} {\mathrm{Hisp}}_{\mathrm{j}} + {\mathrm{u}}_{0\mathrm{j}}\)

$${\upbeta }_{1\mathrm{j}} = {\upgamma }_{10} + {\upgamma }_{11} {\mathrm{Int}}_{\mathrm{j}} + {\upgamma }_{12} {\mathrm{Male}}_{\mathrm{j}} + {\upgamma }_{13} {\mathrm{BaselineDel}}_{\mathrm{j}} + {\mathrm{u}}_{1\mathrm{j}}$$
$${\upbeta }_{2\mathrm{j}} = {\upgamma }_{20} + {\upgamma }_{21} {\mathrm{Int}}_{\mathrm{j}} + {\upgamma }_{22} {\mathrm{Male}}_{\mathrm{j}}$$

Models controlled for baseline delinquency (collected prior to the intervention), sex, race/ethnicity (i.e., Hispanic, Black), maternal criminal history (which differed between groups at baseline), and ACEs. Subsequent trajectories of delinquency were based on three follow-up waves that spanned assessments between 10 and 22 years of age. The age variable was centered at age 10, intervention (Int) was coded as 0 = control, 1 = intervention, and all other covariates were grand-mean centered.

Trajectories of Delinquency Court Charges

Court record variables encoded for the number of total, non-violent, and violent charge dates each year. Once again, a Poisson model was used to estimate quadratic trajectories for these non-normally distributed variables. Charges were calculated for each year following the intervention, and analyzed over age (centered at youngest reported age of 8.75 at baseline):

Level 1: \(\mathrm{E}({\mathrm{Charges}}_{\mathrm{ij}} | {\upbeta }_{\mathrm{j}}) = {\uplambda }_{\mathrm{ij}}\)

$$\mathrm{log}[{\uplambda }_{\mathrm{ij}}] = {\upbeta }_{0\mathrm{j}} + {\upbeta }_{1\mathrm{j}} {\mathrm{Age}}_{\mathrm{ij}} + {\upbeta }_{2\mathrm{j}} {{\mathrm{Age}}^{2}}_{\mathrm{ij}}$$

Level 2: \({\upbeta }_{0\mathrm{j}} = {\upgamma }_{00} + {\upgamma }_{01} {\mathrm{Int}}_{\mathrm{j}} + {\upgamma }_{02} {\mathrm{Male}}_{\mathrm{j}} + {\upgamma }_{03} {\mathrm{MatCrime}}_{\mathrm{j}}\)\(+ {\upgamma }_{04}{\mathrm{ACEs}}_{\mathrm{j}} + {\upgamma }_{05} {\mathrm{Priors}}_{\mathrm{j}} + {\upgamma }_{06} {\mathrm{Black}}_{\mathrm{j}} + {\upgamma }_{07} {\mathrm{Hisp}}_{\mathrm{j}} + {\upgamma }_{08} {\mathrm{County}1}_{\mathrm{j}}\)\(+ {\upgamma }_{09} {\mathrm{County}2}_{\mathrm{j}} + {\upgamma }_{10} {\mathrm{County}3}_{\mathrm{j}} + {\mathrm{u}}_{0\mathrm{j}}\)

$${\upbeta }_{1\mathrm{j}} = {\upgamma }_{10} + {\upgamma }_{11} {\mathrm{Int}}_{\mathrm{j}} + {\upgamma }_{12} {\mathrm{Male}}_{\mathrm{j}} + {\mathrm{u}}_{1\mathrm{j}}$$
$${\upbeta }_{2\mathrm{j}} = {\upgamma }_{20} + {\upgamma }_{21} {\mathrm{Int}}_{\mathrm{j}} + {\upgamma }_{22} {\mathrm{Male}}_{\mathrm{j}}$$

In addition to sex, race/ethnicity, maternal criminal history, and ACEs, the model controlled for participant’s county at the time of placement in foster care (four counties dummy-coded with three variables) because counties differed in their methods of reporting of charges to the state.

In order to examine the impact of non-independent data from siblings, analyses were run with the full sample and then replicated with a sample that contained only one randomly selected sibling. Because no significant differences were found after dropping one sibling, results are reported for the full sample. In addition, the analyses contain data from two separate clinical trials (pilot and efficacy); therefore, we tested whether the trial moderated the significant impact of the intervention on outcomes.

Results

Differences on Baseline Characteristics

Across the 28 estimated baseline comparisons (see Table 1), there were three statistically significant and two marginally significant differences. Intervention youth were more likely than control youth to have mothers with criminal histories, \({\chi }^{2}(1, {\rm N}=426)=4.64\), p = .03. They also had lower rates of self-reported total, t(324.82) = 2.49, p = .01, and non-violent, t(301.89) = 2.70, p = .007, delinquency; in contrast, there was a statistical trend for intervention youth to have more total court charges, t(231.00) = –1.90, p = .06, and non-violent court charges, t(231.00) = –1.74, p = .08, compared to control youth (only intervention youth had charges at baseline).

Attrition

Attrition analyses were conducted for self-reported delinquency only. We first compared those with any data modeled (n = 391) to those whose data were not modeled (n = 35). There was no difference in the rate of attrition by intervention condition, \({\chi }^{2}(1, {\rm N}=426)=1.24\), p = .27. Further, there were no significant predictors of attrition across conditions. Interaction analyses examined whether intervention status moderated the impact of any baseline characteristics on attrition. Of the 28 analyses conducted, only one was statistically significant. Specifically, there was a significant intervention group × maternal criminal history interaction predicting attrition, b = −1.58, SE = 0.74, p = .03. Follow-up analyses indicated that there was a trend for intervention youth with maternal criminal history to be less likely to attrit, b = −.97, SE = .52, p = .06; control youth with maternal criminal history were nonsignificantly more likely to attrit.

Outcome Analyses

Self-Reported Delinquency

Table 2 reports estimates of change in total, non-violent, and violent self-reported delinquency over age. Overall, the significant positive effect of age and the significant negative effect of quadratic age coefficients indicated that youth delinquency followed a curvilinear trajectory that increased during early-to-mid adolescence and declined during mid-to-late adolescence. Intervention status was not associated with the delinquency intercept or linear slope for any outcome. However, intervention status was associated with lower quadratic coefficients for total (b = −.03, p < .01; ER = .97, 95% CI for ER = [0.95, 0.99]) and non-violent delinquency (b = −.05, p < .001; ER = .95, 95% CI for ER = [0.95, 0.96]). That is, the intervention was associated with a 3% decline in the event ratio for the quadratic coefficient for total delinquency and a 5% decline in the event ratio for the quadratic coefficient for non-violent delinquency. Given the non-linear nature in the interpretation of the event ratio (similar to an odds ratio) results, we also provide an illustration of the intervention effects on the predicted trajectories of delinquency over age. As can be seen in Fig. 2a, total self-reported delinquency began to decelerate at an earlier age and at a faster rate for the intervention group as compared to the control group, resulting in the overall lower mid-adolescent peak of delinquent behavior. Youth in the intervention group had 30% less self-reported delinquency than those in the control group at age 14, 59% less at age 16, and 82% less at age 18. The pattern of effects was similar for non-violent delinquency (see Fig. 2b), albeit at overall lower rates. Compared to the control group, youth in the intervention group evidenced 33% less delinquency at age 14, 51% at age 16, and 76% at age 18. There was no significant intervention effect for self-reported violent delinquency.

Table 2 Estimated coefficients for intervention effects on the trajectories of self-reported delinquency over age
Fig. 2
figure 2

Trajectories of total (a) and non-violent (b) self-reported delinquency over age by intervention condition

Court Records of Delinquency Charges

Table 3 reports estimates of change in total, non-violent, and violent delinquency court charges over age. Similar to self-reported delinquency, a significant positive effect of age and a significant negative effect of quadratic age coefficients indicated that rates of charges followed a curvilinear trajectory that increased during early-to-mid adolescence and declined during mid-to-late adolescence. The intervention group experienced a higher intercept for violent charges (i.e., they had higher rates of pre-baseline charges at age 9; b = 4.58, p < .01; ER = 97.6, 95% CI for ER = [3.6, 2.651]). However, intervention status was associated with a lower linear rise in total charges (b = −.97, p < .05; ER = .38, 95% CI for ER = [0.15, 0.99]) and violent charges (b = −1.48, p < .01; ER = .23, 95% CI for ER = [0.08, 0.69]). That is, the intervention was associated with a 62% decline in the event ratio for the linear slope for total charges and a 77% decline in the event ratio for the linear slope for violent charges. Figure 3 provides an illustration for how these non-linear intervention effects translate into counts of delinquency charges at each age. As can be seen in Fig. 3a, the intervention group had higher counts of early (ages 9–12) modeled total charges, but lower rates during the peak offending years (around ages 15 and 16). Compared to the control group, youth in the intervention group had 15% fewer total delinquency charges at age 15 and 16. For charges involving violent behavior (see Fig. 3b), the control group experienced a sharp rise that peaked around age 16, whereas the intervention group experienced a more shallow rise in violent charges over age. Compared to the control group, youth in the intervention group evidenced 30% fewer violent delinquency charges at age 15 and 28% fewer at age 16. For both total and violent delinquency charges, the pattern of differences was reversed by age 18. However, due to scarcity of data at the upper age range and the tendency of the quadratic equations to overestimate the symmetric pattern of the outcomes (i.e., the robust rise before peak offending is necessarily matched by the symmetric robust decline after peak offending), we hesitate to make decisive conclusions about the nature of the intervention effect at age 18. There was no intervention effect on non-violent charges.

Table 3 Estimated coefficients for intervention effects on the trajectories of court reported charges of delinquency over age
Fig. 3
figure 3

Trajectories of total (a) and violent (b) delinquency court charges over age by intervention condition

Moderating Effect of Study Trial.

There were no significant intervention × trial interactions in predicting delinquency.

Discussion

This study sought to examine whether the Fostering Healthy Futures (FHF) program had an impact on the longitudinal course of delinquency for children who had been in foster care during preadolescence. Despite the heterogeneity of their life experiences and the multitude of factors that contribute to engagement in delinquency, results suggest that the FHF program weakened the pipeline that is too often found between child maltreatment and delinquency as well as between the child welfare and juvenile justice systems. The effects of the program were not immediate nor were they invariant across types of delinquency and measurement method. For self-reported total and non-violent delinquency, quadratic effects suggested that children in the intervention (relative to the control group) had a slower and lower rise in their delinquent behavior until the age of 14, which was followed by a much more precipitous drop until young adulthood. For court records of delinquency charges, there was a steady rise in both groups until the age of 16, but the control group had a higher linear rise, especially for violent delinquency. We hypothesize, based on prior studies, that the different trajectories are due to youth engaging in and endorsing more delinquent behaviors prior to coming to the attention of the authorities (Bosick, 2009; Farrington et al., 2007). The different impact of the intervention on non-violent and violent outcomes for the two different measures of delinquency is difficult to understand, although there was not 1:1 item correspondence between the measures. Other studies have found differences by informant and subtype of delinquency, most without clear explanation (Cernkovich et al, 1985; Conduct Problems Prevention Research Group, 2010; Cuervo et al., 2018; Malvaso et al., 2018; Mersky et al., 2012; van der Stouwe et al., 2020). Certainly, this is an important area for further investigation. Given the strengths and limitations of both types of data collection, the positive findings in the current study across both methods of data collection for total delinquency are encouraging. Other strengths of the study include the fact that full intent-to-treat analyses were conducted, there was no differential attrition across groups, and the models included baseline delinquency and covariates that differed between groups at baseline.

Although it is difficult to compare effects sizes in the current study to those of studies which have not employed multiple time points, the small to medium size of the effects replicates what has been found across systematic reviews and meta-analyses of programs intended to reduce delinquent behavior and in evaluations of programs that utilize skills training and mentoring as core intervention components (Beelmann & Lösel, 2020; Dubois et al., 2011; Farrington et al., 2017; Raposa et al., 2019; Sandler et al., 2014). Indeed, it may be that the combination of these preventive intervention components and high rates of program fidelity enabled the FHF program to evidence this impact on delinquency (Christensen et al., 2020; Taheri & Welsh, 2016). It is rare in studies with high-risk and highly mobile populations to achieve such high rates of program engagement and completion. For example, in a review of mentoring programs for children of incarcerated parents, almost half of children did not participate for the expected duration. Even more concerning were the recruitment and retention rates; in one study, there was a 33% enrollment rate, a less than 80% completion rate for the baseline assessment, and a 50% retention rate for the post-assessment (Jarjoura, 2016). These patterns of attrition underscore the need to collect court record data in addition to self-reports of delinquency in order to model change over time as a function of intervention status.

Another strength of the current study was its external validity. In reviews of prevention programming for youth, a self-selection bias has been noted (Taheri & Welsh, 2016). In the current study, all children who had recently been placed in out-of-home care within the target age range, with few exceptions, were recruited for the study; of those, 90% enrolled and 95% of those randomized to the intervention began it. Since children did not sign up for the program and they were not referred by caseworkers, the results can be generalized to children whose families or caseworkers might not recognize a need to enroll their child in a preventive intervention or who believe that a child/family might not engage.

The study was not, however, without limitations. There remain many questions about the long-term efficacy of skills training and mentoring on reducing delinquency, and these program components cannot be disentangled. Furthermore, children and families were not masked to condition and the impact of adjunct services or baseline placement type were not examined. There were some baseline differences on delinquency which were accounted for in the models, but the impact of these differences on the trajectories is unknown. Court records of delinquency charges were modeled, which does not account for all law enforcement contact related to delinquent behavior nor reflect an official finding of guilt on the charges. Youth may have been arrested but never had a case filed, or had a case filed that was ultimately dismissed or expunged. Additionally, the sentence (e.g., diversion, probation, or detention/incarceration) a youth received if found guilty was not included. Importantly, individuals who are incarcerated are less likely to have repeat charges due to restricted opportunities to offend (Cernkovich et al., 1985; Widom et al., 2018). This might have contributed to the drop in number of charges for the control group at older ages, as the control group may have had more and longer periods of detention due to their higher levels of delinquency at earlier ages. Finally, as is the case with most longitudinal studies of delinquency prevention programs, program process (including dose-response effects), moderating and mediating effects were not examined (Sandler et al., 2014); these remain directions for future inquiry.

Conclusions and Implications

Given that entry into the child welfare system typically occurs before children have contact with the juvenile justice system, there is an important window of opportunity to intervene with children who have experienced maltreatment and foster care to prevent delinquency and justice system involvement. Despite this prospect, there are few “true” delinquency prevention programs in the USA. Moreover, few have demonstrated long-term efficacy, and only a fraction are designed for our highest-risk youth (Conduct Problems Prevention Research Group, 2010; Lab, 2018). Despite the drop in youth crime in the early part of the twenty-first century, the costs of confinement of children and adolescents ranges between $8 and $21 billion a year in the USA (Justice Policy Institute, 2014). A decade ago, it was estimated that diverting a “14-year-old high-risk juvenile from a life of crime” would save between $2.6 and $5.3 million (Cohen & Piquero, 2009). Although there is scant evidence for the long-term efficacy of delinquency prevention programs (Farrington & Hawkins, 2019), and some suggestion that mentoring programs for high-risk youth might be iatrogenic (cf. McCord, 1978), there has been enormous public and private investment in these programs (Justice Policy Institute, 2009; Taheri & Welsh, 2016). Public policy, it seems, has been operating independent of the evidence, despite frequent calls to only fund programs “that work.”

FHF, which was implemented before the overwhelming majority of children had entered the juvenile justice system, is the only known prevention program to have demonstrated a reduction in delinquency over a 7–12-year period for youth who had experienced the deleterious impacts of child maltreatment and foster care placement. This study exemplifies the importance of investing funding in replication trials and longitudinal follow-up; without such support, the findings presented in this paper would not be possible. Programs such as FHF are an important part of the prevention continuum for children and youth who have experienced child welfare involvement. The Family First Prevention Services Act of 2018 provides a federal reimbursement mechanism for programs which are evidence-based. Such funding mechanisms, when paired with effective programming and community support, provide key infrastructure for larger scale dissemination. Attention to critical implementation strategies is vitally important to ensure high-quality transportability of interventions from research-based settings into community-based settings (Fagan et al., 2008; Mihalic & Irwin, 2003). Best practices for training providers, user-friendly manuals, availability of pragmatic approaches for fidelity monitoring, and sustainable funding are necessary in order to bring more innovative prevention programs, such as FHF, to scale. When such implementation practices are developed concurrently with the intervention (Curran et al., 2012), engaging and effective prevention programming can reach the intended population sooner, enabling more youth to chart a healthy course through adolescence and adulthood.