Introduction

Key theories of the development of substance use highlight cascading biological and environmental influences that are both independently and jointly associated with later adolescent substance use (Dodge et al. 2009). These developmental cascades are likened to individual streams (e.g., particular influences, like genetics and prenatal exposures) that flow into each other and grow larger. Later, new streams (e.g., parenting environment) add to the growing rivers, and as the developmental influences come together and grow stronger, they gain momentum leading to outcomes (e.g., adolescent substance use) that seem unavoidable, at least near the end of the developmental cascade (Dodge et al. 2009; Masten et al. 2005). A feature of this theory is that the earliest influences are small, observable mainly through more proximal or intermediary influences and processes that involve both the child and their environment, whereas ‘downstream’ developmental influences have much stronger and seemingly less tractable associations with the developmental outcome.

Developmental cascades have been particularly important for understanding adolescent substance use, which is influenced by multiple factors often conceptualized as working together to increase risk via accumulating, dynamic processes (Dodge et al. 2009). The earliest influences on adolescent substance use include genetics (Hopfer et al. 2003) and prenatal risk (particularly substance use exposure; Irner 2012). Intermediary factors contributing to risk for adolescent substance use include parenting during childhood and adolescence (i.e., harsh discipline, low warmth, and low monitoring; Donaldson et al. 2016; Neiderhiser et al. 2013), peers during later childhood and adolescence (e.g., delinquency, substance use; Schelleman-Offermans et al. 2013) and pubertal timing (Marceau and Jackson 2017). The most proximal and strongest predictors of adolescent substance use are internalizing (Hussong et al. 2017) and especially externalizing problems during middle childhood and early adolescence (Hicks et al. 2011).

The present study used the Avon Longitudinal Study of Parents and Children (ALSPAC) to assess developmental cascades from polygenic risk and prenatal substance use through harsh discipline, pubertal onset, and externalizing and internalizing problems to late adolescent substance use outcomes (see Fig. 1, conceptual model). Examining these multiple influences together will elucidate how pubertal timing and parental harsh discipline operate within a larger developmental cascade, or highlight which factors emerge as particularly important in the context of a larger model of the development of substance use. Externalizing and internalizing problems in adolescence can be either positively correlated in the case of comorbidity, or negatively correlated in the case of differentiated problems within individuals. This phenomenon makes it difficult to understand the relative role of problem type vs. severity of comorbid problems in many studies. A novel contribution of this study is our conceptualization of externalizing and internalizing problems using a severity–directionality model (Essex et al. 2003). This model provides a unique and important within-person perspective on comorbidity by reorganizing externalizing and internalizing problems into a score reflecting the severity of comorbid problems, which is measured orthogonally to a second score reflecting the differentiation or specificity of problem type as predominantly externalizing or internalizing. The severity–directionality model of externalizing and internalizing symptoms is expected to clarify the role of behavioral problems in developmental cascades to adolescent substance use.

Fig. 1
figure 1

Conceptual model. System severity and directionality are orthogonal constructs. Ext > Int denotes a preponderance of externalizing problems rather than internalizing problems. Int > Ext denotes a preponderance of internalizing rather than externalizing problems. (+) Indicates hypothesized positive correlations (e.g., a risk factor). (−) Indicates hypothesized inverse associations (e.g., a protective factor). Hashed lines denote correlations among predictor that are present in families where the same parents provide genetic, prental and parenting influences, as in the present study

Pubertal timing and substance use

Puberty unfolds during late childhood/early adolescence and encompasses sex-specific hormone changes and visible changes in secondary sex characteristics (e.g., females: breasts, males: facial hair) that are driven by sex-specific biology (Grumbach and Styne 2003). There is strong evidence that in females, starting puberty earlier is linked to more externalizing and especially internalizing problems (Ullsperger and Nikolas 2017), as well as increased adolescent substance use (Marceau et al. 2015a). In the ALSPAC cohort, girls’ earlier pubertal timing has been linked to alcohol and cannabis use (but not smoking; Hummel 2014). These findings are primarily explained by the developmental readiness hypothesis, stating that youth who go through puberty early are not cognitively or emotionally prepared for the biological and social changes associated with the physical maturation of puberty, which increases risk for maladjustment (Ge and Natsuaki 2009). Early pubertal onset is thought to influence adolescent substance use through biological mechanisms (i.e., hormone changes) and by socially-driven mechanisms (e.g., shifting responsibilities and relationships at home; associating with older peers; Mendle et al. 2007).

For males, both earlier and later pubertal timing have been related to externalizing problems as well as earlier and faster progressions though a variety of substance use milestones (Mendle and Ferrero 2012). The social compensation hypothesis suggests that some boys who are, or who perceive themselves to be, less physically mature due to delayed pubertal onset may engage in more risky behaviors (e.g., substance use) to ‘prove’ their maturity despite their more juvenile appearance (Marceau et al. 2015a). This mechanism is likely to be environmentally mediated, given evidence later puberty is more often associated with substance use outcomes in boys when assessed via visual cues (i.e., self-report or secondary sex characteristics). This is in contrast to measures of hormone levels or changes, which tend to show effects of earlier maturation on substance use involvement (Marceau et al. 2019a). The measure of puberty used in ALSPAC is parent- and youth-reported ratings of secondary sex characteristics, and thus profiles of pubertal risk for late adolescent substance use are expected to be earlier onset for girls but later onset for boys. Finally, based on recent findings suggesting that earlier externalizing and internalizing problems do not predict the timing of puberty (Beltz et al. 2020; Deardorff et al. 2021), pubertal timing is placed before internalizing and externalizing problems in the present developmental cascade (Fig. 1).

Initiation of the proposed developmental cascade

Pubertal timing is genetically influenced (Day et al. 2017), and associated with key prenatal insults, specifically substance use during pregnancy (Irner 2012; Maisonet et al. 2010). Twin studies have found genetic influences common to pubertal timing and adolescent substance use in girls (Corley et al. 2015), and externalizing problems in one study (Vaughan et al. 2015) but not in others (e.g., Dick et al. 2000; Eriksson et al. 2005). However, twin studies (Marceau et al. 2012) and LD regression studies (Zheng et al. 2017) have found that genetic influences on pubertal maturation are largely non-overlapping with those on internalizing phenotypes. Two recent studies found that pubertal timing did not mediate the association of family history of substance use with adolescent substance use (Marceau et al. 2019a; Weigard et al. 2020). These findings suggest that genetic risk for substance use is unlikely to initiate developmental cascades involving pubertal timing (e.g., in contrast to Corley et al. 2015, which was a bivariate twin model first estimating genetic influences on pubertal timing). Further, in the ALSPAC cohort, timing of girls’ breast development partially mediated associations of polygenic risk for early pubertal timing and self-reported depressive symptoms at age 16.5 years (Horvath et al. 2020). Thus, available evidence points to genetic risk for earlier pubertal timing specifically as a potential initiator of developmental cascades involving pubertal timing. We extend this work to explore whether substance use during pregnancy is also a potential initiator of developmental cascades involving pubertal timing. Our manuscript uses the same measure of genetic risk as presented in Horvath et al. (2020), but embedded in a larger developmental model (with prenatal risk and parenting), including a different construct for child behavior (severity and directionality) that is parent-reported, a different quantification of pubertal timing, and extending out to youth-reported substance use.

Intermediary influences in the proposed developmental cascade

Parental harsh discipline

Parental harsh discipline has emerged as a key, intermediary predictor of externalizing problems and adolescent substance use (Brenner and Fox 1998; Riggins-Caspers et al. 2003). Even in the context of developmental cascades including genetic and prenatal risks and stress biology, adoptive parents’ harsh discipline has emerged as a key influence predicting more externalizing and internalizing problems in childhood (Marceau et al. 2015b). There is empirical support of associations of physical discipline with later adolescent externalizing and internalizing problems in studies of biologically-related families that were stronger than other parenting measures (e.g., Bender et al. 2007). Although there is evidence of reciprocal relations (e.g., Lansford et al. 2011) as well as evocative gene–environment correlation processes (Button et al. 2008) underlying associations of parents’ discipline and child behavior problems, harsh discipline goes beyond contingent responsivity. Indeed, harsh discipline is considered an ineffective strategy that is a key feature of coercive cycles whereby youths’ externalizing behaviors elicit ineffective punishment from parents, which backfires and instead leads to worse externalizing problems repeatedly over time (Patterson 1982). Our focus is on this latter pathway, of harsh discipline predicting later problems, although we acknowledge that the associations likely arise due to prior transactional, evocative processes. Thus, based on a large body of work, parental harsh discipline is expected to play a key role in developmental cascades to adolescent substance use (Dodge et al. 2009) primarily through more proximal associations with adolescent externalizing (Bailey et al. 2009) and, to a lesser extent, internalizing problems (Rose et al. 2018). Additionally, by including a measure of harsh discipline in the developmental cascade from genetic influences on puberty through pubertal onset to transitions to substance use, we may identify whether puberty-related genes represent a specific gene–environment correlation in this process.

Externalizing and internalizing problems

One of the strongest predictors of adolescent substance use is earlier behavior problems (Colder et al. 2013; Lansford et al. 2008). Externalizing problems are more often associated with substance use, but internalizing problems are as well (Hussong et al. 2017). Further, there is evidence supporting developmental cascades such that internalizing and especially externalizing problems mediate links from genetic and environmental (e.g., parenting) risk factors for adolescent alcohol, cigarette, and marijuana use (Marceau et al. 2020a; Trucco et al. 2016, 2018; Weymouth et al. 2019). Although the role of externalizing problems is clear and robust, the role of internalizing problems is less clear, and may be substance-specific or model-dependent (Marceau et al. 2020a; Trucco et al. 2018). A test of competing hypotheses for the developmental influence of internalizing problems on adolescent alcohol use suggested that internalizing symptoms that co-occur with externalizing problems put children at higher risk for alcohol use than chronically elevated “pure” internalizing problems (Scalco et al. 2020). This finding is consistent with the idea that the common etiology shared between externalizing and internalizing (Marceau and Neiderhiser 2020; Rhee et al. 2015) may drive positive associations between internalizing problems and substance use during adolescence. Findings of protective effects of internalizing problems for some forms of substance use (including in the ALSPAC cohort for early adolescent alcohol use; Edwards et al. 2014) may reflect the effect of specific, “pure” internalizing problems: for example, anxiety-related behaviors could lead to reduced involvement with substance-using peers (Colder et al. 2013). Novel approaches to understanding the role of internalizing problems in developmental models and in relation to externalizing problems are needed to extend this literature. One such novel approach is to leverage the severity–directionality model of comorbid internalizing and externalizing problems (Essex et al. 2003).

Severity and directionality

In the severity-directional model, the dimension of symptom severity reflects the severity of total problems (Marceau and Neiderhiser 2020). To achieve very high severity, youth must have both types of problems, and so high symptom severity can be thought of as a measure of the within-person comorbidity of externalizing and internalizing problems. Symptom directionality is an orthogonal dimension, indicating the preponderance of symptoms on a scale from pure internalizing (i.e., more negative values) to pure externalizing (i.e., more positive values). Scores near zero on this dimension reflect balanced or comorbid externalizing and internalizing problems, regardless of severity of symptoms (if any symptoms are present) or a lack of either (if no symptoms are present).

A recent twin study found that both severity and directionality of externalizing and internalizing problems were highly heritable (~ 60–65%; Marceau and Neiderhiser 2020). Identified predictors of symptom severity include familial factors (i.e., early life stress, family conflict/resolution; Essex et al. 2011; Marceau et al. 2015c; Neiderhiser et al. 2016) likely reflecting both genetic and shared environmental influences. However, there is some evidence that the prenatal environment may be particularly important for the directionality of problems experienced in childhood and early adolescence. In an adopted-at-birth design, exposure to obstetric complications in the context of genetic risk for substance use predicted a preponderance of internalizing problems at 4.5 years (Neiderhiser et al. 2016) whereas smoking during pregnancy predicted a preponderance of externalizing problems in 7–16 year olds in a sibling comparison study of maternal smoking during pregnancy (Ekblad et al. 2020). To date, no studies have examined whether severity or directionality of symptoms predict adolescent substance use, or whether severity or directionality plays a role in developmental pathways to adolescent substance use. Based on the studies of transitions to substance use reviewed above, we expect that symptom severity (regardless of type) would predict increased adolescent substance use, and that directionality would independently predict substance use such that a preponderance of internalizing problems would be protective, but a preponderance of externalizing problems would be risky for adolescent substance use.

Present study

The present study uses data from the Avon Longitudinal Study of Parents and Children (ALSPAC), a large cohort of youth followed beginning in the prenatal period and assessed through late adolescence, in order to test a novel developmental cascade from genetic and prenatal substance use through pubertal timing and parenting to behavioral risk and subsequently adolescent substance use. A key novel contribution of this study is our attention to early developmental influences, particularly genetic risk for early pubertal timing and prenatal risks. A second novel contribution of this study is that we leverage the severity–directionality model of comorbidity to shed a new and complementary light on the literature examining the unique and overlapping roles of internalizing and externalizing problems on the development of substance use. We examine alcohol, cigarette, and cannabis use progression (e.g., status along a series of use milestones for each of these three substances that are commonly used during late adolescence) in order to explore whether the developmental pathways assessed here are substance-specific or general. Finally, as is standard in studies of puberty, we examine boys and girls separately because of qualitative differences in the underlying biology, physical manifestations, and timing of puberty across sexes, and because of established sex differences in the robustness and direction of associations of pubertal timing and substance use.

Hypotheses

Several general predictions emerge from our conceptual model (Fig. 1). We expect that genetic risk for pubertal timing will predict pubertal onset. We expect that prenatal substance use exposure will predict earlier pubertal onset, increased symptom severity and directionality (specifically a preponderance of externalizing problems following Ekblad et al. 2020), and increased substance use progression. We expect parents’ harsh discipline will predict higher symptom severity based on findings from studies showing that familial factors predict severity but not directionality (e.g., Marceau et al. 2015c; Neiderhiser et al. 2016) and more speculatively, substance use. We expect that earlier pubertal onset will predict higher externalizing/internalizing symptom severity, directionality (a preponderance of internalizing problems) and substance use progression in girls, but that later pubertal onset will predict higher symptom severity, symptom directionality (a preponderance of externalizing problems) and substance use progression in boys. Finally, we hypothesize that both symptom severity and directionality (specifically a preponderance for externalizing problems) will predict more substance use progression. We expect that youth with directionality scores indicating a strong preponderance of internalizing problems will have particularly low rates of substance use. We expect to find indirect effects across these influences in line with Fig. 1, supporting developmental cascades.

Method

Participants

The current study used data from the ALSPAC (Boyd et al. 2013; Fraser et al. 2013). Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees. The initial ALSPAC cohort included 14,541 pregnancies with expected delivery dates between April 1st, 1991 and December 31st, 1992; of these, 13,988 children were alive at 12 months. This cohort was bolstered when the children reached 7 years of age, resulting in 913 additional children (although these children were missing for prenatal risk). In the total ALSPAC sample (the initial sample plus 913 children from the age 7 bolster), there are data available for 15,589 fetuses (14,901 alive at 1 year of age). Of these children, 96% were White. The analytic sample for the current study was restricted to youth from the total sample who were of European ancestry and had both genetic data and any phenotypic data (n = 8791). Phenotypic data included prenatal substance use data, data on pubertal timing, parent discipline at age 9.5 or 11.5 years, internalizing and externalizing problems at age 13 years, substance use at 18.5 years, and covariates (i.e., birth weight, gestational age, and family adversity at 2–4 years of age, sex, and age at the age 13 and 18.5 year assessments). Missing data patterns are presented after the measures. The study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool: http://www.bristol.ac.uk/alspac/researchers/our-data/. Analysis of these data was also approved by the Purdue University Institutional Review Board.

Measures

Polygenic risk for later pubertal timing

A total of 9912 children were genotyped using the Illumina HumanHap550 quad genome-wide SNP genotyping platform, generated by Sample Logistics and Genotyping Facilities at Wellcome Sanger Institute and LabCorp (Laboratory Corporation of America) using support from 23andMe. Data were imputed to 1000 genomes phase 1 version 3 using the Impute2 software (see ALSPAC documentation for further details). Standard quality control was conducted on the genome-wide SNP data using Plink (Purcell et al. 2007); we excluded samples with sex mismatch, call rate < 95%, unexpected first-degree relatedness, too much or too little heterozygosity, and non-European ancestry. This resulted in a total n of 8952 individuals with genetic data.

The polygenic risk score (PRS) was calculated using summary statistics from the largest available GWAS on age at menarche to date: Day et al. (2017) which included data collected from women in 40 studies from the ReproGen consortium, 23AndMe, and Biobank studies (N = 328,345). This sample did include ALSPAC (~ 3% of the sample), however, prior work found that this inclusion of ALSPAC data did not overly inflate effects of the PRS based on sensitivity analyses using weights from the Day et al. (2017) replication sample (deCODE; N = 39,543 women; Horvath et al. 2020). Day et al. (2017) identified 389 SNPs and corresponding effect weights associated with age at menarche at the genome-wide level (p < 5e−8) with available summary statistics. Of those, 372 were included in ALSPAC, and 337 remained after quality control (e.g., minor allele frequency < 0.01, missingness per marker > 0.01). All 337 SNPs showed adequate genotyping and Hardy–Weinberg equilibrium. There was no evidence of linkage disequilibrium (> 0.3) among these 337 SNPs. Thus, the PRS was calculated by summing (across SNPs for each individual) the product of the effect of each included SNP as identified by Day et al. (2017) and the number of effect alleles present for each individual (Maier et al. 2018). These scores were created and used previously (Horvath et al. 2020). Notably, Day et al. (2017) also provided summary statistics for age at voice break in boys. Here, we used only the PRS for age at menarche, as Horvath et al. (2020) showed that this score better predicted pubertal timing (estimated age at Tanner Stage 3 from growth models) for both boys and girls than the corresponding PRS for males’ age at voice break.

Prenatal substance use

Mothers were assessed twice during pregnancy: (i) prior to 18 weeks gestation, and (ii) around 32 weeks gestation. In the first questionnaire, mothers reported on the number of cigarettes per day they smoked “at the moment”, as well as how many drinks of beer or lager (half-pints), wine (glasses), spirits (pub-measures), and other alcohol drinks (pub-measures) on week days and weekend days. The ALSPAC team created a variable for total weekly alcohol consumption by summing the responses on these alcohol measures. In the second assessment, women reported on the number of cigarettes smoked per day, binned into 0 cigarettes = 0, 1–4 cigarettes = 1, 5–9 cigarettes = 5, 10–14 cigarettes = 14, 15–19 cigarettes = 15, 20–24 cigarettes = 20, 25–29 cigarettes = 25, and 30+ cigarettes = 30 in the first 3 months of pregnancy (e.g., 1st trimester), and in the past 2 weeks (e.g., 3rd trimester). Women reported on how often they drank alcohol drinks (never, < 1 glass/week, at least 1 glass/week, 1–2 glasses every day, at least 3–9 glasses every day, or at least 10 glasses) “in the first three months of this pregnancy” (e.g., 1st trimester), as well as “around the time you first felt the baby move” (typically during the 2nd trimester) with instructions to report on the past 2 weeks if she had not yet felt the baby move. Women also reported on the frequency with which they smoked cannabis (everyday, 2–4 times/week, once/week, < once/week, not at all) “in the first three months of this pregnancy” and “between then and now”. Finally, women also reported on amphetamine, barbiturate, crack, cocaine, heroin, methadone, ecstasy, and other drug use “during this pregnancy” on a scale including the following options: nearly every day, once+ per week, once+ per month, or not at all.

Because we were interested in substance use that could potentially harm the fetus, and because the metrics and number of assessments for each substance differed, we used the McNeil–Sjöstrom scale to create a weighted risk total score (McNeil et al. 1994). Specifically, this scale assigns risks values (1 = Not harmful or relevant; 2 = Not likely harmful or relevant; 3 = Potentially but not clearly harmful or relevant; 4 = Potentially clearly harmful or relevant; 5 = Potentially clearly greatly harmful/relevant; 6 = Very great harm to or deviation in offspring). For substance use during pregnancy these scores are based on both the substance used and frequency of use (see “Appendix” Part A, and Marceau et al. 2016 for details). Because some more commonly used substances were reported multiple times during pregnancy, while other more harmful substances were reported only about the whole pregnancy, we aggregated the repeated measures by taking the maximum score across assessments (e.g., for alcohol, smoking, and cannabis). The weighted severity score is the sum of the weighted scores for each individual substance across pregnancy, representing the severity of risk to the fetus of cumulative poly-substance use exposure.

Parental harsh discipline

Parents’ harsh discipline was measured at the 9.5 and 11.5 year assessments via parent report with the two items from the Discipline and Lifestyle questionnaire “How often is he/she punished?” and “How often do you slap or hit him/her?”. Each item was scored as follows: 0 = never, 1 = rarely, 2 = once or twice a month, 3 = once or twice a week, 4 = several times a week, 5 = every day. Items were averaged first within-assessment, but because of high correlations of the scales across assessments, r = 0.64, and better reliability of a four-item scale than a two-item scale, we averaged all four items into a single discipline item reflecting late childhood (age 9.5 and 11.5 years).Footnote 1 Items were correlated r = 0.31–0.57 within and across assessments, Cronbach’s α = 0.73.

Pubertal timing

Pubertal maturation was assessed via Tanner Stage line drawings (Morris and Udry 1980) 9 times (approximately yearly) from age 8 to 17 years. Raters could be parent only, child only, parent and child, “other” only, parent and other, child and other, or parent, child and other, and varied across assessments; most typically parents and children worked together in the early years, and children reported alone in the later years. Samples sizes by reporter and stage at every wave have been previously reported in the Supplemental Materials of Horvath et al. (2020). We operationalized pubertal timing as the age at the assessment in which Tanner Stage 2 was achieved from the repeated measures.

First, we located the first assessment at which Tanner Stage 2 was endorsed and coded the age at that assessment as our indicator of pubertal onset. Then, data cleaning (detailed in “Appendix” Part B) included removing individuals who never reached Tanner Stage 2 in the observed data and did not have data past the 6th wave. If there were data after wave 6, we top-coded so that they received an age at Tanner Stage 2 one year after their last observed assessment of Tanner Stage 1 in order to include them as relatively later pubertal timing (1–8 individuals across phenotypes). As is common in longitudinal puberty data, there were significant reports of regression (see “Appendix” Part B). If individuals regressed back to Tanner Stage 2, or regressed back to Tanner Stage 1 and subsequently reached Tanner Stage 2 again, we used the age at the second instance of reaching Tanner Stage 2. That is, we assumed the second instance was more accurate, based on data showing that younger children are less accurate reporters than older children since they are not aware of the changes yet to come and tend to over-estimate their stage (Shirtcliff et al. 2009). This was a particularly large problem for boys’ genital development (see “Appendix” Part B). If an individual regressed back to Tanner Stage 1 and never attained Tanner Stage 2, we removed the cases (n = 7–46 across phenotypes) because this pattern is not consistent with the biology of pubertal development and thus more likely to reflect measurement error. Finally, we removed individuals who had a missing assessment prior to reporting reaching Tanner Stage 2 (i.e., stage 1 at wave 1, a missing assessment, and stage 3 at wave 3), since differences of a year in age for attaining Tanner Stage 2 in the middle of the assessment period could have ramifications for rank order in the sample and therefore bias results. The exception was for if youth were only missing the first assessment and reported Tanner Stage 2 at the second assessment. n’s affected at each stage of cleaning for each of the four phenotypes are presented in “Appendix” Part B. The sample sizes with data judged to be of reasonable quality for age at Tanner Stage 2 were: girls’ breast development n = 3521; girls’ pubic hair development n = 3121; boys’ genital development n = 3300; boys’ pubic hair development n = 2573.

Externalizing and internalizing problems

Parents rated youths’ externalizing and internalizing problems in the last 6 months during mid-adolescence, at the age 13-year assessment. Externalizing problems were operationalized as the average of the hyperactivity (5 items, i.e., restless, distracted) and conduct problems (5 items, i.e., tantrums, fights) subscales of the Strengths and Difficulties Questionnaire (Goodman 1997; Goodman et al. 1998), and internalizing problems were operationalized as the emotional problems subscale (5 items, i.e., worries, unhappiness) of the same questionnaire. Following prior work (Marceau et al. 2015c; Marceau and Neiderhiser 2020), the externalizing and internalizing scores were simultaneously entered into a principal component analysis (PCA) and exactly two (orthogonal) factor scores were extracted and saved. The first score reflects severity, or what the scores have in common, and the second directionality—what differentiates the scores coded such that more positive scores indicate a preponderance of externalizing problems and more negative scores indicate a preponderance of internalizing problems. Findings from the PCA showed that severity accounted for 68% of the variance in externalizing and internalizing problems. As expected, externalizing and internalizing both loaded strongly on the severity factor (0.61). Externalizing problems loaded strongly and positively (0.88) on the directionality factor, and internalizing problems loaded equally and negatively on the directionality factor (− 0.88). Further details aiding in interpretation of severity and directionality (e.g., correlations with externalizing and internalizing) are presenting in “Appendix” Part C.

Substance use progression

Several substance use indicators were self-reported by adolescents at age 18.5 years. ALSPAC included items on the quantity and frequency of alcohol, cigarette, and cannabis use, as well as the Alcohol Use Disorders Identification Test (AUDIT; Saunders et al. 1993) and the Cannabis Abuse Screening Test (CAST; Legleye et al. 2007). From the available data, we created measures of substance use progression that assess how far along a continuum or progression of use milestones adolescents are at a particular assessment (Marceau et al. 2020b). For each of these measures, youth were coded into the highest category that they endorsed. See Table 1 for categories with N’s.

Table 1 Frequencies for late adolescent substance use progression categories

Covariates

We included birth weight and gestational age (the best estimate according to ALSPAC from multiple informants). We also included family adversity in early childhood (age 2–4 years) as a covariate. Family adversity was measured by the Family Adversity Index (short version), a sum score of binary indicators for (adversity absent = 0 vs. present = 1) early parenthood/pregnancy, housing: inadequacy, hosing does not meet basic living needs, housing defects/infestation, low education of mother and partner, financial difficulty, single parenthood (lack of partner), no partner affection, partner cruelty, large family size, major family problems, psychopathology of mother, drug and alcohol use of mother, mothers’ trouble with police, and mothers’ criminal convictions (see Bowen et al. 2005 for details).

Missing data

Compared to the full sample of 14,701 children alive at 1 year of age, our analytic sample—which included youth with genetic data and any of the measured phenotypic data—included 8791 (60%). Our analytic sample was more likely to have adequate housing, better education, and less financial difficulties (statistics available upon request). Despite p-values < 0.0001, these effects were relatively small (Cramer’s V < 0.13). Within our analytic sample, 25% were missing data on pubertal onset, 14% were missing data on parental harsh discipline, 38% were missing data on severity and directionality, and 70% were missing data on substance use. Compared to the full analytic sample, attrition was related to lower socioeconomic indicators, and being male (statistics available upon request). We also used a series of Kruskal–Wallis one-way analysis of variance tests to determine whether scores on the key study variables were related to missingness on other key study variables. Families that provided data on parental harsh discipline had youth with slightly later age at pubertal onset (except boys’ pubic hair) and lower severity, but no difference in directionality of externalizing and internalizing problems or substance use outcomes. Youth with puberty data had slightly lower severity of problems but no difference in directionality or substance use outcomes. Missingness on severity/directionality of externalizing and internalizing symptoms did not predict substance use outcomes. The overall pattern of findings suggests that the model is somewhat biased towards later pubertal timing and less severe problem behavior, which may lead to smaller associations among these constructs, and limited generalizability in processes for youth with particularly early puberty and severe problems. Despite these patterns of missing data, the sample was large for each construct, and missing data was accommodated using Full Information Maximum Likelihood in the models described below.

Analytic strategy

Hypotheses were tested using path analysis conducted in R(lavaan) (Rosseel 2012) using the se = “bootstrap” command to accommodate the skewed nature of the outcome data and to reduce bias in the p-values for the indirect effects. The models are based on n = 4504 boys and n = 4287 girls. Separate models were run for each pubertal onset phenotype (1 Girls’ breast development; 2 Girls’ pubic hair development; 3 Boys’ genital development; 4 Boys’ pubic hair development). Models included the paths shown in Fig. 1, but also included age at the 13-year assessment as a predictor of severity and directionality, age at the 18.5-year assessment as a predictor of the substance use outcomes, as well as family adversity, birth weight, and gestational age as predictors of or associated with all main study variables. Hypotheses were tested via the presence or absence of hypothesized pathways at p < 0.05. There were 70 direct paths estimated in each model, although 59 were identical across models 1 and 2 (with 11 unique paths in each—those involving the pubertal timing variable which differed across models), and 59 identical across models 3 and 4 (again with 11 unique paths that included a pubertal timing variable and thus differed across models). A Bonferroni adjustment for 162 unique tests yields p < 0.0003. We also tested for indirect effects from genetic and prenatal substance use through pubertal onset to severity, directionality, and substance use outcomes, and from prenatal substance use, pubertal onset and parental harsh discipline through severity and directionality to substance use outcomes. Because of the complexity of models and hypotheses, we interpret findings holistically, weighing information based on nominal and Bonferroni-adjusted p-values, effect size (e.g., correlation coefficients, standardized estimates), consistency with zero-order correlations, and consistency across models.

Results

Frequencies for late adolescent substance use progression categories are presented in Table 1. The majority of girls and boys had progressed at least to binge drinking (6+ in a day) in late adolescence, with nearly half showing signs of hazardous drinking or likely dependence. About half of girls and boys never smoked cigarettes. Whereas about 35 to 40% of youth were in the low stages of progression (ever smoked or recently smoked), only ~ 10% smoked daily or heavily daily. About two-thirds of girls and boys never used cannabis, with the majority of users in the initial stages of progression: only having tried cannabis. About 10% of the sample used cannabis recently, on a monthly basis, or showed signs of dependence.

Descriptive statistics for the other study variables are presented in Table 2. 20% of girls and 22% of boys were exposed to substance use during pregnancy that constituted at least a potential risk to the fetus. For parents’ discipline, observed values matched the possible range from 0 to 5, but low average levels for girls (0.97) and boys (1.16) indicated that on average this sample experienced low levels of harsh discipline. Average age at breast development onset was earlier than pubic hair onset for girls, as expected (Grumbach and Styne 2003). Girls’ average age at pubic hair onset was earlier than boys’, as expected (Grumbach and Styne 2003). However, average age at genital development onset was earlier than average age at breast development onset, reflecting that boys’ perceptions of genital development onset are particularly biased (e.g., reported as earlier than probably true).

Table 2 Descriptive statistics for key study variables

Hypothesis testing

Bivariate correlations are presented in Table 3. Generally, findings in the hypothesis-testing model matched what was observed in the bivariate correlations. We note any instances of differing results in the text below. All models fit the data well (see Figs. 2, 3). Figure 2 presents the findings from direct and indirect path estimates across the two models for girls: including (1) breast development and (2) pubic hair development. Figure 3 presents the findings from direct and indirect paths estimated across the two models for boys: including (3) genital development and (4) pubic hair development. Standardized parameter estimates and p-values for key direct effects (excluding covariate effects) are presented in Table 4, and indirect effects are presented in Table 5. Full model results including unstandardized parameter estimates, standard errors and confidence intervals and covariate effects are provided in Supplemental Tables S1 (breast development) and S2 (pubic hair development) for girls, and Tables S3 (genital development) and S4 (pubic hair development) for boys. In the text below, if effects are specified as ‘nominal’ in the text below, this indicates p < 0.05 but not p < 0.0003. Effects presented without the qualifier of ‘nominal’ survived Bonferroni adjustment (p < 0.0003).

Table 3 Bivariate correlations among key study variables
Fig. 2
figure 2

Model results: girls. BD indicates breast development. PH indicates pubic hair development. Any statistics not represented with BD or PH applied to both models. All paths are positive associations unless noted with (−) to indicate an inverse association. Hashed lines denote association significant at p < 0.05, bold lines denote association significant at p < 0.0003 (adjusted for multiple testing). Gray lines are covariate effects

Fig. 3
figure 3

Model results: boys. GD indicates genital development. PH indicates pubic hair development. Any statistics not represented with GD or PH applied to both models. All paths are positive associations unless noted with (−) to indicate an inverse association. Hashed lines denote association significant at p < 0.05, bold lines denote association significant at p < 0.0003 (adjusted for multiple testing). Gray lines are covariate effects

Table 4 Standardized estimates of hypothesized direct effects
Table 5 Standardized estimates of indirect effects

Girls

Genetic risk for later pubertal timing predicted later timing of both breast and pubic hair development onset, as expected. Genetic risk was not directly related to prenatal substance use, harsh discipline, externalizing/internalizing symptom severity and directionality, and substance use outcomes. Prenatal substance use exposure did not predict pubertal timing in either model. Prenatal substance use exposure was not directly related to parental discipline, externalizing/internalizing symptom severity, or directionality. More severe prenatal substance use had direct effects on increased cannabis and smoking use, and a nominal direct effect on increased alcohol use.

Parental harsh discipline predicted higher symptom severity as well as a preponderance of externalizing problems. A modest bivariate correlation of earlier age at breast development onset with more severe problems did not persist in the context of the model, likely due to shared variance with parental harsh discipline (as earlier onset of breast but not pubic hair development was nominally associated with more harsh discipline). Earlier pubertal onset of pubic hair (but not breast) development nominally predicted directionality in the hypothesized direction (a preponderance for internalizing problems) but not severity. There was also a nominal indirect effect of genetic risk for pubertal timing via the timing of pubic hair onset to symptom directionality. Probes of these relations (“Appendix” Part C) suggest that this effect is best interpreted such that genetic risk for early puberty indirectly predicted a preponderance of internalizing problems via earlier pubic hair development onset.

Higher symptom severity was related to more smoking, and nominally to more cannabis use, but not to alcohol use progression. Symptom directionality, specifically a preponderance of externalizing problems, was related to more smoking and cannabis use, and nominally to more alcohol use. The modest bivariate correlations of more late adolescent smoking with both earlier age at breast development onset and parental harsh discipline did not persist in the context of the model. Pubertal onset was not directly associated with substance use outcomes. Harsh discipline was nominally related to more cannabis use, but not to alcohol use or smoking. Thus, shared variance between girls’ breast development and parental discipline noted above likely explain the null findings of breast development, whereas the association of harsh discipline may have been mediated by symptom severity and directionality.

Indeed, there were indirect effects from more parental harsh discipline via both having more severe problems and having relatively more pure externalizing problems (directionality) to more smoking. There were also nominal indirect effects from more parental harsh discipline via more severe and more pure externalizing (directionality) problems to more cannabis use. There was a nominal indirect effect from more parental harsh discipline via relatively more pure externalizing problems (directionality, but not severity of problems) to more alcohol use. Further, there were nominal indirect effects from earlier timing of pubic hair development to a preponderance of internalizing problems to more smoking, and alcohol and cannabis use. Probes of these relations (“Appendix” Part C) suggest that these effects are best interpreted such that girls with earlier timing of pubic hair onset are at increased risk for internalizing problems which then exert a protective effect against alcohol use. However, for cannabis and smoking, the evidence (“Appendix” Part C) is more consistent with an interpretation such that girls with relatively later timing of puberty were somewhat more likely to have more externalizing-type problems, which put them at particular risk of more cannabis use and smoking.

Boys

For boys, genetic risk for later pubertal timing predicted pubic hair development onset, but not genital development onset. Genetic risk was not directly related to prenatal substance use exposure, harsh discipline, externalizing/internalizing symptom severity and directionality, and substance use outcomes. Prenatal substance use exposure nominally predicted earlier timing of genital but not pubic hair development onset. Prenatal substance use exposure predicted a preponderance of externalizing symptoms (directionality, but not severity of problems), but was not directly related to parental discipline or substance use outcomes. That is, the moderate bivariate correlation of more severe prenatal substance use exposure and more smoking did not persist in the context of the model, suggesting that the association was fully mediated by symptom directionality. Indeed, there were nominal indirect effects from more prenatal substance use exposure via directionality (differentiation towards exhibiting specifically externalizing problems) to more smoking as well as to alcohol and cannabis use.

Parental harsh discipline predicted higher symptom severity as well as a preponderance of externalizing problems (directionality). Pubertal onset was unrelated to symptom severity in either model. A modest bivariate correlation of earlier age at pubic hair development onset with a preponderance of externalizing problems did not persist in the full model, likely due to shared variance with the early life family adversity covariate. Onset of genital development did nominally predict directionality, although not entirely as expected: genital development onset predicted problem type differentiation such that later genital development was related to experiencing more internalizing problems and earlier genital development was related to experiencing more externalizing problems (“Appendix” Part C).

Higher symptom severity was nominally related to more smoking and cannabis use, but not to alcohol use progression. Symptom directionality, specifically a preponderance of externalizing problems, was related to more smoking and nominally to more alcohol and cannabis use. Neither harsh discipline nor pubertal onset was directly associated with substance use outcomes. However, there were nominal indirect effects from genital development onset via directionality to smoking and cannabis use. Probes of these relations (“Appendix” Part C) revealed that the best interpretation of these indirect effects was that earlier genital development was indirectly associated with progression to more advanced stages of smoking and cannabis use via differentiation towards exhibiting more externalizing problems. Further, the modest bivariate correlation between parental harsh discipline and smoking did not persist in the context of the model, suggesting that the association was fully mediated by symptom severity and directionality. Specifically, there was an indirect effect from more parental harsh discipline via a preponderance of externalizing problems to more smoking. There were also nominal indirect effects from more parental harsh discipline via a preponderance of externalizing problems to more alcohol and cannabis use, as well as from more parental harsh discipline via higher symptom severity to more smoking and cannabis use.

Discussion

The present study leveraged the Avon Longitudinal Study of Parents and Children (ALSPAC) to test for developmental cascades associated with the progression of alcohol, smoking, and cannabis use in late adolescence, consistent with current theories of the development of adolescent substance use (Dodge et al. 2009). Specifically, we modeled developmental cascades that included biological and putatively environmental influences beginning with polygenic influences related to pubertal timing and severity of exposure to prenatal substance use, and including intermediary influences of parental harsh discipline and the timing of pubertal onset on transitions from novel features of proximal behavioral risk for substance use (severity and directionality of mid-adolescent externalizing and internalizing problems) with later adolescent substance use progressions. We found robust evidence of developmental cascades involving the severity and directionality of externalizing and internalizing problems as intermediaries between associations of parental harsh discipline with substance use progressions for girls and boys. We also found evidence for some sex-specific cascades of pubertal timing and (for boys only) exposure to prenatal substance use via symptom directionality to late adolescent substance use.

Initiation of developmental cascades

Genetic influences

We provide some support to emerging evidence that genetic risk for early puberty may initiate developmental cascades for girls, indirectly influencing differentiation towards internalizing problems via earlier pubertal onset. For example, in the same sample (using a different measure of pubertal timing), Horvath et al. (2020) found an indirect effect from the same polygenic score via age at the midpoint of breast development (as opposed to at the initiation of pubic hair development found here) to later adolescent depressive symptoms—and no indirect effects in boys. It is unclear why pubic hair onset was a stronger predictor in this study than the more visible changes related to breast development onset. Reported breast development without areolar staging is more biased in youth with more adipose tissue, as fat can be mistaken for breast development (Dorn et al. 2006). Speculatively, if hormone changes do underlie the association of genetic influences of puberty-related genes with differentiation toward internalizing symptoms, this association may be clearer for pubic hair development because self- and parent- reports of pubic hair development could contain less of this over-estimate of early stages of puberty found for breast development. Or, adrenal hormones (contributing to the development of pubic hair) may be more important than gonadal hormones (contributing to breast development) for problem differentiation in girls. It will be critical to replicate this indirect effect from puberty-related genes to differentiation towards internalizing problems in other samples with other measures of puberty and that include hormone ascertainment in order to understand when and why different pubertal phenotypes mark this potential genetic influence. However, in the context of the larger developmental cascades, wherein differentiation towards externalizing problems played a much more prominent role, the sparse findings with regard to genetic risk for pubertal timing and downstream correlates of pubertal timing indicate that genetic risk related to pubertal timing is not a critical initiator of developmental cascades to substance use progressions in late adolescence.

Prenatal substance use exposure

In contrast, our findings suggest that severity of prenatal substance use exposure may be a key differentiating influence that pushes boys towards externalizing problems in particular. This is consistent with recent findings from a sibling-comparison design (that examined boys and girls together) wherein the sibling whose mother smoked (or smoked more) during pregnancy had a greater preponderance of externalizing problems than their co-sibling who was exposed to no (or less severe) smoking during pregnancy (Ekblad et al. 2020). More severe exposure to substance use during pregnancy was also related to severity of problems in both boys and girls. Therefore, we hypothesize that smaller correlations noted in the literature of substance use during pregnancy with internalizing-type problems (Khoury et al. 2018; Min et al. 2020) are potentially the result of comorbid externalizing problems, with prenatal substance use exposure operating to simultaneously differentiate youth problems in the direction of externalizing problems while also making those problems more severe. This is, of course, a causal hypothesis, and the present study—based entirely on correlations within families where parents and children share genes, prenatal, and postnatal environments—cannot support such claims. Future work designed to test causal mechanisms are important for testing this hypothesis.

It was also interesting that prenatal substance use exposure was directly related to adolescent substance use progression for girls, but indirectly via specifically a preponderance of externalizing problems for boys. Recent accumulating evidence suggests that prenatal risk exposures could be more influential for boys’ behavioral development than girls’, and our findings are consistent with studies finding effects on boys but not girls when studying associations of prenatal substance use exposure with externalizing problems earlier in development (Marceau et al. 2019b). However, the presence of direct effects (i.e., outside of developmental cascades), in this study for girls suggests an unknown mechanism by which prenatal exposures have longer-term impacts that emerge later for girls. That is, by examining severity and directionality in the context of developmental cascades to substance use, we provide evidence that prenatal substance use exposure may indeed be particularly problematic for boys’ substance use because it serves as a differentiator pushing boys towards exhibiting externalizing problems earlier in childhood. However, prenatal substance use during pregnancy does not similarly serve as a differentiator of problem type for girls, and more work is needed to determine how prenatal substance use is associated with adolescent substance use for girls.

Intermediate influences

Parental harsh discipline

We found that parental discipline was indirectly associated with smoking and cannabis use in boys and girls via both severity and differentiation towards more externalizing problems, and with alcohol use progression in boys and girls via differentiation towards more externalizing problems. Critically, harsh discipline emerged as a robust predictor of severity and directionality in the context of the other constructs and covariates (i.e., even early life adversity). Harsh discipline is known to be heritable, and because of the composition of the sample (i.e., parents who are biologically related to and provide the caregiving environment for youth) the associations cannot be interpreted as entirely environmental. Based on prior literature, the associations we find for parental harsh discipline with severity and directionality are possibly, in part, a proxy for a transactional evocative gene–environment correlation process that unfolds over time (Button et al. 2008; Elam et al. 2018). We show that these processes are important not only as a non-specific risk factor for externalizing and internalizing problems, but also as a differentiator for externalizing problems. This is consistent with the notion of coercive cycles, developed specifically to explain transactional roles of harsh discipline and conduct problems (Patterson 1982). Further, we found no evidence of direct effects of harsh discipline on substance use, further corroborating the importance of coercive cycles as a part of a developmental cascade greatly increasing risk, indirectly through externalizing problems, for substance use progression in late adolescence (Dodge et al. 2009). This was true even in a complex developmental model that included other key influences on adolescent substance use identified in the literature.

The polygenic score was not associated with harsh discipline; therefore, the genes that are involved in gene–environment correlation processes serving to increase severity and push youth towards externalizing problems are unlikely to be puberty-related genes, specifically. However, the (small) observed correlations between parental discipline and some measures of pubertal onset may indicate that pubertal maturation perceived by parents and youth may be a contributing input for evocative gene–environment correlation processes that ultimately contributes to adolescent substance use progression. That is, as parents and youth notice pubertal onset, parents may directly treat youth differently/with more rules, responsibilities, and consequences, or may adapt their parenting strategies in response to children’s own autonomy-seeking reactions to pubertal onset (Marceau et al. 2020b).

Pubertal timing

There are several considerations regarding the measurement of pubertal onset in the current study that are important to keep in mind prior to interpreting findings. A strength was the repeated measures, allowing for relative precision in prospectively measuring pubertal onset (defined as attaining Tanner Stage 2), at least as perceived by parents and youth. ALSPAC is not the only study to use a combination of parent and youth report particularly at earlier ages (e.g., see the Wisconsin Study of Families and Work; Ellis and Essex 2007). This strategy is thought to prevent some of the bias related to youth who do not yet know what puberty is (because they have not yet experienced it) thinking they are more mature than they would be as rated by a professional. However, overlaying patterns of regression and raters suggest that parents and youth together over-rated boys as beginning genital development early. Even when using the second time of reaching Tanner Stage 2 for 948 boys, the age of attaining genital development Tanner Stage 2 was just under 10 years old. This is surprisingly early, and likely reflects poor measurement or understanding of this component of development. This extra bias for genital development is consistent with findings from Shirtcliff et al. (2009) showing that correlations between youth ratings of Tanner Stages vs. physical examination were somewhat lower for boys’ genital development (r = 0.60) than for boys’ pubic hair development (r = 0.71) or either girls’ phenotypes (r’s > 0.83). In general, findings regarding boys’ genital development especially, but indeed all measures of pubertal onset in this study should be interpreted as perceived onset rather than ‘actual’ onset.

With regard to our findings, the findings for girls were highly consistent with the literature that shows a clear role for pubertal maturation (of pubic hair onset at least) in increasing specifically internalizing problems, likely in part due to hormonal mechanisms (Copeland et al. 2019). For boys, perceived timing of genital development onset equally differentiated boys who thought they matured early towards externalizing-type problems, and boys who thought they matured late towards internalizing-type problems. This is a novel finding in the literature, consistent with some reports that early puberty is related to externalizing problems in boys (particularly White boys; Marceau et al. 2019a) that support the developmental readiness hypothesis (Ullsperger and Nikolas 2017), but not others that support the social compensation hypothesis (Marceau and Jackson 2017). Typically, studies examining associations of pubertal timing and externalizing and internalizing do not consider comorbidity (see Stice et al. 2001 for an exception in girls). Examining directionality of problems lead to several insights that can serve as the basis for novel hypotheses. (1) Boys’ perceptions may be particularly critical (above actual development) to understanding the role of pubertal maturation (Mendle 2014) in terms of differentiation of problems, given that these findings were not present for pubic hair development, only for genital development which showed clearer evidence of perceptual bias. (2) These perceptions may have more to do with the type of problem boys exhibit than the severity of problems. If this is the case, it is unsurprising that mixed, inconsistent, and small effects riddle the literature, as most measures of externalizing and internalizing problems assess the severity of each rather than the extent to which they are differentiated regardless of severity (i.e., directionality). Future work may confirm this new hypothesis that perceived puberty may differentiate problem type in boys, with longer-term consequences for downstream developmental outcomes like substance use.

Severity and directionality

Our analysis of severity and directionality yielded several new insights for transitions across adolescence from behavioral (externalizing and internalizing) risk to substance use progression. For boys and girls, severity of problems played a critical role in developmental cascades from more severe prenatal substance use exposure and late childhood parental discipline to further progression of the less normative substances: smoking and cannabis use. Severity of problems as measured in the severity–directionality model are analogous to the total level of symptoms regardless of type. Youth may have moderately high severity scores if they have a lot of symptoms of only one problem type, but to score very highly on this dimension, youth must be at least somewhat comorbid for externalizing and internalizing problems (Marceau and Neiderhiser 2020). Predictors of severity can be thought of as non-specific risk factors for behavioral problems (Shirtcliff and Essex 2008). Predicting directionality can be thought of as predicting problem differentiation (Marceau and Neiderhiser 2020). Our findings suggest a role of severity regardless of problem type in developmental cascades, as expected. However, there were more and stronger effects of directionality with all three substances, and particularly for boys. These effects often corroborated evidence from other designs suggesting that externalizing problems in particular posed a specific risk for substance use. Overall, the observed pattern of findings suggests that the primary role of internalizing symptoms for substance use outcomes is not protective, but rather risky through its comorbidity with externalizing problems (except, perhaps, for alcohol).

Alcohol, smoking, and cannabis use progressions

Alcohol use was nearly ubiquitous in late adolescence in the ALSPAC sample, reflecting the legality and popularity/normative nature of alcohol use in this population. This likely contributed to weaker findings of the various risk factors and especially problem behaviors predicting alcohol progression: most youth, well-adjusted or otherwise, use alcohol to a certain degree. However, for alcohol, we did see some protective effect of internalizing problems against alcohol use problems for girls and especially for boys (see “Appendix” Part C for fuller explanation). The inhibitory effect of anxiety on social drinking (Hussong et al. 2011) may have only been strong enough to observe in this sample for alcohol use because only youth with a tendency toward more pure internalizing problems were more likely to stop at early stages of alcohol use. In contrast, smoking and cannabis use were less often endorsed (with 50% and 40% at any stage of smoking and cannabis use, respectively). In this sample, therefore, smoking and cannabis use reflected somewhat less normative substances, and were more strongly linked to developmental cascades of risk that included prenatal substance use exposure and parental discipline, and transitions through earlier psychopathology. As adolescent substance use is highly predictive of future problems especially if use has progressed beyond more normative stages, these cascades may indicate pathways of lasting risk (e.g., Sartor et al. 2016). Future research tracking this sample through emerging adulthood may reveal whether these pathways do index lasting risk.

Limitations

The large and well-characterized ALSPAC cohort allowed us to test complex models of developmental cascades to substance use progression. However, these findings are quite limited in generalizability, given the homogeneity and relative privilege of sample, especially of those who continued through the end of the study. Our sample consists of White European youth, and the findings presented here likely are culturally specific to that group, especially given race/ethnic differences in discipline practices (Silveira et al. 2021) and evidence that harsh discipline is not linked to externalizing problems in race/ethnic minority groups as strongly as in White samples (Lansford 2010). Indeed, there is evidence of race/ethnic differences in other developmental cascades—namely pathways from genetic and/or prenatal risk through cortisol reactivity for externalizing and substance use have been found in Black (Buckingham-Howes et al. 2016; Conradt et al. 2014) but not White adolescents (Marceau et al. 2020a). It is critical to identify and test the appropriate developmental cascades within underrepresented groups to move our field forward.

Further, although we included several theoretically important factors in our developmental cascade models, other influences are also important to consider, including peer influences. Similarly, our cascade models are also over-simplified in terms of processes, examining only snapshots on the longest timescale of child and adolescent development. For example, our inclusion of a single measure of parental harsh discipline and a single measure of severity and directionality years later grossly over-simplifies the complex transactional associations expected at shorter timescales (e.g., over the course of conversations which accumulate and canalize into habitual patterns across development; Lougheed 2020). And, that measure of harsh discipline reflected the combination of two items, only one of which has face validity as a measure of harsh discipline. Sensitivity analyses (available upon author request) found that the present analyses do likely represent harsh discipline, although use of more nuanced measures will be important for future work. Another methodological limitation is that ALSPAC was included in the GWAS from which the PRS was formed, which may have slightly inflated results of the PRS (see Horvath et al. 2020). Finally, our pathway-based cascade model does not incorporate contextual influences (i.e., moderators) that are known to be important for the development of substance use. Despite these limitations, our models provide a valuable perspective on adolescent substance use development.

Conclusions and future directions

Using an alternative measurement strategy for examining externalizing/internalizing symptom comorbidity, the present study most strongly supported developmental cascades involving differentiation towards externalizing problems and total symptom severity as intermediaries between associations of parental harsh discipline with substance use progressions for girls and boys. Our findings make three main novel contributions to the literature. First, the strongest role of internalizing problems was to increase risk of substance use via comorbidity with externalizing problems, adding to general symptom severity, especially for less normative substances. There was very limited evidence of protective effects of relatively pure internalizing and only for the normative outcome: alcohol use. Second, the presence of more and stronger effects of directionality than severity suggest that the mechanism of action of the diverse predictors examined here in terms of risk for adolescent substance use is more due to pushing youth towards a specific type of (externalizing) behavior than generally increasing the level of problems. Finally, there was support of developmental cascades from more severe exposure to prenatal substance use to adolescent substance use progression specifically for boys, whereas for girls the associations were direct. Considering that many studies earlier in childhood find that prenatal risks may disproportionately affect males, and specifically link to externalizing problems, our findings provide suggest that prenatal effects for girls may operate through other pathways and/or may surface later in development—a new hypothesis in need of interrogation.