Introduction

In about 40–80 % of children with ADHD, the disorder is chronic and continues into adolescence (Barkley 2006; Barkley et al. 1990; Brown and Borden 1986; Hill and Schoener 1996; Wender 1995; Wilens et al. 2002; Wilson and Marcotte 1996). Despite being highly informative regarding course of ADHD, early longitudinal studies of ADHD into adulthood (including the Milwaukee Study, the New York Study, the Swedish study, and the Montreal Study) report wide ranges of rates of adult ADHD, likely related to methodological challenges (Barkley et al. 2002a; Mannuzza et al. 1993, 1998; Rasmussen and Gillberg 2000; Weiss et al. 1985). This wide range has been attributed to attrition of participants, use of different informants, use of varied diagnostic criteria, and a variation in methods of assessment and ascertainment (e.g., asking about persistence of symptoms vs. current diagnostic status, use of self-ratings vs. observations) (Faraone et al. 2006; Mannuzza et al. 2003). Most recent longitudinal studies of ADHD into adulthood suggest symptoms persist in approximately 66 % of cases (Uchida et al. 2015; Karam et al. 2015). ADHD continues to be described as a lifespan disorder in a majority of studies (Barkley 2006; Halperin and Healey 2011; Spencer et al. 2007; Turgay et al. 2012), though a recent, unreplicated longitudinal investigation suggests that childhood and adulthood ADHD may be different disorders with different etiologies (Moffitt et al. 2015).

A number of longitudinal investigations about childhood ADHD have informed persistence, but not without limitations. Empirical studies have underscored the importance of comorbid and early childhood conduct problems as a significant predictor of persistence of ADHD through adolescence (Gittelman et al. 1985; Hart et al. 1995; Lahey et al. 2000; Loney et al. 1981; Taylor et al. 1991). In addition to psychiatric comorbidity, Rutter’s adversity indicators (such as family size and conflict, paternal criminality, social status and maternal psychopathology) have been suggested to contribute to ADHD persistence into adolescence (Biederman et al. 1992, 1995). An 11-year follow-up study also suggested that persistence was related not only to comorbidity of conduct, mood, and anxiety disorders, but also to ADHD impairment at baseline and exposure to maternal psychopathology; however, the study combined adolescent and adult outcomes (participants were 15–31 yo at follow-up) making it difficult to disentangle outcomes specific to adulthood (Biederman et al. 2011). Another study combining adolescent and adult outcomes of ADHD suggested that IQ and socioeconomic status (SES) acted as moderators of ADHD persistence (Cheung et al. 2015). In addition to combining outcomes of adolescence and adulthood, longitudinal investigations of ADHD into adulthood include use of referred populations, limiting generalizability to less severe populations (Biederman et al. 2010b). Furthermore, many past samples have focused exclusively on males, and only more recently included a focus on females (Biederman et al. 2010a; Hinshaw et al. 2012; Monuteaux et al. 2010).

What remains under investigated is which factors characterize individuals with ADHD who will remit or resolve with development versus experience a chronic and persistent course into adulthood (Karam et al. 2015; Lara et al. 2009; Nigg et al. 2002). A number of longitudinal studies of ADHD have tracked outcomes over 10 years, but there has been less focus on temperament and other features of participants that might contribute to persistent course (Barkley et al. 2002b; Mannuzza et al. 1993, 1998, 2003; Pingault et al. 2011; Rasmussen and Gillberg 2000; Weiss et al. 1985). This is important to elucidate because personality factors including low agreeableness and high neuroticism have been associated with persistent ADHD (Miller et al. 2008). Studies distinguishing key factors that characterize persistence versus remission of ADHD into adulthood are still needed (Barkley et al. 2008; Halperin et al. 2008). The present study benefits from low attrition of participants with ADHD over a 10-year course into adulthood. Understanding how to distinguish these groups is important to determine who are in need of early and ongoing intervention versus those more likely to have symptoms that resolve with development.

The current study capitalized on a large 10-year longitudinal dataset that tracked ADHD severity in participants with ADHD alone and in those with a bipolar phenotype with comorbid ADHD (as well as healthy controls) from childhood into adulthood. The aim was to investigate trajectories of ADHD severity across development and investigate factors that characterized the variation in the longitudinal course of ADHD into adulthood using growth mixture modeling, while carefully controlling for bipolar comorbidity. Personality factors and substance use disorders (SUD) were included given their established relationship to ADHD (Miller et al. 2008; Molina et al. 2009; Wilens and Upadhyaya 2007).

Methods

Participants

N = 268 participants ages 7–16 at baseline were comprehensively assessed using the Washington University in St. Louis Kiddie Schedule for Affective Disorders and Schizophrenia (WASH-U-KSADS) as part of the longitudinal Phenomenology and Course of Pediatric BP-I study (Geller, PI MH-53063). Inclusion criteria for all participants at baseline were 7–16 years of age, males and females, and good physical health. Exclusion criteria were IQ < 70, epilepsy or other major medical or neurological disorders, pervasive developmental disorders, schizophrenia, baseline substance dependency, and pregnancy. Participants who developed SUD or became pregnant after baseline were retained in the study. Additional details of and rationale for study inclusion/exclusion criteria have been previously reported (Geller et al. 2004). Participants were comprehensively assessed (see measures below) every 2 years for a 10- or 12-year period, as detailed in Table 1. The study ended mid-way through data collection at the 12-year wave.

Table 1 Number of subjects assessed at each wave by baseline diagnostic group

ADHD participants, like BP-I participants, were consecutively ascertained from outpatient pediatric and psychiatric clinics. Screenings for exclusion criteria for all new consecutive cases were conducted by non-blind research nurses who were different than the blinded nurses who conducted in-laboratory psychiatric assessment once telephone screenings occurred. Healthy control participants were obtained through a random survey that matched them to BP-I participants by age, gender, SES, ethnicity, and zip code. The Human Studies Committee at Washington University School of Medicine in St. Louis approved the study.

There were N = 81 participants in the ADHD group at baseline. These participants met DSM-IV criteria for ADHD (with hyperactivity, i.e., hyperactive/impulsive subtype [H] or combined type [C], not inattentive type [I]) at baseline without mood comorbidity and were clinically impaired, as evidenced by a CGAS score of ≤60. ADHD participants were required to have onset of ADHD symptoms prior to age 7 and duration greater than or equal to 6 months. ADHD-H and ADHD-C, but not ADHD-I, were included as a psychiatric comparison group (given focused study goals). ADHD-I was an exclusion for the ADHD group at baseline and was not an exclusion for the BP-I group at baseline; however, there was only n = 1 BP-I subjects with ADHD-I at baseline. Children in the ADHD group could not have BP-I or MDD (based on original study aims), but could have conduct disorder (CD) and/or oppositional defiant disorder (ODD), given their common comorbidity with ADHD in children. ADHD participants could have BP-I and/or MDD at follow-up.

There were N = 93 participants in the BP-I group at baseline. Child BP-I participants in this study were required to have elated mood and/or grandiosity as one of the mania criteria. Because high rates of ADHD comorbidity have been observed in child BP-I disorder, an ADHD diagnosis was not exclusionary in the BP-I group. These participants had a Children’s Global Assessment Scale (CGAS) (Bird et al. 1987; Shaffer et al. 1983) score of ≤60, signifying definite clinical impairment.

There were N = 94 healthy control participants at baseline. Healthy control subjects were required to not have BP-I, MDD, or ADHD at baseline. It was found that no healthy control subjects had any Axis I disorders at baseline.

Measures

Diagnostic Assessment The WASH-U-KSADS (Geller et al. 1996) is a semi-structured interview that was administered by experienced research clinicians to mothers about their children and to children about themselves. It was developed from the KSADS (Puig-Antich and Ryan 1986) by adding onset and offset of lifetime and current symptoms for DSM-IV diagnoses. The WASH-U-KSADS has established validity (Geller et al. 1998, 2001). To score the WASH-U-KSADS, child and parent responses were combined by using the most severe rating, in accordance with the methods described by Bird et al. (1992). Of note, the correlation between combined parent and child report and parent-only report of number of baseline ADHD symptoms was 0.97. This is consistent with children under-reporting ADHD symptoms. Combined parent and child report was therefore not separated in the analyses. Teacher ratings were not addressed in the current study as they were inconsistently available. All research materials, including school reports and separate videotapes of mothers and children, were reviewed in consensus conference with research nurses and a senior clinician. Raters were blind to group status at baseline assessment. Clinicians were trained to inter-rater reliability (kappa = 0.82–1.00) and recalibrated yearly (Geller et al. 2001).

Personality: temperament and character The Junior Temperament and Character Inventory (JTCI) measures four temperament and four character traits of children. Temperament traits include novelty-seeking, harm avoidance, reward dependence, and persistence. Character traits include self-directedness, cooperativeness, fantasy, and spirituality. The JTCI is made up of 108 true/false items to assess children’s behaviors, opinions, and feelings (Luby et al. 1999). Temperament traits are considered relatively stable after the preschool years, though dynamic processes with the environment continue (Roberts and DelVecchio 2000; Shiner et al. 2012). The JTCI was administered at the 2-year follow-up separately to children and their parents. Data from the two informants was combined (after reverse scoring negatively phrased items) by considering any items endorsed as true by either informant as true.

Global functioning The CGAS measures severity based on global impairment from psychiatric symptoms and related adaptation in psychosocial functioning in school, social, work, and family contexts. On this scale, 0 is worst, 100 is best, and ≤60 is definite clinical impairment. The CGAS score is the lowest level of functioning during the rating period.

SES was established from the Hollingshead Four-Factor Index of Social Status (Hollingshead 1976).

Substance use disorders (SUD) were defined using DSM-IV (American Psychiatric Association 1994) criteria including for alcohol, marijuana, and other illicit drug use (e.g., sedatives, opioids, cocaine, other). A subset of N = 155 participants were assessed as adults approximately three and a half years after the original study was completed. SUD was assessed at this time with the NetSCID (Structured Clinical Interview for DSM Disorders).

Statistical analysis

Growth mixture modeling (GMM) in Mplus version 7.3 was used to group participants into subgroups based on ADHD severity scores across the longitudinal course of the study. Specifically, an ADHD severity score was calculated at each of the six waves (baseline through 10-year follow-up) as the number of ADHD symptoms endorsed at each wave (maximum of 18). A quadratic growth mixture model with these six severity scores as the outcome variables was used to determine a categorical latent class variable for grouping participants with similar ADHD severity trajectories. Mania severity was calculated as the number of mania symptoms endorsed at each wave (maximum of 8) and was included in the model as a time-varying covariate. Baseline age and gender were also included as covariates in the growth mixture model. Each subject’s probability of belonging to each of the latent classes was evaluated, and participants were assigned to the latent class with the greatest probability.

N = 17 participants were not included in the analysis because they dropped out of the study, so GMM was conducted on N = 251 of the original N = 268 participants. Several growth mixture models with varying numbers of classes for the latent class variable were compared, and the best model according to the Bayesian information criterion (BIC) and the Lo–Mendell–Rubin adjusted likelihood ratio test was selected.

Once latent class assignment was complete, baseline demographic and diagnostic characteristics were compared across classes using post hoc linear regression for continuous variables and logistic regression for categorical variables.

Psychiatric diagnoses through the 10-year follow-up, adult SUD, and temperament and character factors obtained at the 2-year follow-up were compared across classes using post hoc logistic regression for dichotomous outcomes and linear regression for continuous outcomes. These models included baseline age, SES, gender, race, baseline mania, and use of stimulants or other medication for ADHD at any time during the study as covariates. Contrast statements were included in these models to make pairwise comparisons of the classes. Bonferroni correction was used to account for the six pairwise class comparisons, resulting in a significant p value of p < 0.0083. All analyses other than GMM were completed in SAS version 9.3.

Results

Of the N = 268 baseline study participants, N = 17 (6.3 %) dropped out (N = 6 ADHD, N = 9 BP-I, N = 2 healthy controls) through the 10-year assessment. The only missed assessments were those occurring after a subject discontinued from the study by design, i.e., no subject missed an assessment but was then assessed at a later data collection wave. As previously noted, the N = 17 participants who discontinued the study were not included in these analyses. At baseline, 87.1 % of the N = 93 BP-I subjects had comorbid ADHD.

Fit statistics for growth mixture models with latent class variables with 1, 2, 3, 4, and 5 classes were compared. The BIC was lowest in the 4-class model (BIC = 8158.8) compared to the others (1-class BIC = 17129.3, 2-class BIC = 8579.6, 3-class BIC = 8253.0, 5-class BIC = 8235.9). The 4-class model also fit significantly better than the 3-class model according to the Lo–Mendell–Rubin adjusted likelihood ratio test (p = 0.0037), while the 5-class model was not a significant improvement over the 4-class model (p = 0.4727). Therefore, the model that best fit the data included a latent class variable with four classes; one of these had significant intercept, linear, and quadratic components, and the other three classes had significant intercepts and linear components. Details of the growth mixture model with four latent classes are shown in Table 2. There were N = 251 total participants: N = 96 in class 1 (persisting low), N = 39 in class 2 (quickly remitting), N = 60 in class 3 (gradually remitting), and N = 56 in class 4 (persisting high). Figure 1 illustrates the trajectories of ADHD severity scores in the four latent classes.

Table 2 Growth mixture model with four latent classes in N = 251 participants
Fig. 1
figure 1

Four latent class trajectories of ADHD severity across 10-year follow-up in N = 251 participants

Baseline demographic and diagnostic characteristics of the N = 251 participants by latent class are shown in Table 3. Participants in the persistently high class were significantly younger than participants in all three other classes. Also, subjects in the gradually remitting class were significantly younger than quickly remitting subjects. In addition, there were more males in the gradually remitting and persisting high classes compared to the persisting low and quickly remitting classes. The racial breakdown of the gradually remitting class was significantly different than those of the persisting low and persisting high classes, with fewer white subjects and more black and other race subjects. There were no differences in SES. Baseline CGAS scores were higher, and mania and ADHD severity were significantly lower in the persisting low class compared to the other three classes. Additionally, baseline ADHD severity was significantly lower in the quickly remitting class compared to the gradually remitting and persisting high classes. All N = 92 baseline healthy control participants were in the persisting low class. Rate of stimulant use was significantly lower in the persisting low class compared to the three other classes, and participants in the quickly remitting class used stimulants significantly less than participants in the persisting high class.

Table 3 Baseline demographic and diagnostic characteristics of N = 251 participants by latent class

Table 4 shows psychiatric diagnoses through 10-year follow-up, adult SUD, and temperament and character factors by latent class. Comorbid disorders were considered anytime through 10-year follow-up, and temperament assessments took place 2 years after baseline. They were correlates of course. In general, the persisting low class had the least psychopathology. Because this class contained primarily baseline healthy controls who remained consistently healthy over the 10 years of study, comparisons of interest are those between the quickly remitting, gradually remitting, and persisting high latent classes.

Table 4 Characteristics of N = 251 participants by latent class controlling for baseline age, socioeconomic status, gender, race, baseline mania, and stimulant use

The persisting high class had significantly higher rates of MDD and ODD than the gradually remitting class and higher rates of MDD than the quickly remitting class. Rates of mania and social phobia were higher in the persisting high class compared with the gradually remitting class, but significance did not remain after Bonferroni correction. Similarly, the persisting high class had higher rates of panic attack than the quickly remitting class, but these results did not hold up to correction for multiple comparisons. The persisting high class had the highest rate of SUD in adulthood (60 %), although this rate was not significantly greater than the rates in the other classes (PL: 19.4 %, QR: 18.5 %, GR: 36.4 %). There were no significant differences in rates of alcohol use disorder or cannabis use disorder. However, the persisting high class had marginally higher rates of both alcohol and marijuana use than the quickly remitting class and marginally higher rates of cannabis use disorder than the gradually remitting class.

Temperament and character differences were evident between the quickly remitting and persisting high classes, with the persisting high class having less persistence, less self- directedness, and more fantasy.

Discussion

Study findings elucidate key psychopathological features that characterize severe trajectories of ADHD that persist into adulthood. A persisting high ADHD trajectory had the highest rates of comorbid major depressive disorder and oppositional defiant disorder. These findings are in line with a number of studies finding comorbid psychopathology among persisting cases of ADHD in both clinical and non-clinical samples (Barbaresi et al. 2013; Barkley et al. 2008; Kessler et al. 2011; Lara et al. 2009; Mannuzza et al. 1998). In contrast to other longitudinal studies, CD was not found to be more common in the persisting high class. This may be because of the inclusion of females in the sample and the low prevalence of CD found in the overall sample at baseline (see Geller et al. 2002). The lack of SUD findings may also be related to the low base rate of CD, as CD symptoms are thought to account for the relationship of ADHD and SUD in some larger longitudinal investigations (Lynskey and Fergusson 1995); however, alcohol and cannabis use were marginally significant in the persisting high group compared to the others, suggesting power limitations could also account for the lack of SUD in persistent ADHD.

Importantly, the persisting high ADHD group had less persistence, less self-directedness, and more fantasy on temperament and character measures than the quickly remitting group, whose severity of ADHD diminished into adulthood. It is not altogether surprising that the group characterized by persisting high ADHD had low self-directedness and persistence, as lack of persistence in effortful tasks has been well described in laboratory and actual settings and speculated to be related to motivational and reward sensitivity differences in ADHD (for review, see Barkley 2006; Rettew et al. 2004). Similarly, children with ADHD may be described by teachers as seemingly off-task or fantasizing, though fantasy specifically assessed as a character trait (e.g., tendency to imagine/daydream) in ADHD is not empirically well investigated (Rettew et al. 2004).

Limitations of the study include the secondary analysis of a dataset originally ascertained for the investigation of BP-I; most of the affected subjects at baseline had BP-I as their primary diagnosis, which is a possible confounder. However, the vast majority of the BP-I sample had comorbid ADHD, and mania severity was included as a time-varying covariate in the growth mixture models that determined the ADHD trajectories. Further, baseline BP-I diagnosis was controlled for in statistical comparisons of the ADHD trajectory classes. Given the richness and comprehensive nature of this large longitudinal dataset, the comparison ADHD group and the ADHD severity in the BP group meaningfully informs trajectories of ADHD. Despite the rigorous statistical control, ADHD in the BP-I group could reflect a unique pathology, and this should be considered in future studies. The study did not include the inattentive type of ADHD and study findings are not generalizable to this group. A potential limitation was the use of medications in the sample. However, this was also addressed statistically by covarying for stimulant use. Another limitation is the lack of teacher informed ADHD in children; however, because the study tracked individuals over 10 years, teacher reports became inconsistently available. This is not an uncommon problem in longitudinal studies of ADHD into adulthood and does not minimize the merit of the study in adding to a critical body of literature informing long term clinical outcomes. A recent study suggested teacher reports did not predict persistence of ADHD (Cheung et al. 2015). Finally, the higher SES of the overall sample may limit generalizability to other populations.

Conclusions

Despite the limitations, the present investigation tracked participants over 10 years from middle childhood into early adulthood, informing characteristic trajectories of ADHD severity into adulthood using advanced statistical methodology. Current findings add to the literature on the characteristics of childhood ADHD that persist into adulthood and begin to elucidate what factors may emerge in such a consistently severe group of individuals with ADHD. Findings suggest that key temperament features may be important markers of a persistent course and therefore might be key targets for early intervention. Taken together, findings highlight the need for additional, larger investigations that could shed light on ADHD subjects most at risk of adverse adult outcomes so that limited public health resources can be optimized.