Introduction

Comorbidity and symptom overlap are common among psychiatric disorders of childhood (Frick et al. 2020). Consequently, differential diagnosis is often a challenge, which in turn hinders accurate treatment planning and treatment efficacy. The high comorbidity rate among children affected by both ASD and ADHD serves as an exemplar of the challenges of differential diagnosis in child psychopathology (Rodriguez-Seijas et al. 2019).

Furthermore, recent research has demonstrated that the comorbidity of ADHD and ASD persists into adulthood (Solberg et al. 2019a, b). At the same time, however, this same research highlights the importance of differential diagnosis. For example, individuals with ADHD were found to be three times more likely than those with ASD to suffer a substance use disorder.

The presence of Intellectual Disability (ID) further complicates differential diagnosis of ADHD from ASD as this condition often co-occurs with both syndromes. In a sample of over 2000 children with ASD, over 18% also had ID (Levy et al. 2010; Tonnsen et al. 2016), and 3.5% of children and adolescents with ID also have ADHD, which is similar to the 3.9% in children and adolescents without ADHD (Platt et al. 2018).

The research literature cited documents the difficulties clinicians face when disentangling the diagnoses, symptom overlap, and symptom differentiation when evaluating children and adolescents referred for both ADHD and ASD symptomatology. Thus, the current study seeks to address this clinical challenge by determining whether or not widely used parent and teacher rating scales can aid this task. The specific aims of this investigation were to (a) determine whether or not the Parent and Teacher Rating Scales of the BASC-3 could be used to differentiate ADHD from ASD accurately and (b) identify the BASC-3 obtained scores that were most useful for making this clinical differentiation. The BASC-3 holds promise for making this diagnostic differentiation because of prior research indicating that this measure has functioned well to diagnose ASD in young children (Bradstreet et al. 2017) and ADHD in children and adolescents (Reynolds and Kamphaus 2015).

Methods

Subjects

The clinical sample in this study consists of 118 children diagnosed with autism spectrum disorders (ASD) and 118 children diagnosed with attention-deficit/hyperactivity disorder (ADHD). The children were evaluated for participation who met the specified inclusion criteria by trained recruiters and independent examiners from various schools and clinical settings (this process is described in detail in the BASC-3 professional Manual; Reynolds and Kamphaus 2015). For the ADHD as well as the ASD samples, information on IQ was not available. Information regarding subtypes of ADHD represented in the clinical sample was not available, and future studies may wish to address this shortcoming. The ADHD group has demographically similar characteristics as the ASD group with respect to the age, parents’ education level, gender, and ethnicity. Table 1 presents the demographic characteristics of the two groups. For parent education level (i.e., the highest school grade completed by the child’s mother), Level 1 indicates grade 11 or less, Level 2 for high school graduate, Level 3 for 1 to 3 years of college or technical school, and Level 4 for 4 years of college or more. For race/ethnicity, AA represents African American, HI for Hispanic, OT for Asian and Other, and WH for White.

Table 1 Sample demographic characteristics

Measure

Child (ages 6 through 11) and Adolescent (ages 12 through 21) forms of the Behavior Assessment System for Children-Third Edition (BASC-3) Parent Rating Scales (PRS) and Teacher Rating Scales (TRS) are used (Reynolds and Kamphaus 2015). The PRS and TRS each provide a comprehensive measure of a child’s adaptive and problem behaviors in community/home and school settings, respectively. Both forms contain descriptors of behaviors that informants rate on a four-point scale of frequency, ranging from Never to Almost Always (i.e., 0 = Never, 1 = Sometimes, 2 = Often, and 3 = Almost Always).

Psychological profiles with strong theoretical relationships to ASD symptomology on BASC-3 include the Developmental Social Disorders, Atypicality, and Withdrawal scales. The Developmental Social Disorders content scale provides a global indication of deficits in social skills, communication, interests, and activities, and an elevated score may indicate the presence of any symptoms reflective of ASD. The Atypicality clinical scale measures a child’s tendency to behave in ways that are considered odd and unusual. The Withdrawal clinical scale measures a child’s tendency to avoid contact and to lack interest in making contact in social settings as well as children who do not attach well to others. Previous BASC research has shown that Atypicality and Withdrawal scale scores are markedly elevated in children diagnosed with an ASD (Volker et al. 2009). The Autism Probability Index (AUI), a completely actuarially derived index, which is new to the BASC-3, was also selected for analysis. The AUI provides an indication of the similarity between the obtained behavioral ratings and the ratings of children identified as having an ASD.

The Attention Problems and Hyperactivity clinical scales were used to assess a subset of core ADHD symptoms. The Attention Problems scale measures an inability to maintain attention and the tendency to be easily distracted from tasks requiring attention. The Hyperactivity scale assesses the hyperactivity and impulsivity aspects of ADHD. ADHD Probability Index (ADI) provides an indication of the similarity between the obtained behavioral ratings and the ratings of children identified as having ADHD. Since the items included in this index are not limited to attention problems and hyperactivity, it is of interest to examine its power in discriminating ASD from ADHD groups.

Extensive psychometric properties of the selected scales are reported in the BASC-3 Manual (Reynolds and Kamphaus 2015). Most alpha coefficients are above .85 for all ages. Concurrent validity as indicated by the correlation between similar BASC-3 and Autism Spectrum Rating Scales (ASRS; Goldstein and Naglieri 2010) scales is reported as moderate to high.

Procedures

To compare ability of the selected BASC-3 profiles to the discriminate between the ASD and ADHD profiles, taking into account various model assumptions and theoretical framework, v logistic regression models were applied to examine the relationship between the scales selected and clinical diagnosis. Each model’s resulting area under the Receiver Operating Characteristics Curve (AUC) is reported. In addition, using the predicted probability .60 as the cut point, the predictive accuracy is obtained as indicated by the percentage of the correct prediction, the false-positive (FP) prediction errors, and the false-negative (FN) errors.

In addition, independent t tests and Cohen’s d effect size are calculated to validate the statistical and clinical significance of the score differences between the two groups. Since norms using ADHD groups are available in BASC-3, these tests are conducted using both general combined norms and clinical norms established using ADHD sample.

Results

Table 2 presents the logistic models’ results for the PRS and TRS. The sensitivity of the models including ASD scales is above .80. The prediction accuracy rate is around .72, and FP rate is low. On the other hand, the models including ADHD scales in diagnosis of ASD do not show discrimination with AUC being around .50, and high false-negative rate.

Table 2 Predictive accuracy

Tables 3 and 4 show consistent results using general combined and clinical norms. First, the ASD and ADHD groups show significant differences in all ASD-related scales with the effect sizes ranging from − 1.1 to − 1.5. A large number of t tests are reported in these tables; however, the majority of the differences seen on the key scales investigated were significant at the p < .01 level, and the effect sizes, reported in these same tables, are substantial and of even greater importance in diagnostic decision-making than traditional statistical significance testing. Second, the two groups demonstrate negligible variations in Attention Problems and Hyperactivity scales. Except for the PRS, the differences between the two groups are insignificant with small effect sizes (d = − .01 to − .02). This suggests ADHD indicators do not differentiate the ASD and ADHD groups well.

Table 3 Differentiation of Autism from ADHD: General Combined Norm Sample
Table 4 Differentiation of autism from ADHD: clinical norm sample

Conclusions

One way to help clinicians meet the challenges of differential diagnosis is to create improved assessment methods and procedures. Considerable attention, for example, was paid improving the item content, that is, content validity of the BASC-3 over the previous edition of the BASC (Reynolds and Kamphaus 2015). Items were added to assess more fully the complete range of symptoms associated with ASD, new empirically derived scales were created to provide the clinician with more options for making differential diagnoses of ASD, and existing BASC clinical scales were re-analyzed with new items to ensure greater homogeneity of construct assessment. This latter procedure resulted in increases in internal consistency coefficients for most scales. The current results, demonstrating considerably improved differential validity for ASD and ADHD syndromes, may be explained by using these construct assessment improvements in the revision process.

For now, the BASC-3 ASD-related diagnostic scales show considerable promise for two purposes, (1) diagnosis of ASD and (2) differential diagnosis of ADHD and ASD. Clinicians may have further confidence in these findings due to the fact that the same results were found for relatively large clinical and general normative samples. Both parent and teacher ratings’ of the behavioral profiles using the content and clinical scales of the BASC-3 have the potential for ASD-sensitive and ASD-specific screening. The Autism Probability Index is also highly sensitive to the presence of ASD but has somewhat less specificity. Compared to the far lengthier ADOS-2, while the differences are considered by us to be small, the BASC-3 performed overall at a higher level of accuracy with this clinical sample than has been reported in an updated study of the diagnostic accuracy of the ADOS-2 in a referred sample. In a large clinical sample of children and adolescents (N = 1080, age 1.7 to 20.5), updated actuarial diagnostic algorithms were derived in-sample to determine the “best” possible diagnostic algorithms associated with ASD for the ADOS-2. The revised, best algorithms resulted in values of sensitivity = .85 and specificity = .86 (Kamp-Becker et al. 2017). These algorithms were improved over the original ADOS-2 Manual algorithms but also the algorithms pertained especially to cases with core autism, and to girls. Kamp-Becker et al.’s results also suggested less effective diagnostic differentiation for children and adolescents with internalizing disorders and conduct disorder. The BASC-3 scales show superior sensitivity to the detection of ASD when it is present with only small sacrifices to specificity, or the detection of the absence of ASD in referral samples that are also fraught with children where ADHD is a key differentiator. The practical efficacy of the BASC-3 scales in terms of both clinician time and costs offer a clear alternative to lengthier assessments with no loss of diagnostic accuracy. Further research however is suggested to look at combining the results of BASC-3 with assessments such as the ADOS-2 in hopes that the combined assessment results can further improve the accuracy of what can be life changing diagnostic decisions for children and their families. The use of IQ as a moderator variable in differential diagnosis in referral samples also may be a fruitful area for future research on discriminating persons with ASD (or the detection of comorbidities of various forms) from persons with disorders that have overlapping symptoms. The interpretation of test scores and particularly the application of diagnostic labels must be considered in context of the referral base and base rates of various disorders, which can vary considerably by setting (i.e., see discussion by Reynolds (2016)).