Introduction

Autism spectrum disorder (ASD) is a neurodevelopmental disorder that affects more than one in 70 individuals (Centers for Disease Control and Prevention 2014). Social communication difficulties and restricted, repetitive behaviours are defining features of ASD (American Psychiatric Association 2013). Structural language skills (including use of grammar, syntax and phonology) in children with ASD vary widely. Some children with ASD never develop the ability to speak while others demonstrate superior language ability (Tager-Flusberg et al. 2011, 1990). In addition, children with ASD may show atypical language presentation such as regression in communication skills in the first few years of life (see Barger et al. 2013 for a review) and persistent or unusual repetitive, stereotyped language (Tager-Flusberg et al. 2011). Several studies have reported children with ASD to have stronger expressive relative to receptive language ability (e.g., Ellis Weismer and Kover 2015; Pickles et al. 2014), however other studies have not found this to be the case (Kwok et al. 2015). It has been argued that the apparent receptive-expressive discrepancy may relate to the use of standardized language tests (Tek et al. 2014; Wittke et al. 2017). For example, standardised tests may not fully capture the child’s language ability because children with ASD do not always respond well to structured testing due to attention and motivation difficulties (Condouris et al. 2003). Verbal skills in childhood play an important role in predicting long-term outcomes in children with ASD in wellbeing, adaptive functioning and mental health (Billstedt et al. 2011; Gillespie-Lynch et al. 2012; Howlin and Moss 2012; Howlin et al. 2013).

Children with ASD have differing trajectories of language development, which include loss of skills, plateau, improvement and acceleration (Anderson et al. 2007; Ellis Weismer and Kover 2015; Kjelgaard and Tager-Flusberg 2001; Tager-Flusberg et al. 1990). Children with ASD and stronger verbal ability have been reported to have trajectories comparable to those with typically developing language on most measures while children with low verbal skills have made slower progress (Tager-Flusberg et al. 1990; Tek et al. 2014;). Substantial variability in communication trajectories has also been reported in the first few years of life in ASD (Brignell et al. 2016; Landa et al. 2013; Lombardo et al. 2015) with more stable and predictable patterns of development reported from 6 to 19 years (Pickles et al. 2014).

Variability in trajectories seen in ASD may not be unique to the condition. Studies of language growth in children with specific language impairment, have also reported greater variability in trajectories before 6 years and relative consistency from 6 years of age onwards (Conti-Ramsden et al. 2012; Law et al. 2008). Variability has also been found in population studies of children under 8 years (McKean et al. 2016; Taylor et al. 2013) and in children under 9 years with non-spectrum developmental disabilities (Anderson et al. 2007). However, clearly-defined comparison groups within the same cohort are needed to test whether this pattern of language trajectory is indeed unique to ASD.

Language trajectories may differ according to the severity of the child’s language problems and the time period being studied (Tek et al. 2014; Toth et al. 2006). Studies of children with ASD have examined a range of factors that may impact on later language outcome, focusing both on environmental (e.g., parent interaction style, socio-economic status and intervention; Ellis Weismer and Kover 2015; McDuffie and Yoder 2010; Siller et al. 2013) and child factors (e.g., play, gesture, joint attention, imitation, IQ, language ability, gender and ASD symptoms; Baghdadli et al. 2012; Ellis Weismer and Kover 2015; Luyster et al. 2007; Toth et al. 2006). Nonverbal IQ is the most consistently reported significant predictor of later language outcome (Thurm et al. 2015; Wodka et al. 2013). Earlier language ability has also been found to be important (Ellis Weismer and Kover 2015; Turner et al. 2006). There have been inconsistent findings around the importance of ASD symptoms in predicting language outcomes. Severity of ASD symptoms are reported as significant by some studies (Baghdadli et al. 2012; Ellis Weismer and Kover 2015; Magiati et al. 2011). One study that grouped children into categories of no words/single words, phrase or fluent speech based on ADOS module found severity of ASD symptoms was a stronger predictor than non-verbal IQ for the age at which children acquired speech (Wodka et al. 2013). By contrast, others have noted social ability is less important after factors such as nonverbal IQ are taken into account (Sigman and McGovern 2005; Thurm et al. 2015).

Despite an increase in the number of studies that have investigated patterns and predictors of language outcome in ASD, clinicians are still unable to provide timely, accurate prognostic information about language outcomes to parents based on their child’s individual skill profile. Policy makers and service providers also require information about communication outcomes to inform decisions regarding allocation of resources and services. The aims of this study were to (a) examine individual patterns of receptive and expressive language development in children with ASD from 4 to 7 years; (b) compare the proportions of children with ASD who had declining, stable and accelerating patterns of language development between 4 and 5 and 7 years to children with typical development (TD) and language impairment (LI); (c) compare mean language development for children with ASD over time to children with LI and TD; (d) investigate whether children with ASD have relative weakness in receptive compared to expressive language (e) investigate predictors of language outcomes from 4 to 7 years, in a representative community sample.

Methods

Participants

Participants were drawn from the Early Language in Victoria Study (ELVS), a longitudinal, community-based study of language development in children (n = 1910). Participants were recruited from 6 of 31 local government areas around metropolitan Melbourne to represent a range of social economic advantage and disadvantage. Exclusion criteria were known developmental delay (e.g., Down syndrome), cerebral palsy or other serious medical conditions when the child was 8 months, and if parents had inadequate English to complete questionnaires (see Reilly et al. 2006, 2007 for further details). Several subgroups were selected from the ELVS for comparison. Communication growth from 1 to 2 years for these same groups have been reported in a prior publication (Brignell et al. 2016).

Ethical approval was obtained from the Royal Children’s Hospital (#23018) and La Trobe University, Human Ethics Committee (#03-32). All parents provided written, informed consent.

Measures

Demographic details were collected when participants were aged 8–12 months. The Clinical Evaluation of Language Fundamentals-Preschool-Second Edition (CELF-P2; Wiig et al. 2006) was administered to participants at 4 years and the Clinical Evaluation of Language Fundamentals-Fourth Edition (CELF-4; Semel et al. 2003) at 5 and 7 years. This comprehensive standardised tool measures receptive and expressive language. Language domains including morphology (grammar), syntax (sentence structure), semantics (word meanings) and vocabulary were assessed and included for both plotting change over time and as the outcome measures for the regression analysis. Trained psychology graduates and speech pathologists administered the direct language assessments. Receptive and expressive index standard scores from the CELF-4 were used.

The Kaufman Brief Intelligence Test-2 (KBIT-2; Kaufman and Kaufman 2004) and Wechsler Abbreviated Scale of Intelligence (WASI; Wechsler 1999) were administered at 4 and 7 years, respectively. Standard scores were used from each tool. The Pediatric Quality of Life Inventory 4.0: parent report form (PedsQL; Varni et al. 2001) and the Strengths and Difficulties Questionnaire (SDQ; Goodman 1997) were completed by parents when participants were aged 4 years. A critical review of studies that used tools to assess quality of life in children with ASD found the PedsQL was the most frequently used quality of life tool for children with ASD. It was also the only tool where both reliability and validity had been established for children with ASD. Significant differences were found in social functioning scores between individuals with ASD and chronic health conditions and individuals with ASD and healthy individuals (i.e. no chronic condition) (Ikeda et al. 2014). Several studies have reported on the performance of children with ASD on the SDQ (e.g. Iizuka et al. 2010; Russell et al. 2013). These studies found children with ASD had more difficulties than those without ASD (and children with other co-morbidities) in the peer problems and prosocial behaviour subscales.

Identification of Subgroups

Identification of Participants with Autism Spectrum Disorder

From 4 years of age parents were asked if their child had received a diagnosis of ASD. Following parent report a psychologist (MP), experienced in the diagnosis of ASD, conducted a telephone interview to verify the diagnosis and collect information on: type of diagnosis, age of diagnosis, comorbidities and where the diagnosis was made, who made the diagnosis and some general questions about schooling and progress. The psychologist did not use an autism diagnostic tool during this interview. Forty four children were reported to have ASD, yielding an estimated prevalence between 2.3–3.0% with denominators representing those participants who remained in the study at 7 years (n = 1438) and those who were originally enrolled in the study (n = 1910). The prevalence of ASD in the ELVS cohort at 7 years is consistent with another population-based Australian study (the Longitudinal Study of Australian Children) that reported a parent-reported estimated ASD prevalence of 2.5% at 7 years and 2.4–3.9% at 10 years of age (May et al. 2017; Randall et al. 2015). Similar estimates have also been reported in several international studies ranging from 2.2 to 2.6% (Kim et al. 2011; Waugh 2016; Zablotsky et al. 2013). The majority of children (n = 41/44; 93%) received a diagnosis of ASD from a multidisciplinary team with at least two professionals (typically paediatrician, psychologist, speech pathologist). The remaining children (n = 3) were diagnosed by a paediatrician. Families of children with ASD also completed the Social Communication Questionnaire (SCQ; Rutter et al. 2003) at 7 years of age. Mean scores on the SCQ for the children with ASD in this study were 13.3 (SD 4.08; range 5–20). Mean SCQ scores were lower than might be expected for children diagnosed with ASD in this study indicating milder ASD symptoms. This may be related to the sample being community-based compared with clinically-derived samples. The SCQ is a parent reported measure of ASD symptoms and is not equivalent to a comprehensive diagnostic assessment that takes into account multiple assessment tools, informants and child behaviour in different environments. Forty-four children in the sample were diagnosed with ASD but in the present study only those children who had completed the CELF at 4 or 5 and 7 years (n = 26–27) were included in the analysis.

Identification of Language Impaired and Typical Language Development

Children were allocated to the language impaired (LI) and typical language (TD) groups based on their performance on the CELF and the WASI. Group allocation was conducted at 7 years. Children in the LI group had a WASI score ≥ 70 and a CELF score at two time points (7 years and 4 or 5 years) ≥ 1.25 standard deviations below the mean on either the receptive and/or expressive index. For the TD group children were required to have a WASI standard score ≥ 85 and a CELF score at 7 years and 4 or 5 years of ≤ 1.25 standard deviations below the mean on both the receptive and expressive index. Children were excluded from these groups if they had ASD. These criteria for group assignment were based on previous literature (Reilly et al. 2014; Tomblin et al. 1996). Children were only included in the current study if their language skills were measured on the CELF at 4 or 5 and 7 years. This produced 119 children in the LI group and 861 children in the TD group. Seventeen children with ASD were excluded from the analyses because they did not complete the language assessment at two time points. Reasons for missing data included: withdrew from the ELVS (n = 4), verbal ability too limited to complete CELF (n = 3), unable to schedule (n = 3), child non-compliant to testing (n = 4) and parents too busy with other interventions to schedule assessment (n = 3). We tested whether there were differences between the characteristics of these 17 children and those with ASD who were included in the study. There were no significant differences (p > 0.05) for the eight demographic factors (gender, number of children at home, English main language spoken at home, married/defacto, socio-economic advantage, maternal age at birth, primary caregiver completed high school and CDI total language score at 2 years). However, mean scores on early language and IQ measures were lower for the children with ASD who were not included in the analysis (see Supplementary Table A for detail).

Predictor Measures for Whole Sample

Nine predictors were used in this study. Baseline receptive and expressive language ability was based on a child’s standard score at 4 or 5 years on the CELF-P or the CELF-4. Standard scores from the receptive and expressive index were used. Social communication skills were measured using the social functioning subscale from the PedsQL and peer relationship problems and prosocial behaviour subscales from the SDQ. Autism was assigned as described above. Nonverbal IQ was measured using the KBIT-2 matrices subtest. Standard scores were used. The ELVS sample at 7 years is skewed toward families who are more socio-economically advantaged (McKean et al. 2015) thus Socio-Economic Indexes for Areas (SEIFA) quintiles were used based on the Australian Bureau of Statistics reference. We were not able to assess intervention as a predictor of outcome because data were not consistently collected at each wave on intervention.

Statistical Analysis

All analyses were conducted using Stata version 13.1. Individual patterns of language growth from 4 to 7 years were plotted for the children with ASD. The proportions of children with ASD who had declining, increasing and stable patterns were compared to children with LI and TD using the Chi square statistic. Various methods for studying language and developmental trajectories are available (e.g., Landa et al. 2012; Venker et al. 2014), however, the small sample size of children with ASD in the current study limited the range and complexity of language development analyses that could be conducted. We defined declining and increasing patterns as more than one standard deviation change (±15 points) in standard scores between 4 and 7 years on the CELF. A one standard deviation change is denoted as a clinically meaningful change in the CELF test manual and is consistent with cut points used in at least one previous study examining language trajectories (e.g., Snowling et al. 2016). Data were also analysed using the Generalised Estimating Equations (GEE) method for fitting the marginal models. An exchangeable correlation structure was used and robust standard errors, which take the dependence of the multiple responses from each participant, were taken into account. The mean change in language from 4 to 7 years for each group was plotted for these models. To assess whether a diagnosis of ASD predicted a greater gap between receptive and expressive language at 4 and 7 years of age we used linear regression with an autism diagnosis as the explanatory variable and the difference between CELF scores at 4 and 7 years as the outcome variable. Cohen’s d effect sizes are reported to indicate the strength of the differences in receptive-expressive language gap between children with and without ASD.

We also used linear regression to analyse predictors of language outcome. Predictors were included in the model if they were found to be significant univariate predictors of CELF-4 standard scores. Multicollinearity between the social communication measures (i.e. subscales of the SDQ, PedsQL) was assessed to prioritise the measures. A variance inflation factor of 1.2 indicated low correlation so all three social communication measures were kept in the model. Finally, to assess whether social communication may influence language outcomes differentially by group we conducted bivariate linear regression analyses. We included each social communication measure separately for each group for each language outcome. We then added IQ to the regression model to test whether social communication remained a significant predictor of language receptive or expressive outcome.

Results

Participant characteristics are described in Table 1. There was no evidence of a difference (p > 0.05) between any of the three groups (ASD, LI, TD) in number of indigenous children, families where English was the main language spoken at home and maternal age at birth. A higher proportion of children with ASD were male compared to the other groups (ASD vs TD: χ2 17.05, p < 0.001; ASD vs LI χ2 6.61, p = 0.01). Children with LI were more likely to have a greater number of children living in the home (t = 3.0, p < 0.001) with parents who had not completed high school (χ2 11.02, p < 0.001) and who were not married/defacto (χ2 11.88, p = 0.001) compared to the TD group. The LI group was more socially disadvantaged than both the TD (t = 2.25, p = 0.02) and ASD (t = 6.95, p < 0.001) groups. Mean IQ was 99.71 (SD 14.35) for children with ASD included in the analysis with no child having a non-verbal IQ < 70. Children with TD had significantly higher non-verbal IQs than children with ASD (t = 3.01, p < 0.001) and children with ASD had significantly higher nonverbal IQ than children with language impairment (t = 2.74, p = 0.007). Children with ASD had significantly higher scores on social measures (SDQ pro-social, t = 2.09, p = 0.037; PedsQL social, t = 2.12, p = 0.0345; and SDQ peer problems, t = 3.36; p = 0.0008) indicating they had greater social difficulties. Children with LI had significantly higher scores than those with TD on the PedsQL social (t = 2.46, p = 0.014) and SDQ peer problems (t = 4.78, p = 0.001) measures.

Table 1 Participant characteristics

Individual Change in Language Development for Children with ASD

Changes in language development for individual children with ASD (n = 26–27) are shown in Fig. 1. Three main patterns: declining, accelerating and stable were found for receptive and expressive language in children with ASD from 4 to 7 years. The majority of children with ASD had stable patterns with scores, remaining within one SD of their previous standard score from 4 to 7 years. Three children with ASD could not be tested on the CELF at all three time points due to limited language abilities. One child was minimally verbal (≤ 10 words) at 4 years and continued to have limited receptive and expressive language at 7 years. Another child was verbal but did not have adequate language to complete testing at 4 and 7 years. The third child could only use a limited number of phrases and communicated primarily using sign at 4 and 5 years. This child was verbal by 7 years and able to complete the CELF-4.

Fig. 1
figure 1

Individual patterns of language development from 4 to 7 years of children with ASD (n = 26–27). Standard scores on the CELF are presented

Patterns of Language Development

Language development patterns were defined using changes in standard scores for children who were able to complete the CELF (n = 27). Three children who attended assessments at required ages but were not able to complete the CELF are described separately and are not included in the sample of 27. Overall there were no significant differences between the three groups (ASD, LI, TD) in the proportions of children who demonstrated declining, stable and accelerating patterns in receptive language (χ2 4.0932; p = 0.394) and expressive language (χ2 5.9358; p = 0.204) (Table 2). Eighty-one and eighty-five percent of children with ASD had relatively stable standard scores in receptive and expressive language respectively from 4 to 7 years. Eight and eleven percent of children had increasing and 7 and 8% declining patterns, in receptive and expressive language respectively.

Table 2 Patterns of development (decline, maintain and accelerate) for children with ASD, LI and TD

Mean Course of Language Development for Each Group

Receptive Language

The estimated mean standard scores for the LI group were 24.6 units lower than the TD group across the three waves of data collection (p < 0.001; 95% CI − 26.1, − 23.1). The estimated mean standard scores for the ASD group were 18.0 units lower than the TD group across the three waves of data collection (p < 0.001; 95% CI − 23.2, − 12.7). There was a small effect of gender in the whole cohort (ASD, LI and TD) with girls having a mean standard score 1.6 units higher than boys (p = 0.005; 95% CI 0.5, 2.8). There was no evidence of effect modification by gender on language grouping (i.e. ASD, LI, TD).

Expressive Language

Estimated mean standard scores were 24.4 units lower for the LI than the TD group across the three waves of data collection (p < 0.001; 95% CI − 26.1, − 22.6). Estimated mean standard scores were 18.1 units lower for the ASD than the TD group across the three waves of data collection (p < 0.001; 95% CI − 24.0, − 12.2). There was no evidence of an effect of gender on the mean standard scores for the whole cohort nor was there any evidence of an interaction between gender and grouping.

Mean patterns of development from 4 to 7 years for children with ASD, LI and TD are presented in Fig. 2a, b. While language ability varied between the three groups, the slopes of change over time in each group were generally flat, indicating the development of language was generally at a comparable rate. Exceptions to this were girls with ASD who demonstrated variable patterns of growth and the LI group who demonstrated a decrease of approximately 7 standard score points from 5 to 7 years in expressive language. Of note, girls with ASD achieved lowest scores of all three groups in receptive language. In order to assess whether using standard scores may have masked variability in language development we compared language development using standard scores to raw scores. Patterns of development for raw scores were similar to those for standard score, with children with LI and ASD having lower mean scores but tracking in parallel to children with TD.

Fig. 2
figure 2

Mean language trajectories (CELF standard scores) for male and female subgroups of children with ASD, TD, LI in receptive language (a) and expressive language (b)

Receptive-Expressive Language Profiles for ASD and non-ASD

There was a wide range of receptive/expressive difference scores (i.e. CELF receptive index standard score minus CELF expressive index standard score) observed in children with and without ASD at 4 years (− 30 to 36), 5 years (− 43 to 43) and 7 years (− 35 to 51). There was no evidence that having a diagnosis of ASD predicted greater receptive-expressive difference scores at 4 years (ASD n = 30, non-ASD n = 1530 at 4 years), 5 years (ASD n = 23, non-ASD n = 959) or 7 years of age (ASD n = 30, non-ASD n = 1174). Children with ASD aged 4 years had on average a 0.45 unit lower difference between expressive and receptive language than children without ASD (95% CI -3.1, 4.0; p = 0.81, d = 0.05). At 5 years children with ASD had on average a mean difference 2.0 units higher than children without ASD (95% CI − 2.9, 6.8; p = 0.43, d = − 0.17) and at 7 years children with ASD had on average a mean difference 1.7 units lower than children without ASD (95% CI − 5.8, 2.3; p = 0.40, d = 0.15). See Fig. 3.

Fig. 3
figure 3

Distribution of the receptive-expressive difference scores for non-ASD (n = 1204–1560) and ASD (n = 30) children at 4 (a) and 7 years (b)

Predictors of Receptive and Expressive Language Outcomes at 7 Years for Whole Sample

Five of nine predictors examined using multivariate analysis made a statistically significant independent contribution to variance in receptive language outcome. These included receptive (B = 0.38, p < 0.001) and expressive baseline scores (B = 0.23, p < 0.001), gender (B = 1.63, p = 0.01), non-verbal IQ (B = 0.17, p < 0.001) and socio-economic disadvantage (B = 0.63, p = 0.02). For expressive language, only receptive (B = 0.19, p < 0.001) and expressive (B = 0.56, p < 0.001) baseline scores and IQ (B = 0.07, p = 0.002) were significant. A diagnosis of autism and social ability at 4 years were not significant predictors of either receptive or expressive language ability at 7 years once gender, baseline receptive language skills, baseline expressive language skills and IQ at 4 years of age were taken into account. The model explained 44% of the variance in receptive language outcome (p < 0.001) and 58% of the variance in expressive language outcome (p < 0.001). See Table 3.

Table 3 Predictors of language outcomes

We investigated the social communication variables further to test whether they influenced language outcomes for the three groups differentially. First, we conducted bivariate linear regression with each social communication measure for each language outcome by group. Here the PedsQL social and SDQ peer problems variables were found to be significant predictors for receptive language in children with ASD (β = 0.46, p = 0.038 and β = − 4.1, p = 0.032, respectively). Pro-social behaviour was a significant predictor (β = 0.43, p = 0.017) of receptive language for the TD group. None of the social variables reached significance for children with LI. For expressive language, SDQ-peer problems was a significant predictor (β = 5.46, p = 0.011) for the ASD group and no other social communication variables reached significance for either language outcome in any other group. Next we added non-verbal IQ (KBIT-2) to the model for those analyses where social variables were found to be significant predictors of language outcome. No social variables maintained significance for any of the three groups once non-verbal IQ was added to the regression model.

Discussion

To our knowledge this study is the first to report on verbal children with ASD (IQ > 70) using clinician administered, comprehensive, standardised language assessments in a longitudinal community sample that has used the same measures for children with LI and TD. The community sample minimizes ascertainment bias and enables important comparisons.

This study found individual variation in the type of language pattern observed for children across all three groups (LI, TD and ASD), however, the groups were remarkably similar in the proportions of children with declining, maintaining and accelerating patterns from 4 to 7 years. For children with ASD, 81–85% had relatively stable standard scores in receptive and expressive language respectively from 4 to 7 years. Only 7–8% of children had standard scores that had decreased more than one standard deviation from their previous scores and 8–11% of children had more than one standard deviation increase in receptive and expressive language standard scores (acceleration).

This finding is consistent with two other studies that used different methodology to the current study. Fountain et al. (2012) mapped trajectories of communication functioning based on interview questions from an annual Client Development Evaluation from 3 to 14 years. Most children with ASD improved their communication functioning over time with around 7.5% described as “bloomers” (communication functioning accelerated over time). For the whole sample, most rapid development was found to occur prior to 6 years with functioning becoming more stable beyond this time (Fountain et al. 2012). Pickles et al. (2014) investigated parent-reported adaptive communication skills in children with ASD from 2 to 9 years and found that despite substantial heterogeneity in language development in the early years, progress beyond 6 years was mostly uniform. Pickles et al. (2014) hypothesised that heterogeneity in early language development may be related to increased brain plasticity and greater sensitivity to environmental stimuli that facilitate early language development. This pattern of development over time is supported by a prior study that used the same cohort as the current study and mapped language from 1 to 2 years (Brignell et al. 2016). Language change from 1 to 2 years was found to be more variable than that reported in the current study from 4 to 7 years. There are, however, other possible explanations for his change in development over time including less accurate measurement of language ability in the early years or multiple different language development pathways that all progress to the same endpoint in later childhood. Furthermore, our study used more than one standard deviation change (i.e. decline, maintain, accelerate) to indicate a clinically meaningful change, consistent with some other studies (e.g., Snowling et al. 2016). Criteria used to assess variation in patterns has differed from study to study and the threshold used in the current study may be wider and less likely to find more subtle differences in development compared to others.

Children with ASD had lower mean scores in receptive and expressive language compared with typically developing children but scored higher than children with language impairment across the three time periods. Despite some heterogeneity in language development, most children with ASD demonstrated rates of language development comparable to those with LI and TD. In other words, progress over time for the ASD group was similar to the LI and TD group, albeit with the ASD and LI groups starting from a lower baseline. Stability in communication progress has also been reported to occur in two other studies of children with ASD (Fountain et al. 2012; Pickles et al. 2014) and is consistent with studies of children with language impairment (Conti-Ramsden et al. 2012; Law et al. 2008) and the ‘tracking hypothesis’. The ‘tracking hypothesis’ suggests despite children with LI having lower mean scores than children with TD, on average they track in parallel, with stable growth in language ability over time (Law et al. 2008). The current study presents mean scores for each group and did not subgroup children with ASD based on language ability. However, it has been argued that children with ASD and co-occuring LI may present with comparable language phenotypes to children with LI alone and children with ASD who have age appropriate language skills may present with phenotypes comparable to those with typical language development (Tager-Flusberg 2006, 2015). If phentotypes are reported to be similar and language impairment is co-morbid to ASD rather than integral, it is feasible language trajectories may also be similar and that a diagnosis of ASD in itself may be less relevant to language change over time.

Mean receptive and expressive language standard scores were lower for girls with ASD compared with all other groups in this study and female gender was a significant predictor of receptive language outcome at 7 years. However, the small sample of four girls in our study limits any specific conclusions we can make about language development in girls with ASD.

A strength of this study is that we were able to report on receptive and expressive language separately and compare each of these domains across several time points. Here we found a diagnosis of ASD did not predict a greater difference in receptive relative to expressive language. Furthermore, this study did not find a significant mean difference between the two language domains for children with ASD at any of the three time points measured. This finding is in contrast to some other studies that have found slower growth in receptive language relative to expressive language (Kover et al. 2013) and expressive language advantage over receptive language for a substantial group of children with ASD (Ellis Weismer et al. 2010; Hudry et al. 2010; Luyster et al. 2008; Mitchell et al. 2006). However, our results are consistent with a meta-analytic review investigating receptive-expressive discrepancy in ASD (Kwok et al. 2015). This review (n = 74 studies) examined receptive and expressive language performance in children with ASD and found no evidence that expressive language was stronger than receptive language in children with ASD. Furthermore, the child’s developmental stage, cognitive abilities, vocabulary, overall language ability and type of tool used to assess language and diagnose ASD were not found to be significant contributors to relative receptive-expressive discrepancy (Kwok et al. 2015). Most of these studies have not included comparison groups which has limited our understanding of whether language profiles are specific to ASD.

Several factors predicted later language ability in the whole cohort. Receptive and expressive language ability and nonverbal IQ at 4 years were found to be important predictors of language at 7 years. This is convergent with previous research using clinical samples of children with ASD that have highlighted the importance of early language and IQ (Ellis Weismer and Kover 2015; Fountain et al. 2012; Luyster et al. 2007; Paul et al. 2008; Thurm et al. 2007; Wodka et al. 2013; Yoder et al. 2015). In longitudinal studies of children without ASD earlier language ability has been found to be a predictor of later language (Bishop and Edmundson 1987; Hayiou-Thomas et al. 2014), with expressive syntax and narrative retell the strongest predictors in one study of children with language impairment (IQ > 70) from 7 to 11 years of age (Botting et al. 2001). The same study found IQ was an important predictor of later language outcome (Botting et al. 2001).

Socio-economic disadvantage (specifically being in the most disadvantaged quintile), and gender were predictors of receptive but not expressive language in the current study. This finding is consistent with other studies of children with ASD that have found caregiver education discriminated higher vs low verbal outcomes (Anderson et al. 2007; Ellis Weismer and Kover 2015). Furthermore, socioeconomic characteristics (more educated, non-minority mothers) predicted which children with ASD were more likely to make the most rapid progress in communication functioning in another study (Fountain et al. 2012). Low maternal education and low family income have been identified as a risk factor of receptive vocabulary at 7 years in another large cohort longitudinal population-based study, although expressive vocabulary was not measured (Taylor et al. 2013), and low maternal education and socioeconomic status predicted adverse receptive and expressive language outcomes at 4 years in another study that used data from the same cohort as the current study (Reilly et al. 2010). Samples of children with language impairment have also been found to contain disproportionate numbers of children with socio-economic disadvanatage (Roy and Chiat 2013; Toppelberg and Shapiro 2000).

This study did not enable us to identify the mechanisms by which socio-economic disadvantage may impact later language outcomes, however a range of factors may be at play including a possible link between socioeconomic disadvantage and parental language ability that could be inherited, family stress, access to interventions and level of language stimulation and learning opportunities in the child’s home and community environment. There is some evidence, for example, that parents in higher socioeconomic groups expose their children to larger vocabularies compared with those in lower socioeconomic groups and better outcomes have been found to be a function of parent interaction style and language input in typically developing children (Hart and Risely 1995; Hoff 2003).

It is unclear why socioeconomic disadvantage predicted lower receptive but not expressive language at 7 years in this study. We might hypothesise that receptive language development is more sensitive to the language environment than expressive language. It is also possible different factors influence receptive and expressive language outcomes in dissociable ways. For example, a study of 129 children with ASD found while ASD symptom severity predicted growth in both receptive and expressive language, cognition only predicted growth in expressive language (Ellis Weismer and Kover 2015). Moreover, another study found joint attention, intentional communication and verbal responses from parents predicted outcomes in both language domains. However, consonant inventory only predicted expressive language growth and early receptive vocabulary and autism severity predicted only receptive language growth (Yoder et al. 2015).

Diagnosis of ASD and parent-reported social ability at 4–5 years of age were not predictors of expressive or receptive language at 7 years once factors such as IQ and baseline language ability were taken into account. The methodology used in this study was unique in being able to test a categorical diagnosis of ASD as a predictor of language outcomes in a large cohort of children with and without ASD. Direct comparison to the findings of previous studies that have examined the influence of ASD symptoms on later language (e.g. Ellis Weismer and Kover 2015; Wodka et al. 2013; Yoder et al. 2015) was not possible. Our sample of children with ASD was small (and therefore may be too small to observe an effect) and we were not able to classify children into different levels of severity of ASD symptoms. Furthermore, children in our study with ASD may have had milder social communication symptoms than children from clinically-derived samples. Results should therefore be viewed with caution and await replication.

Our finding that social ability at 4 years (social functioning, peer relationship problems and pro-social behaviour) did not predict language at 7 years is consistent with the findings of some studies of children with ASD (Sigman and McGovern 2005; Thurm et al. 2015) but not others (Baghdadli et al. 2012; Ellis Weismer and Kover 2015; Magiati et al. 2011; Wodka et al. 2013). Variable findings across studies may relate to different ages being studied, the different tools used to measure social ability, or the different types of social abilities captured by the tool. It is possible some social abilities have more influence over language than others and that there are more sensitive ages and language levels where social ability has greater influence on language. For example, social ability may have more impact when children are in the early stages of learning language or for children who have more significantly delayed language.

This study did not collect detailed information on interventions children had received and is therefore not able to investigate whether intervention type or dosage predicted language outcomes. Of studies that have investigated the impact of intervention on language there have been mixed findings with some reporting intervention was not a significant predictor (e.g, Ellis Weismer and Kover 2015) and others reporting intensity of speech therapy (along with age and non-verbal IQ) predicted acquisition of speech (Mazurek et al. 2012) and number of hours of speech/language therapy (along with motor imitation) predicted later language ability (Stone and Yoder 2001).

Recently studies have started to explore neural substrates that may contribute to the heterogeneity in language trajectories in ASD. In one study, the best predictor of later language outcomes for children with ASD was a combination of behavioural measures (standardised tests of adaptive communication, language and autism symptoms) and speech-related neuroimaging biomarkers (specifically, superior temporal cortex activity as measured using fMRI; Lombardo et al. 2015). The afore-mentioned findings are yet to be replicated and our understanding of the brain mechanisms underlying language ability and outcomes in ASD is still in its infancy. Understanding the neurobiological underpinnings of language will be important to delineate ASD subtypes, to assist clinicians with applying appropriate language interventions and for better predicting language outcomes.

Limitations

The analysis in this study was limited to children who could complete the CELF, with qualitative information provided about three children with ASD who were not able to complete the CELF at two time points due to limited language and/or cognitive ability and floor effects on tools (i.e. children were not able to achieve basal scores). Therefore, the main findings are only relevant to children who had verbal language and an IQ > 70.

Standardised tools such as the CELF are not designed to assess children with very limited verbal ability and being a large scale community-based study meant there were limitations to the customisation of assessments for children with different verbal abilities. However, best practice recommendations have been developed for the assessment of minimally-verbal children with ASD and appropriate assessment tools (formal and informal) are available (Kasari et al. 2013). Children in the study with minimal verbal ability made progress in their communication skills, however it was not possible to assess their rate of development. To date evidence points to children with ASD who had relatively lower expressive language ability at baseline experiencing a slower rate of language growth compared to children with higher language development (Anderson et al. 2007; Baghdadli et al. 2012; Ellis Weismer and Kover 2015; Tek et al. 2014). The exclusion of children who were minimally verbal may skew our findings and limits their application to assessable children.

Our sample size of children with ASD (n = 27, including four female) is small which reduces statistical power. The substantial difference in sample size between the children with ASD and those without ASD also needs to be considered when interpreting our findings. Given this was a community-based study, the children in the study were not diagnosed with ASD uniformly using the same methods and we relied on parent report of ASD, with verification through interview. Different versions of the CELF were used at 4 years compared to 5 and 7 years, however mean scores, standard deviations and ranges for the whole sample were very similar for each version of the CELF (4 years: M 99.62, SD 15.1, 50–140; 5 years: M 100.6, SD 13.9, 55–144; 7 years: M 98.8, SD 13.6, 45–136). While it is possible our subgroups may respond differently to different versions of the CELF the change in version should impact all groups equally. Lastly, our language impairment group included children who had expressive language disorder, receptive language disorder, or combined receptive/expressive language disorder. Inclusion criteria for the LI group were based on prior research and recommendations (Bishop et al. 2017; Reilly et al. 2014), however, combining language subgroups rather than examining language domains separately (i.e. children with expressive compared with receptive language disorder) may have impacted our findings around group differences.

Clinical Implications

Our findings contribute new knowledge to our current understanding of language development in children with ASD. The findings provide preliminary information that in verbal children with ASD aged 4–7 years with intelligence within the normal range, most children will develop language at a similar pace to typically developing children and a diagnosis of ASD in itself does not necessarily mean a child will fall further behind their peers in language development. Rather, other factors may be more relevant to later language ability such as nonverbal IQ and earlier language level than a diagnosis of ASD. This is important information for clinicians to consider when parents ask them about their child’s likely language outcome. It may also have implications for the types of factors to consider and prioritise when providing language interventions.

The age at which this study investigated predictors (4 years) of later outcomes is consistent with the mean age children are typically diagnosed with ASD in Australia before 7 years of age (Bent et al. 2015) and the factors found to be predictive of language outcomes should be readily available to clinicians who diagnose children with ASD, as part of recommended best practice assessment.

This study was not able to provide clinicians with useful prognostic information on language development in children with ASD who have intellectual disability and/or those children who could not complete formal testing on the CELF. Predictors of language outcomes have, however, been investigated in studies specifically designed for minimally verbal children with ASD (e.g., Yoder et al. 2015) and change in language over time for these children is also being examined (Rose et al. 2016). Such studies are highly valuable and this area of research needs to be a research priority given the poor outcomes for children who do not develop verbal language. Further research is also needed to bring together the multiple factors that interact and impact language outcomes so more personalised prognostic information can be provided to families.