Abstract
Effective biomarkers are urgently needed to facilitate early diagnosis of autism spectrum disorder (ASD), permitting early intervention, and consequently improving prognosis. In this study, we evaluate the usefulness of nine biomarkers and their association (combination) in predicting ASD onset and development. Data were analyzed using multiple independent mathematical and statistical approaches to verify the suitability of obtained results as predictive parameters. All biomarkers tested appeared useful in predicting ASD, particularly vitamin E, glutathione-S-transferase, and dopamine. Combining biomarkers into profiles improved the accuracy of ASD prediction but still failed to distinguish between participants with severe versus mild or moderate ASD. Library-based identification was effective in predicting the occurrence of disease. Due to the small sample size and wide participant age variation in this study, we conclude that the use of multi-parametric biomarker profiles directly related to autism phenotype may help predict the disease occurrence more accurately, but studies using larger, more age-homogeneous populations are needed to corroborate our findings.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Autism spectrum disorder (ASD) is a complex neuro-behavioral syndrome usually described as a heterogeneous group of neurodevelopmental disorders, which is believed to affect about 1:68 children just only in US population (Christensen et al. 2016) and 1:160 pediatric subjects globally in the world (Elsabbagh et al. 2012). Standardized ASD diagnostic criteria were published and recently reviewed in the World Health Organization’s International Classification of Diseases (ICD-11 will be published in 2018) and the American Psychiatric Association’s Diagnostic and Statistical Manual, fifth edition (DSM-5) (APA 2013). Central issues to these criteria are impaired social development, communication deficits, and pathological lack of flexibility or “insistence on sameness” (Volkmar and Reichow 2013). Current diagnosis practices are based on phenotypic characterizations that rely on standardized scoring systems. These diagnostic methodologies have been vital for advancing clinical practice and research, but fall short of enabling early diagnosis and preclinical disease prediction. ASD-associated deficits are commonly recognized in children during their first 12 to 24 months of age (Zwaigenbaum et al. 2015), but a reliable diagnosis is often made at 3 years of age or later (Woolfendena et al. 2012; Steffenburg et al. 2018; Sharma et al. 2018).
It is widely accepted that early diagnosis provides valuable opportunities for primary intervention and better prognosis in ASD (Boyd et al. 2010; Debodinance et al. 2017). Therefore, it is particularly advantageous being able to predict ASD before the onset of prodromal signs and symptoms, a goal that is not currently attainable in the absence of suitable biomarkers (Sices et al. 2017). Biomarkers are measurable outputs that indicate the presence of a disease or an outcome, including biochemical analytes and imaging data. In ASD diagnosis, biochemical analytes are typically measured in body fluids (e.g., blood, urine, saliva, and cerebrospinal fluid); they are easy to measure, cost-effective, and do not often require invasive procedures (Mayeux 2004; Nunes et al. 2015; Beversdorf and Missouri Autism Summit Consortium 2016). The very recent years have witnessed an increasing interest in the search for suitable biomarkers for the early diagnosis of ASD (Daniels and Mandell 2014; Uddin et al. 2017; Prata et al. 2017).
It is noteworthy that ASD is caused by the combined action of various genetic, epigenetic, and environmental factors, rather than a single mutation or a single simple pathogenetic cause or mechanism (Volkmar and Reichow 2013; Beversdorf and Missouri Autism Summit Consortium 2016). Consequently, even a single, non-polymorphic defined phenotype might be caused by multiple panoplies of different underlying mechanisms, which may trigger an effective and safe treatment in one patient but not necessarily in another one (Volkmar and Reichow 2013). On the one hand, the remarkable genetic heterogeneity of ASD may raise a challenge that may hamper the search for a more general and a wider ASD biomarker collection (Geschwind and Levitt 2007; Khramova et al. 2017). On the other hand, class-specific biomarkers may guide a better understanding of the underlying mechanisms of ASD, thus providing a tool for tailoring therapeutic strategies to specific classes of ASD patients (Loth et al. 2016).
In the present study, we reappraised and re-analyzed previously published data (Alabdali et al. 2014a, b) using a different mathematical approach, to highlight new important insights on ASD biomarkers. Unlike our previously reported investigations, none of the participants was needed to be excluded as a possible outlier in the current study. We used principal component analysis (PCA) to verify the authenticity of the classification of participants based on selected biomarkers and used multiple statistical tests to verify the obtained results. More importantly, we evaluated the effect of using multiple biomarkers simultaneously on the accuracy of predicting disease occurrence, an approach previously suggested as a way to improve prediction accuracy (Gupta et al. 2013; Abruzzo et al. 2015).
Methods
Participants
Participants enrolled in the present study were previously described (Alabdali et al. 2014a, b). Briefly, 58 male autistic patients ranging in age from 3 to 12 years (mean 7.0 ± 2.34 SD) were recruited through the Autism Research and Treatment Centre, Faculty of Medicine, King Saud University, Riyadh, Saudi Arabia. Patients enrolled in the study were diagnosed with ASD according to the fourth edition of the Diagnostic and Statistical Manual of Mental Disorders and further updates (APA 2000; Sharma et al. 2018; Galiana-Simal et al. 2018). A number of 32 age- and gender-matched control participants (mean age 7.2 ± 2.14 SD) were recruited from children who came to the Well Baby Clinic at King Khalid University Hospital for routine checking. Control subjects did not show any signs or symptoms of infectious diseases or neuropsychiatric disorders. All participants had normal erythrocyte sedimentation rates and urine analysis results. The Ethical Committee of the Faculty of Medicine, King Saud University approved the present study. Participants’ parents or legal tutors signed informed consents before any sample were collected. The experimental design of the whole research study was consistent with the principles of the Declaration of Helsinki (General Assembly of the World Medical Association 2014).
Measures of Disease Severity Among Autistic Patients
Disease severity was measured using the Childhood Autism Rating Scale (CARS) and the Social Responsiveness Scale (SRS) (Chen et al. 2018). To obtain a CARS score, each child was rated on a scale ranging from 1 (normal) to 4 (severely abnormal) with respect to each of 15 criteria (relating to others; imitation; emotional response; body use; object use; adaptation to changing; visual response; listening response; taste, smell, and touch responses; fear and nervousness; verbal communication; non-verbal communication; activity level; level and reliability of intellectual responses and general impressions). A final score was obtained by computing the sum of the 15 individual scores, resulting in a combined score that could range from 15 to 60. Scores below 30 were considered non-autistic; 30–36.5 were considered mild to moderate autism and scores greater than 36.5 were considered severe autism (Mick 2005). SRS scores were generated from the results of a questionnaire, with scores ranging from 60 to 75 considered mild to moderate, and scores of 76 or greater considered severe autism (Constantino et al. 2003). Patients with a history of epileptic seizures, obsessive-compulsive disorder, fragile X syndrome, or any psychiatric or neurologic disorder other than autism were excluded from the study.
Biomarker Data Collection
Blood samples were treated as previously described (Alabdali et al. 2014a, b). Briefly, whole blood specimens were collected by venipuncture using heparin as an anticoagulant. Plasma and red blood cells were separated by centrifugation and stored at – 80 °C until used. Biomarkers were properly selected to represent various physiological processes with established links to ASD. Serotonin, gamma-aminobutyric acid (GABA), and dopamine are related to brain neurochemistry; the hormone oxytocin has been shown to improve social interactions in ASD patients (Yatawara et al. 2016); interferon-gamma-inducible protein-16 (IFI16) is associated with neuroinflammation and ASD (Alabdali et al. 2014b) and glutathione-S-transferase (GST), vitamin E, mercury, and lead are markers associated with xenobiotic toxicity and their scavenging by detoxification and antioxidant enzyme complex have also been associated with ASD (Alabdali et al. 2014b). All analytes, except for lead and mercury, were measured in plasma. Lead and mercury were measured in red blood cells. Experimental procedures used to measure these analytes have been described elsewhere (Alabdali et al. 2014a, b). Raw data are shown in Tables 1 and 2.
Assessing the Accuracy of Prediction
Two methods were employed to evaluate the accuracy of biomarker-based predictions of binary clinical outcomes (e.g., autism versus healthy control or having severe versus mild/moderate disease). One method relies on calculating the area under a ROC curve (AUC). Receiver operating characteristic (ROC) curves are generated by graphing biomarker sensitivity on the vertical axis and specificity subtracted from one (1—specificity) on the horizontal axes for all possible biomarker values. The aim is to graphically illustrate the trade-off between sensitivity and specificity at all possible cut-off values of a continuous biomarker. A biomarker with perfect sensitivity and specificity is the one that yields an AUC of 1.0, while a useless biomarker yields an AUC of 0.5. An AUC of 0.5 indicates that the predictions made using the biomarker are equivalent to chance or random guessing. AUC values below 0.5 should indicate that the predictions made using the biomarker are more often false than true (Perlis 2011). The second method is a library-based identification, which relies on comparing subjects of unknown classification to a library of subjects of known classification. Therefore, a library must be constructed with subjects organized into units of unique classifications. Each of the libraries used in the current study contained 2 units, one for autistic and the other for healthy control participants. Unknown participants were then submitted for identification by determining the library unit to which the unknown subject is most similar. Similarity can be determined using various coefficients. In the present study, pairwise similarities were calculated using Canberra distances (Eq. (1)), and matching to a library unit was accomplished using the K-nearest neighbor algorithm. Using this algorithm, a user-defined number of top matches is determined for each unknown, and the unknown is simply assigned to the unit containing the largest number of those top matches. This number becomes a score that can be used as a measure of confidence in the identification process. It was in the present study based on the top five most similar library entries, giving rise to scores ranging from 0 to 5.
Designing Biomarker Profiles
In the present study, data for each of the nine investigated variables (biomarkers) were available for some but not all participants (Tables 1 and 2). To maximize the use of participants and variables, five biomarker profiles were constructed. Profile 1 contained all variables and only those participants with no missing data for any of the nine variables (10 controls, six autistics). Similarly, profile 2 contained eight variables (25 controls, nine autistics), profile 3 contained seven variables (25 controls, 20 autistics), profile 4 contained six variables (25 controls, 21 autistics), and profile 5 contained five variables (30 controls, 40 autistics).
Statistical Analysis
Data were expressed as means ± SD (standard deviations). Statistical analysis of quantitative data was performed using a nonparametric test. An ANOVA with a two-tailed t test was used to determine the significance of differences observed in biomarker values between autistic and control participants. A p value of < 0.05 was considered significant.
PCA and multidimensional scaling (MDS) were performed using Bionumerics version 6.6 (Applied Maths, Austin, TX) or IBM SPSS version 22 as previously described (El-Ansary et al. 2016). Briefly, the inputs into PCA and MDS were a covariance matrix and a similarity matrix, respectively. Similarity matrices were constructed from all possible pairwise similarities calculated using Canberra distances (Eq. (1)). PCA reduces the number of variables by condensing correlated variables. Therefore, the correlation between some of the variables must exist for the analysis to be meaningful. The presence of correlated variables was tested by Bartlett’s test of sphericity (Bartlett 1937), with a p value threshold of < 0.001. Kaiser-Meyer-Olkin (KMO) measure was used to test the adequacy of the sample sizes (Kaiser 1974; Tomlinson et al. 2013). The number of statistically significant components in PCA was determined using parallel analysis (Monte Carlo simulation) using Brian O’Connor’s syntax for SPSS (O’Connor 2000).
where “D” is the Canberra distance metric, “n” is the number of variables, “i” is the ith variable, and “X” and “Y” are the two participants.
Hierarchical clustering was performed using Bionumerics version 6.6 as previously described (El-Ansary et al. 2016). Briefly, pairwise similarities were calculated using Canberra distances, and dendrograms were constructed using unweighted pair group method with arithmetic mean algorithm. A two-tailed t test was used to determine the significance of differences observed in biomarker values between autistic and control participants. A p value of < 0.05 was considered significant. A t test was performed using GraphPad Prism version 6 (GraphPad Software, Inc., La Jolla, CA). The correlation was estimated by Spearman correlation coefficient, and a p value is assigned based on permutation analysis. Correlation analyses were performed using GraphPad Prism version 6. For analyses involving computation of a Z-score, Z-scores were calculated according to the formula of Eq. (2) using GraphPad software
where Z is the Z-score, X is the observed value, μ is the mean, and σ is the standard deviation.
Results
The Accuracy of Disease Prediction Using Individual Biomarkers
Consistent with previously published results (Alabdali et al. 2014a, b), all nine biomarkers significantly differed between autistic and control groups (Fig. 1). Individual biomarkers were evaluated for their accuracy in predicting the occurrence of disease and disease severity using the AUC method. Most autistic participants had impaired CARS and SRS scores, but some ended up with a normal score using one of the scoring methods. Also, a few participants either had a missing score or were too young to be scored by SRS. For this reason, ROC curves were generated separately for each of the autistic participants with impaired CARS and those with impaired SRS scores. In both groups—henceforth referred to as CARS and SRS groups—all nine biomarkers effectively predicted the occurrence of autism, with AUC values falling between 0.64 and 0.96. Vitamin E was associated with the largest AUC (0.94), followed by dopamine, serotonin, and GST (all > 0.8) in the CARS group, while GST had the largest AUC (0.96), followed by vitamin E, mercury, and dopamine in the SRS group. GABA, mercury, and IFI16 were the only biomarkers able to predict the occurrence of severe autism—as determined by SRS scores—with AUC values ranging from 0.66 to 0.78. None of the tested biomarkers was able to predict the level of CARS impairment (Table 3).
Combining Biomarkers (Variables) into Profiles Improves Disease Prediction
Next, we asked whether grouping the nine biomarkers into profiles could enhance their predictive power. Five profiles were designed as described in the methods section; the most complex of which consisted of nine variables, and the simplest of five. Employing PCA and MDS in testing these five profiles revealed clear segregation of autistic and control participants, with complete segregation, achieved using profiles of higher complexity—those with larger numbers of variables. Before moving forward with further analyses, we thought to verify several aspects of the PCA analyses. We used Bartlett’s sphericity test to confirm the presence of correlated variables and showed that the absence of correlations in our datasets was extremely unlikely (p values < 0.0001). In terms of the adequacy of sample sizes, KMO measure of sampling adequacy was employed giving rise to values hovering around 0.7. The obtained values were consistent with samples of sufficient sizes for the analyses to be meaningful (Kaiser 1974; Tomlinson et al. 2013). In terms of the significance of principal components, Monte Carlo simulation demonstrated that the first component (PC1) in the analysis of each of the five biomarker profiles was the only significant component (Fig. 2). PC1 was the principal component responsible for most of the segregation between the autistic and control groups. We then examined the contribution of individual variables to the segregation of autistic and control participants by comparing their contribution to the principal component responsible for most of this segregation, in this case, the first principal component. We found that the markers responsible for most of the separation between the two groups (e.g., dopamine, serotonin, GST, and vitamin E) were the same ones that had shown relatively large AUCs. Conversely, markers with small AUCs (e.g., oxytocin and IFI16) did not contribute nearly as much in separating autistic and control subjects in PCA analysis (Fig. 3). To further confirm the authenticity of the segregation between autistic and control participants, we wanted to use a clustering method that differed in principle from PCA and MDS. For this purpose, we used hierarchical clustering, which produced consistent results, further confirming the genuineness of the segregation between autistic and control participants based on our biomarker profiles (Fig. 4). We also compared the AUCs obtained using profiles to those obtained using individual biomarkers. Variables were combined by using either the coordinates of PC1 from PCA or the sum of Z-scores as input in ROC curve analyses. When variables were combined into profiles using PC1 coordinates, we found that complex profiles had AUCs of one (perfect sensitivity and specificity), while simpler profiles had slightly smaller AUCs. In all cases, combining markers led to increased AUCs in both CARS and SRS groups. In our experience, using the sum of Z-scores did not perform as well as individual biomarkers or profiles combined using PC1 coordinates (Table 3).
Our results suggested that complex profiles were better in distinguishing autistic participants from healthy controls and that was shown using mathematically different approaches. Next, we wanted to rule out possible confounding factors to confirm our findings. Although complex profiles outperformed simpler ones in distinguishing autistic from healthy controls, the former were tested on a smaller number of participants (complex profile n = 16–46, simple profile n = 47–71). Consequently, PCA and MDS plots depicting the results of low-complexity profiles contained larger numbers of data points than the ones depicting the results of high-complexity profiles, creating higher density plots for the former compared to the latter (Fig. 3). Higher plot densities could have contributed to the partial overlap between autistic and control groups seen with simple profiles by simply providing more opportunities for overlap due to random chance alone. Thus, further analyses were performed to interrogate this notion.
Additional Testing Confirms that Higher Complexity Profiles Yield Better Separation of Autistic and Control Groups
To investigate whether profile complexity was the principal underpinning of the observed separation between autistic and control subjects, two tests were performed. First, all five profiles were tested using the same number of participants. To do so, we used the small group of participants (six autistics, ten controls) with whom we had a complete dataset covering all variables. Second, we used group-specific means as surrogates for missing data points. In other words, variable means within a group—either autistic or control—were used to substitute for missing data points of the corresponding group. The latter approach enabled the use of a larger number of participants (58 autistic, 32 control) compared to the former. Both tests confirmed that profiles of higher complexity enabled better distinction between autistic and control subjects than simpler profiles. This was demonstrated by tighter group clustering and wider inter-group distances in PCA and MDS plots using the 16 participants with no missing data points (Fig. 5). Using this group of participants for whom missing data were replaced by the corresponding means, better group separation was evident using profile 1 (nine variables) compared to profile 5 (five variables), as demonstrated by PCA, MDS, and hierarchical clustering (Fig. 6). Profiles 2, 3, and 4 were also tested showing results that supported the same conclusion (data not shown).
The next question we wanted to answer is whether our biomarker profiles can be used to predict the occurrence of disease within the population of participants included in the current study. Library-based identification was employed to answer this question.
High-Complexity Biomarker Profiles Predict the Occurrence of Disease with 100% Specificity and Sensitivity
Library-based identification was used to compare the sensitivity and specificity of autistic patients’ identification, within the available sample size, using five biomarker profiles. Only observed data were used in this test (i.e., group means were not used to fill-in for missing data). We showed that high-complexity profiles (profiles 1, 2, and 3) resulted in a perfect identification of both autistic and control participants, while the rate of correct identification (RCI) ranged from 83 to 96% using simpler profiles. These results stimulated our interest in testing profiles with fewer than five variables, which we tested by modifying profile 5 to generate new profiles consisting of all possible combinations of one, two, three, and four variables. Identification was attempted using each of these profiles, and RCI was averaged over profiles composed of the same number of variables. The results obtained showed a progressive decline in RCI as the number of variables decreased, underscoring the superiority of using biomarker profiles over individual markers and that of high-complexity profiles over simple ones (Fig. 7). For diagnostic purposes, it would be useful to have some measure of confidence each time an identification is made, in addition to the predetermined sensitivity and specificity. Using k-nearest neighbor in library-based identification generates a score, which we thought might be suitable to serve as this measure of confidence. To test this possibility, we compared the ranges and averages of the scores associated with correct identifications to those associated with incorrect identifications. We found that the average scores associated with incorrect identifications were consistently lower than those associated with correct identifications and scores of four or greater were largely associated with correct identifications (Fig. 7). Taken together, our results suggest that the use of our biomarker profiles for diagnostic purposes may lead to the development of a novel diagnostic tool for the laboratory diagnosis of ASD. Given the heterogeneity of disease manifestations and their direct implications for treatment, prognosis, and patient’s quality of life, it would be advantageous to develop laboratory methods that can accurately predict various ASD-associated clinical pictures. Therefore, we wanted to explore the utility of our biomarker profiles in differentiating different levels of disease severity.
The Biomarker Profiles Investigated in the Present Study Were Not Able to Predict Disease Severity
In addition to assisting with the initial diagnosis of ASD, having reliable biomarkers to help quantitate disease severity would likely inform treatment decisions, facilitate follow-up, and improve prognosis. Therefore, we wanted to determine whether any of the biomarkers investigated in the current study correlated with either CARS or SRS scores. Both scoring systems did not correlate with any of the biomarkers studied here, as demonstrated by Spearman correlation (Fig. 8) and multiple regression analysis (data not shown). Also, hierarchical clustering, PCA, and MDS analysis did not show discernible segregation between autistic participants with different disease severity (Fig. 9). Taken together, our data suggest that predicting disease severity, at least based on CARS and SRS scores, using the markers we studied isunlikely to be successful.
Discussion
In the present study, we examined the potential of nine analytes in distinguishing autistic patients from healthy controls and in distinguishing between severe and mild to moderate impairment of the CARS and SRS scores. The data have been previously analyzed, but in previous analyses, participants with markedly different observed data from the mean were considered outliers and were therefore eliminated from further analyses. It is conceivable that biomarker data may differ widely among autistic patients simply because ASD consists of a diverse group of neurodevelopmental conditions with dramatically different presentations. However, this was not true for all of the nine biomarkers we tested. For example, the variance in the healthy control group was ten times that of autistic participants for serotonin, but the variance for lead was more than three times higher in the autistic group compared to controls. Regardless of the amount of variance, none of the data points stood out as an outlier in dot plots (data not shown). Also, the variance in healthy versus autistic subjects may vary widely in different populations and between different markers. Taken together, we could not develop a convincing rationale for identifying and excluding potential outliers. Therefore, all participants were included in this study, which might explain the lower AUC values obtained in this study compared to previously published work (Alabdali et al. 2014a, b).
Our data show that any of the nine biomarkers tested is likely useful in predicting the occurrence of ASD, with vitamin E and GST being the most useful in predicting both CARS and SRS impairments. We also found dopamine, serotonin, and mercury to be good predictors of the occurrence of ASD. Predicting the severity of CARS and SRS impairments was more challenging, with GABA being the most promising predictor of the severity of SRS impairment and no useful predictors of CARS impairment were found. It would have been interesting to test the effect of combining GABA with additional biomarkers on prediction accuracy, but we did not have enough participants to test this possibility. We speculate that the use of biomarkers in this study and other biomarkers might be more useful in predicting the level of impairments of individual components, rather than overall CARS and SRS scores. Additional studies involving larger numbers of participants are needed, however, to test this hypothesis.
We have demonstrated that combining multiple variables into profiles augmented prediction accuracy and that increased profile complexity is generally associated with high accuracy. In the current study, we combined multiple variables using three methods. In the first method, we replaced observed values of individual variables by the coordinates of the eigenvector (or principal component) that explained the most variance and was responsible for most of the segregation between groups. This gave us a single value for each participant that was computed from the multiple variables included in each analysis. The advantage of this method is the ability to combine variables in a way that is focused on the portion of data variance that is most relevant to the segregation of the groups under study. The caveat, however, is the possible loss of information, which is an inherent disadvantage of data reduction techniques, including PCA and MDS. The second method was taken from the work of Abruzzo et al. (2015), which involved computing a Z-score for individual variables and combining them by taking the sum of Z-scores (Alessandro Ghezzo, personal communications). Z-scores describe the relationship between the values of a dataset and the mean. Specifically, a Z-score of zero indicates that the corresponding value is equal to the mean, while Z-scores greater than zero represent the number of standard deviation the corresponding value is above the mean, and those lower than zero (i.e., negative Z-scores) indicating the number of standard deviations the corresponding value is below the mean. Since the mean of the dataset directly affects the Z-score (see Eq. (2)), input data should contain equal numbers of all groups. Having more participants in one group than another will immediately skew the Z-scores of all variables for which group means are unequal. For the autistic and control groups, means are unequal for all variables tested in this study. Both of these two methods were used as input into ROC curve analyses. In the third method, which was used in library-based identification, we combined variables using a similarity coefficient. The coefficient we used here, Canberra metric, was selected because it resulted in the best group separation when compared to other coefficients, such as Pearson correlation, ranked correlation, cosine coefficient, Gower coefficient, and Bray-Curtis coefficient (data not shown). The Canberra metric computes the distance between a pair of participants by first computing the sum of absolute differences between these two participants for each variable and then dividing by the number of variables to obtain a mean summarized distance. This coefficient standardizes all variables by dividing each absolute difference by the corresponding absolute sum before a grand sum over all variables is calculated (Eq. (1)).
The use of PC1 to calculate AUCs or a similarity coefficient in library-based identification led us to conclude that profiles were superior to individual markers in regard to prediction accuracy. This conclusion is in agreement with the conclusion of a previous study, in which six biomarkers were combined using the sum of Z-scores. This study showed that prediction accuracy increased when the six variables were combined (Abruzzo et al. 2015). The advantage of using the sum of Z-scores to combined variables was not shown in our study. In fact, doing so in our study lowered prediction accuracy as demonstrated by the UACs. It is noteworthy that most of our datasets contained unequal numbers of participants in each of the two groups being compared. This alone may offer some explanation since this can easily alter the mean and, thus, the Z-scores, as described above. We conclude that the discrepancy between the study by Abruzzo et al. (2015) and ours may be attributable, at least in part, to the imbalanced groups in our datasets. A clear advantage of the use of similarity measures and eigenvectors over the sum of Z-scores is that computing the sum of individual Z-scores may conceivably result in cancelation of group-specific features, while this is not the case with the other two methods.
We also compared the accuracy of predicting ASD occurrence using ROC curves versus using library-based identification. ROC curves are widely used in studies addressing the utility of various biomarkers in clinical practice. One of the greatest advantages of using ROC curves is the ability to optimize a cutoff value taking into account sensitivity, specificity, and clinical considerations specific to each disease. Raising a cutoff value increases specificity, but often at the expense of sensitivity (Akobeng 2007; Hajian-Tilaki 2013). The trade-off between sensitivity and specificity varies according to the severity of the illness in question, the treatability of this illness, and the consequences of delaying treatment. High sensitivity might be crucial for illnesses known to cause devastating consequences if left untreated and, thus, the benefit of early detection may outweigh the harm of reduced specificity. On the contrary, harsh treatment decision may require a high level of certainty (or specificity) that such treatment is justified.
Conclusion
Although we find our results compelling and encouraging of further investigations, we acknowledge the limitations imposed by the limited number of participants. Studies of larger scale are warranted to verify our findings and move the proposed diagnostic tool to clinical practice.
References
Abruzzo PM, Ghezzo A, Bolotta A, Ferreri C, Minguzzi R, Vignini A, Visconti P, Marini M (2015) Perspective biological markers for autism spectrum disorders: advantages of the use of receiver operating characteristic curves in evaluating marker sensitivity and specificity. Dis Markers 2015:329607–329615. https://doi.org/10.1155/2015/329607
Akobeng AK (2007) Understanding diagnostic tests 3: receiver operating characteristic curves. Acta Paediatr 96:644–647
Alabdali A, Al-Ayadhi L, El-Ansary A (2014a) A key role for an impaired detoxification mechanism in the etiology and severity of autism spectrum disorders. Behav Brain Funct 10:14. https://doi.org/10.1186/1744-9081-10-14
Alabdali A, Al-Ayadhi L, El-Ansary A (2014b) Association of social and cognitive impairment and biomarkers in autism spectrum disorders. J Neuroinflammation 11:4. https://doi.org/10.1186/1742-2094-11-4
APA—American Psychiatric Association (2000) Diagnostic and statistical manual of mental disorders: DSM-IV-TR. American Psychiatric Association, Washington, DC
APA—American Psychiatric Association (2013) Diagnostic and statistical manual of mental disorders: DSM-5. American Psychiatric Association Publishing, Arlington
Bartlett M (1937) Properties of sufficiency and statistical tests. P Roy Soc Lond A Mat Phys Sci 160:268–282
Beversdorf DQ, Missouri Autism Summit Consortium (2016) Phenotyping, etiological factors, and biomarkers: toward precision medicine in autism spectrum disorders. J Dev Behav Pediatr 37:659–673
Boyd B, Odom S, Humphreys B, Sam A (2010) Infants and toddlers with autism spectrum disorder: early identification and early intervention. J Early Interv 32:75–98
Chen KL, Lin CH, Yu TY, Huang CY, Chen YD (2018) Differences between the childhood autism rating scale and the social responsiveness scale in assessing symptoms of children with autistic spectrum disorder. J Autism Dev Disord, in press. https://doi.org/10.1007/s10803-018-3585-y
Christensen DL, Baio J, Van Naarden BK, Bilder D, Charles J, Constantino JN, Daniels J, Durkin MS, Fitzgerald RT, Kurzius-Spencer M, Lee LC, Pettygrove S, Robinson C, Schulz E, Wells C, Wingate MS, Zahorodny W, Yeargin-Allsopp M (2016) Centers for Disease Control and Prevention (CDC). Prevalence and characteristics of autism spectrum disorder among children aged 8 years—autism and developmental disabilities monitoring network, 11 sites, United States, 2012. MMWR Surveill Summ 65(3):1–23
Constantino JN, Davis SA, Todd RD, Schindler MK, Gross MM, Brophy SL, Metzger LM, Shoushtari CS, Splinter R, Reich W (2003) Validation of a brief quantitative measure of autistic traits: comparison of the social responsiveness scale with the autism diagnostic interview-revised. J Autism Dev Disord 33:427–433
Daniels AM, Mandell DS (2014) Explaining differences in age at autism spectrum disorder diagnosis: a critical review. Autism 18:583–597
Debodinance E, Maljaars J, Noens I (2017) Interventions for toddlers with autism spectrum disorder: a meta-analysis of single-subject experimental studies. Res Autism Spectr Disord 36:79–92
El-Ansary A, Hassan WM, Qasem H, Das UN (2016) Identification of biomarkers of impaired sensory profiles among autistic patients. PLoS One 11:e0164153. https://doi.org/10.1371/journal.pone.0164153
Elsabbagh M, Divan G, Koh YJ, Kim YS, Kauchali S, Marcín C, Montiel-Nava C, Patel V, Paula CS, Wang C, Yasamy MT, Fombonne E (2012) Global prevalence of autism and other pervasive developmental disorders. Autism Res 5:160–179
Galiana-Simal A, Muñoz-Martinez V, Calero-Bueno P, Vela-Romero M, Beato-Fernandez L (2018) Towards a future molecular diagnosis of autism: recent advances in biomarkers research from saliva samples. Int J Dev Neurosci 67:1–5
General Assembly of the World Medical Association (2014) World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. J Am Coll Dent 81(3):14-18
Geschwind DH, Levitt P (2007) Autism spectrum disorders: developmental disconnection syndromes. Curr Opin Neurobiol 17:103–111
Gupta VB, Sundaram R, Martins RN (2013) Multiplex biomarkers in blood. Alzheimers Res Ther 5:31. https://doi.org/10.1186/alzrt185
Hajian-Tilaki K (2013) Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian J Intern Med 4:627–635
Kaiser HF (1974) A note on the equamax criterion. Multivar Behav Res 9:501–503
Khramova TV, Kaysheva AL, Ivanov YD, Pleshakova TO, Iourov IY, Vorsanova SG, Yurov YB, Schetkin AA, Archakov AI (2017) Serologic markers of autism spectrum disorder. J Mol Neurosci 62(3–4):420–429
Loth E, Spooren W, Ham LM, Isaac MB, Auriche-Benichou C, Banaschewski T, Baron-Cohen S, Broich K, Bölte S, Bourgeron T, Charman T, Collier D, de Andres-Trelles F, Durston S, Ecker C, Elferink A, Haberkamp M, Hemmings R, Johnson MH, Jones EJ, Khwaja OS, Lenton S, Mason L, Mantua V, Meyer-Lindenberg A, Lombardo MV, O'Dwyer L, Okamoto K, Pandina GJ, Pani L, Persico AM, Simonoff E, Tauscher-Wisniewski S, Llinares-Garcia J, Vamvakas S, Williams S, Buitelaar JK, Murphy DG (2016) Identification and validation of biomarkers for autism spectrum disorders. Nat Rev Drug Discov 15:70–73
Mayeux R (2004) Biomarkers: potential uses and limitations. NeuroRx 1:182–188
Mick KA (2005) Diagnosing autism: comparison of the childhood autism rating scale (CARS) and the autism diagnostic observation schedule (ADOS). Wichita State University, Wichita
Nunes LA, Mussavira S, Bindhu OS (2015) Clinical and diagnostic utility of saliva as a non-invasive diagnostic fluid: a systematic review. Biochem Med (Zagreb) 25:177–192
O’Connor BP (2000) SPSS and SAS programs for determining the number of components using parallel analysis and Velicer’s MAP test. Behav Res Methods Instrum Comput 32:396–402
Perlis RH (2011) Translating biomarkers to clinical practice. Mol Psychiatry 16:1076–1087
Prata J, Santos SG, Almeida MI, Coelho R, Barbosa MA (2017) Bridging autism spectrum disorders and schizophrenia through inflammation and biomarkers—pre-clinical and clinical investigations. J Neuroinflammation 14(1):179
Sharma SR, Gonda X, Tarazi FI (2018) Autism Spectrum Disorder classification, diagnosis and therapy. Pharmacol Ther. in press. https://doi.org/10.1016/j.pharmthera.2018.05.007
Sices L, Pawlowski K, Farfel L, Phillips D, Howe Y, Cochran DM, Choueiri R, Forbes PW, Brewster SJ, Frazier JA, Neumeyer A, Bridgemohan C (2017) Feasibility of conducting autism biomarker research in the clinical setting. J Dev Behav Pediatr 38(7):483–492
Steffenburg H, Steffenburg S, Gillberg C, Billstedt E (2018) Children with autism spectrum disorders and selective mutism. Neuropsychiatr Dis Treat 14:1163–1169
Tomlinson A, Hair M, McFadyen A (2013) Statistical approaches to assessing single and multiple outcome measures in dry eye therapy and diagnosis. Ocul Surf 11:267–284
Uddin LQ, Dajani DR, Voorhies W, Bednarz H, Kana RK (2017) Progress and roadblocks in the search for brain-based biomarkers of autism and attention-deficit/hyperactivity disorder. Transl Psychiatry 7(8):e1218
Volkmar FR, Reichow B (2013) Autism in DSM-5: progress and challenges. Mol Autism 4:13. https://doi.org/10.1186/2040-2392-4-13
Woolfendena S, Sarkozya V, Ridley G, Williams K (2012) A systematic review of the diagnostic stability of autism spectrum disorder. Res Autism Spectr Disord 6:345–354
Yatawara CJ, Einfeld SL, Hickie IB, Davenport TA, Guastella AJ (2016) The effect of oxytocin nasal spray on social interaction deficits observed in young children with autism: a randomized clinical crossover trial. Mol Psychiatry 21:1225–1231
Zwaigenbaum L, Bauman ML, Stone WL, Yirmiya N, Estes A, Hansen RL, McPartland JC, Natowicz MR, Choueiri R, Fein D, Kasari C, Pierce K, Buie T, Carter A, Davis PA, Granpeesheh D, Mailloux Z, Newschaffer C, Robins D, Roley SS, Wagner S, Wetherby A (2015) Early identification of autism spectrum disorder: recommendations for practice and research. Pediatrics 136(Suppl 1):S10–S40. https://doi.org/10.1542/peds.2014-3667C
Acknowledgments
The authors would like to thank Dr. Shiao Y. Wang for his valuable assistance with data analysis.
Funding
This research was supported by a grant from the Research Centre for Female Scientific and Medical Colleges at King Saud University.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no potential conflicts of interest with respect to the authorship and/or publication of this article.
Ethical Approval
All procedures performed were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Rights and permissions
About this article
Cite this article
Hassan, W.M., Al-Ayadhi, L., Bjørklund, G. et al. The Use of Multi-parametric Biomarker Profiles May Increase the Accuracy of ASD Prediction. J Mol Neurosci 66, 85–101 (2018). https://doi.org/10.1007/s12031-018-1136-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12031-018-1136-9