Use of Machine Learning to Identify Children with Autism and Their Motor Abnormalities

Crippa, Alessandro; Salvatore, Christian; Perego, Paolo; Forti, Sara; Nobile, Maria; Molteni, Massimo; Castiglioni, Isabella

doi:10.1007/s10803-015-2379-8

Use of Machine Learning to Identify Children with Autism and Their Motor Abnormalities

Original Paper
Published: 05 February 2015

Volume 45, pages 2146–2156, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Autism and Developmental Disorders Aims and scope Submit manuscript

Use of Machine Learning to Identify Children with Autism and Their Motor Abnormalities

Download PDF

Alessandro Crippa^1,2,
Christian Salvatore²,
Paolo Perego³,
Sara Forti¹,
Maria Nobile^1,4,
Massimo Molteni¹ &
…
Isabella Castiglioni²

4330 Accesses
126 Citations
7 Altmetric
Explore all metrics

Abstract

In the present work, we have undertaken a proof-of-concept study to determine whether a simple upper-limb movement could be useful to accurately classify low-functioning children with autism spectrum disorder (ASD) aged 2–4. To answer this question, we developed a supervised machine-learning method to correctly discriminate 15 preschool children with ASD from 15 typically developing children by means of kinematic analysis of a simple reach-to-drop task. Our method reached a maximum classification accuracy of 96.7 % with seven features related to the goal-oriented part of the movement. These preliminary findings offer insight into a possible motor signature of ASD that may be potentially useful in identifying a well-defined subset of patients, reducing the clinical heterogeneity within the broad behavioral phenotype.

Novel AI driven approach to classify infant motor functions

Article Open access 10 May 2021

Whole-Body Movement during Videogame Play Distinguishes Youth with Autism from Youth with Typical Development

Article Open access 27 December 2019

Early Diagnose of Autism Spectrum Disorder Using Machine Learning Based on Simple Upper Limb Movements

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Autism spectrum disorder (ASD) is a highly heterogeneous neurodevelopmental disorder with multiple causes, courses, and a wide range in symptom severity (Amaral et al. 2008). Although the core features of ASD are persistent deficits in social communication and interaction and the presence of restricted, repetitive patterns of behavior, interests, or activities (DSM V, American Psychiatric Association 2013), it is of great importance not to ignore the motor impairments associated with ASD as they are highly prevalent, at 79 %, and can have a significant impact on quality of life and social development (Lai et al. 2014). Motor abnormalities in ASD may occur very early in development (Teitelbaum et al. 1998, Brian et al. 2008) and be apparent over time (Fournier et al. 2010; Van Waelvelde et al. 2010) being a pervasive feature of the disorder. Recent studies have also provided evidence for the specificity of motor impairments identified in high-functioning children with ASD compared to children with attention deficit/hyperactivity (ADHD) (Izawa et al. 2012; Ament et al. 2014) and to typically developing children matched by nonverbal IQ and receptive language (Whyatt and Craig 2013). Overall, these findings suggest that motor abnormalities could be a consistent marker of ASD (Dowd et al. 2012). A number of different motor deficits have been reported in ASD, including anomalies in walking patterns (e.g., Rinehart and McGinley 2010; Nobile et al. 2011), hand movements such as reaching (e.g., Mari et al. 2003; Glazebrook et al. 2006; Forti et al. 2011), and eye-hand coordination (e.g., Glazebrook et al. 2009; Crippa et al. 2013). The severity of motor deficits correlates with the degree of social withdrawal and the severity of symptoms (Freitag et al. 2007). Motor control has even been speculated to be crucial for communication and social interaction (Leary and Hill 1996). Indeed, Minshew et al. (2004) proposed that studies on motor function could have significant potential in elucidating the neurobiological basis and even improving the diagnostic definition of ASD.

Currently, the gold standard for the diagnosis of ASD has been formalized with the clinical judgment of symptoms and with semistructured, play-based behavioral observations (Lord et al. 2000) and standardized interviews or questionnaires (e.g., Lord et al. 1994). However, recent studies have started to explore the predictive value of neurobiological as well as behavioral measures in ASD in order to identify a well-defined phenotype of individuals and—possibly—to enable a computer-aided diagnosis perspective. These studies typically implement pattern classification methods that are based on machine-learning algorithms to predict or classify individuals of different groups by maximizing the distance between groups of datasets. Machine learning commonly refers to all procedures that train a computer algorithm to identify a complex pattern of data (i.e., “features”) that can then be used to predict group membership of new subjects (e.g., patients vs. controls). Machine-learning techniques based, for example, on support vector machines (SVMs; Vapnik 1995) require a well-characterized dataset in the training phase in order to extract the classification algorithm that best separates the groups (i.e., the “hyperplane” or “decision function”). In the testing phase, the classification algorithm can be used to predict the class membership of a participant not involved in the training procedure (e.g., whether a new child has ASD). Pattern classification methods can also identify complex patterns of anomalies not efficiently recognized by other univariate statistical methods. Thus, the use of pattern recognition methods to predict group membership should not be considered merely in a potentially “diagnostic” perspective but also as a useful tool used to develop objective measures for each individual from a set of sample data. Most of the studies have applied pattern classification methods to neuroanatomical data measured by structural magnetic resonance (MRI; Ecker et al. 2010a, b) or by diffusion tensor imaging (Lange et al. 2010; Ingalhalikar et al. 2011; Deshpande et al. 2013), although Oller et al. (2010) analysis of data regarding automated vocal analysis produced promising results.

In the present work, we have undertaken a proof-of-concept study to determine whether a simple upper-limb movement could be useful to accurately classify low-functioning children with ASD who are between the ages of 2 and 4. In order to answer this question, we developed a supervised machine-learning method to identify preschool children with ASD and correctly discriminate them from typically developing children by means of kinematic analysis of a simple reach, grasp and drop task. We decided to analyze this simple motor task because the motor system can be more easily probed in low-functioning autistic children than systems that underlie complex cognitive functions. In addition to the potential predictive value of our machine-learning method in exploring the clinical relevance of simple upper-limb movement measures in ASD, we could identify a limited set of kinematic characteristics that even suggests the hypothesis of a motor signature of autism.

Methods

Participants

Fifteen preschool-aged children with autism (ASD) were compared to fifteen typically developing (TD) children who were matched by mental age. IQ and mental age were assessed in our institute by using the Griffiths Mental Development Scales (Griffiths 1970) as a part of the routine clinical practice with low-functioning children. A poor score on the Griffiths scales at 1 and/or 2 years has been demonstrated to be a good predictor of impairment at school age (Barnett et al. 2004). All participants had normal or corrected-to-normal vision and were drug-naïve.

The participants in the ASD group were recruited at our institute over an 18-month period. All participants in the clinical group had been previously diagnosed according with the criteria described in the Diagnostic and Statistical Manual of Mental Disorders-IV TR (American Psychiatric Association 2000) by a medical doctor specialized in child neuropsychiatry with expertise in autism. The diagnoses were then confirmed independently by a child psychologist through direct observation and discussion with each child’s parents. Seven children had been administered the Autism Diagnostic Observation Schedule (ADOS; Lord et al. 2000). The participants in the control group were recruited by local pediatricians and from kindergartens to be mentally age-matched to the clinical sample from the normally developing population. We decided to include, as a comparison group, typically developing children matched by mental age, following the assumption that mental age usually predict ability to understand task instructions, use appropriate strategies and inhibit inappropriate responses (Jarrold and Brock 2004). The TD children had no previous history of social/communicative disorders, developmental abnormalities, or medical disorders with central nervous system implications. All of the participants’ legal guardians gave their informed written consent prior to the children’s participation. The research was approved by the ethics board of our institute in accordance with the Declaration of Helsinki.

Procedure

The participants sat in front of a table of variable height, which was adjusted to the base of the children’s trunk. The experimenter sat at the opposite side of the table, and one parent was present in the room. All trials started with the children’s hands resting at a set position 20 cm away from the ball support. The experimental task consisted of grasping a rubber ball (6-cm diameter) that was placed over a support (see Fig. 1a); that is, a reach-to-grasp movement before they dropped it in a hole (7-cm diameter). The hole (see Fig. 1b) was located inside a see-through square box (21 cm high, 20 cm wide; see Fig. 1) and was large enough not to require fine movements. Ten trials per participant were conducted: five consecutive trials on the left side (and left hand) and five consecutive trials on the right side (and right hand). The order of trial blocks was counterbalanced between participants. The experimenter performed the task first in order to overtly illustrate the task demand (i.e., reach for the ball, grasp it and drop it in the hole) without any verbal cue. Practice trials, the number of which varied individually, were given to participants before recording in order to verify the children’s understanding of the task. The participants were allowed to interrupt the experiment at will in order to rest. The experimental task was simple and interesting enough to ensure the full motivation and compliance of all participants across groups.

Apparatus

An optoelectronic system (The SMART D from BTS Bioengineering^® Garbagnate Milanese, Italy) was used to acquire the kinematics data. Three-dimensional kinematic data were collected by eight infrared-motion analysis cameras at 60 Hz (spatial accuracy <0.2 mm), located four per side at 2.5 m from the participants. Passive markers (1 cm) were attached to the ulnar and radial surfaces of the participants’ wrists and to the hand dorsum on the fourth and fifth metacarpals (see Fig. 1). Moreover, two markers were placed on the ball and four on the box edges under the goal area. All raw data were first preprocessed with Matlab (Mathworks^® Natick, MA, USA); a fifth-order Butterworth, 8-Hz low-pass filter was applied, and movement segmentation and parameters estimation were computed with self-written software.

The overall movement was divided into two sub-movements: Sub-movement 1—the movement necessary to reach the ball and place it on its support; Sub-movement 2—the movement to transport the ball from its support to the target box hole where the ball was to be dropped. For each of these sub-movements, statistics pertaining to a set of dependent measures was collected: (a) total movement duration (TD), (b) number of movement units^{Footnote 1} (MU), (c) peak velocity (PV), (d) time of PV from sub-movement onset (tPV), (e) peak acceleration (PA), (f) time of PA (tPA), (g) peak deceleration (PD), and (h) time of peak deceleration (tPD). Moreover, final movement accuracy was evaluated by the wrist inclination at the time of the ball drop (delta_WA), calculated as the angle between the palm and the vertical axis of the coordinate system (more precisely, the difference between the WA at the end of the transport phase and the WA at the time of peak deceleration). These 17 kinematic measures were used as input features for the pattern classification procedure.

Data Analysis

After checking that the assumptions were not violated, an analysis of covariance (ANCOVA) was carried out to compare the two groups of children on all kinematic measures with Group (ASD vs. TD) as a between-participant factor, and with IQ and chronological age as between-participant covariates. The alpha level was set to .05 for all data analyses. Effect sizes for ANCOVA are reported using partial eta squared (η ²_p ).

The Machine-Learning Method

A pattern classification method based on a machine-learning algorithm was used to classify ASD versus TD by maximizing the distance between the two groups of datasets. A validated supervised machine-learning method (Salvatore et al. 2013) was used. The method involves two different steps: (1) feature selection, the process of selecting a subset of relevant features to be used for classification, and (2) classification, the process of using the selected features to separate the two considered groups of subjects (ASD vs. TD).

Feature Selection

In order to understand which of the collected kinematic features were more discriminative for the ASD versus TD comparison, feature selection was implemented by using a Fisher discriminant ratio (FDR)-based technique (Padilla et al. 2012).

By this technique, for each subject, the collected features and the “label” associated to that subject on a clinical diagnosis basis (i.e., ASD or TD) were considered to calculate a score (FDR score) for each feature.

Specifically, for the feature i, the FDR score was calculated using the following formula:

$$ FDR_{i} = \frac{{\left( {\mu_{i - ASD} - \mu_{i - TD} } \right)^{2} }}{{\sigma_{i - ASD}^{2} + \sigma_{i - TD}^{2} }} $$

where $ \mu_{i - ASD} $ and $ \mu_{i - TD} $ are the mean value of the feature i calculated across the whole ASD and TD datasets, respectively. $ \sigma_{i - ASD}^{2} $ and $ \sigma_{i - TD}^{2} $ are the variance of the feature i calculated across the whole ASD and TD datasets, respectively.

Ranked features were then sorted in a decreasing order, from the most to the least discriminative, according to their FDR score.

Classification Algorithm

Classification of ASD and TD subjects was performed using a Support Vector Machine (SVM) approach (Schölkopf et al. 2000; Vapnik 1995, 1998; Vapnik and Chapelle 1999, López et al. 2011), already optimized and validated in a clinical setting (Salvatore et al. 2013).

The aim of the considered SVM is to generate a model able to (1) learn from the selected features of labeled subjects how to discriminate subjects of different groups (binary labeled training datasets), and (2) correctly classify, by means of the same selected features, new unlabeled subjects as belonging to one of the two groups (ASD or TD).

The learning process of the classifier consists of a training phase in which the selected features of the ASD and TD subjects are two training datasets associated to the ASD and TD labels, respectively.

Mathematically, if we have training data consisting of a vector $ x_{i} \in R^{N} , i = 1, \ldots ,N $ and the associated binary label $ y_{i} \in \left\{ { \pm 1} \right\} $ (e.g., +1 for ASD, −1 for TD), then SVM uses the principle of structural risk minimization to design an optimal hyperplane (OH) that maximizes the distance between the two training groups and that separates them. The lower the distance of a training subject from the OH, the more important that training subject to define the OH. Thus, the distance identifies the “weight” of that training subject in the definition of OH.

The OH can then be used as model to classify new subjects, i.e., subjects for which the label is unknown.

Mathematically, the model used for the identification of the binary label y′ of a new subject x, as a result of the classification of that new subject, is given by the following function:

$$ y^{{\prime }} \left( x \right) = \mathop \sum \limits_{i = 1}^{N} a_{i} \cdot y_{i} \cdot k\left( {x,x_{i} } \right) + b $$

$ a_{i} $ being the weight of the training subject $ x_{i} $, $ y_{i} $ being the binary label of the training subject $ i $, $ k(x,x_{i} ) $ being a linear kernel function, $ b $ being a threshold parameter called bias, and $ N $ being the number of training subjects. We chose to employ a linear kernel because it represents the more general form of a decision function and because it ensures better computational efficiency.

In this study, the whole machine-learning method was implemented on the Matlab platform (Matlab version R2013b, The MathWorks, Natick, MA). In particular, we used functions of the biolearning toolbox of Matlab to implement the classification algorithm.

Performance of the Classification Algorithm

Performance of the classification algorithm was assessed by using a cross-validation strategy. In general, cross validation involves splitting the original dataset into two complementary subsets: a training set and a testing set. The training set is a set of data associated to a label and used to perform the training of the classifier (as already described in the previous section); the testing set is a set of data not associated to a label and used to perform the validation of the classifier. By considering different partitions of the data, multiple rounds of cross-validation can then be performed.

In a particular case of cross-validation, called leave-one-out (LOO) cross-validation, the testing set is solely composed of one sample of the original dataset and the training set is made up of the remaining samples of the original dataset (N − 1). Therefore, if we want to test all N samples in the original dataset, then it is sufficient that the number of rounds to be performed equals the number N of samples in the original dataset. LOO is a widely used validation approach in literature because it has been proven able to return an almost unbiased estimate of the probability of error (e.g., Vapnik 1998; Chapelle et al. 1999).

In this study, validation of the classifier for the ASD versus TD comparison was performed by using an LOO cross-validation strategy for a number i of selected features running from one to the whole number of features (i.e., 17). A schematic description of the whole procedure is shown in Fig. 2.

In order to quantify the performance of the proposed classification algorithm, the accuracy, specificity, and sensitivity rates were computed. Accuracy of classification measures the rate of correctly classified samples in both positive (ASD) and negative (TD) classes. Specificity and sensitivity measure the rate of correctly classified samples in the positive (ASD) and in the negative (TD) class, respectively.

Mathematically, the accuracy, specificity and sensitivity of the classifier when the first i selected features are used, were computed as follows:

$$ Accuracy_{i} = \frac{{N^{CC} }}{N} $$

$$ Specificity_{i} = \frac{{N_{TD}^{CC} }}{{N_{TD}^{CC} + N_{TD}^{IC} }} $$

$$ Sensitivity_{i} = \frac{{N_{ASD}^{CC} }}{{N_{ASD}^{CC} + N_{ASD}^{IC} }} $$

where N is the total number of classified subjects; $ N^{CC} $ is the total number of correctly classified (CC) subjects, $ N_{TD}^{CC} $ is the number of TD samples that were CC as belonging to the TD gr (true negatives), $ N_{TD}^{IC} $ is the number of TD samples that were incorrectly classified (IC) as belonging to the ASD class (false positives); $ N_{ASD}^{CC} $ is the number of ASD samples that were CC as belonging to the ASD class (true positives), $ N_{ASD}^{IC} $ is the number of ASD samples that were IC as belonging to the TD class (false negatives).

We then studied the dependency of accuracy, specificity, and sensitivity on the number i of selected features.

The maximum values reached for accuracy, specificity, and sensitivity, referred to as maximum accuracy, specificity, and sensitivity, allowed the definition of the most discriminative features.

Overall mean accuracy, specificity, and sensitivity rates were calculated as mean values of accuracy, specificity, and sensitivity as follows:

$$ Overall\,mean\,accuracy = \frac{1}{F} \cdot \sum\limits_{i = 1}^{F} {Accuracy_{i} } $$

$$ Overall\,mean\,specificity = \frac{1}{F} \cdot \sum\limits_{i = 1}^{F} {Specificity_{i} } $$

$$ Overall\,mean\,sensitivity = \frac{1}{F} \cdot \sum\limits_{i = 1}^{F} {Sensitivity_{i} } $$

where $ F $ is the whole number of features (17).

Results

Data on the demographic, cognitive, and clinical characteristics of the participants are summarized in Table 1.

Table 1 Demographics of the participants

Full size table

The validity of mental age matching was confirmed (p > 0.05). Gender was also balanced between groups, as there were 3 girls in the ASD group and 2 girls in the healthy control group (χ²(1) = .240; p > 0.05). As expected, IQ and chronological age were not balanced across groups (both p < 0.001). Table 2 shows kinematic feature values of the two groups of children included in the study (ASD vs. TD) and the results of ANCOVA calculated on all kinematic measures. We found several significant group differences based on the kinematic variables even after having controlled for between-participant differences in IQ and chronological age.

Table 2 Kinematic data were initially analyzed through an ANCOVA with Group (ASD vs. TD) as a between-participant factor, and with IQ and chronological age as covariates

Full size table

The Machine-Learning Method

Classification Algorithm

In Fig. 3, the optimal hyper-plane separating ASD from TD participants is shown as a representative example of the training phase of the machine-learning method.

Performance of the Classification Algorithm

In Table 3, the accuracy, specificity, and sensitivity of the machine-learning method for the comparison of ASD versus TD are reported.

Table 3 Accuracy, specificity and densitivity rates of SVM using LOO validation

Full size table

The machine-learning method was able to successfully classify participants by diagnosis. The classification accuracy reached a maximum accuracy of 96.7 % (specificity 93.8 % and sensitivity 100 %) by using seven features selected by the Fisher discriminant ratio-based technique. Overall mean accuracy, specificity, and sensitivity rates were also calculated over a number of selected features ranging from one to 17 (the whole number of features). The overall mean classification accuracy (specificity/sensitivity) was 84.9 % (mean specificity 89.1 % and mean sensitivity 82.2 %).

In Fig. 4, the dependence of the metrics on the number of considered features is shown. The resulting data are shown for a number of features ranging from one to 17. As expected, accuracy, specificity, and sensitivity rates increase with the number of selected features, reaching their maximum values when considering seven selected features.

Besides calculating the accuracy of the SVM method, we were particularly interested in identifying which kinematic features contributed toward the classification. Our analysis showed that seven of 17 features were sufficient to classify autism with a 96.7 % accuracy rate. All of these seven kinematic features are related to the second part of the movement, sub-movement 2 (i.e., the movement to transport the ball from a support to the target hole in which the ball was to be dropped): (1) total duration; (2) delta wrist angle; (3) number of movement units; (4) time of peak deceleration; (5) peak acceleration; (6) time of peak velocity; and (7) peak velocity. Finally, the most discriminative features between the two groups when considering all of the N rounds (30) of the LOO cross-validation strategy are reported here in descending order: Total Duration sub movement 2, Delta Wrist Angle, Movement Units sub movement 2, time of Peak Deceleration sub movement 2, Peak Acceleration sub movement 2, time of Peak Velocity sub movement 2, Peak Velocity sub movement 2, Peak Velocity sub movement 1, time of Peak Acceleration sub movement 1, Peak Acceleration sub movement 1, time of Peak Acceleration sub movement 2, Peak Deceleration sub movement 2, time of Peak Velocity sub movement 1, Movement Units sub movement 1, time of Peak Deceleration sub movement 1, Peak Deceleration sub movement 1, Total Duration sub movement 1.

Discussion

Autism spectrum disorder is currently diagnosed on the basis of symptoms as qualitatively judged by clinicians and by means of semistructured observations (ADOS) and standardized interviews or questionnaires (ADI-R). Given this gold standard for the diagnosis of ASD, the use of pattern recognition methods to predict group membership has recently attracted strong attention, not only from a computer-aided diagnosis perspective, but also as suitable tool to define objective, quantitative measures of the disorder. Previous works have investigated the predictive value of neurobiological and behavioral measures in patients with ASD. The purpose of the present study was to explore the ability of the kinematic analysis of a simple upper-limb movement to correctly discriminate young low-functioning children with ASD from typically developing children. To achieve this goal, we applied our validated supervised machine-learning procedure (Salvatore et al. 2013) to the kinematic analysis of a simple reach, grasp, and drop task performed by preschool children with ASD in comparison to their mental-age-matched, typically developing peers.

The SVM algorithm reached a good mean individual classification in the comparisons between children with ASD and healthy controls (overall mean accuracy = 84.9 %, with overall mean specificity = 89.1 % and overall mean sensitivity = 82.2 %), with a maximum accuracy of 96.7 % (with maximum specificity of 93.8 % and maximum sensitivity of 100 %). The classification accuracy that was achieved in this study is consistent with previous SVM applications to MRI data (Ecker et al. 2010a, b) and to diffusion tensor imaging (DTI) data (Ingalhalikar et al. 2011; Deshpande et al. 2013) or with quadratic discriminant function application on diffusion tensor asymmetries (Lange et al. 2010). Our results are also consistent with the findings of Oller et al. (2010), who derived algorithms that were based on linear discriminant analysis by using an automated analysis of the acoustic characteristics of babble and early language to discriminate typical from language disordered development, such as autism or language delay. Thus, the present findings clearly show the feasibility and the applicability of our SVM method in correctly classifying preschool children with ASD on the basis of a motor task. Indeed, an autism diagnosis is particularly difficult in young, low-functioning children with autism, even using the gold standard diagnostic procedure. Our motor measure might have potential clinical application in such cases, thus providing useful information for clinicians to support a diagnostic decision. A point of relevance of our work, in fact, is that we decided to study the predictive value of a simple reach, grasp, and drop task, because the motor system can be more easily evaluated (i.e., even in young low-functioning children with ASD) than other more complex systems (e.g., cognitive functions). Indeed, because of the easiness and self-explanatory nature of the task, all participants were able to fully understand the experimental demand and to complete the movement successfully. Furthermore, kinematics analysis provides a constraint-free, non-intrusive environment for a challenging clinical population such as ASD in comparison with a magnetic resonance examination that is mostly used in previous pattern-recognition applications. Lastly, kinematic analysis is also a more convenient and less expensive technology than MRI to implement in a clinical setting equipped with an optoelectronic system to acquire kinematic data. Indeed, the task can be easily administered by any professional who works with children. Testing sessions last 15 min, and data analysis can be performed by a trained bioengineer in approximately 30 min for each subject.

Using feature selection, we also found the best classification accuracy of 96.7 % with seven features which had the highest discriminative ability between the groups. All of these seven kinematic features are related to the second part of the movement—sub-movement 2—in which the child transported the ball from a support to the target hole where the ball was to be dropped. This suggests that goal-oriented movements may be critical in separating children with ASD from typically developing children. More specifically, the top three features within the seven kinematic characteristics of sub-movement 2—time duration, movement units, and wrist angle—indicate respectively slower and more fragmented movements in children with ASD with inappropriate hand inclination for ball-drops during the final phases of hand transport. Thus, our results extend previous investigations in ASD that report the difficulty of translating intention into a motor chain leading to the action goal (Cattaneo et al. 2007; Fabbri-Destro et al. 2009; Forti et al. 2011). These findings demonstrate that a limited set of kinematic characteristics could reliably identify children with ASD in order to describe a well-defined phenotype of individuals within a complex and highly heterogeneous disorder, even suggesting a possible motor signature of autism related to disrupted planning movement sequences.

Despite our promising results, some methodological limitations of the present exploratory study should be considered. The main limitation is related to the small sample sizes of participant groups; the present findings, therefore, need to be replicated in a larger sample in order to validate the present SVM method by using a data set upon which it has not trained. Another potential limitation of this study is that our SVM classification is highly specific to the sample employed in training the classifier (i.e., preschool children with ASD). Future studies involving females with ASD, children with high-functioning autism, and adult patients are needed to generalize our findings to the heterogeneous spectrum of the disorder. Although we found that our significant between-groups differences were not dependent on IQ and chronological age, it could be worthwhile in future studies to train the computer algorithm with data from age-matched typically developing participants as well. Unfortunately, we did not collect ADOS scores from the entire clinical sample; thus, we could not perform a correlation analysis between our significant findings and the clinical characteristics of children with ASD. Future extensions of this work should also include other neurodevelopmental conditions (e.g., intellectual disability, developmental delays without intellectual disability, or developmental coordination disorders) in order to verify the classifier specificity to ASD, rather than a neurodevelopmental disorder in general. Indeed, some studies have recently indicated the specificity of motor difficulties in older high-functioning children with ASD compared to children with ADHD (Izawa et al. 2012; Ament et al. 2014) and to healthy children matched by nonverbal IQ and receptive language (Whyatt and Craig 2013). Finally, it should be noted that the predictive values of classification methods are restrained by the base rate of neurodevelopmental disorder in the population (Bishop 2010; Heneghan 2010; Yerys and Pennington 2011). Therefore, caution is needed when comparing classification-based accuracy values to the conventional diagnostic measures.

Nevertheless, although the present results should be considered preliminary, this study represents a “proof-of-concept” that kinematic analysis of simple upper-limb movement can reliably identify preschool-aged, low-functioning children with ASD. The significant predictive value of our SVM classification approach might be valuable to support the clinical practice of diagnosing ASD, thus encouraging a computer-aided diagnosis perspective. Moreover, our findings offer insight on a possible motor signature of autism that is potentially useful to identify a well-defined subset of patients, thus reducing the clinical heterogeneity within the broad behavioral phenotype. This may guide further exploration of neuropathology of the disorder with neuroimaging techniques or genetic analysis.

Notes

A movement unit is defined as an acceleration phase followed by a deceleration phase higher than 10 mm/s, starting from the moment at which the increase or decrease in cumulative velocity is over 20 mm/s (Von Hofsten 1991; Thelen et al. 1996).

References

Amaral, D. G., Schumann, C. M., & Nordahl, C. W. (2008). Neuroanatomy of autism. Trends of Neurosciences, 31(3), 137–145.
Article Google Scholar
Ament, K., Mejia, A., Buhlman, R., Erklin, S., Caffo, B., Mostofsky, S., et al. (2014). Evidence for specificity of motor impairments in catching and balance in children with autism. Journal of Autism and Developmental Disorders. doi:10.1007/s10803-014-2229-0.
American Psychiatric Association. (2000). Diagnostic and statistical manual of mental disorders (4th ed., text rev.). Washington, DC: American Psychiatric Association.
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.). Arlington, VA: American Psychiatric Association.
Google Scholar
Barnett, A. L., Guzzetta, A., Mercuri, E., Henderson, S. E., Haataja, L., Cowan, F., & Dubowitz, L. (2004). Can the Griffiths scales predict neuromotor and perceptual-motor impairment in term infants with neonatal encephalopathy? Archives of Disease in Childhood, 89, 637–643.
Article PubMed Central PubMed Google Scholar
Bishop, D. V. M. (2010). The difference between Po0.05 and a screening test. http://deevybee.blogspot.com/2010/07/difference-between-p-05-and-screening.html. Accessed June 30, 2014.
Brian, J., Bryson, S. E., Garon, N., Roberts, W., Smith, I. M., Szatmari, P., & Zwaigenbaum, L. (2008). Clinical assessment of autism in high-risk 18-month-olds. Autism, 12(5), 433–456.
Article PubMed Google Scholar
Cattaneo, L., Fabbri-Destro, M., Boria, S., Pieraccini, C., Monti, A., Cossu, G., & Rizzolatti, G. (2007). Impairment of actions chains in autism and its possible role in intention understanding. Proceedings of National Academy of Science of United States of America, 104(45), 17825–17830.
Article Google Scholar
Chapelle, O., Haffner, P., & Vapnik, V. N. (1999). Support vector machines for histogram-based image classification. IEEE Transactions on Neural Networks, 10(5), 1055–1064.
Article PubMed Google Scholar
Crippa, A., Forti, S., Perego, P., & Molteni, M. (2013). Eye-hand coordination in children with high functioning autism and Asperger’s disorder using a gap-overlap paradigm. Journal of Autism and Developmental Disorders, 43(4), 841–850.
Article PubMed Google Scholar
Deshpande, G., Libero, L. E., Sreenivasan, K. R., Deshpande, H. D., & Kana, R. K. (2013). Identification of neural connectivity signatures of autism using machine learning. Frontiers in Human Neuroscience, 7, 670.
Article PubMed Central PubMed Google Scholar
Dowd, A. M., McGinley, J. L., Taffe, J. R., & Rinehart, N. J. (2012). Do planning and visual integration difficulties underpin motor dysfunction in autism? A kinematic study of young children with autism. Journal of Autism and Developmental Disorders, 42(8), 1539–1548.
Article PubMed Google Scholar
Ecker, C., Marquand, A., Mourão-Miranda, J., Johnston, P., Daly, E. M., Brammer, M. J., et al. (2010a). Describing the brain in autism in five dimensions—Magnetic resonance imaging-assisted diagnosis of autism spectrum disorder using a multiparameter classification approach. Journal of Neuroscience, 30(32), 10612–10623.
Article PubMed Google Scholar
Ecker, C., Rocha-Rego, V., Johnston, P., Mourao-Miranda, J., Marquand, A., Daly, E. M., et al. (2010b). Investigating the predictive value of whole-brain structural MR scans in autism: A pattern classification approach. Neuroimage, 49(1), 44–56.
Article PubMed Google Scholar
Fabbri-Destro, M., Cattaneo, L., Boria, S., & Rizzolatti, G. (2009). Planning actions in autism. Experimental Brain Research, 192(3), 521–525.
Article PubMed Google Scholar
Forti, S., Valli, A., Perego, P., Nobile, M., Crippa, A., & Molteni, M. (2011). Motor planning and control in autism. A kinematic analysis of preschool children. Research in Autism Spectrum Disorders, 5(2), 834–842.
Article Google Scholar
Fournier, K. A., Hass, C. J., Naik, S. K., Lodha, N., & Cauraugh, J. H. (2010). Motor coordination in autism spectrum disorders: A synthesis and meta-analysis. Journal of Autism and Developmental Disorder, 40, 1227–1240.
Article Google Scholar
Freitag, C. M., Kleser, C., Schneider, M., & von Gontard, A. (2007). Quantitative assessment of neuromotor function in adolescents with high functioning autism and Asperger syndrome. Journal of Autism and Developmental Disorders, 37(5), 948–959.
Article PubMed Google Scholar
Glazebrook, C. M., Elliott, D., & Lyons, J. (2006). A kinematic analysis of how young adults with and without autism plan and control goal-directed movements. Motor Control, 10(3), 244–264.
PubMed Google Scholar
Glazebrook, C., Gonzalez, D., Hansen, S., & Elliott, D. (2009). The role of vision for online control of manual aiming movements in persons with autism spectrum disorders. Autism, 13(4), 411–433.
Article PubMed Google Scholar
Griffiths, R. (1970). The ability of young children. A study in mental measurement. London: University of London Press.
Google Scholar
Heneghan, C. (2010). Why autism can’t be diagnosed with brain scans: Using brain scans to detect autism would be a huge waste of money, says Carl Heneghan. http://www.guardian.co.uk/science/blog/2010/aug/12/autism-brainscan-statistic. Accessed June 30, 2014.
Ingalhalikar, M., Parker, D., Bloy, L., Roberts, T. P., & Verma, R. (2011). Diffusion based abnormality markers of pathology: Toward learned diagnostic prediction of ASD. Neuroimage, 57(3), 918–927.
Article PubMed Central PubMed Google Scholar
Izawa, J., Pekny, S. E., Marko, M. K., Haswell, C. C., Shadmehr, R., & Mostofsky, S. H. (2012). Motor learning relies on integrated sensory inputs in ADHD, but over-selectively on proprioception in autism spectrum conditions. Autism Research, 5(2), 124–136.
Article PubMed Central PubMed Google Scholar
Jarrold, C., & Brock, J. (2004). To match or not to match? Methodological issues in autism-related research. Journal of Autism and Developmental Disorders, 34(1), 81–86.
Article PubMed Google Scholar
Lai, M. C., Lombardo, M. V., & Baron-Cohen, S. (2014). Autism. The Lancet, 383(9920), 896–910.
Article Google Scholar
Lange, N., Dubray, M. B., Lee, J. E., Froimowitz, M. P., Froehlich, A., Adluru, N., et al. (2010). Atypical diffusion tensor hemispheric asymmetry in autism. Autism Research, 3(6), 350–358.
Article PubMed Central PubMed Google Scholar
Leary, M. R., & Hill, D. A. (1996). Moving on: Autism and movement disturbance. Mental Retardation, 34(1), 39–53.
PubMed Google Scholar
López, M., Ramírez, J., Górriz, J. M., Álvarez, I., Salas-Gonzalez, D., Segovia, F., et al. (2011). Principal component analysis-based techniques and supervised classification schemes for the early detection of Alzheimer’s disease. Neurocomputing, 74, 1260–1271.
Article Google Scholar
Lord, C., Risi, S., Lambrecht, L., Cook, E. H, Jr, Leventhal, B. L., DiLavore, P. C., et al. (2000). The autism diagnostic observation schedule-generic: A standard measure of social and communication deficits associated with the spectrum of autism. Journal of Autism and Developmental Disorders, 30(3), 205–223.
Article PubMed Google Scholar
Lord, C., Rutter, M., & Le Couteur, A. (1994). Autism diagnostic interview-revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. Journal of Autism and Developmental Disorders, 24(5), 659–685.
Article PubMed Google Scholar
Mari, M., Castiello, U., Marks, D., Marraffa, C., & Prior, M. (2003). The reach-to-grasp movement in children with autism spectrum disorder. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, 358(1430), 393–403.
Article PubMed Central PubMed Google Scholar
Minshew, N. J., Sung, K., Jones, B. L., & Furman, J. M. (2004). Underdevelopment of the postural control system in autism. Neurology, 63(11), 2056–2061.
Article PubMed Google Scholar
Nobile, M., Perego, P., Piccinini, L., Mani, E., Rossi, A., Bellina, M., & Molteni, M. (2011). Further evidence of complex motor dysfunction in drug naive children with autism using automatic motion analysis of gait. Autism, 15(3), 263–283.
Article PubMed Google Scholar
Oller, D. K., Niyogi, P., Gray, S., Richards, J. A., Gilkerson, J., Xu, D., et al. (2010). Automated vocal analysis of naturalistic recordings from children with autism, language delay, and typical development. Proceedings of National Academy of Science of United States of America, 107(30), 13354–13359.
Article Google Scholar
Padilla, P., Lopez, M., Gorriz, J. M., Ramirez, J., Salas-Gonzalez, D., & Alvarez, I. (2012). NMF–SVM based CAD tool applied to functional brain images for the diagnosis of Alzheimer’s disease. IEEE Transactions on Medical Imaging, 31(2), 207–216.
Article PubMed Google Scholar
Rinehart, N., & McGinley, J. (2010). Is motor dysfunction core to autism spectrum disorder? Developmental Medicine & Child Neurology, 52(8), 697.
Article Google Scholar
Salvatore, C., Cerasa, A., Castiglioni, I., Gallivanone, F., Augimeri, A., Lopez, M., et al. (2013). Machine learning on brain MRI data for differential diagnosis of Parkinson’s disease and progressive supranuclear palsy. Journal of Neuroscience Methods, 222, 230–237.
Article PubMed Google Scholar
Schölkopf, B., Smola, A. J., Williamson, R. C., & Bartlett, P. L. (2000). New support vector algorithms. Neural Computation, 12(5), 1207–1245.
Article PubMed Google Scholar
Teitelbaum, P., Teitelbaum, O., Nye, J., Fryman, J., & Maurer, R. G. (1998). Movement analysis in infancy may be useful for early diagnosis of autism. Proceedings of the National Academy of Science of the United States of America, 95, 13982–13987.
Article Google Scholar
Thelen, E., Corbetta, D., & Spencer, J. P. (1996). Development of reaching during the first year: Role of movement speed. Journal of Experimental Psychology: Human Perception and Performance, 22(5), 1059–1076.
PubMed Google Scholar
Van Waelvelde, H., Oostra, A., Dewitte, G., Van Den Broeck, C., & Jongmans, M. J. (2010). Stability of motor problems in young children with or at risk of autism spectrum disorders, ADHD, and or developmental coordination disorder. Developmental Medicine & Child Neurolology, 52(8), 174–178.
Article Google Scholar
Vapnik, V. N. (1995). The nature of statistical learning theory. New York, NY: Springer.
Book Google Scholar
Vapnik, V. N. (1998). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999.
Article Google Scholar
Vapnik, V. N., & Chapelle, O. (1999). Bounds on error expectation for support vector machines. Neural Computation, 12(9), 2013–2036.
Article Google Scholar
Von Hofsten, C. (1991). Structuring of early reaching movements: A longitudinal study. Journal of Motor Behavior, 23(4), 280–292.
Article Google Scholar
Whyatt, C. P., & Craig, C. M. (2013). Sensory-motor problems in autism. Frontiers in Integrative Neuroscience, 7(51), 1–12.
Google Scholar
Yerys, B. E., & Pennington, B. F. (2011). How do we establish a biological marker for a behaviorally defined disorder? Autism as a test case. Autism Research, 4(4), 239–241.
Article PubMed Google Scholar

Download references

Acknowledgments

This research has been partially funded by the FP6-NEST Adventure activities Specific Targeted Research Project: ‘‘TACT’’ (Thought in ACTion) to Dr. Molteni and by the Fund for Research of the Italian Ministry of University and Research, within a framework agreement between Lombardy Region and National Research Council of Italy (No. 17125, 27/09/2012). The funding sources had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. We acknowledge the work of Silvia Borini, Cristina Motta, Elisa Mani, and Laura Villa in the diagnostic evaluation of participants with autism and Giuseppe Aceti, Maura Mariani, Claudio Marcolini, Mariangela Perego, Barbara Urbani, and Angela Valli for their help in recruiting participants. We also thank Silvia Colonna and Maddalena Mauri for helping editing the last version of manuscript and the anonymous reviewers for their comments. Lastly, we are especially grateful to all the families of the children who took part in this study.

Author information

Authors and Affiliations

Child Psychopathology Unit, Scientific Institute, IRCCS Eugenio Medea, Via Don Luigi Monza 20, 23842, Bosisio Parini, Lecco, Italy
Alessandro Crippa, Sara Forti, Maria Nobile & Massimo Molteni
Institute of Molecular Imaging and Physiology, National Research Council, Via F.lli Cervi 93, 20090, Segrate, Milan, Italy
Alessandro Crippa, Christian Salvatore & Isabella Castiglioni
Bioengineering Lab, Scientific Institute, IRCCS Eugenio Medea, Via Don Luigi Monza 20, 23842, Bosisio Parini, Lecco, Italy
Paolo Perego
Department of Clinical Neurosciences, Hermanas Hospitalarias, FoRiPsi, Albese con Cassano, Italy
Maria Nobile

Authors

Alessandro Crippa
View author publications
You can also search for this author in PubMed Google Scholar
Christian Salvatore
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Perego
View author publications
You can also search for this author in PubMed Google Scholar
Sara Forti
View author publications
You can also search for this author in PubMed Google Scholar
Maria Nobile
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Molteni
View author publications
You can also search for this author in PubMed Google Scholar
Isabella Castiglioni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alessandro Crippa.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Crippa, A., Salvatore, C., Perego, P. et al. Use of Machine Learning to Identify Children with Autism and Their Motor Abnormalities. J Autism Dev Disord 45, 2146–2156 (2015). https://doi.org/10.1007/s10803-015-2379-8

Download citation

Published: 05 February 2015
Issue Date: July 2015
DOI: https://doi.org/10.1007/s10803-015-2379-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Use of Machine Learning to Identify Children with Autism and Their Motor Abnormalities

Abstract

Similar content being viewed by others

Novel AI driven approach to classify infant motor functions

Whole-Body Movement during Videogame Play Distinguishes Youth with Autism from Youth with Typical Development

Early Diagnose of Autism Spectrum Disorder Using Machine Learning Based on Simple Upper Limb Movements

Introduction