Introduction

Autosomal recessive cerebellar ataxias (ARCAs) are heterogeneous and complex inherited neurodegenerative disorders that affect the cerebellum, the spinal cord and the peripheral nerves. Molecular diagnosis of the ARCAs is challenging due to both phenotypic and genetic heterogeneity [1]. It was formerly a step by step approach with serial sequencing of several genes by the Sanger technique. The advent of affordable next generation sequencing (NGS) technologies allows now sequencing of exomes or large gene panels for diagnosis of rare neurological diseases such as ARCAs [2]. However, massive amount of data with multiple rare variants in several genes in a single patient increases the complexity of the analysis. A collaborative cross-talk between molecular geneticists and clinicians is even more necessary than before for NGS diagnosis validation in clinical practice. Age of onset, ataxia progression rate and careful clinical examination combined with laboratory, morphological and neurophysiological investigations [including vitamin E, alpha-fetoprotein (AFP), albumin, cholestanol, brain MRI, nerve conduction studies (NCS)] are useful for reaching a diagnostic conclusion. A new classification of ARCAs also has been established comprising 3 groups: ARCA with pure sensory neuropathy, ARCA with sensorimotor axonal neuropathy and ARCA without neuropathy. An algorithm for the diagnosis of ARCAs based on these items has been proposed by our group according to personal experience and literature data [3] (Fig. 1). We conducted a blind study to validate this clinical practice-based algorithm on a series of patients with molecular diagnosis of ARCA. These patients were part of a cohort of patients consecutively investigated with targeted capture sequencing of a panel of 57 ataxia genes.

Fig. 1
figure 1

Algorithm for the diagnosis of autosomal recessive cerebellar ataxias (simplified and adapted [3]). The combination of natural history, associated clinical signs and paraclinical data (such as brain MRI, nerve conduction studies and several biomarkers) leads to one or a few diagnosis. yr years, EMG electroneuromyography

Patients and methods

Between 2010 and 2012, 145 unrelated index patients were recruited in 12 tertiary centers for movement disorders: 130 in France and 15 in Algeria. Inclusion criteria for the NGS analysis were the combination of: (1) progressive cerebellar ataxia; (2) age at onset before 60 years; (3) molecular analysis negative for Friedreich ataxia and other investigations depending on clinical assessment; (4) recessive inheritance or sporadic cases. Written informed consent was obtained from all participants and local ethics committee approved the study. Hundred and forty-five consecutive patients were analyzed by a targeted exon-capture strategy coupled with multiplexing and high-throughput sequencing of 57 genes causing ataxia when mutated (listed in supplementary file-A). Library preparation, targeted capture and sequencing were realized as previously reported [4]. NGS analysis is detailed in supplementary file-B [5].

Hundred and thirty-four patients were presenting ataxia starting before 40 years and 11 patients had late onset ataxia, starting after 40 years. Our cohort was mainly comprised of sporadic cases: 85 patients (59 %) had neither familial history of ataxia nor consanguinity. Fifty-four patients (37 %) had a recessive pedigree (2 or more affected in the kindred or isolated case with parental consanguinity). A molecular diagnosis was made in 27/145 patients (19 %) with mutations in ARCA genes. Among the 27 patients with ARCA molecular diagnosis, 4 were excluded due to the lack of available data. Molecular data of these 4 patients and clinical data of 118 patients without diagnosis are summarized in supplementary file C [6] and D, respectively.

We selected the remaining 23 ARCA patients with an established molecular diagnosis to assess the validity of the clinical practice-based algorithm. Two movement disorders specialists (MA, CT) performed independently a blind analysis based on the clinical (age at onset, current age, current disability based on scale of assessment and rating of ataxia (SARA) [7] score and/or spinocerebellar degeneration functional score (SDFS) [1], exhaustive clinical examination abnormalities including ocular motor signs, movements disorders, pyramidal signs, mental retardation) and paraclinical (biomarkers—especially vitamin E, AFP, albumin, cholestanol-, brain MRI, NCS findings) but molecular data: each patient had to be categorized in one of the three ARCA groups as described above and ranking of the two most probable disease causing genes was achieved. Inter-rater agreement was assessed by computing a weighted Kappa coefficient (K) with ‘“squared”‘ weights. All mutated genes affecting the 23 patients were ordered according to the data of the literature as followed on a ordinal scale to take account of the degree of disagreement [group without neuropathy: (1) SYNE1 (2) ANO10, (3) ADCK3; group with pure sensory neuronopathy: (1) POLG, (2) TTPA; group with axonal sensorimotor neuropathy (1) APTX, (2) SETX, (3) CYP27A1, (4) SACS]. Genes that showed larger number of associated phenotypic differences were separated by greater distance on the ordinal scale. Confidence interval was calculated using the adjusted bootstrap percentile (BCa) method based on 10,000 replicates. A “z” test was performed to assess if the classification which produced the Kappa statistic is significantly better than a random result (K = 0). A p value <0.05 was considered statistically significant. Analyses were performed using R software version 3.1.0 (R Project for Statistical Computing) with the “irr” package.

Results

Twenty-three patients were investigated with the clinical practice-based algorithm. The age of onset ranged from 1 to 47 years (median 16). Genetic analysis identified two pathogenic mutations in ANO10 (6 patients), in SETX (4), in SYNE1 and ADCK3 (3 each), in SACS and APTX (2 each) and in TTPA, CYP27A1, POLG (1 each) (Table 1). The correct ARCA group was found in all patients by the two assessors. The gene ranked first by the first assessor (MA) was correct in 14/23 cases (61 %) and 18/23 cases (78 %) for the second assessor (CT). The most frequent error (11 errors/14) was misdiagnosis within the ARCA group without neuropathy probably due to the closely overlapping phenotypes of pure cerebellar ataxias (especially ARCA1, ARCA3 and to some extent ARCA2), with a wide range of age at onset. Five errors were shared by both experts. Considering the two most probable genes according to the assessors, the correct diagnosis was identified in 18/23 (78 %) and 21/23 (91 %), respectively. There was a high inter-rater agreement [K = 0.85 (0.55–0.98) p < 0.001] on the first gene ranked by the assessors confirming the algorithm’s reproducibility.

Table 1 Gene ranking based on the clinical practice-based algorithm

Discussion

We report on the validation of a clinical practice-based algorithm in a series of patients, based on a blind analysis of clinical and paraclinical data. Given the high percent of correct diagnosis in our study (the assessors were able to find the good molecular diagnosis in 2/3 and 3/4 of cases, respectively), it is expected that this algorithm will be useful in clinical practice for neurologists and geneticists. Moreover, the algorithm allowed us to identify nine distinct entities, including entities that belong to the three different groups of the new classification of ARCAs [3]. One hundred forty-five patients suspected with ARCA were included in the study which is a high number since Friedreich ataxia was previously excluded. Therefore, the evaluation of the 23 patients with a molecularly confirmed diagnosis is relevant.

Confirmation of diagnosis by NGS can be straightforward in presence of clear-cut mutations, even with few clinical data. However, NGS data analysis frequently reveals several variants of unknown significance especially missense mutations in one or more ARCA-causing genes. In these difficult cases, proper phenotyping of ARCA patients, including precise clinical examination, biochemical investigations, brain imaging and NCS, is still necessary for guidance of genetic analysis and interpretation of the NGS data: these data have to be in agreement with an already described phenotype in order to confirm the variant’s pathogenicity. Similarly, appropriate knowledge of the several entities and of their description in the literature is recommended for the best management of such scarce diseases.

Despite the relevance of the algorithm, it is not infallible since few errors were made. ANO10 and SYNE1 mutations are responsible for close phenotypes [8, 9] with pure cerebellar ataxia and slow progression and may be difficult to distinguish without genetic analysis especially because there is no biomarker. Some errors were therefore done during the blinded assessment regarding these entities. Such algorithm is limited by the presence of atypical phenotypes associated with mutations in known genes. For instance, most patients with autosomal recessive spastic ataxia of Charlevoix Saguenay (ARSACS) experience their first symptoms before 5 years of age, but in very few patients the first signs may occur after 20 or 30 years. Atypical phenotypes have also been reported regarding peripheral neuropathy which was lacking in 1/11 patient in a recent series of ARSACS [10]. Lack of neuropathy is also a rare feature in ataxia with oculomotor apraxia type 2 (AOA2), representing only 2.5 % of cases in a series of 90 patients [11]. Only one patient with ADCK3 confirmed mutations presented a mild axonal neuropathy (out of 34 patients described in the litterature [1218]). In the same way, few patients with AOA2 or ataxia telangiectasia presented with the very unusual lack of elevated AFP serum level [11, 19]. However, the overall clinical and paraclinical assessment mostly lead to only a few possible diagnoses, confirming that the phenotype in one subtype of ARCA is relatively homogeneous. Few clinical findings (such as oculomotor apraxia, vertical supranuclear gaze palsy, spastic paraplegia, telangiectasia) as well as reliable biomarkers (such as vitamin E, AFP and albumin serum level) may also be suggestive of one or few diseases. The good results of our blinded assessment support this statement.

Herein, the percentage of positive diagnosis (19 %) is similar to previous studies on NGS in cerebellar ataxias [2] but remains low for several reasons including selection of patients with onset before 60 years of age (whereas ARCAs mostly occur before 30), exclusion of patients with Friedreich ataxia, absence of important ataxia genes in the gene panel, such as WFS1 and SPG7, and the fact that many genes have not been identified yet. It is also possible that a few sporadic cases have in fact polyglutamine SCA due to marked anticipation, particularly for SCA2 and SCA7. Polyglutamine expansions should therefore also be tested, along with Friedreich ataxia expansions.

Web resources

UCSC Genome Browser: http://genome.ucsc.edu/index.html

Ensembl Genome Browser: http://www.ensembl.org/index.html

Exome Variant Server (EVS), NHLBI GO Exome Sequencing Project (ESP), Seattle, WA: http://evs.gs.washington.edu/EVS (June, 2013)

http://www.lbgi.fr/VaRank/.