Introduction

Pompe disease (PD), or glycogen storage disease type II (GSDII; MIM #232300), is an autosomal recessive disorder caused by a deficiency of the lysosomal enzyme acid α-glucosidase (GAA; EC 3.2.1.3), which catalyzes the hydrolysis of α-1,4 and α-1,6 links of glycogen. GAA deficiency leads to the accumulation of glycogen in lysosomes especially within skeletal, cardiac, and vascular smooth muscle cells. The clinical spectrum ranges from a rapidly fatal infantile-onset form to a slowly progressive adult form (Hirschhorn and Reuser 2001; van der Ploeg and Reuser 2008). The classic infantile-onset form presents in the first months of life with generalized hypotonia and muscle weakness, severe hypertrophic cardiomyopathy, failure to thrive, and respiratory insufficiency. Untreated patients usually die in the first year of life due to progressive cardiorespiratory failure (Kishnani et al. 2006). The late-onset (also called non-classic) form presents a more variable spectrum of clinical features, and onset of symptoms ranges from the first years of life to late adulthood (Chan et al. 2017). According to Güngör and Reuser (2013), the “childhood” Pompe disease includes patients with onset of symptoms ≤ 16 years without persisting and progressive cardiac hypertrophy, whereas the “adult” Pompe disease corresponds to patients with onset of symptoms > 16 years. The late-onset form is characterized by progressive proximal and axial muscle weakness leading to loss of motor function (Hagemans et al. 2006). Diaphragmatic weakness and respiratory insufficiency are frequent and prominent. The clinical phenotype usually correlates with mutation severity and subsequent residual enzymatic activity (Laforêt et al. 2000; Hirschhorn and Reuser 2001; Herzog et al. 2012).

The incidence of the disease varies from 1/14,000 to 1/600,000 depending on geographic origin, diagnostic criteria, and methods (Martiniuk et al. 1998; Leslie and Bailey 2007). The diagnosis of PD is established by evidencing a marked decrease of acid alpha-glucosidase activity either in lymphocytes/leucocytes, cultured fibroblasts, muscle biopsy, or dried blood spots, and it is confirmed by the determination of the causative mutations (Pompe Disease Diagnostic Working et al. 2008; Reuser et al. 2011). The GAA gene (HGNC: 4065) is located on 17q25. It contains 20 exons, and the start codon is at nucleotide 33 in exon 2 (Hoefsloot et al. 1990). The May 2016 edition of the Pompe Disease Mutation Database (http://www.pompecenter.nl) provides a list of 558 sequence variants spread along the entire gene, among which 374 are pathogenic. Many mutations are private, and only few mutations are common in specific populations, such as the c.-32-13T>G mutation in late-onset forms of PD in Caucasians.

In France, a clinical network was set up among neuromuscular referral centers to facilitate diagnosis and follow-up of PD. Patient samples were sent to two main laboratories (Paris and Lyon) performing enzymatic and genetic analyses. Furthermore, a PD national registry has been created in 2004 and qualified in 2008, with the aim of prospectively gathering clinical, functional, and biological data of all French PD patients (Laforêt et al. 2013). Therefore, the aims of this study were as follows: (1) to characterize the GAA mutations present in late-onset French PD patients, (2) to establish the epidemiology of this clinical form in France, (3) to identify potential genotype-phenotype correlations.

Material and methods

Patients and databases

The study complies with national data protection requirements (Commission Nationale Informatique et Libertés, CNIL) and with the Code of Ethics of the World Medical Association (Declaration of Helsinki). The French Pompe registry has been qualified since 2008 by Comité National des Registres-Maladies Rares (CNR-MR) supported by Institut National de la Santé et de la Recherche Médicale (INSERM) and Institut de Veille Sanitaire (InVS).

Inclusion criteria were as follows: (1) confirmed diagnosis of PD, established with at least two different methods (either enzymatic test on lympho/leukocytes and confirmation on fibroblasts, or enzymatic test on lympho/leukocytes and confirmation by GAA gene sequencing, or enzymatic test on dried-blood spots and confirmation by GAA gene sequencing); (2) disease onset ≥ 2 years of age. Some patients had incomplete genetic (i.e., molecular analysis not performed, one unidentified allele, absence of informed consent) or clinical (age at onset, first symptoms) data. For the epidemiologic study, only patients diagnosed after January 1, 2001, were included (Fig. S1).

Genetic analyses

DNA amplification and sequencing

Genomic DNA was extracted from patient peripheral blood samples or cultured fibroblasts according to well-established procedures. PCR reactions were carried out to amplify the coding exons of the GAA gene and their corresponding intron-exon boundaries, using specific pairs of primers located in the flanking intronic regions. Each amplicon was purified and sequenced in both directions using the same primers. Sequencing was performed using the ABI PRISM Big Dye Terminator Kit (Applied Biosystems®) following standard recommendations. A specific PCR protocol was performed to detect the common deletion c.2481+102_2646+31del (delta exon 18) using primers located on both sides of the deletion.

Mutation analysis and in silico prediction of the identified variants

Sequence variations were compared with the reference sequence NM_000152.4. In silico prediction of the newly identified variants was performed using the Alamut® software (http://www.interactivebiosoftware.com). The severity of each mutation was classified as very severe, potentially less severe, less severe, and potentially mild according to the Pompe Disease Mutation Database ranking system (Kroos et al. 2008).

Epidemiology of Pompe disease

The French PD patients included in this study were collected using information from the following: (1) the neuromuscular disease referral centers and their large clinical network, (2) the two main diagnosis laboratories (Paris and Lyon), (3) the French Pompe registry. Incidence rate and proportion of newborns that will be affected by late-onset Pompe disease were estimated. The incidence rate describes how quickly a disease occurs in a population and is calculated by the ratio between the number of new late-onset PD diagnoses during specified time intervals (2001–2005, 2006–2010, 2011–2015, and for the whole period 2001–2015) and the average population during these time intervals. In order to estimate the proportion of newborns that will be affected by late-onset Pompe disease, we also assessed the number of diagnosed patients divided by the average number of newborns in their year of birth. The population data were extrapolated from Institut National de la Statistique et des Etudes Economiques (INSEE).

Clinical data

The clinical data regarding onset of the disease were obtained from the French Pompe registry or from clinical records. The onset of the disease was defined as the occurrence of muscular weakness (difficulty in running, climbing stairs, rising from floor, etc.) or signs of respiratory muscle weakness (orthopnea, respiratory insufficiency). HyperCKemia (2.5-fold the upper normal limit), muscle fatigue, and exercise intolerance were not considered as first symptoms.

Statistical analysis

Demographic descriptions are given in number and/or percentage (%) for qualitative variables, in mean and standard deviation (± SD) for quantitative variables (or in median in case of non-Gaussian distribution). Quantitative variables were compared using t test (two groups) or ANOVA with Bonferroni’s multiple comparison test (more than two groups) in the GraphPad Prism® software. The statistical significance was set at p < 0.05.

Results

Patient population

PD diagnosis was established in 246 patients including 209 index cases and 37 relatives during the period 1972–2015. Sex ratio was 1:1 (130 females, 116 males). Consanguinity was reported in two families. Mean age at diagnosis was 43.1 ± 17.5 years. Sixteen patients were diagnosed at a presymptomatic stage (5 relatives and 11 index cases, the latest usually referred for hyperCKemia or aspecific symptoms such as fatigue or myalgia). Mean age at disease onset was 34.6 ± 20.1 years, ranging from 2 to 70 years. Of patients, 87.4% had an age at onset > 16 years (adult Pompe disease) and 12.6% an age at onset ≤ 16 years (childhood Pompe disease including 11% for the 4–16 years and 1.6% for the 2–3 years). The age at onset was not significantly different between males and females (mean age at disease onset 35.6 ± 20.4 versus 33.7 ± 19.9 years). First symptoms were usually muscular weakness and rarely respiratory problems. The mean diagnostic delay (years from disease onset to diagnosis) was 8.7 ± 9.3 years ranging from 0 to 46 years (Table S1).

GAA gene analysis

GAA gene sequencing was performed in 204 patients, and causative mutations were found in 185. In the 19 remaining patients, only one mutation was identified, usually due to the incomplete DNA analysis performed in the first years of the study. We identified 83 different pathogenic or probably pathogenic variants, spread all over the sequence except exons 16 and 20 (Fig. S2). They included 42 missense mutations (one affecting the start codon), 15 frameshift mutations leading to a premature stop codon, 14 splice mutations, 8 nonsense mutations, 3 small in-frame deletions and one large deletion. The distribution of mutations among patient alleles is shown in Table S2. Splice mutations (including the.c-32-13T>G) represented 53% of alleles, missense mutations 23%, deletions 9%, frameshift mutations 8%, nonsense mutations 5%, and other mutations or complex alleles 2%. Five patients including two siblings had complex alleles with two alterations in cis (Table S3).

The common c.-32-13T>G mutation was detected in 89% of index cases on at least one allele, and it was present on both alleles in 3 patients (allele frequency = 48.1%). Other recurrent mutations included the large deletion c.2481+102_2646+31del (7.5%), the missense mutations c.1927G>A (3.7%) and c.655G>A (2.7%), and the small deletion c.525del (2.4%). Other mutations had an allele frequency ≤ 2% (Table 1).

Table 1 Recurrent mutations in late-onset PD patients in France (left column) and age at onset of patients carrying these mutations in association with the common c.-32-13T>G mutation (right column)

Among the 83 different variants identified, 28 (34%) were novel (Table 2). These included 13 mutations leading to a truncated mRNA and/or protein (9 frameshift mutations and 4 nonsense mutations), 3 splice mutations, 10 novel missense mutations, and 2 deletions of one amino acid. The novel variants were defined as probably disease-causing mutations using the Alamut® software.

Table 2 Novel mutations identified in the French late-onset PD patients

Epidemiology of Pompe disease

The distribution of newly diagnosed PD patients per year is presented in Fig. 1. The number of diagnoses increased from 2.6/year (period before 2001) to 10.6/year (2001–2010) and 12.8/year (2011–2015). Considering the 2001–2015 period, we estimated the incidence of late-onset PD in France to 17.76 patients per 100,000,000 inhabitants per year. We could also estimate the proportion of newborns that will be affected by late-onset PD to 1.43/100,000 newborns or 1/69,927 newborns (Table S1).

Fig. 1
figure 1

Number of late-onset PD patients diagnosed per year in France

Genotype-phenotype correlations

Clinical data and complete genotype identification were available for 149 patients. Patients with the c.-32-13T>G mutation (n = 131) presented a later onset compared to the 18 patients non-carrying the c.-32-13T>G mutation (mean age at onset 36.2 ± 13.9 years (ranging from 2 to 64) versus 24.7 ± 14.8 years (ranging from 3 to 51); p < 0.01) (Fig. 2a). Patients with the c.-32-13T>G mutation on one allele were grouped according to the severity of the second mutation. No correlation could be found with the age at onset, but all the patients presenting an early-onset had a very severe mutation on the second allele (Fig. 2b). Patients carrying identical GAA genotype (c.-32-13T>G on one allele and another recurrent mutation on the second allele) had a highly variable age at onset (Table 1, right column). In childhood PD patients (onset ≤ 16 years), the c.-32-13T>G mutation was less frequent (65%; 15/23 patients) than in adult PD patients (92%; 166/180; Chi-squared test, p < 0.001). Moreover, very severe mutations were detected in 14/23 childhood PD patients (Table 3). Concerning the five patients with three pathogenic or probably pathogenic variations in the GAA gene, their age at onset was heterogeneous, between 17 and 50 years (Table S3).

Fig. 2
figure 2

a Age at disease onset in patients with the c.-32-13T>G mutation versus patients non-carrying the c.-32-13T>G mutation. Horizontal lines represent mean values (unpaired t test p = 0.0014). b Age at disease onset in patients carrying the c.-32-13T>G mutation on one allele grouped according to the second mutation severity. Horizontal lines represent mean values. No significant differences were observed between groups even if a trend toward earlier onset is observed in patients with severe mutations. The severity is given according to the ranking system used in the Pompe Disease Mutation Database (Kroos et al. 2008). VS very severe, PLS potentially less severe, LS less severe, PM potentially mild

Table 3 Patients with childhood Pompe disease

Discussion

This study described molecular features and epidemiology of late-onset PD and investigated potential correlations between genotype and age at disease onset in the largest population ever reported. Homogeneity of the data and accuracy of the results were guaranteed by the French Pompe registry, which collected data on natural history and follow-up of all patients (whether treated with enzyme replacement therapy or not) in France. In this cohort, the common c.-32-13T>G mutation was present in 89% of index cases as described in the Caucasian population (Huie et al. 1994; Kroos et al. 1995; Montalvo et al. 2006; Herzog et al. 2012). This mutation leads to impaired splicing of exon 2. Since a low amount of normally spliced mRNA is produced, the GAA activity is not completely deficient, and patients usually develop a late-onset form of PD (Raben et al. 1996; Dardis et al. 2014). The carrier frequency of the c.-32-13T>G mutation was estimated at 1/154 in the general unaffected Dutch population (Ausems et al. 1999) and at 1/184 in the Genome Aggregation Database (gnomAD, Broad Institute; Lek et al. 2016). According to these estimates, Rairikar et al. (2017) calculated that 16 to 40% of all adult PD patients should theoretically have this mutation in homozygosity. However, few homozygous patients have been reported in the literature and they were 3 in our series, suggesting that homozygotes for the c.-32-13T>G mutation are usually asymptomatic or have a milder phenotype that could be underdiagnosed.

Two other common mutations were identified in our population: c.2481+102_2646+31del (delta exon 18) and c.525del. Their frequency was lower than in the Netherlands (Ausems et al. 2001) but comparable to non-Dutch populations (Hirschhorn and Huie 1999; Herzog et al. 2012). Two missense mutations, c.1927G>A (p.Gly643Arg) and c.655G>A (p.Gly219Arg), were more frequent in France than in other countries. Numerous other GAA mutations were found, confirming the high heterogeneity of mutations in PD. Among the 83 mutations identified, 55 (66%) were private. To note, France is an ethnically complex country as it has been settled by several European peoples and has several overseas territories spread all over the world. This genetic background could contribute to the high number of GAA variations. Interestingly, five patients had three mutations including at least one missense mutation (two of them were novel). It was not possible to clearly determine if the presence of two alterations in cis was necessary for the disease onset.

In our study, 28 mutations were novel. The nonsense and frameshift mutations (n = 13) created a premature termination codon (all located upstream of exon 20) leading to a truncated GAA mRNA with or without mRNA decay (absent or truncated protein). The splice mutations (n = 3) directly affected a donor or acceptor site, leading to aberrant splicing. The pathogenicity of the novel missense mutations and single amino-acid deletions (n = 12) was assessed using different criteria. In silico prediction studies argued in favor of their pathogenicity. They affected highly conserved residues among species and/or were in the nearby of the active site. Moreover, these gene alterations were absent from large population datasets of genetic variation (gnomAD), or at very low allele frequency (< 1/10.000).

The number of PD diagnoses increased in France over the last 40 years, reflecting the improvement of diagnostic techniques and greater awareness on PD clinical symptoms. As our data came from the two main French laboratories involved in PD diagnosis and the French Pompe registry, we could assume that this cohort of 246 patients represented almost the entire population of late-onset PD in France. Therefore, the frequency of late-onset PD was estimated around 1/69,927 newborns. This result was extrapolated from clinical data of adult patients, thus presenting signs or symptoms of the disease, and not from population-based studies. We think that this estimate is realistic, because it encompasses all the diagnosed patients in France during a long observational period (15 years). However, it has some obvious limitations: (1) it does not include potential asymptomatic or presymptomatic patients; (2) it does not consider immigration, old diagnoses, and deaths; (3) we used the average number of newborns as denominator to calculate the incidence of a late-onset disease. By comparison, other studies performed either with enzymatic newborn screening or genetic screening reported a frequency of 1/26,466 in Taiwan (Chien et al. 2011) and 1/57,000 in The Netherlands (Ausems et al. 1999), respectively. These epidemiologic studies also have limitations. For example, the genetic screening may overestimate the number of patients that will become symptomatic (as discussed above, the most frequent mutation in Caucasian patients, c.-32-13 T>G, is rarely seen in homozygous state in symptomatic patients, and it may be associated with a normal phenotype). Moreover, these studies were performed in more restricted populations and not in a large nationwide study covering a long observational period.

The mean age at diagnosis in our cohort (43 years) was higher compared to other published data (Hagemans et al. 2005; Winkel et al. 2005; Palmer et al. 2007; Remiche et al. 2014). The mean diagnostic delay in the last 20 years in France remained stable, around 9 years and was quite similar to other registries: 6 to 11 years (Winkel et al. 2005; Palmer et al. 2007; Kishnani et al. 2013; Remiche et al. 2014). This delay confirms that PD is a challenging diagnosis due to the high variability of symptoms and age at onset in late-onset PD, to the presence of atypical phenotypes and possible atypical histopathological features. Furthermore, late-onset PD is a very slow-evolving disease that can manifest with minimal signs in early adolescence before disease onset. Moreover, biases in the identification and annotation in medical records of the very first symptoms can be partially responsible for this long diagnostic delay. Clearly, this delay remains a big challenge in PD, since enzyme replacement therapy (ERT) is available and probably offers a better outcome when started earlier (Strothotte et al. 2010; van der Ploeg et al. 2010; Chien et al. 2011; Angelini et al. 2012). The detection of presymptomatic patients (7.8% of total diagnosis in the last 5 years in our study) could be facilitated in the future due to the increased knowledge on PD and development of miniaturized tests (dried blood spots) for the measurement of GAA activity. This will certainly improve the diagnostic work-up and clinical follow-up of patients (Echaniz-Laguna et al. 2015). Moreover, newborn screening (NBS) which is already performed in some countries (Taiwan, USA, Italy…) could also facilitate diagnosis and early treatment. However, some drawbacks need to be considered such as difficulties to predict phenotype and age at disease onset and false positives due to pseudodeficiency variants (Bodamer et al. 2017).

In our cohort, there was no straightforward relationship between genotype and age at disease onset. The c.-32-13T>G mutation was generally associated with onset in adulthood, but it could also be observed in late-onset PD patients presenting clinical onset in childhood (15 patients in our series with age at disease onset ranging from 2 to 16 years). Individuals with the same genotype presented first symptoms either in adulthood or childhood. This correlated with previous reports describing families with late-onset PD siblings and substantially variable phenotypes (Wens et al. 2013). This was probably due either to genetic factors (GAA polymorphisms, ACE, or ACTN3 variants) or environmental disease-modifying factors (De Filippi et al. 2014; Kroos et al. 2007; Kroos et al. 2012).

Conclusion

We reported molecular and epidemiologic data from the largest cohort of late-onset PD patients included in a nationwide study covering more than 40 years of this challenging diagnosis. We demonstrated high clinical and molecular variability in late-onset PD and estimated its frequency around 1/69,927 in France. PD diagnosis still remains difficult, as PD is a rare disease, with a heterogeneous and non-specific phenotype. An increased awareness of clinicians is needed to decrease the delay between onset and diagnosis.