Introduction

Amyotrophic lateral sclerosis (ALS) is a progressive adult-onset disease that primarily affects the upper and lower motor neurons, as well as the frontotemporal and other regions of the brain [1, 2]. It is characterized by increasing muscular atrophy, loss of strength, paralysis, and death, particularly due to respiratory failure [3, 4]. The etiology of ALS still largely remains unknown; however, a few genetic and environmental factors seem to be implicated in its development [5,6,7,8,9].

ALS has two forms; a familial and a sporadic one, both of which are caused by the accumulation of abnormal proteinaceous aggregations in motor neurons and glial cells [10]. Excitotoxicity, oxidative stress, mitochondrial dysfunction, neuroinflammation, altered energy metabolism, and RNA misprocessing are the main pathological factors responsible for the impaired homeostasis which creates these inclusions of the abnormal proteins [11]. CNS is susceptible to oxidative stress [12]. This is the consequent of the fact that the neuronal membrane presents a high abundance of polyunsaturated fatty acids as well as oxygen, and a relatively low component of antioxidants [13]. Therefore, oxidative stress is considered to be the most commonly identified contributor to ALS-associated degeneration [14].

The familial form of ALS accounts for 10% of the ALS cases [15] and is inherited following dominant traits, frequently with high penetration [16]. Advances In technology have permitted broad DNA sequencing in sporadic ALS and revealed that genetic polymorphisms in establishing genes are not rare; for example, SOD1 mutations occur in 2% of sporadic ALS [16].

Apart from genetics, exogenous and environmental factors, such as environmental toxicants, smoking, infections, and traumatic brain injury among others, have been incriminated for possible contribution to ALS development as well [17]. However, it should be mentioned that neither genetic architecture nor environmental factors seem to be sufficient for the development of ALS on their own [5, 18].

ERCC6L2 (excision repair cross-complementing rodent repair deficiency complementation group 6-like 2) is a gene located in chromosome 9, implicated in DNA repair and mitochondrial function [19]. The ERCC6L2 gene product belongs to the snf2 family of helicase-related proteins [19]. Snf2 is the catalytic subunit of the chromatin remodeling complex [20]. The ERCC6L2 protein is involved in transcription regulation, DNA repair, DNA translation, and chromatin unwinding [21]. It consists of 712 amino acids [19] and has the characteristics of an early DNA damage response protein that traffics to mitochondria and the nucleus after genotoxic stress in a reactive oxygen species (ROS)-dependent fashion [19]. Its parallel increase with ROS indicates that this protein can be associated with alterations in cellular homeostasis [19], as well as oxidative stress, which holds crucial pathological significance in neurodegenerative diseases.

Recent results from genome-wide gene-environment interaction analysis of pesticide exposure and risk of PD support that SNPs across the ERCC6L2 gene (namely rs67383717 and rs591486) and exposure to pesticides seem to have a joint effect on the development of PD [22]. Considering the similarities of the two diseases (ALS and PD) in terms of causative risk factors, as discussed above [23], as well as the impact of ERCC6L2 and especially of the rs591486 SNP on PD (due to the strong association that was found), we deemed that it would be useful to study whether there is an association between ALS susceptibility and the rs591486 ERCC6L2 gene variant. Therefore, the aim of this study is to examine the effect of the ERCC6L2 rs591486 to ALS development.

Methods

Participants

ALS was diagnosed by a specialist neurologist following the El Escorial criteria [24]. Only patients with definite ALS were included. The study protocol was approved by the local ethics committees and written informed consent was obtained from all participants included in the study.

DNA isolation and genotyping procedure

Peripheral blood leucocytes were used for DNA extraction with the method of salting out [25, 26]. Using TaqMan allele specific discrimination assays on an ABI PRISM 7900 Sequence Detection System, tag SNPs were genotyped, and with the SDS software (Applied Biosystems, Foster City, CA, USA), they were analyzed. The personnel that performed the experimental work, was unaware of information regarding the participants [25]. Ten percent of the samples (randomly selected) was regenotyped, without revealing any conflict. The genotype call rate was 99.03%.

Statistical analysis

The exact test, with a p value of ≤ 0.05, was indicative for deviation from the Hardy-Weinberg equilibrium (HWE) [27]. The study’s statistical power [28] was calculated using the CaTS (http://www.sph.umich.edu/csg/abecasis/cats) Power Calculator for Genetic Studies (Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA) [29].

Using binary logistic regression models (univariate and after adjustment for age and sex), odds ratios (ORs) and 95% confidence intervals (CIs) were calculated in order for possible associations between rs591486 and ALS risk to be estimated. Analysis was performed with the SNPStats software (http://bioinfo.iconcologia.net/SNPstats/) [30], assuming the co-dominant (genotypic), dominant, recessive, over-dominant, and additive models of inheritance. We further performed subgroups analyses based on ALS site of onset. More precisely, we evaluated the effect of the rs591486 on patients with ALS with bulbar onset and on patients with limb onset by comparing them with controls.

The effect of the SNP genotype on the age of ALS onset, compared to the homozygosity for the wild allele, was evaluated using Cox proportional hazard regression models [31]. Subgroup cox analyses based on site of onset was also performed, as previously described.

A value smaller than 0.05 was set as the statistically significant threshold. With SPSS version 17.0 for Windows (SPSS Inc., Chicago, IL, USA) statistical analysis was carried out.

Results

In total, 310 individuals were recruited for this study; 155 patients with definite ALS [77 (49.7%) female, mean age ± standard deviation (SD) = 63.74 ± 11.30 years], and 155 matched for age and sex healthy controls. The majority of the ALS patients mentioned alcohol consumption (67.1%) or smoking (68.4%). The most common site of onset was the lower limbs (34.8%), followed by the bulbar type (32.3%).The main demographic characteristics of the ALS cohort participants are displayed in Table 1.

Table 1 Demographic characteristics of the ALS cohort

Fischer’s exact test revealed that rs591486 was in HWE in both ALS cases and healthy controls group, with p values equal to 0.26 and 0.078 respectively. Based on the power analysis, our study had a power of 80.0% to detect an association of a variant with a genetic relative risk of 1.51, assuming the multiplicative model, MAF equal to 48%, and type I error level of 0.05.

The present study included data from 310 individuals; 155 healthy controls and 155 individuals with ALS. Analysis performed to assess the genotypic frequencies for ERCC6L2 rs591486 showed that genotypes G/G, G/A, and A/A were found in 48 (31%), 66 (43%), and 40 (26%) of the healthy controls. Concerning the individuals with ALS, the results for G/G, G/A, and A/A were 31 (20%), 84 (55%), and 38 (25%), as well. The allelic frequencies in healthy individuals were 53% (162 individuals) for the G allele and 47% (146 individuals) for the A allele, whereas the results for ALS patients were 48% (146 individuals) for the G allele, and 52% (160 individuals) for the A allele. Allele and genotype frequencies in ALS cases and in healthy controls are shown in Supplementary Table 1.

According to univariate single locus logistic regression analysis, ERCC6L2 rs591486 was significantly associated with ALS, particularly in dominant and over-dominant mode. For the dominant mode specifically, ERCC6L2 rs591486 was associated with an increased risk of ALS development [odds ratio, OR (95% confidence interval, CI) 1.78 (1.06–3.00), p = 0.028]. Additionally, heterozygosity (G/A) of the SNP studied in over-dominant mode was found to significantly contribute to ALS [OR (95%CI) = 1.62 (1.03–2.55), p = 0.035]. A non-statistically significant trend for association was also found in the co-dominant mode for the G/A genotype [OR (95%CI) = 1.97 (1.13–3.43), p = 0.052]. The statistically significant results were maintained after adjustment for age and sex in both dominant [OR (95%CI) = 2.15 (1.04–4.46), p = 0.037] and over-dominant mode [OR (95%CI) = 1.91 (1.01–3.60), p = 0.043]. ORs, CIs, and p values for all modes for the comparison between the whole ALS sample and the controls are presented in Table 2.

Table 2 Single locus analysis (crude and adjusted for age and sex) for association between ERCC6L2 rs591486 and ALS, in co-dominant, dominant, recessive, over-dominant, and log-additive modes

Subgroup adjusted for age and sex analysis, based on the ALS site of onset, revealed an association between ERCC6L2 rs591486 and ALS with limb onset in dominant [OR (95%CI) = 3.90 (1.46–10.38), p = 0.0038], log-additive [OR (95%CI) = 1.78 (1.04–3.06), p = 0.033], and co-dominant [p = 0.015, with OR (95%CI) = 4.06 (1.46–11.30) for the G/A genotype and OR (95%CI) = 3.57 (1.13–11.27) for the A/A genotype] modes. ORs, CIs, and p values for all modes for the comparison between the ALS subgroups based on the site of onset and the controls are presented in Table 3.

Table 3 Single locus analysis (crude and adjusted for age and sex) for association between ERCC6L2 rs591486 and ALS based on site of disease onset (bulbar or limbs), in co-dominant, dominant, recessive, over-dominant, and log-additive modes

Cox proportional hazard regression analyses (Tables 4 and 5) showed that the rs591486 had a significant effect on the age of onset of ALS with limb onset. In particular, individuals carrying the G/A genotype of the rs591486 had a significantly younger age of onset of ALS with limb onset compared to the wildtype G/G genotype [hazard ratio (95% CI) 2.180 (1.091–4.357), p = 0.027, and mean age of limb onset ALS onset: 60.20 years for G/A and 66.73 years for G/G carriers].

Table 4 Cox regression analysis for single locus association of the ERCC6L2 rs591486 with age at onset of ALS (crude and adjusted for sex)
Table 5 Cox regression analysis for single locus association of the ERCC6L2 rs591486 with age at onset of ALS (crude and adjusted for sex) based on site of onset (bulbar of limb)

Discussion

In the current study, we performed a single locus analysis in order to detect a possible association between rs591486 ERCC6L2 and ALS risk in a Greek cohort. We found that rs591486 was associated with an increased risk of ALS. Moreover, we found that rs591486 ERCC6L2 influences the age of onset of ALS with limb onset. Therefore, our results provide preliminary indication for a potential link between ERCC6L2 genetic variability and the development of ALS.

ERCC6L2 is a helicase implicated in DNA repair and mitochondrial function [19]. It is an early DNA damage response protein that traffics to mitochondria and the nucleus after genotoxic stress, and also its presence is related to an increase of ROS [19]. Cellular augmentation of ROS significantly contributes to processes that damage tissues via oxidation, through mechanisms such as apoptosis and autophagy [32]. Consequently, ERCC6L2 could be implicated in changes in cellular homeostasis causing oxidative stress. Since CNS is rather susceptible to oxidation due to the conformation of its key proteins, the aforementioned change could possibly lead to the accumulation of abnormal proteins and, consequently, to cell death [33, 34].

Rs591486 is present within intron 2 of the ERCC6L2 gene. Therefore, our results should be interpreted with caution as rs591486 may or may not affect the function of the ERCC6L2 gene itself. It could also represent a regulatory element for a different gene, or even be in LD with a distant SNP. Therefore, a tagging SNP selection approach could have possibly revealed the effect of the entire ERCC6L2 gene on ALS and not only of variants in high proximity to rs591486. Moreover, a supportive functional analysis could have let us extract the strongest conclusions regarding the role of the rs591486 on the ERCC6L2 and ALS.

The lowest p value for HWE in the control group compared to the case group might be related to the small sample size of the study [35]. It is possible that this difference in p values between the two groups would have been eliminated, along with an increase in the study’s power, if additional participants had been recruited. [36].

Genetic variability seems to play a major role in neurodegenerative diseases, as the coexistence of polymorphisms with exogenous factors can affect their development, and variants have been shown to modify the risk of ALS through candidate gene association studies (CGASs) and genome-wide association studies (GWASs) [15, 37, 38]. For instance, SNPs of around 30 genes appear to modify the effect of environmental exposures in ALS etiology [5]. Additionally, SNPs of ERCC6L2 such as rs591486 have been found to have a joint effect with exposure to pesticides in the development of PD [22]. Concerning the aforementioned ERCC6L2, through its suggested contribution to oxidative stress, it may also be involved in the pathology of ALS.

Quite a few studies regarding neurodegeneration and ALS genetic architecture published so far have primarily focused on genes and their protein products that are implicated in oxidative stress. The main protein studied in ALS is the SOD1. SOD1 is an antioxidant enzyme that decomposes the harmful superoxide radical into O2 and H2O2, which is less harmful to cells [39]. SOD1 mutations occur in 2% of sporadic ALS [16] and in 20% of the familial cases [40]. Conditions in which neuronal cells are not able to cope with mitochondrial and oxidative stress seem to contribute or lead to neurodegenerative diseases including ALS, at least in cases with the most highly linked to SOD1 genetic variant [41].

Other ALS-linked genes that code proteins which induce oxidative stress or impair protein degradation machinery are RNA-binding proteins such as TDP-43, FUS, and hnRNPA1. They participate in different facets of the oxidation process, such as unfolding, aggregation, and proteolytic cleavage [42]. Regarding ALS genetics, impairment in autophagy adaptors such as optineurin, ubiquilin 2, and p62 results in disturbances in protein quality control, and therefore to oxidative stress [16]. Our study has concluded that ERCC6L2 can be possibly added to the list of genetic risk factors of ALS. However, due to the fact that our findings are preliminary, further research towards its contribution to oxidative stress and its interaction with other oxidative agents is strongly recommended.

Our study has several strong points that must also be noted. It is characterized by the selection of the gene on a biological basis and by the sample homogeneity [43], as all the participants in each cohort were recruited from the same hospital and had a Greek origin. However, there are a few limitations in the current study that must also be recognized. Firstly, as a case-control study, it carries the inherent limitations of its type, and secondly, our results would have presented more robustness, if additional clinical outcome measures [44,45,46] and environmental covariates [5, 47], such as pesticide exposure [48, 49], had been included in regression models. Finally, we included patients without screening for known causative fALS mutations [16], such as mutations of SOD1, TARDBP, and FUS genes and the C9orf72 mutation [50].

In conclusion, despite the fact that the exact mechanism of ALS development is yet to be discovered, the current study provides preliminary indication regarding the potential role of the ERCC6L2 rs591486 as a possible genetic risk factor for the development of ALS. Our findings should be replicated in other ethnicities and experimental models to study the exact role of ERCC6L2 rs591486 in oxidative stress, which appears to be a main contributor in the pathophysiology of ALS.