Introduction

Primary osteoarthritis (OA) is a multifactorial and polygenic disorder with a complex etiology that affects the structure and composition of articular tissues. It is characterized by progressive loss of articular cartilage, presence of osteophytes and subchondral bone sclerosis, resulting in pain and disability. Overweight, age, and gender are the main factors involved in OA development [1], but more detailed studies on OA pathogenesis acknowledge genetic factors and inflammatory processes as critical elements for its development [2]. In addition to these factors, an imbalance in cartilage homeostasis is favored by pro-inflammatory molecules, such as cytokines, lipid mediators, reactive oxygen species (ROS) and reactive nitrogen species (RNS), triggering an oxidative stress (OS) state [3]. Recent studies have suggested that OS plays a critical role in the development of many pathological conditions, such as OA, due to oxidation of proteins of the articular cartilage such as type II collagen [4,5,6]. Current studies on proline recycling by prolidase enzyme involved in collagen synthesis, show that in this process significantly favors ROS production, which promote OA development [7,8,9,10,11,12].

On the other hand, production of advanced glycation end products (AGEs) is an unavoidable process in vivo, and their accumulation in different tissues is importantly related to aging and systemic OS [13]. AGEs are the result of the non-enzymatic, post-translational addition of reducing sugars to proteins or apolipoproteins [6]. Accumulation of AGEs in the joints of OA patients leads to cartilage stiffness and fragility. AGEs bind the receptor for advanced glycation end products (RAGE) to trigger the intracellular pro-inflammatory response and promote hypertrophy development in chondrocytes [14].

As well as AGEs, the pro-inflammatory mediator interleukin-6 (IL-6) plays a critical role in very early stages of OA with increasing serum levels as the disease progresses [15]. In addition to IL-6, a direct correlation between serum levels of tumor necrosis factor-α (TNF-α) and IL-1β, and the OA radiographic grade has been observed. These three cytokines are the main mediators in cartilage degradation associated with OA. A recent study showed that IL-1β decreases expression of type II collagen and aggrecan, and increases production of matrix metalloproteinases-1 and -3 (MMPs). Both IL-1β and the cartilage mechanic charge promote nitric oxide (NO) production by positively regulating nitric oxide synthase 2 (NOS2), which induces chondrocyte apoptosis [16].

Overweight is linked to adipose tissue accumulation in different parts of the body. Adipose tissue is a metabolically active endocrine organ that secretes IL-1β, TNFα, and adipokines, such as adiponectin, leptin, and resistin. These inflammatory mediators have been shown to be related to articular cartilage degeneration and OA development [17]. However, the role of adiponectin has not been fully elucidated, as there are inconsistencies regarding its protecting, or rather, damaging role in maintaining articular cartilage homeostasis [18].

On the other hand, the antioxidant system plays a critical role in the homeostasis of several tissues, such as cartilage. Paraoxonase-1 (PON-1) is a Ca2+-dependent esterase with antioxidant properties that is associated with high density lipoproteins (HDL). However, PON-1 enzyme activity is reduced in high OS diseases, such as heart disease, dyslipoproteinemias, inflammatory processes, lung cancer, type 2 diabetes mellitus, and as recently shown, OA [19,20,21].

Another group of enzymes with antioxidant activity are carbonic anhydrases (CAs), which regulate pH inside and outside the cell, catalyzing the reversible conversion of HCO3 and H+ ions into water and CO2. Since CAs are involved in multiple body processes, their inhibition is associated with a series of disorders, such as macular edema, congestive heart failure-induced edema, glaucoma, obesity, cancer, and osteoporosis [22]. Derived from the avascularity in the articular cartilage, oxygen and glucose concentrations are low, so oxygen and nutrients are basically supplied by subchondral bone and synovial fluid diffusion [23, 24]. When chondrocytes suffer from oxygen deficit, glycolysis is the main mechanism for energy sustenance. However, lactic acid is generated as a subproduct of this process, contributing to acidification [25]. In this sense, significant changes in pH are critical for the articular cartilage and subchondral bone calcification processes through mineralization and formation of hydroxyapatite [Ca10(PO4)6(OH)2] and precipitation of calcium carbonate (CaCO3), where CAs play an essential role [26].

Changes in the expression of the aforementioned proteins may be due to the presence of genetic polymorphisms in their respective genes, thus affecting OA development. Several studies suggest a significant association of single-nucleotide polymorphisms (SNPs) in OA development [27, 28]. Nevertheless, the main limitations of association studies on complex diseases are limitation in the control of variables, especially environmental factors with potentially confusing effect; and sample size, which can significantly influence the statistic power to detect small effects. Moreover, SNP distribution is generally affected by the ethnicity of populations. On the other hand, studies related to analyzing gene–gene or SNP–SNP interactions, better known as epistasis, are usually conducted with logistic regression models, linkage disequilibrium (LD), and Hardy–Weinberg equilibrium (HWE), all of which have their own limitations. Logistic regression models require a very large “N” to detect statistical significance; also, polymorphisms that are poorly represented or of low frequency are difficult to detect; and the interactions between the polymorphisms can only be evaluated by pairs. With respect to linkage disequilibrium (LD), also evaluates by pairs and not in group. If two or more polymorphisms are very close to each other, they can generate false positives without an interaction taking place. Finally, with respect to the HWE, it can be affected when a polymorphism is not in equilibrium in complex diseases (such as OA), but this measure of disequilibrium can be given by other causes such as gene drift (when a trait is randomly fixed and inherited more), by positive selection, or selective pressure.

In this sense, the multifactor dimensionality reduction (MDR) approach is a non-parametric method that detects and characterizes non-linear interactions between discrete attributes (for instance, SNP, smoking, gender, etc.), which are predictive of a discrete result (for example, cases and controls). Furthermore, MDR combines attribute selection and construction, and classification with cross-validation to offer a powerful approach for modeling interactions [29, 30].

According to the information described above, the objective of this study was to identify and evaluate epistasis between multiple SNPs related to OS in Mexican knee OA patients.

Materials and methods

Subjects

Ninety-two unrelated patients with primary knee OA from the Outpatient Department of the “Luis Guillermo Ibarra Ibarra” National Rehabilitation Institute (INRLGII) were included in this case-control study. The knee OA was diagnosed under the guidelines of the American College of Rheumatology [31]; the clinical exam and the X-ray evaluation were performed by a rheumatologist and a radiologist, respectively. OA severity was evaluated using the Kellgren–Lawrence radiological scale (K–L) [32]. Patients with K–L ≥ 2 were included due to from this radiological grade the presence of osteophytes is confirmed and there is possible narrowing of the joint space; however, in K–L < 2 grade, the presence of osteophytes is still doubtful or null. Patients with other etiologies causing knee diseases, such as inflammatory arthritis (rheumatoid arthritis or any other autoimmune disease), post-traumatic arthritis, post-septic arthritis, poliomyelitis, or skeletal dysplasia, were excluded. Additionally, 147 healthy employees from the Departments of Human Communications and Human Resources, as well as janitorial staff of the INRLGII, with no signs, symptoms or family history of OA, were selected as the control group.

All the procedures with the participants of this study were conducted under the ethical standards of the Institutional Research and Ethics Committee of the INRLGII (CONBIOETICA-09-CEI-031-20171207) and under the Helsinki Declaration (1964). This study was approved by the ethics committee of the INRLGII and is derived from a main study with registration INR-18/13. All the participants signed an informed consent letter before the study, their minimum age is 40 years old, and they all are from the same geographic region (Mexico City and bordering states).

SNPs selection and genotyping

Nine candidate SNPs of six genes involved in OS development were selected. The search strategy was as follows: (a) information of each SNP was obtained from the public databases Hap Map (http://www.hapmap.ncbi.nlm.nih.gov/) and the National Center for Biotechnology Information dbSNP database (http://www.ncbi.nlm.nih.gov/snp); (b) the previously selected SNPs have been evaluated in OA or other pathologies [33,34,35,36,37,38,39]; and (c) the minor allele frequency (MAF) in the Mexican population must be > 10% (Table 1). Considering that the ancestral structure of the Mexican population is primarily composed of Amerindian, European, and African populations, a panel of eight ancestry-informative markers (AIMs) was included to distinguish each of these three populations with a value of δ > 0.44 to adjust all the analysis [40].

Table 1 SNPs evaluated in this study

Peripheral blood of all participants was obtained through venipuncture in order to isolate genomic DNA using a commercial method (QIAmp 96 DNA Blood Kit, Qiagen, Hilden, Germany). Isolated DNA was adjusted with molecular grade water at 50 ng/µl and maintained at − 80 °C until it was used. Genotyping was conducted with the OpenArray technology on a QuantStudio 12K flex System equipment (Thermo Fisher Scientific). A mixture of 2.5 ng/µl of DNA and 2.5 µl of TaqMan OpenArray Genotyping Master Mix (Thermo Fisher Scientific) was prepared and loaded into 384-well trays. Mixes were loaded onto genotyping OpenArray plates previously loaded with the genotyping primers and probes, using the AccuFill System (Thermo Fisher Scientific). Amplification was carried out following the manufacturer’s protocol. Results were analyzed using the TaqMan Genotyper software v1.2.

Statistical analysis

The data were expressed as mean ± standard deviation (SD). The P-values were calculated with Student’s t test (continuous variables) or Fisher’s exact test (categorical variables), when appropriate. HWE was calculated using the chi-squared test and SNPs not in HWE in the control group were excluded. Gene and allele frequencies among cases and controls were calculated using the chi-squared test or Fisher’s exact test, when appropriate. Associations of each SNP with OA risk were calculated with logistic regression models adjusted by age, gender, body-mass index (BMI), and ancestry. All statistical analyses were performed with the STATA v14.0 statistical package (StataCorp, Texas, USA), with a significance level of α = 0.05. The Bonferroni’s test was used to determine the significance level to correct multiple test errors, in which taking into account the nine selected SNPs, an adjusted P-value < 0.005 (α/number of loci) was considered statistically significant. The STRUCTURE software v2.3.4 (Pritchard Lab, Stanford University, USA) was used to evaluate the effect of population stratification on the associations found for each population k (k = 3) with the genotypes of the eight AIMs selected. The interactions between the nine SNPs selected in OA were evaluated with the MDR software v3.0.2, according to the algorithm proposed by Ritchie et al. [29]. Additionally, the OR-value of the high and low risk genotypes was calculated from the best model obtained, following the method described by Chung et al. [41].

Results

Characteristics of the study population

The characteristics of the study population are shown in Table 2. Cases were older than controls (47.2 ± 12.5 years old vs. 40.9 ± 12.0, respectively, P = 0.0001), and the group of women was more representative than the group of men, both in cases and controls (P = 0.005). BMI shows significant differences between both study groups, being higher in OA patients than in controls (29.0 ± 4.19 vs. 24.8 ± 4.38, P < 0.0001). The place of birth of the participants was similar among both study groups (P = 0.40), with the majority originating from Mexico City.

Table 2 Characteristics of the study population

Gene and allele frequency of oxidative stress SNPs

Table 3 shows the genotype and allele distribution in cases and controls with OR-values adjusted by age, gender, BMI, and ancestry; there were no significant differences in their distribution in both study groups. The genotype distributions were all in HWE (P > 0.05), indicating that the study subjects were representative of the study field.

Table 3 Genetic and allelic frequencies of SNPs studied in OA patients and controls

Interquartile range of the three main components of the study groups

There was no significant difference in the genetic structure between the study groups (P > 0.05). The interquartile range of the three main ancestral components of the Mexican population was: Amerindian component, k1 = 0.34 ± 0.08, P = 0.116; European component, k2 = 0.3 3 ± 0.16, P = 0.128; African component, k3 = 0.34 ± 0.08, P = 0.141.

Evaluation of gene–gene interactions: MDR

Table 4 summarizes the results of the exhaustive MDR analysis, which analyzed all possible combinations of the studied polymorphisms. According to the MDR analysis, the best model includes the ADIPOQ (rs1501299) and PON1 (rs662) polymorphisms. This model had a balanced accuracy test of 0.6171, a cross-validation consistency of 10/10 and an interaction P-value = 0.0107. Figure 1 shows a dendogram and interaction map of the studied polymorphisms, based on entropy measures among individual variables. (A) Dendrogram shows strong or weak interactions between polymorphisms of OS in knee OA. (B) A strong interaction effect was observed between ADIPOQ (rs1501299) and PON1 (rs662) polymorphisms, and between ADIPOQ (rs1501299) and AGER (rs1800624) polymorphisms, with information gain values of 9.84% and 7.08%, respectively.

Table 4 Results of MDR analysis
Fig. 1
figure 1

a Dendrogram of interactions between oxidative stress polymorphisms in knee osteoarthritis patients. The colors used depict the degree of synergy, ranging from red (highest information gain) to blue (highest information redundancy). b Interaction map for knee osteoarthritis risk. The interaction model describes the percentage of the entropy (information gain) that is explained by each factor or 2-way interaction. Values inside nodes indicate information gain of individual attributes or main effects, whereas values between nodes show information gain of pairwise combinations of attributes or interaction effects. Positive entropy (plotted in red or orange) indicates interaction, which can be interpreted as a synergistic or nonadditive relationship; while negative entropy (plotted in yellow-green or green) indicates independence or additivity (redundancy). (Color figure online)

Moreover, our model allowed us to identify interactions between two loci in high risk genotypes of the ADIPOQ (rs1501299) and PON1 (rs662) polymorphisms, and the most representative were [GG+CC], [GT+CC] and [TT+CT], respectively; and low risk genotypes [GT+CT] and [TT+TT], respectively (Fig. 2, left). Interestingly, when these two SNPs were merged as a single variable and the data reanalyzed it clearly came out as the best model for OA risk prediction (Fig. 2, right). According to this model, Table 5 shows the OR-values generated between ADIPOQ (rs1501299) and PON1 (rs662) polymorphisms.

Fig. 2
figure 2

Distribution of high-risk and low-risk genotypes in the best two-locus model. The distribution shows high-risk (dark shading) and low-risk (light shading) genotypes associated with knee osteoarthritis (KOA) in the two-locus interaction detected by MDR analysis. The percentage of KOA subjects (left black bar in boxes) and control subjects (right hatched bar in boxes) is shown for each two-locus genotype combination. Boxes were labeled as high-risk if the ratio of the percentage of cases to controls met or exceeded the threshold of 1.0. Boxes were labeled as low-risk if the threshold was not exceeded. Based on the pattern of high-risk and low-risk genotypes, this two-locus model is evidence of gene–gene interaction. Summary for ADIPOQ (rs1501299) and PON1 (rs662) SNPs combinations associated with risk to OA as (left) independent variables and as (right) a single merged variable

Table 5 OR-values according to the best model between ADIPOQ_rs1501299 and PON1_rs662 polymorphisms

Discussion

In this study, we evaluated the epistasis by MDR of genetic polymorphisms related to OS in knee OA patients. By definition, OS is a perturbation in the balance of production of ROS and antioxidant defenses resulting in macromolecular damage and interruption of signaling and redox control [3]. At the articular tissue level, ROS play a major role in the inflammatory intracellular signaling mechanisms of the synovial membrane and adjacent tissues [4]. Even though the environment and lifestyle seem to be key factors for the onset and progression of metabolic diseases, some of them, such as obesity and type 2 diabetes are subject to genetic susceptibility, with the subsequent risk of developing OA [6]. Despite the fact that we could not establish a significant association of the genetic polymorphisms of OS in OA development, our findings show significant evidence of interaction between variants rs1501299 (ADIPOQ) and rs662 (PON1).

Adiponectin is a 30-kDa hormone that is synthesized in the adipose tissue and carries out several biological functions in the organism; its receptors (AdipoR1 and AdipoR2) are expressed on the surface of human chondrocytes, suggesting an important role in OA etiology [17]. Even though its dual role (protector or risk factor) in articular cartilage maintenance is speculated upon, it was recently shown to be related to adiponectin activity and expression of articular cartilage damage markers, with production of pro-inflammatory and catabolic factors in OA [42]. Recently, Zhan et al. [36] evaluated polymorphisms rs2241766 and rs1501299 of the ADIPOPQ gene in knee OA patients with no significant association found. Nevertheless, when they stratified patients by radiological grades 2, 3, and 4, they identified significant risk association with rs1501299 variant only. In our study, we analyzed the same variants and our results were similar. Jiang et al. [43] evaluated variants rs182052, rs2082940, and rs6773957 of ADIPOQ, and only variant rs182052 was associated with a potential risk of developing knee OA in a Chinese population. The association results might be consistent, but to our knowledge, there is not enough evidence of the contribution of the ADIPOQ gene variants in the serum levels of adiponectin that could directly influence OA development, as is the case with heart disease. A correlation between the presence of certain genotype or allele of variant rs1501299 and the serum levels of adiponectin has been observed in this pathology, with the subsequent risk of suffering a heart attack. In other words, higher serum levels of ADIPOQ turn out to be protective [44].

PON-1 is a 45-kDa glycoprotein primarily synthesized in the liver, with an antioxidant activity modulated by age and OS. With aging, OS increases and the enzymatic activity of PON-1 decreases, which correlates with an increase in HDL oxidation susceptibility [21]. Soran et al. [20] and Ertürk et al. [45] observed that OS favors lipid peroxidation while decreasing the enzymatic activity of PON-1 in OA patients. Moreover, several polymorphisms that might affect the risk of development of disease have been identified in the PON1 gene; the most widely studied among them, rs854560 (L55M) and rs662 (R192Q), are located in the coding region. One study revealed that these two polymorphisms dramatically affect PON-1 activity and concentration in the serum of patients with a cardiovascular disease [46]. As far as we know, there is no scientific evidence evaluating the influence of polymorphisms rs854560 and rs662 on PON-1 activity in OA patients, but there is for other rheumatic pathologies. Tanimoto et al. [47] evaluated PON-1 serum activity subject to variant rs662 in rheumatoid arthritis patients, and they found it to be reduced in the group of patients. On the other hand, Xu et al. [48] evaluated variants rs662 and rs854560 in patients with ankylosing spondylitis, and like the previous report, they found a lower PON-1 activity in patients compared to the control group. The data presented above suggest that serum levels of adiponectin and paraoxonase-1 in OA patients are regulated by the presence of polymorphisms in their respective genes.

Regarding the MDR analysis, we found significant evidence of interaction between variants rs1501299 of ADIPOQ and rs662 of PON1 in OA patients. This interaction is interesting, as both adiponectin and paraoxonase-1 are actively involved in OA development, suggesting a multi-loci effect of genes ADIPOQ and PON1, i.e., these two genes may need each other in order to have an effect on OA development. This approach can be supported by the fact that the genotype analysis allowed us to identify low- and high-risk genotypes only for these two polymorphisms. Nevertheless, the calculated OR-values clearly show a risk interaction. However, we must highlight that our study was exclusively focused on one population, and the number of polymorphisms that were evaluated did not encompass the majority of those related to OS. Likewise, the design of our study does not allow us to know if there is a significant difference between the interaction of both polymorphisms and the different degrees of OA; so, the sample size should be increased to have representativeness in each of the radiological degrees. The interaction strength or intensity of any polymorphism can be lost if the genes are examined individually without considering the potential interactions with other genes, particularly those in related pathways. Therefore, further studies analyzing other pathways involved in OA development, as well as other populations, are needed.

In conclusion, we analyze polymorphisms related to OS in Mexican knee OA patients and found no significant associations, but the effect of the interaction between polymorphisms ADIPOQ rs1501299 and PON1 rs662 seems to play an important role in the OA pathogenesis. When we evaluate the polymorphisms individually, we observe that the effect is relative small for it to be detected as statistically significant with our sample size; however, when the polymorphisms are analyzed in a combined manner by the MDR method, a significant interaction can be identified since the combined effect of both polymorphisms is stronger; one of the virtues of the MDR method is that it does not require large sample sizes to detect significant interactions. Finally, the epistasis analysis may provide an excellent tool for identifying individuals at high risk for developing OA (and others complex diseases, such as diabetes mellitus, cancer, and rheumatoid arthritis, among others.) which can serve as a therapeutic target.