Introduction

Alzheimer’s disease (AD) stands as a prevalent neurodegenerative condition, particularly affecting the elderly, aged sixty years and above [1]. The intricate mechanisms underlying its pathophysiology and etiology remain incompletely elucidated [2]. In addition to neuronal loss, various contributing factors have been identified in AD onset [3]. Notably, neuroinflammation [3] and dementia [4] emerge consequent to the dysregulation of the nucleotide-binding oligomerization domain (NOD)-Like Receptor Pyrin Domain-Containing 3 (NLRP3) protein. NLRP3, a tripartite protein belonging to the NALP family, is renowned for its involvement in apoptosis and inflammation [5]. Structurally, NLRP3 comprises an amino-terminal pyrin domain, a central nucleotide-binding and oligomerization domain (NOD; also referred to as the NACHT domain), and a C-terminal leucine-rich repeat domain [6]. Research indicates that inflammation induced by viral or bacterial infections plays a significant role in the pro-inflammatory activation of immune cells resident in the central nervous system (CNS), such as astrocytes and microglia, leading to neuronal death [7, 8]. Ever since the discovery of NLRP3 as a prominent inflammasome linked to various age-related inflammatory conditions like gout, multiple sclerosis, diabetes, ulcerative colitis, and AD [9, 10], NLRP3 has garnered significant attention as a potential therapeutic target. This attention stems from its influence on activated astrocytes and microglia, which are known to exhibit enrichment in genes associated with AD [11].

The human NLRP3 gene spans approximately 32.9 kilobases (kb) in length, positioned on chromosome 1q44 and comprising nine exons. Variations in single-nucleotide polymorphisms within the NLRP3 gene can potentially alter its functionality, thereby augmenting the activation of inflammasomes and the production of IL-1β [12, 13]. Notably, next-generation sequencing (NGS) technologies provide extensive insights into sequence variability, facilitating comprehensive analysis of genetic alterations. Exploring the impact of extensive sequence variation data within biological systems can be challenging and resource-intensive [14]. Conversely, in silico analysis methods present an accessible, cost-effective, and efficient approach for evaluating the potential consequences of amino acid alterations. Leveraging protein structure and sequence information, these techniques offer predictive insights into phenotypic changes resulting from genomic variations. Specifically, non-synonymous single-nucleotide polymorphisms (nsSNPs) introduce modifications to protein sequences, potentially influencing their functionality. Therefore, conducting initial in-silico analyses can augment our capacity to identify nsSNPs for experimental investigations. These analyses involve assessing the potential influence of an amino acid variant on protein structure and functionality [15, 16].

Numerous genotyping analyses have been undertaken previously to identify diverse single nucleotide polymorphisms (SNPs) within NLRP3, which have been associated with various inflammatory and autoimmune disorders. These disorders include multiple sclerosis, rheumatoid arthritis, psoriatic arthritis, celiac disease, myasthenia gravis, and ulcerative colitis, among others [17, 18]. Several studies have indicated a correlation between NLRP3 polymorphic sites and the susceptibility to certain autoimmune disorders. However, conflicting evidence exists, with some studies proposing a protective effect of NLRP3 gene polymorphisms at these sites against disease onset [19, 20]. Nevertheless, there is a dearth of research investigating the structural and functional implications of these SNPs. In this study, we utilized in silico methodologies to pinpoint pathogenic SNPs within the NLRP3 gene, aiming to prioritize SNPs for subsequent genetic mapping investigations. Various computational techniques and tools were employed to fulfill the objectives of this study. Leveraging the in-silico approach offers significant advantages in identifying candidate SNPs, as it is both accessible and cost-effective, thereby facilitating future genetic studies.

Materials and Methods

To provide a comprehensive analysis of nsSNPs in the NLRP3 gene and their implications for Alzheimer’s disease, we employed a series of bioinformatics tools and methodologies. The following tools were used in the present study to ensure a thorough evaluation of the genetic variants and their potential impacts on protein function and disease association (Table 1).

Table 1 Summary of Methodological Steps and Tools

Sequence Retrieval and Mining of nsSNPs in the NLRP3 Gene

The UniProt database (UniProt ID: Q96P20) offers essential information regarding the human NLRP3 fasta sequence. Given the pivotal role of NLRP3 inflammasome activation in Alzheimer’s disease (AD) pathogenesis, exploring missense single nucleotide variations (nsSNVs) within the NLRP3 gene is imperative. The NLRP3/caspase-1/IL-1β axis represents a crucial pathological pathway associated with AD, emphasizing the significance of investigating genetic anomalies [21]. These nsSNVs can profoundly influence both the structure and function of the protein, potentially impacting disease progression. To initiate a comprehensive molecular analysis, we retrieved nsSNPs from the SNP database (https://www.ncbi.nlm.nih.gov/snp/), laying the groundwork for further investigation [22].

Identification and Prediction of Effects of SNPs

Several computational tools were sequentially employed to assess the functional and structural impacts of deleterious SNPs within the NLRP3 gene as outlined by Wanarase et al., [22].

SIFT (Sorting Intolerant from Tolerant) tool, available at https://sift.bii.a-star.edu.sg/www/SIFT_dbSNP.html, evaluates the phenotypic effects of amino acid substitutions by considering protein evolution and function. Scores range from 0 to 1, where values below 0.05 indicate damaging substitutions [23]. The Align GVGD web server (http://agvgd.hci.utah.edu/) predicts the deleteriousness of missense substitutions based on protein multiple sequence alignments and biophysical characteristics of amino acids. Variants are classified on a spectrum, ranging from C0 to C65, with C65 being the most likely to impact protein function [22]. PolyPhen-2 (Polymorphism Phenotyping V2), accessible at http://genetics.bwh.harvard.edu/pph2/, predicts the structural and functional consequences of amino acid substitutions using physical and comparative considerations. Variants with a PSIC score above 0.85 are considered probably damaging [22, 24]. The PANTHER cSNP classification system (http://pantherdb.org/tools/csnpScoreForm.jsp) assesses the evolutionary conservation and molecular functions of proteins, providing position-specific evolutionary conservation (PSEC) scores. Input queries include the protein sequence, amino acid variants, and the human organism [22].

Prediction of Solvent Accessibility

Understanding interaction interfaces and active sites within folded proteins is crucial for unraveling molecular behavior. Changes in these regions, such as amino acid substitutions, can disrupt binding affinity, especially in enzymes where catalytic activity is essential. NetSurf-2.0 is a valuable tool for analyzing surface composition, solvent accessibility, structural disorder, and other factors affecting amino acid residues. Utilizing protein sequences in FASTA format, NetSurf-2.0 (http://www.cbs.dtu.dk/services/NetSurfP/) employs deep neural networks trained on solved protein structures for accurate estimations [25, 26].

Homology Modeling and Functional Analysis

For structural assessment, SWISS-MODEL employs a hierarchical approach encompassing protein modeling, structural refinement, and functional annotation. By utilizing appropriate templates sourced from the Protein Data Bank (PDB), the server generates the top five models. The TIMP1 FASTA sequence serves as input for this comprehensive analysis [27].

The SWISS-MODEL web server (https://swissmodel.expasy.org/) leverages the updated UniProtKB proteome to conduct homology modeling of protein structures. Using the FASTA sequence as the input query, SWISS-MODEL provides critical metrics including the Ramachandran plot, Z Score, comparison plots, global and local quality scores, and molprobity score, offering a thorough evaluation of the modeled protein [28].

To assess the geometry of each residue and the overall structure, PROCHECK analysis (https://servicesn.mbi.ucla.edu/PROCHECK/) was employed [29]. Additionally, the ERRAT server (https://servicesn.mbi.ucla.edu/ERRAT/) was utilized to validate the overall model quality by evaluating the statistical relationship of non-bonded interactions between various types of atoms based on typical atomic interactions. Predicted models in PDB format serve as the basis for input queries in both PROCHECK and ERRAT analyses.

Prediction of Disease Association

Performing disease association predictions and structural analyses of nsSNPs is crucial for identifying genetic risk factors, and understanding the molecular mechanisms of diseases. In the present study, we have employed SNP& GO (https://snps.biofold.org/snps-and-go/snps-and-go.html), a SVM-based classifier that predicts the disease-associated variations using the PANTHER, GO and PhD-SNp terms corresponding to the query protein. The output is predicted as neutral variation (Probability <0.5) or as disease associated variation (Probability>0.5) [30].

PhD-SNP (https://snps.biofold.org/phd-snp/phd-snp.html) is a SVM classifier designed to predict if a nonsynonymous single-point mutation is disease-associated or neutral using protein sequence data. This tool employs BLAST software to compare mutations against the UniRef90 database, considering various mutation characteristics. The output is a score between 0 and 1, with scores below 0.5 indicating a likely disease association [31].

Assessing nsSNP Impact on Protein Aggregation

Protein aggregation is a hallmark of Alzheimer’s disease, making it essential to study the impact of nsSNPs on this process. The Solubility based on Disorder and Aggregation (SODA) tool (http://protein.bio.unipd.it/soda/) is designed to evaluate how mutations affect protein aggregation, disorder, and secondary structure. By inputting protein sequences or PDB files, SODA employs methods like PASTA 2.0, ESpritz-NMR, and Fells to predict variations, including insertions, deletions, substitutions, and duplications. It provides a final score indicating the solubility difference between the wild-type and mutant proteins, offering insights critical for Alzheimer’s research [32].

Analyzing Non-Covalent Interactions

Mutations can alter the non-covalent interactions within a protein, impacting its stability, function, and overall structure. Understanding these changes is critical for comprehending the molecular mechanisms underlying diseases like Alzheimer’s. The Arpeggio server is a valuable tool for analyzing these interactions by calculating the number of interatomic contacts in a protein structure. This server supports PDB format files and identifies about 15 types of interatomic interactions. Using OpenBabel and SMARTS queries, Arpeggio assigns atom types and identifies nearest-neighbor atoms within a 5 Å radius, generating a Structural Interaction Fingerprint (SIFt) for each pairwise contact [33].

Construction of Protein-Protein Interaction Network (PPIN)

The STRING (https://string-db.org/) database was used to predict the functional association of NLRP3 with other proteins. The NLRP3 protein was mapped against Homo sapiens (NCBI taxonomy ID: 9606) and a PPIN was constructed at the highest confidence (confidence score: 0.900) by including known interactions (from curated databases and experimentally determined), predicted interaction (gene neighbourhood, gene fusions and gene co-occurrence), and others (text mining, co-expression, and protein homology) [34]. The predicted PPIN was further subjected to clustering to identify functional proteins majorly interacting with NLRP3.

Protein-Protein Docking

The NEK7 was identified as a major functional protein interacting with NLRP3. In order to perform protein-protein docking the 3D structure of NEK7 (PSB ID: 5DE3) was retrieved from the PDB database. The modelled structure of wild-type NLRP3 and the cleaned and prepared structure of NEK7 were subjected to global protein-protein docking using the HDOCK server [35]. Following docking the docking scores were calculated by a knowledge-based iterative scoring function wherein a more negative docking score indicated a more possible binding model. The tool states that a docking score of −200 or better and a confidence score above 0.7 suggest empirically defined docking between the two molecules. The docked structures of the wild-type and the mutant type were further visualised for the molecular interactions among the proteins.

Molecular Docking of a Memantine with the NLRP3 Protein and Its Mutant Forms

Docking analysis was conducted using the PyRx tool (https://pyrx.sourceforge.io/) to evaluate the binding affinity of the selected ligand with both the NLRP3 wild type and R262L mutated protein structures. The mutated protein structure and the wild type were individually assessed against Memantine (PubChem CID ID: 4054) using AutoDock Vina 2.0 integrated into PyRx.

In this analysis, the modeled proteins served as the macromolecules, while Memantine was designated as the flexible entity for docking. A grid box was generated with the center dimensions of X:184.614, Y:201.012, and Z:134.0352 for all the structures. This investigation aimed to elucidate Memantine’s binding mechanisms and potential interactions with the NLRP3 protein and its mutant variant, offering valuable insights into Memantine’s efficacy. The docking results were quantified based on binding affinity, representing the energy associated with ligand binding [26]. Throughout the docking process, the ligands maintained nine degrees of flexibility, with the binding pose exhibiting zero Root Mean Square Deviation (RMSD), deemed optimal for maximal binding efficacy. Additionally, interactions were visualized using BIOVIA Discovery Studio and the protein structures (wild and mutant) were superimposed to understand the structural alignment (https://discover.3ds.com/discovery-studio-visualizer-download) [36].

Molecular Dynamic Simulation

The R262L mutation stands out as it may significantly impact the NLRP3. The Molecular Dynamics Simulation (MDS) using GROMACS software was employed to evaluate the stability of the R262L and wild-type NLRP3 models. Subsequently, MDS was conducted using the GROMACS package with the GROMOS96 54a7 force field. Solvation was achieved using the SPCE water model, and a salt concentration of 0.15 M with NA+ and Cl- ions was maintained, while the complex’s pH was set at 7. The equilibration process consisted of two phases: first, a 10-ns v-rescale algorithm heating period gradually increased the temperature to approximately 300 K in the NVT ensemble. This was followed by a 5-ns restrained phase to allow solvent settling, succeeded by a 10-ns NPT equilibration phase with gradually removed restraints. Once equilibrated, the systems underwent a 100 ns preparation period for graphing and MMPBSA analysis for the last 50 ns, with pressure maintained at 1 bar and temperature at 300 K [36].

MDS assessed variations in Root Mean Square Deviation (RMSD), Root Mean Square Fluctuation (RMSF), Radius of Gyration (Rg), Solvent Accessible Surface Area (SASA), and hydrogen bond count of the complex, providing a comprehensive evaluation of its stability.

Results

Retrieval of SNPs

The dbSNP database was used to retrieve the SNPs for the human NLRP3 gene. It contained 13825 SNPs in total, out of which 7 inframe deletions, 2 inframe indel, 6 inframe insertions, 2 initiator codon variants, 12178 introns, 893 missense (nsSNPs), 225 non-coding transcript variants and 512 synonymous. For this study, only nsSNPs of NLRP3 were selected, which contributed to only 6.45% of all SNPs known in the human NLRP3 gene (Fig. 1).

Fig. 1
figure 1

Prediction of SNPs associated with the human NLRP3 gene. The figure shows the presence of mutations in the NLRP3 genes retrieved from the dbSNP database. A total of 13825 SNPs were retrieved of which, 0.05% were inframe deletions, 0.014% were inframe indels, 0.04% were inframe insertions, 0.014% were initiator codon variants, 88% were introns, 6.45% were missense, 1.62% were non-coding transcripts and 3.7% were synonymous

Identification and Prediction of Effects of SNPs

In the initial screening of 893 non-synonymous single nucleotide polymorphisms (nsSNPs), SIFT analysis categorized 86 as deleterious and 116 as tolerated, with the remaining nsSNPs remaining unclassified. To refine these results, Align GVGD, PolyPhen-2, and PANTHER methodologies were employed to filter the SIFT predictions and assess the potential consequences of the nsSNPs.

Align GVGD analysis revealed insights into the potential functional impact of the nsSNPs. Among the 86 deleterious nsSNPs identified by SIFT, 42 (48.84%) were classified as having the highest likelihood of functional impact (Class 65), indicating a higher probability of inducing functional changes. Additionally, 40 nsSNPs (46.51%) were classified into other classes (C0, C15, C25, C35, and C55), suggesting a lower likelihood of functional impact. The remaining mutations were excluded from further analysis due to their association with unknown amino acids. Subsequently, only the 42 nsSNPs classified as having the highest likelihood of functional impact (Class C65) were subjected to further scrutiny (Table 2). PolyPhen-2 analysis identified 4 nsSNPs as possibly damaging, 26 as probably damaging, and 12 as benign. PANTHER analysis indicated that 6 nsSNPs were possibly damaging, 20 were probably damaging, and 16 were probably benign (Table 2).

Table 2 Identification of deleterious nsSNPs of NLRP3 through various prediction tools

Through a comprehensive analysis, 18 non-synonymous single nucleotide polymorphisms (nsSNPs) were identified as significant candidates. These nsSNPs were predicted to be deleterious by SIFT, categorized as Class C65 in Align-GVGD analysis, and deemed probably damaging according to predictions from PolyPhen-2 and PANTHER. The selection of these 18 nsSNPs was made meticulously, considering the robustness of the analyses and the consistency of outcomes across various prediction techniques (Table 3).

Table 3 Identification of damaging and deleterious nsSNPs using various prediction tools

Prediction of Changes in Protein Stability

The 18 chosen nsSNPs underwent analysis using the MUpro and I-Mutant web servers to assess the impact of point mutations on protein stability, as indicated by the change in free energy value. For MUpro analysis, the range of DDG values for the nsSNPs varies from −1.92 kcal/mol to −0.1738 kcal/mol. Among these mutations, all are predicted to decrease stability (Table 4). In the case of I-Mutant analysis, some mutations are predicted to increase stability while others are predicted to decrease stability. Specifically, 2 mutations are predicted to increase stability, while 16 mutations are predicted to decrease stability (Table 4). Additionally, there are 16 mutations that are predicted to decrease stability by both MUpro and I-Mutant tools, suggesting a consensus in their assessment of these mutations and were subjected to further screening.

Table 4 Prediction of the effect of the deleterious nsNSPs on the stability of NLRP3

Protein stability is a crucial determinant of protein function. Stable proteins are more likely to maintain their correct folding and functional conformation, which is essential for their normal biological activity. Conversely, decreased stability can lead to misfolding, aggregation, and dysfunctional interactions. These effects are particularly relevant in the study of disease-associated proteins, where even minor perturbations in stability can have significant pathological consequences. In our study, we focused on nsSNPs that decrease the stability of the NLRP3 protein rather than those that increase it. NLRP3 is a central component of the inflammasome complex, which plays a critical role in the inflammatory response. Decreased stability in the context of NLRP3 does not necessarily equate to a loss of function. Instead, it can result in aberrant activation and heightened inflammatory responses as they may form aggregates leading to excessive and chronic inflammation.

Prediction of Solvent Accessibility

The analysis conducted through NetSurfP aimed to predict the solvent accessibility of various mutations within the NLRP3 gene (Fig. 2, Table 5). These predictions provide valuable insights into the potential functional implications of each mutation, shedding light on their respective roles within the protein structure.

Fig. 2
figure 2

Solvent accessibility and stability prediction by NetSurfP

Table 5 Identification of evolutionary conservation in the damaging nsSNPs of NLRP3

In Table 4, the classification of each mutation based on solvent accessibility is outlined, along with corresponding RSA (Relative Surface Accessibility) and ASA (Absolute Surface Accessibility) values. Notably, the mutations exhibit a mixture of buried and exposed classifications, suggesting diverse interactions with the protein environment. Among the mutations classified as “Buried,” including D305H, T954M, T350M, L307P, G303D, D305G, C261W, F525C, Y443H, C998S, G303C, and L362Q, the RSA values range from 0.015 to 0.223. These mutations are likely situated within the core of the protein, with limited exposure to the surrounding solvent environment. The corresponding ASA values further support this inference, ranging from 2.152 to 32.141 (Table 5). Conversely, mutations classified as “Exposed,” such as T897M, R262P, R262L, and T659I, exhibit higher RSA values ranging from 0.28 to 0.339. These mutations are positioned on the protein surface, allowing for greater interaction with solvent molecules. As a result, their ASA values are comparatively higher, ranging from 38.841 to 64.733 (Table 5).

Validation of Protein Models for Structural and Functional Analysis

The three-dimensional structural analysis of the human NLRP3 protein was conducted to investigate the impact of deleterious nsSNPs on both wild-type and mutant-type structures. Notably, mutations such as T897M, R262P, R262L, and T659I were found to induce destabilizing effects in the NLRP3 protein. The SWISS-MODEL platform was utilized to generate three-dimensional models of both the wild-type and mutant-type NLRP3 proteins, employing the 7pzc.1.A template from the UniProtKB proteome as the most suitable alignment. Through this process, detailed structural insights into the conformational changes induced by the mutations were obtained. Analysis of the Ramachandran plot revealed that approximately 86% of the residues in each structure occupied the most favoured region, indicating the overall high quality and reliability of the modelled protein structures. This analysis provides valuable information regarding the structural stability and integrity of the NLRP3 protein under the influence of deleterious nsSNPs (Table 6, Fig. 3).

Table 6 Predicted Model Evaluation Scores
Fig. 3
figure 3

Modeled structures of (a) Wild type (b) R262L (c) R262P (d) T659I (e) T897M mutants of NLRP3

Prediction of Disease Association and Aggregation Propensity

The nsSNPs can alter the amino acid sequence of proteins, potentially affecting their structure and function. Understanding whether these changes are associated with diseases helps in identifying genetic risk factors and mechanisms underlying various conditions. In the present study, the disease association was predicted with SNP&GO and PhD-SNP servers wherein a probability score greater than 0.5 were predicted to be disease-associated variation (Table 7). The rs180177442 (R262P, R262L) was predicted to be disease-associated across both servers.

Table 7 Predicted disease association, and aggregation propensity, for nsSNPs in NLRP3

The aggregation propensity for the wild-type NLRP3 protein is −5.798, serving as a baseline for evaluating the effects of various mutations. The SODA tool predicts protein aggregation propensity, with more negative values indicating a higher tendency to aggregate. The R262L mutation exhibits a significantly negative aggregation value of −19.634, indicating a much higher propensity to aggregate and reduced solubility compared to the wild type (Table 7). Consequently, the R262L mutation was prioritized for further analysis.

Analyzing Non-Covalent Interactions

The non-covalent interaction analysis of wild-type and R262L mutated NLRP3 reveals subtle but significant differences that could explain the mutant’s behavior. The R262L mutant shows a slight decrease in hydrogen bonds and weak hydrogen interactions compared to the wild type, indicating minor alterations in stability and local interactions. The decrease in ionic interactions further suggests potential disruptions in electrostatic stability. Interestingly, the R262L mutant has an increase in aromatic contacts, hydrophobic contacts, carbonyl interactions, polar contacts, and weak polar contacts, which suggests a compensatory mechanism to maintain structural integrity (Table 8). The increase in hydrophobic contacts, in particular, aligns with previous reports indicating that changes in hydrophobic interactions significantly affect protein stability. These changes contribute to the mutant’s higher aggregation propensity and lower solubility, reinforcing its association with disease.

Table 8 Non-covalent interactions in wild and R262L mutated model of NLRP3

Protein-Protein Interaction Network for NLRP3

The STRING analysis for NLRP3 reveals its crucial role in innate immunity and inflammation, particularly through its involvement in the NLRP3 inflammasome. As a sensor, NLRP3 interacts with several key proteins such as PYCARD, NEK7, AIM2, and CASP1, facilitating the formation and activation of the inflammasome complex. This complex subsequently activates caspase-1, leading to the maturation and secretion of pro-inflammatory cytokines IL-1β and IL-18, which are central to the inflammatory response. The interactions with proteins like MAVS and TXNIP further underscore the role of NLRP3 in mediating inflammatory pathways and responding to pathogenic threats (Fig. 4).

Fig. 4
figure 4

Protein-protein interaction network for NLRP3. The results show (a) the PPIN generated with STRING at the highest confidence of 0.900 and the highlighted regions (b, c) represent the major clusters in the network

Protein-Protein Docking

STRING identified two major clusters, with NLRP3 showing significant interaction with NEK7. NEK7 is essential for the activation of the NLRP3 inflammasome, a multi-protein complex that plays a pivotal role in the inflammatory response associated with Alzheimer’s disease. NEK7 interacts with NLRP3, facilitating its activation, which leads to the production of pro-inflammatory cytokines. This inflammation is a key feature of Alzheimer’s disease pathology, contributing to neuronal damage and disease progression. Due to its critical role in inflammasome activation, NEK7 was docked with wild type and R262L mutated model of NLRP3 (Fig. 5).

Fig. 5
figure 5

Protein-protein docking of wild and mutated model of NLRP3 with NEK7

The docking results between NLRP3 and NEK7 reveal significant differences between the wild-type and R262L mutant. The docking score (DS) for the wild-type is −281.76 with a confidence score (CS) of 0.933, indicating a strong binding model. For the R262L mutant, the DS is −295.11 with a CS of 0.948, suggesting an even stronger interaction. Visualization of the interactions shows that the wild-type NLRP3 forms contacts with 32 residues in NEK7, establishing 2 salt bridges, 5 hydrogen bonds, and 229 non-bonded contacts. The hydrogen bonds are precise, with distances ranging from 2.23 to 3.13 Å. In contrast, the R262L mutant forms contacts with 30 residues in NLRP3 and 298 residues in NEK7, with 4 salt bridges, 6 hydrogen bonds, and 210 non-bonded contacts (Fig. 5). The hydrogen bonds in the R262L mutant interaction are generally slightly shorter, indicating stronger interactions, with distances between 1.78 and 3.28 Å. This increase in hydrogen bonds and salt bridges in the R262L mutant suggests a more robust and possibly more pathogenic interaction with NEK7, highlighting the mutant’s potential to exacerbate inflammatory responses, which is critical in the context of Alzheimer’s disease. These results underscore the R262L mutant’s enhanced propensity to aggregate and interact more strongly with NEK7, reflecting its potential for increased inflammatory activity.

Molecular Docking

Following molecular docking with memantine, disparities in binding affinity and interaction residues were noted between the wild-type and mutated NLRP3 models. Relative to the wild type, the R262L mutant displayed a slightly elevated binding affinity (−6.6 kcal/mol versus −6.5 kcal/mol) (Table 9). Upon examination of interaction residues, TRP A:416 consistently engaged with memantine in both wild-type and mutant models (Fig. 6), underscoring its pivotal role in drug binding. However, the mutants exhibited additional or altered interactions compared to the wild type. Notably, the R262L mutant formed a conventional hydrogen bond with SER A:345 (Fig. 6). These variances in interaction residues and bond types suggest structural modifications induced by mutations, potentially influencing drug-binding efficacy.

Table 9 Molecular docking of memantine with NLRP3 (wild and mutated models)
Fig. 6
figure 6

Molecular visualization of molecular docking of NLRP3 with Memantine. a 2D visualization of Wild-type (b) 3D visualization of Wild-type (c) 2D visualization of R262L mutant (d) 3D visualization of R262L mutant. Purple color indicates hydrogen bond donors and green color indicates hydrogen bond acceptors

The wild type and the mutated models were superimposed to have a better understanding on the structural alignment. The structures had a 100% overlapping (Topology score: 1.00), RMSD values of 0.06, with a 1.00 fragment score and 21.17 Z-score (Fig. 7).

Fig. 7
figure 7

Alignment of wild-type NLRP3 with R262L mutant. a Cartoon representation (b) Backbone representation of NLRP3 with R262L mutant (Green: Wild type NLRP3; Brown: R262L mutant), c Pairwise sequence alignment of NLRP3 with R262L mutant

Molecular Dynamic Simulation

Molecular dynamics simulations were carried out separately for 100 ns on two systems that are the wild NLRP3 and the R262L mutated model. The stability of the of the complexes was investigated with the RMSD values. The Root Mean Square Deviation (RMSD) graph provides a quantitative measure of the structural stability of the wild-type NLRP3 protein versus the R262L mutant over a 100 ns molecular dynamics simulation. RMSD values (Fig. 8), plotted over time, reflect the average distance between backbone atoms of the protein at a given time point and the initial structure, thus serving as an indicator of conformational changes and stability.

Fig. 8
figure 8

RMSD plots for wild type and R262Lmutated model of NLRP3

In this graph, the wild-type NLRP3, represented by the orange line, maintains relatively low RMSD values throughout the simulation, fluctuating around 0.4 to 0.5 nm. This indicates that the wild-type protein remains close to its original conformation with only minor deviations, demonstrating a stable structure over time. Such stability is crucial for maintaining the protein’s functional integrity and proper interaction with other molecules, such as NEK7, in the inflammasome complex. Conversely, the R262L mutant, shown by the black line, exhibits consistently higher RMSD values, fluctuating around 0.5 to 0.7 nm. This suggests that the mutant protein undergoes greater conformational changes and deviates more significantly from its initial structure. The increased RMSD values indicate that the R262L mutation induces structural instability, potentially leading to alterations in the protein’s functional domains and interaction surfaces (Fig. 8).

The Root Mean Square Fluctuation (RMSF) graph illustrates the flexibility of individual residues in the wild-type and R262L mutant NLRP3 proteins over the course of the simulation. RMSF measures the average deviation of each residue from its mean position, providing insight into the dynamic behavior and flexibility of specific regions within the protein. In this graph (Fig. 9), the wild-type NLRP3, represented by the orange line, shows relatively consistent fluctuations across most residues, with peaks indicating regions of higher flexibility. The R262L mutant, depicted by the black line, displays generally higher RMSF values compared to the wild type, particularly in the regions around residues 200, 400, and 900. This indicates that these regions in the mutant protein are more flexible and experience greater deviations from their mean positions. The increased flexibility in the R262L mutant can be attributed to the structural instability introduced by the mutation, which disrupts the local environment and weakens stabilizing interactions.

Fig. 9
figure 9

RMSF plots for wild type and R262L mutated model of NLRP3

The Radius of Gyration (Rg) graph provides insight into the overall compactness and structural integrity of the wild-type NLRP3 and the R262L mutant proteins over the course of the simulation. Rg measures the distribution of a protein’s mass around its center of mass, with lower values indicating a more compact structure. In this graph (Fig. 10), the wild-type NLRP3, represented by the orange line, displays relatively consistent Rg values around 4.0 nm throughout the simulation, with slight fluctuations. This stability in Rg values indicates that the wild-type protein maintains a relatively compact and stable structure, which is crucial for its proper function and interactions with other proteins. Conversely, the R262L mutant, depicted by the black line, shows slightly higher Rg values, fluctuating around 4.1 nm. These higher Rg values suggest that the mutant protein adopts a less compact structure compared to the wild type. The fluctuations in the Rg values for the R262L mutant also appear more pronounced, indicating greater variability in the protein’s overall shape and compactness during the simulation.

Fig. 10
figure 10

Rg plots for wild type and R262L mutated model of NLRP3

The Solvent Accessible Surface Area (SASA) graph measures the surface area of the protein that is accessible to solvent molecules, providing insight into the protein’s exposure to the surrounding environment. SASA is an important parameter that reflects the folding state, stability, and potential interaction sites of a protein. Higher SASA values indicate more surface area is exposed to solvent, suggesting a more unfolded or extended conformation, while lower values indicate a more compact and folded structure.

The wild-type NLRP3, represented by the orange line, shows a gradual decrease in SASA values over the simulation time, stabilizing around 800 nm². This decrease indicates that the wild-type protein becomes more compact as the simulation progresses, reducing its solvent-exposed surface area (Fig. 11). Such behavior is typical of a stable protein that folds correctly and maintains a compact conformation, essential for its proper biological function. The R262L mutant, depicted by the black line, also shows a decrease in SASA values, but with more fluctuations and generally slightly higher values than the wild type. The R262L mutant stabilizes around 830 nm², which is higher than the wild type, suggesting that the mutant protein retains more exposed surface area throughout the simulation. The increased SASA values and greater fluctuations indicate that the R262L mutant adopts a less compact and more dynamic conformation compared to the wild type. The higher SASA values for the R262L mutant suggest that this mutation causes the protein to adopt a more open structure, potentially exposing more hydrophobic regions to the solvent. This exposure can lead to increased interactions with solvent molecules, promoting aggregation and reducing solubility, as observed in the experimental data. The increased solvent exposure also implies that the mutant protein may be less stable, with a greater tendency to unfold or misfold.

Fig. 11
figure 11

Solvent accessible surface area plot for wild type and R262L mutated model of NLRP3

Discussion

NLRP3 is recognized for its involvement in sensing various molecular patterns, leading to the assembly of the inflammasome complex, activation of caspase-1, and subsequent release of the pro-inflammatory cytokine IL-1β, thereby orchestrating innate immune responses. The NLRP3 inflammasome is implicated in AD, as its activation within microglial cells by amyloid-beta (Aβ) peptides initiate neuroinflammatory processes [37]. It is widely acknowledged that polymorphisms play a significant role in modulating gene expression, potentially influencing inter-individual variations in disease susceptibility and severity [38]. SNPs are the most common type of genetic variation, occurring at a frequency of more than 1% at specific genomic positions [39]. These variations can occur within coding regions, non-coding regions, or intergenic regions of the human genome. Among SNPs, the nsSNPs stand out as they entail a single amino acid substitution within a gene’s coding region, potentially leading to significant alterations in the structural and functional characteristics of the resultant protein [40]. Consequently, nsSNPs have garnered considerable attention in recent research endeavours. Therefore, investigating the relationship between genetic mutations and their phenotypic consequences holds promise in elucidating the underlying mechanisms contributing to AD.

Numerous research endeavours have aimed to anticipate the functional implications of nsSNPs, particularly discerning whether they bear disease relevance or remain neutral, by scrutinizing various attributes of these polymorphisms. Certain attributes rely solely on sequence information, such as the specific residues encountered at the SNP locus. Single-base alterations occurring within protein-coding regions of DNA can induce modifications in amino acids, thereby potentially impacting protein structure and function [39].

Focusing on mitigating neuroinflammation as a therapeutic approach for progressive neurological disorders represents a logical strategy for neuroprotection. However, to effectively implement such strategies, it is imperative to pinpoint the cellular and molecular pathways pivotal in disease-related neuroinflammation [41]. Research conducted by Martinez et al. underscores the necessity of NLRP3 in the manifestation of inflammatory symptoms and neurodegeneration in a mouse model of Parkinson’s disease. This study identified several SNPs concentrated in the highly conserved NLRP3 NACHT domain. Among these variants, one synonymous SNP, rs7525979, was found to be significantly associated with a reduced risk of Parkinson’s disease development [42]. In our study, our focus centered on nsSNPs within the human NLRP3 gene. Analysis conducted through the dbSNP database revealed a total of 13,825 SNPs associated with the NLRP3 gene, with nsSNPs (missense) and synonymous SNPs accounting for 6.4% and 3.7% of the total mutations, respectively. Intriguingly, a substantial proportion (approximately 88%) of these mutations were found within intronic regions, representing non-coding segments of the gene. Further scrutiny was directed towards the 893 nsSNPs identified in our investigation.

Characterizing these nsSNPs based on their impact on protein structure was imperative, prompting the utilization of various bioinformatics tools. Our methodology incorporated diverse parameters describing pathogenicity, providing molecular insights into the effects of these mutations. Given the complexity of predicting the pathogenic effects of SNPs, employing a single method proved insufficient. Therefore, we employed a combination of multiple methods to cross-validate and enhance the robustness of our predictions. Specifically, our study utilized a suite of in silico prediction algorithms, including SIFT, Align GVGD, PolyPhen-2, and PANTHER.

The mutations including rs121908153 (D305H), rs139814109 (T840M, T897M, T954M), rs151344629 (T350M), rs180177431 (L307P), rs180177433 (T438I), rs180177441 (G303D), rs180177442 (R262P, R262L), rs180177447 (D305G), rs180177475 (C261W), rs180177478 (F525C), rs180177499 (Y443H), rs199517145 (C998S), rs201504984 (G303C), rs201593863 (L362Q), and rs201875324 (T659I), were predicted to be deleterious and probably damaging by all prediction software. Except for the mutation T840M and T438I, all the other mutations were predicted to have a destabilizing effect on the protein stability. A decline in protein stability can precipitate structural alterations that impede the protein’s capacity to execute its intended functions efficiently. This destabilization may induce structural modifications, such as misfolding or unfolding, compromising the protein’s functional efficacy. Notably, among the four nsSNPs analyzed—rs180177442 (R262L and R262P), rs201875324 (T659I), and rs139814109 (T897M)—the prediction indicated that they were exposed in the protein. The term “buried” signifies that the amino acid resides deep within the protein’s core, shielded from the surrounding solvent. Conversely, an “exposed” residue is positioned on the protein’s surface, directly interacting with the solvent environment. Buried residues serve a pivotal role in upholding the structural stability and integrity of proteins [43].

Mutations affecting buried residues can disrupt the intricate network of interactions within the protein core, resulting in destabilization of its structure. Buried residues often coincide with critical functional sites within proteins, such as catalytic sites or binding interfaces. Mutations in these residues can interfere with essential sites, thus altering the protein’s function. Given their pivotal role in protein folding and stability, buried residues are typically more evolutionarily conserved. Consequently, mutations in buried residues are more likely to exert a significant impact on both protein structure and function [44]. To gain insights into their three-dimensional structures, these mutated proteins were modeled using 7pzc.1 as the template. Subsequently, molecular docking studies were conducted with Memantine to elucidate potential interactions.

Using SNP&GO and PhD-SNP servers, the nsSNPs R262L and R262P were identified as disease-associated, with R262L showing a higher aggregation propensity (−19.634) and reduced solubility. The non-covalent interaction analysis highlighted that the R262L mutant had slight decreases in hydrogen bonds and ionic interactions but increases in hydrophobic and polar contacts, suggesting enhanced aggregation and disease association. STRING analysis revealed NLRP3’s crucial role in inflammation through interactions with proteins like NEK7, which is vital for inflammasome activation. Docking studies showed stronger interactions between R262L NLRP3 and NEK7 compared to the wild type, with the mutant forming more hydrogen bonds and salt bridges, indicating a potentially more pathogenic interaction. These findings underscore the R262L mutation’s enhanced inflammatory response potential, relevant to Alzheimer’s disease progression.

Memantine is a medication prescribed for alleviating symptoms associated with AD and dementia. Classified as an NMDA receptor antagonist, it operates by mitigating aberrant brain activity. While it does not offer a cure for AD, memantine demonstrates efficacy in enhancing cognitive functions, and memory retention and decelerating the advancement of the disease [45]. The molecular docking analysis of memantine with wild type and mutated NLRP3 models revealed variations in binding affinity and interaction residues. Despite minor fluctuations in binding affinity among mutants compared to the wild type, interaction residues differed significantly. These findings suggest structural alterations induced by mutations, potentially influencing drug-protein interactions. Specifically, the R262L mutation stands out as it may significantly impact the NLRP3-memantine binding interface, highlighting its importance for further investigation in understanding drug efficacy and potential therapeutic implications. The R262L mutant model was subjected to molecular dynamics simulations to understand the structural stability and flexibility of the wild and mutated structures over time.

The higher RMSD values for the R262L mutant are indicative of its decreased structural integrity. This instability can lead to improper folding or increased flexibility, which may expose hydrophobic regions and promote aggregation. The structural changes observed in the R262L mutant are consistent with its predicted higher aggregation propensity and reduced solubility, factors that contribute to its pathogenicity. The higher fluctuations observed in the R262L mutant suggest that the mutation induces localized structural instability, making certain regions of the protein more dynamic and less stable. Furthermore, the increased structural fluctuations in the R262L mutant could disrupt its ability to interact effectively with NEK7 and other inflammasome components. Moreover, the regions with increased RMSF in the R262L mutant may correlate with areas critical for the protein’s role in inflammasome activation and the inflammatory response. The heightened fluctuations in these regions could disrupt the precise molecular interactions required for proper inflammasome assembly and function, potentially leading to aberrant inflammatory signaling. This is particularly relevant in the context of Alzheimer’s disease, where dysregulated inflammation is a key pathological feature. The increased Rg values for the R262L mutant reflect a structural impact caused by the mutation, leading to a more expanded and less stable conformation. This expansion could result from the disruption of key interactions within the protein’s core, leading to a loosening of its tertiary structure. The higher Rg values and increased fluctuations suggest that the R262L mutant is less able to maintain a tightly packed conformation and further may potentially contribute to aggregation due to the exposure of hydrophobic residues. The changes in SASA values for the R262L mutant highlight its structural impact on NLRP3. The mutation appears to disrupt the protein’s ability to maintain a tightly packed conformation, leading to an increase in solvent-exposed surface area.

The R262L mutant (rs108177442) (Alleles: G > A/G > C/G > T) identified in the present study is reported to be present in 1 in 264690 people (Allele frequency: 0.000004) as per the data available in dbSNP (accessed on 22nd July 2024). The R262L coding variant is associated with Familial cold autoinflammatory syndrome 1 (ClinVar Accession: RCV000084231.1) and is considered to be likely-pathogenic in cases of Chronic infantile neurological, cutaneous and articular syndrome (ClinVar Accession: RCV001028056.1) and CIAS1 mutations associated with phagocytic cell-mediated autoinflammatory disorders ClinVar Accession: RCV000788842.1).

Previously, Zhang et al. examined a Chinese population and found that variants rs10754558 and rs10925019 within NLRP3 are linked to ulcerative colitis development [46]. Conversely, Varghese et al. conducted a study among Caucasians and concluded that there is no association between NLRP3 polymorphisms and inflammatory bowel disease [47]. Yang et al. proposed that the rs4353135 polymorphism may contribute to the pathophysiology of juvenile idiopathic arthritis in a Taiwanese population [48]. Additionally, Dehghan et al. emphasized the potential critical role of NLRP3 SNPs in regulating inflammation underlying cardiovascular disease [49]. Several limitations need to be acknowledged in our study. While we have identified high-risk nsSNPs within the conserved domain that exhibit a destabilizing effect on the protein, their functional associations were not characterized in this investigation. Further clinical and genetic studies are warranted to explore the mechanisms underlying the correlation between NLRP3 polymorphisms and the onset of AD.

Conclusion

Detection of deleterious variants is a prominent area of research, with bioinformatics tools offering insights into the impact of nsSNPs or missense mutations on protein stability and structure. Our analysis of 893 nsSNPs revealed 18 deleterious variants associated with the NLRP3 gene. Notably, rs180177442 (R262L and R262P), rs201875324 (T659I), and rs139814109 (T897M) were classified as high-risk variants due to their significant deleterious effects and association with reduced protein stability. These findings suggest potential targets for proteomic studies, diagnostics, and therapeutic interventions in AD. Further investigation through in vivo experiments is warranted to elucidate the implications of these nsSNPs in AD pathology.