Introduction

Endoplasmic reticulum (ER) is a basic sub-cellular compartment responsible for the critical process involving folding and maturation of the proteome, primarily transmembrane and secreted proteins in eukaryotic cells. ER protein homeostasis is highly sensitive to changes in intracellular and extracellular stimuli, involving a complex network of regulators including chaperones, ensuring high degrees of fidelity and efficiency. Nevertheless, alteration in the protein-folding environment causes accumulation of misfolded proteins profoundly inducing ER stress, which subsequently activates the Unfolded Protein Response (UPR) constituted by three signaling pathways mediated by downstream ER stress sensors IRE1 (the inositol-requiring ER-to-nucleus signal kinase 1), PERK (protein kinase-like ER kinase), and ATF6 (Activating Transcription Factor6) [1, 2]. This UPR pathway leads to a series of downstream events like decreasing translation and increasing transcription of ER chaperones to restore normal cellular function and survival [3]. However in chronic ER stress, protective remodeling does not happen due to prolonged UPR activation ultimately leading to pro-apoptotic signals that eventually eliminate the stressed cells [4].

The major role of UPR is to re-establish homeostasis either by increasing folding capacity via expression of folding chaperones or by the downregulation of ER protein client load via inhibiting protein translation and promoting the degradation of misfolding proteins. Accumulating experimental and substantial clinical evidence suggests that prolonged ER stress crucially impact the pathogenesis of various human diseases, including diabetes, cardiovascular diseases, inflammatory diseases, neurodegenerative disorders, and cancer. Currently, cellular ER stress along with oxidative stress in many disease conditions is treated with the administration of antioxidants and chemical chaperones. However, their protective effects are not sustainable, hence altering the molecular machinery of the cell will provide ways to understand the underlying metabolic changes. Interestingly, transcriptionally reprogramming of ER proteostasis network would augment ER protein-folding capacity by modulating the UPR downstream signaling pathways and therefore prevent protein aggregation and misfolding [5,6,7,8,9].

Heat shock protein 90 (Hsp90) is a evolutionarily conserved, ubiquitous family of ATP-dependent molecular chaperone that plays essential role in stabilizing protein folding, to modulating cellular signaling by regulating many client protein substrates. In eukaryotes, HSP90 exists as four isoforms: cytosolic HSP90α and HSP90β, ER-resident GRP94, and tumor necrosis factor receptor-associated protein 1 (TRAP1) which localizes to the mitochondria [10]. GRP94, the ER-resident member of the hsp90 family, governs the maintenance and activation of many essential proteins, particularly membrane-resident and secreted protein clients, namely MHC class II, integrins, and Toll-Like receptors. Coordinated up-regulation of GRP94 with many other ER folding components was first discovered while using pharmacological treatments to induce UPR [11, 12]. There are four ways to acknowledge the contribution of GRP94 in ER quality control which are (a) chaperoning the protein folding, (b) interacting with the ER protein-folding machinery, (c) storing calcium, and (d) assisting the targeting of misfolded proteins to ER-associated degradation (ERAD) [13,14,15].

The most important activity of GRP94 subsists as molecular chaperone that directs folding and/or assembly of secreted and membrane proteins. GRP94 protein contains three domains: C-terminal domain with ATP-binding site thought to regulate HSP90 homodimerization, charged middle linker region, and an N-terminal domain that contains an ATP-binding pocket (with ATPase activity) which regulates the turnover of folded client proteins (Fig. 1). Both the C- and N-terminal ATP-binding sites have been greatly explored as target for inhibitors; however, the majority of HSP90 inhibitors have concentrated only on the N-terminal ATP-binding site [16, 17]. Molecular chaperones require ATP hydrolysis to favor the native folding of their substrates, to avoid aggregation, and revert protein misfolding. GRP94, an ER chaperone displays selective ligand binding affinity to NECA (5ʹ-N-ethylcarboxamido adenosine), a broad-spectrum adenosine A2 receptor agonist which is more potent than adenosine and a well-known activator of GRP94. Several structural studies have demarcated the interaction of NECA with the N-terminal region of GRP94. A number of studies indicate that scaffolds generated from core hopping of known agonist with further pharmacophore modeling and QSAR validations will provide similar scaffold and ligands which would serve as potential agonist in activating the target [23]. Therefore, we decipher that NECA would probably increase the chaperone activity of GRP94.

Fig. 1
figure 1

GRP94 protein with three distinct domains showing the Adenosine binding N-terminal domain (69-337 region)

A number of small-molecule pharmacological chaperones have emerged as a novel therapeutic approach to treat metabolic disorders, such as type 2 diabetes and neurodegenerative diseases. Small-molecule kinase inhibitors and ATP mimetics have shown to activate the IRE1 arm of the UPR signaling pathway and reduce ER stress-associated pathogenesis. Hence, identification of small molecules that enhance the ER chaperoning activity and preferentially activation of the UPR-associated transcriptional program is needed to ameliorate protein misfolding and aggregation. This strategy of small molecule-dependent ER programming would restore the protein-folding ability of GRP94 thereby preventing ER stress-induced disease pathogenesis [18, 19]. With this insight, we have employed Scaffold hopping, pharmacophore modeling, QSAR-based virtual screening, molecular docking, and dynamic simulations to screen small molecules as potential agonists for GRP94 using NECA scaffold.

Materials and Methods

Protein and Ligand Preparation

The crystal structure of GRP94 was downloaded from the Protein Data Bank (PDB ID: 4NH9, https://www.rcsb.org/) with resolution 2.77 Å [20]. All the water molecules were removed, hydrogen atoms were added, and the H-bond assignment was optimized with exhaustive sampling using Glide module of Schrödinger Maestro Suite 2018-4 version 11.8, https://www.schrodinger.com/products/maestro). Finally, the protein structure was energy minimized using the OPLS3e force field to reach the converged root mean square deviation (RMSD) of 0.30 Å. The grid box was defined using “Receptor Grid Generation” by assigning a common constituency point from where a cubic grid box was extended to touch the bounty of 20 Å in size. Ligprep module of Schrodinger Maestro was used to prepare the 3D co-ordinates of the retrieved ligand structures from the selected ZINC and Selleckchem compound databases [21]. The ligands were geometrically minimized using the OPLS3e force field with the Truncated Newton Conjugate Gradient of RMSD below 0.01 kJ/Ǻ. Eventually, the minimized conformer was filtered through a relative energy window of 10 kcal/mol and a minimum atom deviation of 1.00 Å. The ionization states were created for ligands at a physiological pH of 7.2 ± 0.2 and all other variables remained default. The tautomers of the ligands were generated, optimized, and partial atomic charges were computed using the OPLS3e force field [22]. Then the prepared ligands structures were segregated and selected by removing the repetitive compounds manually.

Core Hopping of NECA

Core Hopping v2.1 module of Schrodinger Maestro 11.8 software was used to modify the known agonist by performing fragment-based replacement using ligand- and isosteric-based constraints. The first step was to define the points at which the cores were attached to the scaffold using Define Combinations mode. Next step was to subject the scaffold for optimization process for altering the cores. The altered scaffolds were selected based on the template of NECA and were considered as the initial structure for further analysis. The selected scaffolds of NECA with new analogs were further subjected for Pharmacophore modeling, 3D-QSAR, virtual screening, and molecular dynamics simulations. This parameter is based on the principle of bio-isosteres by replacing functional groups in the template ligand, for targeting and reducing the toxic side effects as well as improving the curative effects [23, 24].

Pharmacophore Generation and Modeling

Pharmacophore models were generated by ligand-based modeling through PHASE module 11.8 of Schrödinger Maestro 2018. Phase allows specification of matching chemical molecules and marginalization for pharmacophore features. The common pharmacophoric features were used for generating the hypothesis. The box size was adjusted to 2 Å with default scoring parameters and the maximum number of sites was set to 5 and minimum to 3. The top ranking hypothesis was selected for further screening. The common integral pharmacological features involved were hydrogen bond acceptor (A), hydrogen bond donor (D), hydrophobic group (H), and aromatic ring (R). After the pharmacophore generation, the scoring function was used to analyze the hypothesis. The selected alignment was computed using the default values and survival score. The feature definitions were utilized to map the positions of pharmacophore sites and were represented internally by a set of SMARTS patterns [25]. To encompass the space occupied by the aligned training set molecules, the grid was generated. Each model contains five or more partial least square (PLS) factors that tend to suit it [26].

3D-QSAR Validation

An atom-based 3D-QSAR model was built from the alignment on the simplest pharmacophore model through PHASE 11.8. A PLS analysis, that correlated the estimated activities with the observed activities, was used to evaluate the predictive ability of the best-scoring hypotheses [27]. QSAR models were important to elucidate the structural information of the ligand’s active conformation. QSAR was mainly based on known biological activities for structural and functional ligands for the pharmacophore model [28]. The statistical parameters such as R2 (coefficient of determination), SD (standard deviation of regression), and Q2 (cross-validated coefficient) were calculated to assess the general significance of the model [29].

Virtual Screening and Docking

Virtual screening was performed in ZincPharmer (http://zincpharmer.csb.pitt.edu/) and Selleckchem Natural Compound library with 2726 compounds (https://www.selleckchem. com/screening/natural-product-library.html) by exploiting the best pharmacophore model to efficiently search the datasets for fixed conformers [30, 31]. Low-energy 3D conformers with the best bond lengths and angles were generated for each two-dimensional structure. In this study, ADHR_2 was the best hypothesis based on which query search was performed to identify hits matching the pharmacophoric features of the hypothesis using the search for match option in PHASE. The default constraints included a maximum of 0.7 Root Mean Square Deviation (RMSD), obeying 10 rotatable bond cutoffs, with a molecular weight range of 180–500 Da. The obtained database hits were ranked depending on the degree of consistency with the pharmacophore model. A minimum fit value of > 3, the lowest limit to qualify as a hit compound, was applied to decrease the number of hits. Molecules with good fitness scores were selected for further docking studies.

Structure-based docking studies were carried out to investigate the intermolecular interactions between the ligand and the targeted protein using Glide 11.8, Schrodinger, 2018. In the first step, the high-throughput virtual screening mode of Glide was employed and 10% of the top-scoring ligands were further subjected to Glide SP docking. Again, 10% of the top-scoring leads from Glide SP were retained and all the ligands were subjected further to Glide XP docking. Low-energy conformations of all the compounds were docked into the active site of GRP94 using Grid-based Ligand Docking with Energetics [32] in extra precision mode without applying any constraint. The best-docked structure was identified using Glide score function, Glide energy, and Glide E model energy. The top ten lead compounds were selected based on their docking score for further studies.

Lipinski’s Rule and ADMET Prediction

Qikprop v.11.8, module was applied to screen the lead compounds for their ADMET properties. ADMET gives the specific range for analyzing the drug-like properties, such as aqueous solubility (QPlog S), brain/blood partition coefficient (QP log BB), (QP log Po/w), predicted apparent MDCK cell permeability (QPMDCK), and human oral absorption [33]. The drug-like behavior of the compounds was predicted through the analysis of pharmacokinetic profile of the compounds. Physically significant descriptors and pharmaceutically relevant properties of all the test compounds such as molecular weight, log P, H-bond donors, and H-bond acceptors were analyzed in accordance with Lipinski’s rule of five [34]. Finally the selected compounds were filtered through Lipinski’s rule to ensure the properties of pharmacokinetics.

SwissADME online tool (http://www.swissadme.ch/) brings together the most relevant computational methodologies to give a worldwide assessment of small-molecule pharmacokinetics profiles [35]. The pharmacological properties of the top lead molecules were assessed based on different parameters and the bioavailability radar was generated.

Molecular Dynamics Simulations

The best-docked complex of GRP94 target protein acquired from the virtual screening result was explored for the conformational stability using molecular dynamic (MD) simulations. MD simulations were carried out using the Desmond module of Schrodinger suite of 2018-4 for 100 ns with the force field OPLS-2005. The generated MD trajectory files were examined in order to determine the equilibrium of dynamics over the period of time. The structure was generated and solvated using the TIP4P model by orthorhombic water box parameters as well as neutralizing with counter ions from each edge of the box [36]. Before the MD simulations, the complex charge was electrically neutralized as well as the system was minimized with their energies by series of heating and equilibrium processes. The selected complexes were energy minimized by the protocol of steepest descent minimization for the maximum extent followed by a cascade of molecular simulations. The above mentioned steps were taken into account at temperature of 300 K with a NVT ensemble by Nose–Hoover thermostats, whereas the pressure was maintained using Berendsen thermostats method [37]. Eventually the salt concentration of the system was set in a specified state as 0.15-M Na+/Cl for the neutralizing components in a dynamics system. The process of simulated protein–ligand complexes were examined to assess the root mean square deviations (RMSD) for the MD trajectory frames of the selected simulated structure [38]. Further, RMSD variations were calculated and the Root mean square fluctuation (RMSF) was analyzed to study the structural changes of both the GRP94 protein and ligand complexes.

Results

In the present study, novel GRP94 agonists were designed and identified by means of core hopping approach, the structure of NECA (5ʹ-N-ethylcarboxamidoadenosine), the known agonist for GRP94 was used as the lead compound. The newly designed scaffolds of GRP94 from core hopping analysis [39] were subjected to pharmacophore hypothesis testing, 3D-QSAR model validations, and high-throughput virtual screening using Selleckchem and ZINCpharmer database to effectively identify small molecules for enhancing GRP94 activity. The screened compounds were validated as effective drug molecules based on ADMET prediction and the top lead compounds were taken over for further analysis. Molecular dynamics simulations of the representative ZINC and Selleckchem compounds with GRP94 protein were also performed to study the binding stability of the protein–ligand complexes.

Core Hopping Approach

The scaffold structure of NECA was divided into three parts as Core A, Core B-linkers, and Core C. The linkers and core of NECA were modified to produce various novel scaffolds. After optimization, 37 novel ligand-based scaffolds were generated by Core hopping. The newly constructed scaffolds obtained by modifying the linkers were visually inspected and four new scaffolds were selected for further analysis (Fig. 2). These scaffolds were then subjected for pharmocophore modeling, QSAR analysis, and virtual screening.

Fig. 2
figure 2

Scaffolds generated by Core hopping of NECA

Pharmacophore Model Generation

The pharmacophore models were generated using a set of pharmacophore features in order to get suitable sites for all the new scaffolds. In total, 392 common pharmacophore hypotheses were generated using different combinations of 13 variants. Among the 13 variants, five pharmacophore models ADHH_1, ADHHR_3, ADHR_2, DHHR_1, and DHHR_4 were selected for further analysis (Table 1). The scoring procedure provides a ranking of the different hypotheses, allowing us to make rational choices. The scoring algorithm includes the alignment of site points and vectors, volume overlap, number of ligands matched, selectivity, relative conformational energy, and activity [40]. A total of five hypotheses survived the scoring processes were utilized to build an atom-based QSAR model. In the present study, the pharmacophore site of the selected scaffold was defined by a set of four pharmacophoric features: hydrogen bond acceptor (A), hydrogen bond donor (D), hydrophobic groups (H), and an aromatic ring (R). Hypothesis ADHR_2 was selected as the best hypothesis based on highest survival score of 4.594 and volume of around 0.640 (Fig. 3).

Table 1 Pharmacophore hypothesis generated for the NECA scaffolds
Fig. 3
figure 3

A Structure-based Pharmacophore Mapping of ADHR_2 Hypothesis, B Common Pharmacophore Hypothesis aligned with all the Variants, and C Atom-based 3D-QSAR model Visualized for the selected Hypothesis ADHR_2

Evaluation of 3D-QSAR

The 3D-QSAR model prediction and validation were performed for the five selected pharmacophore hypothesis [38]. Regression values for the QSAR model were analyzed with the increasing number of PLS factors as the statistical significance and predictive ability of the model also incrementally increased up to 7. ADHR_2 has the most significant R2 value of 0.3170, predictive power Q2 value of 0.1039, Pearson R value of 0.3483, variance (SD) of 1.2098 indicative of its best fit, and high cross-validated coefficient of correlation (Table 2). In addition, ADHR_2 has P-value of 8.73e−018 indicative of the goodness of the model (Fig. 3).

Table 2 Atom-based 3D-QSAR validation for NECA scaffolds

Virtual Screening

Structure-based docking studies were carried out to investigate the intermolecular interaction between the ligands and the targeted protein, GRP94 using Glide [41]. ADHR_2 hypothesis was subjected for virtual screening against ZINC and Selleckchem databases to obtain ligands that match the pharmacophoric features of the model. A total of 2, 26, 248 compounds from ZincPharmer database and 833 compounds from Selleckchem were retrieved as hits. These hit compounds were docked onto GRP94 protein using Glide XP module of Schrodinger. The top ten ligands from ZINC database (Table 3) interacted with the following GRP94 active site residues ASN107, LYS114, ASP149, ASN162, GLY196, PHE199, and TYR200 either by hydrogen bonding or hydrophobic interactions (Fig. 4). Further, the potential lead molecules such as ZINC39737222, ZINC92952357, ZINC67650204, ZINC72457930, and ZINC15669361 had their docking score range from − 12.271 to − 9.025 which was considerably much higher than the known agonist, NECA (ZINC3995401).

Table 3 Docking and Binding energy of top hit compounds from ZINC database with 4NH9
Fig. 4
figure 4figure 4

2D and 3D Interactions of GRP94 and Top hit compounds from ZINCpharmer: A ZINC39737222, B ZINC92952357, C ZINC67650204, D ZINC72457930, and E ZINC15669361

Similarly, the top hits from Selleckchem database (Table 4) exhibited more interactions with GRP94 active site residues particularly, ASN107, ASP110, LYS114, ASP149, GLU158, LYS161, ASN162, GLY196, PHE199, and TYR200 either by hydrogen bonding or hydrophobic interactions (Fig. 5). In addition, the potential lead molecules such as 1,2,3,4,6-O-Pentagalloylglucose, Epimedin A, Narcissoside, Eriocitrin, and Secoisolariciresinol diglucoside had their docking score ranging from − 14.027 to − 13.096 and Glide energy from − 75.043 to − 61.039 which was considerably significant than the known agonist, NECA (ZINC3995401).

Table 4 Docking and binding energy of top hit compounds from Selleckchem database with 4NH9
Fig. 5
figure 5figure 5

2D and 3D Interactions of GRP94 and Top hit compounds from Selleckchem A Epimedin A, B 1,2,3,4,6-O-Pentagalloylglucose, C Narcissoside, D Eriocitrin, and E Secoisolariciresinol diglucoside

ADMET Analysis

Lipinski’s rule of five was based on the physicochemical properties of drugs and candidate drugs in clinical trials to evaluate drug likeness. Qikprop module of Schrödinger was utilized to examine the drug likeness of compounds. The percentage of the human oral absorption of selected lead compounds was found to be 52% to 100%. For selected lead compounds ZINC39737222, ZINC92952357, ZINC67650204, ZINC72457930, and ZINC15669361, the parameters like water solubility (QPlogS), blood barrier (Qplog BB), and cell permeability (QPPCaco) were within the permissible range as depicted in Table 5. Human oral absorption was 100% for ZINC39737222, 70.531% for ZINC92952357, 58.50% for ZINC67650204 as well as ZINC72457930, and 90.153% for ZINC15669361 indicating their efficacy as potential drug molecules with good therapeutic efficiency.

Table 5 Lipinski’s rule and ADMET properties of top hit compounds from ZINC database

The different parameters of drug-likeness analysis of the lead compounds were visualized using the SwissADME bioavailability radar (Figs. 6 and 7) [42]. Narcissoside and Eriocitrin have no pharmacokinetics interference as well as are non-inhibitors of CYP enzymes which attributes to their excellent drug-like properties with strong solubility and no excretion issues. The drug-likeness characteristics (Table 6) of Narcissoside are appropriate since they follow the Lipinski RO5 with minimal violation, zero alerts for Pan-Assay Interference Compounds (PAINS), and a bioavailability score of 0.17. Although, the potential bioactive compound Epimedin A exhibits negligible solubility, violate drug-likeness parameters because of its higher molecular weight. The drug-likeness analysis of ZINC top lead compounds using SwissADME indicated ZNC92952357 and ZINC67650204 with bioavailability score of 0.55 which signifies good pharmacokinetic properties and significant cellular permeability.

Fig. 6
figure 6

SwissADME bioavailability radar showing drug likeness of different bioactive molecules—pink areas represent pharmacokinetic property, like lipophilicity, molecular weight, solubility, and flexibility. A ZINC39737222, B ZINC92952357, C ZINC67650204, D ZINC72457930, and E ZINC15669361

Fig. 7
figure 7

SwissADME bioavailability radar showing drug likeness of different bioactive molecules—pink areas represent pharmacokinetic property, like lipophilicity, molecular weight, solubility, and flexibility. A Epimedin A, B 1,2,3,4,6-O-Pentagalloylglucose, C Narcissoside, D Eriocitrin, and E Secoisolariciresinol diglucoside, F NECA

Table 6 Drug-likeness characteristics of top hit compounds from Selleckchem database

Molecular Dynamics Simulations of ligand–Protein Complexes

In order to study the dynamic behavior and the binding stability of the ligand–protein complex under suitable environmental conditions, such as temperature and pressure, MD simulations were performed for 100 ns using Desmond module of Schrodinger Biosuite. In the trajectory analysis of the backbone, RMSD for all the top lead molecules with GRP94 were stable throughout the simulation run with minimum conformational changes.

Analysis of RMSD and RMSF Plot of ZINC Ligands

The RMSD versus the simulation time was considered as a significant criterion to evaluate the stability of the dynamic behavior. The final RMSD (Root Mean Square Deviation) values of the MD simulations for the selected complexes were indicated as a time-dependent function. The RMSD trajectory analysis of the known agonist NECA (ZINC3995401) as shown in Fig. 8A displayed considerable deviations for the whole simulation time without attaining the stable state. The RMSD of ZINC72457930 complex initially showed conformational fluctuations but after 20 ns it reached the RMSD value of 2.0 Å and showed a stable interaction throughout the whole simulation process without significant deviations. Further, the trajectories of ZINC39737222 and ZINC67650204 were for the most part similar within the range of 2.0 and the dynamics of the complexes was typically more stable without much deviations. The RMSD of ZINC15669361 ligand complex displayed fluctuations initially rather attained stability after 40 ns and exhibited stability throughout the runtime of 100 ns. However, the trajectory of ZINC92952357 complex showed initial deviations from 1.0 Å and reached the stable conformation after 10 ns.

Fig. 8
figure 8

MD simulations of the top hit ZINC compounds for 100 ns with RMSD, RMSF, and Interactions

RMSF (Root Mean Square Fluctuation) plot was analyzed to evaluate the residue wise fluctuation differences in all the selected protein–ligand complex systems as well as to identify the changes in the ATP-binding region for the entire simulation period (Fig. 8B). During the entire simulations, all the protein–ligand complexes exhibited uniform fluctuations in most of the residues except significant displacement at the residues PHE199, TYR200 close to nearly 6 Å indicating considerable mobility of the residues at those regions. The overall analyses revealed that all the lead molecules that complex with GRP94 confer a similar fluctuating pattern. The interactions of ZINC39737222 revealed that residues ASP149, GLU158, ASN162, and PHE195 fluctuate at the range of 1.5 Å. Furthermore, ligands ZINC67650204, ZINC15669361, and ZINC92952357 displayed significant interactions with the following residues: ASP149, GLU158, ASN162, PHE199 as well as LYS161, ASN107, and MET154 indicating novel interactions at specifically different residues. All the five compounds exhibited similar range of fluctuations. However, ZINC72457930 showed fluctuations at majority of the amino acid residues indicating the low stability of protein–ligand interactions: THR38, TYR39, ASP34, GLU201, and THR229.

Analysis of RMSD and RMSF Plot of Selleckchem Compounds

The RMSD of O-Pentagalloylglucose-GRP94 complex initially exhibited conformational fluctuations but after 40 ns it reached the stable conformation with RMSD value of 3.0 Å throughout the whole simulation process without significant deviations. However, the trajectories of Eriocitrin and Secoisolariciresinol diglucoside were for the most part similar within the range of 2.4 Å and the dynamics of the complexes was typically more stable without much deviations. The RMSD of Epimedin A ligand complex displayed initial deviations from 1.0 Å rather attained stability after 20 ns with an RMSD of 3.0 Å and exhibited stability throughout the runtime of 100 ns. Conversely, the trajectory of Narcissoside complex showed considerable fluctuations throughout the runtime and finally reached stability only after 85 ns (Fig. 9A).

Fig. 9
figure 9

MD simulations of the top hit Selleckchem compounds for 100 ns with RMSD, RMSF, and Interactions

Protein–Ligand Interactions

Different bonded interactions like hydrogen, hydrophobic, ionic, and water bridges are important for understanding the intermolecular recognition between the protein–ligand complexes. Among them, hydrogen bond interactions are very crucial in determining the interactive residues in the protein–ligand contacts. The histogram of all the GRP94-ligand complexes during the dynamic simulations are depicted in Fig. 9C. The results revealed that the hydrogen-bonded interactions were more prominently seen in all the five protein–ligand complexes, indicating significant maintenance of the hydrogen bonds throughout the simulation time. All the lead molecules showed stable interaction with active site residues, ASP149, GLU158, ASN162, PHE199, LYS161, and ASP107. However, Epimedin A, Narcissoside, and Eriocitrin ligands exhibited significantly higher bonded interactions than the known agonist, NECA indicating that these molecules would serve as potential agonists of GRP94 protein. However, further in vitro validations are required to confirm the theoretical predictions. Moreover, the interactions of key residues pronounced in docking studies were also retained at 100-ns MD simulations suggesting that the identified novel lead molecules would have better agonist activity than the known agonist NECA (Fig. 10).

Fig. 10
figure 10

Binding interactions and MD simulations of the known agonist NECA (ZINC3995401) with RMSD and RMSF

Discussion

Disruptions in the protein-folding machinery due to protein overload leads to accumulation of misfolded proteins in the ER which activates the UPR signaling pathway to attenuate protein synthesis and maintain protein homeostasis. Within the lumen of the ER, protein chaperones assist in the folding of newly synthesized polypeptides and prevent aggregation of unfolded or misfolded proteins. Quality control exists in the ER to avoid accumulation of unfolded or misfolded proteins. Abnormal protein conformations are a major cause for disturbed cellular homeostasis; therefore, perturbations in the ER are thought to be the origin of many diseases and developmental abnormalities. Grp94, an HSP90-like protein in the ER lumen that chaperones secreted and membrane proteins, has emerged as an important target in a variety of diseases, including cancer, glaucoma, and neurodegenerative disorders, as well as viral and parasitic infections [43]. As the ER workload increases, GRP94 is transcriptionally co-regulated with other chaperones to increase the efficiency of folding and reduce the chance of misfolded proteins by preferential interactions with Grp94-dependent clients [44, 45]. Hence, therapeutic targeting of GRP94, a major ER chaperone, exploiting small molecules would enhance the efficacy of GRP94 to restore ER homeostasis and mediate cellular protection.

There is growing interest in developing new drugs that modulate Hsp90 chaperone activity. While Hsp90 inhibition has gained major attention as novel anti-cancer target, the therapeutic relevance of Hsp90 activation in other disease areas remains unexplored. Zhao et al. 2010 reported the unexpected identification of tamoxifen as an effective small-molecule activator of Hsp90 ATPase activity [46]. Natural products remain as a valuable source of small molecules that serve as potential leads for drug development. A literature survey on nature-derived ER modulators by Correia da Silva et al., 2022 listed a number of molecules notably, fisetin, withaferin, cephalostatin, and basiliolides as ER stress inducers, while hydroxytyrosol and berberine are reported to ameliorate ER stress. Several small molecules have so far been reported to modulate the different arms of the UPR and therefore exhibit promising beneficial effects in diverse human diseases. Salburinal, AA147, Apigenin, and Baicalein are different modulators that target the UPR-transducer pathways [47, 48]. Recent studies indicate that Curcumin, well-known polyphenol from turmeric, exerts antioxidant defense activity against ROS-induced myogenic cell damage via selective expression of the ER stress-inducible chaperone GRP94 [49]. GRP94 induced by curcumin maintains calcium homeostasis and protects the cell from apoptosis, demonstrating that curcumin inhibits UPR induction by inhibiting transcription factor AP-1. Consequently, this study aimed at enhancing the chaperoning activity of GRP94 through small molecule-dependent ER reprogramming to reduce the protein overload and misfolding associated with ER stress.

In this study, the selected molecule 4NH9 (Correlation between chemotype-dependent binding conformations of HSP90 alpha/beta and isoform selectivity) displays chaperone activity at the N-terminal regions from the residues 69–337. In addition, the NECA ligand within the 3D structure of 4NH9 binds the following actives sites ASN149, TYR200, ASN107, and ASN162 with docking score of -6.513 kcal/mol. Huck et al., 2019 reported the structural analysis of GRP94 complex with NECA revealing the interaction sites as MET85, ASN162, LEU163, THR165, ALA167, THR171, GLY196, VAL197, PHE199, and TYR200 which is adjacent to the central adenine-binding cavity of the N-terminal domain of GRP94. A four-point pharmacophore model ADHR_2 was generated and 3D-QSAR analysis validated the hypothesis ADHR_2 as the best fit model with high R2 value and least P-value indicative of the goodness of fit. High-throughput virtual screening of the hypothesis against Selleckchem and ZINC databases retrieved a total of 833 and 2,26,248 compounds. Using docking score, binding energy, and ADMET predictions, the following compounds ZINC39737222, ZINC92952357, ZINC67650204, ZINC72457930, and ZINC15669361 from ZINC database as well as 1,2,3,4,6-O-Pentagalloylglucose, Epimedin A, Narcissoside, Eriocitrin, and Secoisolariciresinol diglucoside from Selleckchem library were identified as potential lead molecules. These compounds exhibited various bonded interactions with either of the active site residues in the N-terminal region of GRP94. All the lead compounds were within the acceptable range defined for human consumption, thereby indicating their potential as drug-like molecules. MD simulations revealed the RMSD trajectories of all complexes were stable with minor deviations within the range of 0.3 nm.

Protein–ligand interactions exhibited by the selected compounds ZINC92952357, ZINC67650204, and ZINC72457930 indicated more stable interactions with the N-terminal domain of GRP94 with Glide docking score from − 10.535 to − 10.454 9 kcal/mol and binding energy from − 42.559 to − 32.239 kcal/mol. Further analysis showed that these ZINC compounds are mainly piperidine and pyrazole derivatives that are known for their efficacy to differentially regulate ER stress and thus represent potential therapeutic agents to treat ER stress-related renal and neurodegenerative disorders [50]. Similarly, natural compounds such as Narcissoside, Eriocitrin, Epimedin A, and Secoisolariciresinol diglucoside showed better interactions with the ER chaperone with Glide docking score from − 13.520 to − 13.096 and glide energy from − 66.270 to − 61.039. Many natural occurring compounds including polyphenols, alkaloids, and flavonoids, target ROS and ER stress to modulate the disease pathogenesis. Narcissoside, known as narcissin, is a flavonoid derivative present in a number of medicinal plants displays several pharmacological properties. It exhibits neuroprotective potential by preventing the ROS-induced oxidative stress and apoptosis [51]. Eriocitrin, a major flavonoid in citrus plants, has displayed suppressive effect on oxidative stress and lipid-lowering effect in rats [52]. Secoisolariciresinol diglucoside, a lignan found mainly in flaxseed, has significantly regulated the lipid metabolic disorders through the ER stress-Ca2+-mitochondrial-associated pathway [53]. However, the mechanism of regulation of these natural compounds on ER chaperones as well as UPR pathway are yet to be elucidated in wet lab studies.

Conclusion

In the present study, novel GRP94 agonists were designed by means of core hopping approach and the newly designed scaffolds of NECA were subjected to pharmacophore hypothesis testing, 3D-QSAR model validations, and high-throughput virtual screening on ZINC and Selleckchem databases to effectively identify small molecules for enhancing GRP94 activity. The screened compounds were validated as effective drug molecules based on ADMET prediction and the top lead compounds were subjected to MD simulations to study the binding stability of the protein–ligand complex. The study yielded 10 top-docked compounds, which were then filtered further using post-docking analysis (top 5). Significantly, Epimedin A, Narcissoside, Eriocitrin, and ZINC92952357, ZINC67650204, and ZINC72457930 satisfy all the parameters investigated, such as docking score, binding energy, ADMET properties, and dynamic simulations. The present study deciphers that core hopping coupled with atom-based 3D-QSAR model and docking studies helped in designing analogs with better activity that may serve as potent small-molecule agonists for GRP94 to alleviate ER stress-associated disease pathogenesis. However, further experimental validations are required to substantiate the efficacy of these small molecules.