Introduction

The Zika flavivirus (ZIKV) is an arthropod-borne virus and closely related to West Nile, Dengue, and Yellow Fever viruses [1]. ZIKV was first discovered in 1947 in the Zika forest of Uganda and isolated from multiple Aedes species, which serves as the primary epidemic vector and is also associated with ZIKV transmission to humans [2]. There were many cases of ZIKV reported in countries such as Tanzania, Uganda, Gabon, Egypt, Indonesia, and India [3]. Recent investigations suggest a higher risk of infection in microcephaly during pregnancy, several neurological complications such as Guillain-Barré syndrome, meningoencephalitis, acute respiratory distress syndrome, hepatic dysfunction, hemorrhagic complications, multi-organ dysfunction syndrome, and death [2, 4, 5].

The single-stranded positive-sense RNA genome of ZIKV translates into a long polyprotein in infected cells cytoplasm. The ZIKV polyprotein consists of three structural proteins [precursor membrane (prM) protein, envelope (E) protein, and capsid (C) protein] form the virus particle and seven non-structural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5) perform essential functions in polyprotein processing, genome replication, and manipulation of host responses for viral advantage [6]. As an essential and imperative component of viral replication and forming membrane-bound complexes with other viral proteins, the NS3-Hel protein can be a most attractive antiviral target [7]. The multifunctional protein NS3 helicase possesses 5′-terminal RNA triphosphatase and nucleotide 5′-triphosphatase activities [8]. Fang et al. reported three crysal structures of ZIKV NS3 helicase comprising one in apo form and two in complex with ADP and Mn2+ [9]. The NS3-Hel comprises three domains with equal sizes, and apparent clefts locate between the adjacent domains (Fig. 1). The mechanism of flavivirus ATP hydrolysis in DENV NS3 helicase is illustrated in four states. The first state denotes the substrate complex using ATP analog, second state mimics the catalytic transition state using ADP, third state represents a form of both ADP and phosphate bound complex and fourth state represents a ADP bound product complex [10]. In the ATP bound substrate complex and ADP bound product complex of ZIKV NS3 and DENV NS3, the reactant water play essential role by forming hydrogen bonds with two conserved residues [9]. Domain 1 at residues 182–327 and domain 2 at residues 328–480 comprises the tandem α/β RecA-like folds characteristic of SF1 and SF2 helicases [11]. Domain 1 is associated with the classical motifs I/P-loop (or Walker A), Ia, II (or Walker B), and III, whereas the motifs IV, IVa, V, and VI are involved in domain 2. These helicase motifs are typically associated with ATP binding and/or hydrolysis (motifs I, II, and VI), inter-domain communication, and RNA binding (motifs Ia, IV, and V) and line a cleft at the interface of domains 1 and 2 [12]. The C-terminus of NS3- Hel contains an NTP-dependent RNA helicase domain, and it is mainly responsible for the hydrolysis of NTPs and the unwinding of the RNA [13, 14]. Thus, NS3-Hel becomes a promising target for antiviral drug development programs.

Fig. 1
figure 1

The three domains of NS3-Hel protein binding pocket used for virtual screening

Yuan et al. validated the ZIKV NS2B-NS3 protease inhibitor activity in novobiocin and lopinavir-ritonavir [15]. There are various repurposed drugs like chloroquine, emricasan, niclosamide, mycophenolic acid, mefloquine, bortezomib, suramin, sofosbuvir, nitazoxanide, and nitazoxanide, which inhibit the ZIKV replication but the molecular mechanism of inhibition is still unknown [16, 17]. The compounds CHEMBL619 and ZINC720 reported as hits for NTPase site of NS3 helicase protein [18].

To our knowledge, no potential and safe antiviral drugs or vaccines are available for the ZIKV. Therefore, a vaccine against the virus is urgently needed, and development is probably some years away [19]. Hence, alternative treatment options are extremely required, both for prophylaxis of the infection to prevent and post-infection therapy. Molecular modeling and computational approaches are precious tools in developing potential inhibitors of ZIKV [20]. The structure-based drug design (SBDD) approach identifies leads to a target protein [21]. SBDD explores new insights on the nature of the active site and the ligand–protein interactions. This approach reduces the time and cost.

In this research investigation, we identified top inhibitors against ZIKV through various computational methods. In this study, we performed virtual screening to screen possible ZIKV inhibitors from the ZINC compound database. To evaluate drug-likeness, we completed the pharmacokinetics and toxicity parameters calculations of selected inhibitors. We also executed density functional theory calculations for top-scored compounds. We hope the results obtained from this research investigation would help search for novel potential inhibitors against ZIKV by identifying pharmacophoric and structural features involved in the binding process.

Materials and methods

Protein preparation

The three-dimensional crystal structure of Zika virus NS3 helicase at 1.8 Å resolutions with PDB-ID: 5JMT was retrieved from the Protein Data Bank (http://www.rscb.org/pdb) [22, 23]. This resolution of the protein suggests its good quality as a resolution of 2.0 Å is the recommended maximum for the best structural protein for molecular modeling [24]. Restrained minimization was done through the OPLS-2005 force field and 0.3 Å as an RMSD constraint after optimizing hydrogen bonds.

Active site identification

The catalytic site of the Zika virus NS3 helicase was identified using the SiteMap tool of Maestro. It recognizes potential active pocket by joining together “sitepoints” that most probably subsidize to tie protein–protein or protein–ligand binding [25, 26].

Receptor grid generation

After preparing the protein and identifying the catalytic binding site, the receptor grid was generated for the protein by the Grid Generation panel of the Glide module. The receptor grid was generated to establish the active site of NS3 helicase protein prior to docking at the centroid of the predicted active sites. This technique generates two cubical boxes having a common centroid to organize the calculations: a larger enclosing and a smaller binding box [27].

Ligand preparation and virtual screening

Virtual screening is a computational approach used in the drug discovery process to get small drug-like molecules that are most likely to bind the receptor or target molecule [28]. In this research work, the selected dataset consists of 0.5 million compound libraries of the ZINC database in SDF format [29]. The complete virtual screening was performed against the above-said databases by the Glide tool of Maestro. In this study, all selected dataset was screened for High throughput virtual screening (HTVS), Standard precision (SP) docking, and Extra precision (XP) docking [30]. Based on their Glide Gscore, drug likeness properties, the top five scoring compounds were selected for further investigation.

Binding energy calculation

The molecular mechanics-generalized born surface area (MM-GBSA) method quantitatively measures the binding strength between the receptor and ligands [31]. Prime MM-GBSA panel of Maestro calculates the ligand binding as well as ligand strain energies of ligand–protein complexes. For this calculation, the top-scored ligand–protein complexes were selected with default parameters.

Molecular dynamics simulations

MD simulation study was done through the Desmond tool of Maestro for protein–ligand complexes [32]. The top five compounds were considered for MD simulations based on their binding interactions (Glide Gscore), binding energy, pharmacokinetic, and toxicity parameters. The selected protein–ligand complex was pre-processed to refine side chains, add missing atoms, and minimize strain before MD simulation through the protein preparation wizard. The prepared complex was imported into the solvation tab of system builder panel. In this panel, a solvated model generates by selecting POPC (300 K) as a membrane model, SPC as a solvent model with an orthorhombic box shape. In the ions tab, the system neutralized by adding the required number of ions and 0.15 M as the salt concentration of Na + and Cl− ions to simulate the physiological conditions. All periodic boundary conditions were employed. All bad contacts were removed by energy minimization with the help of the hybrid method steepest decent and the limited-memory Broyden–Fletcher–Goldfarb–Shanno (LBFGS) algorithms [33]. Further, a minimized solvated model was imported into the molecular dynamics tab as “out.cms” file and simulation was carried out for 100 ns by keeping a 4.8 ps trajectory recording. The force field OPLS_2005 was selected simulation calculations. The model was relaxed before the production system run because it makes a series of predefined minimizations and MD executions.

Pharmacokinetic and toxicity parameters calculations

QikProp tool is used for calculation of pharmacokinetic properties of ligands as molecular weight (MW), molecular volume, hydrophobicity, hydrophilicity, polar surface area (PSA), number of the rotatable bond, donor and acceptor hydrogen bonds, etc. [34]. PSA is a very useful parameter for the prediction of drug transport properties. QPlogPo/w is the octanol/water partition coefficient which evaluates the lipophilicity of the compounds. QPlogS and QPlogBB predict the aqueous solubility and brain/blood partition coefficient of compounds, respectively. OSIRIS property explorer was used for the toxicity prediction of inhibitors of NS3 protein.

Prediction of bioavailability and synthetic feasibility

In silico bioavailability and synthetic feasibility of the top-scored compounds were evaluated through the Swiss ADME tool. Consideration of synthetic practicability produces a number between 1- for simply synthesized compounds and 10- for compounds that are challenging to synthesize [35].

Density functional theory calculations

Single point energy calculations using density functional theory were performed using Jaguar to explain ligand-bound protein at an electronic level. The chemical structures of ZINC01033978 and ZINC00114948 were optimized using hybrid functional B3LYP parameters with 6-31G** basis set. Various properties such as electrostatic potential, average local ionization energy, gas-phase energy, canonical orbital, etc. were evaluated.

Results and discussion

Identification of active sites

For generating the active catalytic site, a comprehensive search was done by SiteMap for searching hydrogen-bonded, hydrophobic, and van der Waals regions in protein pockets. It results in five possible binding sites depicted in Table 1.

Table 1 Structural features of the active sites predicted by SiteMap

Based on exposure, enclosure, and covered by other features, scoring was done. The druggability of each binding site had given in terms of the sitescore. In all possible binding sites, site_1 has a maximum of 1.081 sitescore along with 1346.618 Å as volume. The sitescore of other site_2, site_3, site_4, and site_5 was 1.046, 0.830, 0.940, and 0.756. Due to excellent don/acc, volume, and Dscore value, site_1 can mechanistically justify the binding pattern and answer for the protein–ligand interactions. Hence, site_1 was considered an appropriate binding site to perform virtual screening based on sitescore and other structural features. The active site was made up of 42 amino acid residues, namely Pro292, Pro542, Val366, Ser601, Asp602, Asp540, Lys389, Val543, Glu489, Arg388, Leu442, Ser365, Pro364, Val363, Thr409, Cys429, Asp410, Leu430, Lys389, Arg598, Hid486, Arg617, Ser608, Phe609, Val599, Ala605, Leu541, Lys431, Pro432, Ser293, Ala264, Asp291, Thr290, Glu489, Met536, Leu493, Thr267, Phe289, Glu392, Met414, Gly539, and Hid484. The binding pocket of site_1 diagram has given in supplementary data (Fig. S1).

Virtual screening and binding interaction analysis of complexes

A dataset of small molecules of the ZINC database was used for the virtual screening of NS3 helicase protein (5JMT). Active sites with the best site scores (top-ranked potential receptor binding cavity) had been taken as a prerequisite for receptor grid generation. The executed virtual screening approach was using the hierarchical model of elimination technique, i.e. HTVS followed by SP and further the XP docking. The number of compounds employed for HTVS, SP and XP docking were 50,000, 5000 and 500, respectively. After the hierarchal model of screening, the top five inhibitors were considered for further studies. The GlideGscore and Prime MM-GBSA binding energy parameters score of all top-scored inhibitors is shown in Table 2.

Table 2 Results of MMGBSA binding energies and docking score of top five inhibitors

The GlideGscore of ZINC01033978 was − 7.55 kcal/mol with two hydrogen bonds with the sidechain Ser601 and Asp602 amino acid residues (Table 3). This suggests that ZINC01033978 possesses greater affinity for NS3 helicase protein. A hydrophobic interaction was also observed between NS3 helicase protein and ZINC01033978 by Pro292, Val543, Pro542, Leu442, Val366, Pro364, Val363, Cys429, and Leu430 amino acid residues (Fig. 2). Similar to ZINC01033978, the ZINC00114948 compound interacts by hydrogen bonding with Asp540 and Lys389, which belongs to an essential cleft for viral replication (Fig. 3) [13].

Table 3 Top five screened inhibitors with their number of interacting hydrogen bonds, interacting residues, and other observed interactions
Fig. 2
figure 2

NS3 helicase protein and ZINC01033978 binding interaction diagram

Fig. 3
figure 3

NS3 helicase protein and ZINC00114948 binding interaction diagram

ZINC01033978 is a triazole derivative characterized by an indane ring with a ketone group, and a fluorophenyl substituted group at positions 3 and 5. The two nitrogen of triazole ring involved in hydrogen bond interaction with Ser601 and Asp602 residues. These amino acid residues interact with the RNA motifs of NS3 protein [12]. Triazole nucleus play significant role in medicinal chemistry due to its capability of forming a hydrogen bond, which improves their solubility and ability to favorable interact with bimolecular targets. The 1,2,3-triazoles are highly stable to metabolic degradation as compared to other heterocyclic compounds because they have three adjacent nitrogen atoms [36, 37].

Compound ZINC01047185 interacts with sidechain residues Asp540 and Lys389 and backbone residue Asp540 to make hydrogen bonds (Supplementary Fig. S2). The residues Leu541, Pro542, Val599, Phe609, Ala605, Val366, Leu442, Leu430, and Pro432 show hydrophobic interactions with compound ZINC01047185. The other compound, ZINC12340356, forms five hydrogen bonds with the sidechain residues Ser293, Asp291, Arg598, Hid486, and Arg388 (Supplementary Fig. S3). The π-π stacking has also been observed between NS3 helicase protein and benzene ring of ZINC12340356 by Arg598 residue. Same as the compound, ZINC01034603 interacts by hydrogen bonding with Asp291, Arg598, and Lys389, and Val543 amino acid residues. In addition, the pyrazole ring was involved in π-π stacking interaction with NS3 helicase protein by Arg598 residue (Supplementary Fig. S4). Epigallocatechin-3-Gallate binds with RNA binding cavity by forming hydrogen bond with Leu430 [38]. This indicates similar binding of screened compounds.

Molecular dynamics simulations analysis

MD simulation enables the atomic-level characterization of numerous biomolecular processes, such as analyzing the stability of protein–ligand interactions associated with activation and deactivation of various molecular pathways. The stability analysis of all top-scored inhibitors was carried out by the Desmond tool.

The simulation study of complexes was performed for 10,0000 ps after the system minimization with 2000 iterations. The observed root mean square deviation (RMSD) and root mean square fluctuation (RMSF) is shown in Table 4.

Table 4 Observed RMSD and RMSF values of top five screened inhibitors

For the ZINC01033978 compound, the RMSD indicates good stability in the NS3 Hel binding site, suggesting this molecule may represent a potential NS3 Hel inhibitor. The amino acid residues Val366, Ser601, Asp602, and Ser365 were involved in hydrogen bond interactions, and Pro292, Lys389, Met414, Cys429, Leu430, Leu442, His486, Pro542 residues form the hydrophobic interactions with ZINC01033978 (Fig. 4). The hydrophobic interactions involved a hydrophobic amino acid and an aromatic or aliphatic group on the ZINC01033978. The residue Glu392 was found to be involved in polar or ionic interactions between two oppositely charged atoms within 3.7 Å.

Fig. 4
figure 4

The protein RMSD, RMSF, and protein–ligand contacts diagram of complex 5JMT with ZINC01033978

Similarly, the compound ZINC00114948 was found to be able to form hydrogen bonds with Asp540, Lys389 and Ser608 (Fig. 5). Therefore, the residue Asp540 plays an essential role in the binding mechanism due to hydrogen bonds forming with sidechain and backbone residue. Moreover, ZINC00114948 interacts with Leu442, Pro432 and Ala605 residues through hydrophobic interactions suggesting this compound is also an attractive lead molecule towards ZIKV NS3 Hel inhibitors. These results are consistent with molecular docking studies. Therefore, docking results are found validated through MD simulation calculations.

Fig. 5
figure 5

The protein RMSD, RMSF, and protein–ligand contacts diagram of complex 5JMT with ZINC00114948

As shown by MD simulation of compounds ZINC01047185, ZINC12340356 and ZINC01034603, the system was found to be perfectly equilibrated and acceptable for small, globular proteins (Supplementary Figs. S5, S6, S7). The hydrophobic interactions enhance the binding affinity; hence, the residue Pro542, Cys429, Pro432, Leu442 and Ala605 consist of the NS3-Hel binding pockets hydrophobic region.

Pharmacokinetic, toxicity parameters and bioavailability analysis

Lipinski’s rule of five was one factor used to assure the drug-like (oral) pharmacokinetics profile of the ligands. The various physicochemical and significant therapeutic descriptors were calculated through the Qikprop tool and depicted in Tables 5 and 6, respectively. All top-scored inhibitors show no violation of Lipinski’s rule of five and have been found in an acceptable range of all pharmacokinetic parameters.

Table 5 Principle pharmacokinetic parameters of top five screened inhibitors
Table 6 Therapeutic significant parameters of top five screened inhibitors

The toxicity parameters of all top-scored inhibitors were evaluated through OSIRIS property explorer. None of the toxic profiles has been found in most selected inhibitors except ZINC00114948 and ZINC01047185, which shows a high reproductive effect. All evaluated parameters are listed in Table 7.

Table 7 Toxicity parameters of the top five screened inhibitors

The boiled egg diagram (Fig. 6) expresses brain penetration and human intestinal absorption (HIA). The yellow region (yolk) and the white region (albumin) denote the area where there is a high chance of brain penetration and human intestinal absorption, respectively. Only inhibitor ZINC00114948 falls within the yellow region to imply a high probability of brain penetration and being absorbed by the gastrointestinal tract. In contrast, compounds ZINC01033978, ZINC01047185, ZINC12340356, and ZINC01034603 fall in the white region signify their high chances of being absorbed by the gastrointestinal tract only. We also generated the bioavailability radar chart through the SwissADME tool, which gives a quick look at the drug-likeness of top scored inhibitors [38, 39]. The predicted bioavailability score is shown in Table 8.

Fig. 6
figure 6

The EGG-BOILED model for top scored compound

Table 8 Synthetic accessibility and bioavailability score of top five screened inhibitors

In bioavailability radar charts, the pink area represents the most favorable area for each property. FLEX, LIPO, SIZE, and POLAR all refer to rotatable bonds, logP, molecular weight, and polar surface area, respectively. The inhibitor ZINC01033978 was the best of the other selected inhibitors as it falls within the acceptable range for all parameters. The bioavailability radar charts are shown in supplementary data (Fig. S8).

Estimation of synthetic feasibility

According to Ertl and Schuffenhauer, “the synthetic accessibility score is in the range of 1 for simply synthesizable and 10 for challenging to synthesize” [40]. All top-scored inhibitors show a synthetic feasibility score of around 3.0, indicating that all are easily synthesizable (Table 8). Furthermore, we forward to plan future synthesis with the best plausible route.

DFT calculations

Two top-scored compounds were chosen for energy calculations. The electronic interactions of the compounds play a vital role in biological effects. Therefore, the position of LUMO–HOMO is responsible for the electron transfer in reaction. Properties like vibrational frequencies, HOMO energy, LUMO energy, ESP, and interaction strength were calculated using Jaguar (Table 9) (Figs. 7, 8).

Table 9 Summary of calculated electronic properties of hit molecules
Fig. 7
figure 7

DFT results for ZINC01033978: A HOMO map B LUMO map C Interaction strength map D Electrostatic potential map

Fig. 8
figure 8

DFT results for ZINC00114948: A HOMO map B LUMO map C Interaction strength map D Electrostatic potential map

Conclusion

In conclusion, a structure-based virtual screening and molecular dynamics simulation study were carried out to discover novel, potential NS3 helicase protein inhibitors. Due to the most imperative viral replication component, targeting NS3-Hel protein is quite an attractive approach to treat ZIKV. The active site of the NS3 helicase protein consists of 42 amino acid residues. The top five inhibitors were selected based on GlideGscore. The highest scoring inhibitor ZINC01033978 (7.55 kcal/mol) exhibits three hydrogen bonds with amino acid residues Ser601, Val366, and Asp602. The docking analysis suggests that the NS3-Hel protein binding pocket is consists of hydrophobic (Phe609, Val599, Leu541, Ala605, Leu430, Leu442, Pro432, Pro292, Val543, Pro542, and Pro364) part, H-bond (Asp540, Lys289, Ser601, Asp602, Val366, Arg598, Asp291, Lys389, Ser293, and Hid486) part and π-π stacking residues (Arg598). The RMSD of NS3-Hel protein with inhibitors was found in the range of 1.65 to 2.50 Å, indicating the simulation has equilibrated, and changes are perfectly acceptable for small, globular proteins. Hence, both docking and molecular dynamics simulation investigations revealed that the top-scored inhibitors have a strong binding affinity towards the NS3 helicase protein. Thus, screened compounds as potential inhibitors against the NS3-Hel ZIKV. The pharmacokinetic analysis revealed that top-scored inhibitors have no violation of Lipinski’s rule of five, confirms drug-likeness ability. The toxicity parameters are also found in the acceptable range except for the reproductive effect. All top-scored inhibitors are easily synthesizable, having the least synthetic accessibility score. Experimentally, the need to validate the molecular modeling results reported here with in vitro and/or in vivo inhibition evaluation is acknowledged. Still, due to lack of funding, work is limited. Therefore, investigated inhibitors could be provided the lead to the targeting NS3-Hel protein for the ZIKV treatment.