Introduction

Lymphatic Filariasis (LF) also known as elephantiasis and Onchocerciasis (river blindness) are common Neglected Tropical Diseases (NTD), which are caused by some parasitic nematode worms (Sightsavers 2013). LF is caused by filarial worms like Wuchereria bancrofti, Brugia timori and Brugia malayi, which are been transmitted by mosquitoes, while Onchocerca volvulus is the causative agent for onchocerciasis, which is transmitted from one person to another by blood-feeding black flies (Bakowski et al. 2019). Elephantiasis alone is responsible for not less than 2.8 million disabilities globally (Jacobs et al. 2019). The global program intended to eliminate these filarial diseases started far back through the Mass Drug Administration (MDA) of anti-filarial such as ivermectin, albendazole, and diethylcarbamazine, either as a dual (annual to bi-annual) or as triple-drug (once every 3 years) treatment (Jacobs et al. 2019; Carter et al. 2020). However, it became unlikely that the MDA regimen will be adequate to eliminate these filarial diseases in all endemic areas, majorly due to their inability to kill the macrofilariae (Lakshmi et al. 2010). Given the current scenario, therefore, a macrofilaricidal agent is required to kill worms to reduce both diseases’ elimination time frames (Sashidhara et al. 2014).

Fortunately, one unique characteristic of these filarial worms is their symbiotic co-existence with a known bacterium referred to as Wolbachia (Slatko et al. 2010). In the search for new anti-filarial drugs, some researchers have chosen the option of targeting Wolbachia, which past research has shown that its elimination from the host filarial nematodes leads to antifilarial effects with the reduction of adult worm’s lifespan (Bouchery et al. 2013; McGillan 2017). Although the anti-bacteria drug, doxycycline has been used clinically for the treatment of filarial diseases over the years, the treatment method is not efficient enough for use in mass administration including requirements for long treatment periods (4–6 weeks) as well as contraindications in pregnancy and children (McGillan 2017). Therefore, advances in the development of new anti-Wolbachia agents with short treatment periods and reduced complications are necessary.

Some compounds of the pyrazolopyrimidine class were earlier reported to show a variety of bioactivities such as anti-viral agents, anti-malarial, anti-depressants, anti-tuberculosis, and kinase inhibitors (McGillan et al. 2021; Ugbe et al. 2022a). However, certain side effects have been associated with most of the drugs in this class such as hypnotic and/or anxiolytic effects. To further explore the anti-filarial effect of the pyrazolopyrimidine compounds, McGillan (2017) synthesized several pyrazolopyrimidine derivatives and reported their inhibitory activities against Wolbachia infected insect cells (Aedes albopictus, C6/36). Notable targets of Wolbachia pipientis include Oxidoreductase α-DsbA1 (PDB ID: 3F4R), OTU deubiquitinase (6W9O), thiol-disulfide exchange protein alpha-DsbA2 (6EEZ), and Cytoplasmic incompatibility factor CidA (7ESX) amongst others.

Computer-aided drug design plays a crucial role in the discovery of new drug molecules in pharmaceutical design, drug metabolism, and medicinal chemistry. It saves time, and cost and tends to be highly effective for the evaluation of a large virtual database of chemical compounds (Adeniji et al. 2020). Molecular docking simulation computer-aided screening method which probes the binding of ligands in the active sites of the protein target using a valid docking tool (Ibrahim et al. 2020). Pharmacokinetics analysis on the other hand is important in the pre-clinical study of new drug compounds to ascertain how such drug compounds affect the living organism when administered. Some of the most important pharmacokinetic properties to be determined during pre-clinical testing include Absorption, Distribution, Metabolism, Excretion, and Toxicity (ADMET) (Lawal et al. 2021; Ibrahim et al. 2021). Physico-chemical properties such as molecular weight, Topological Polar Surface Area (TPSA), lipophilicity, hydrogen bond donors, and hydrogen bond acceptors amongst others are necessary to predict a drug’s likelihood of being orally bioavailable (Lipinski et al. 2001). This work focuses on the virtual molecular docking screening of a series of Fifty-two (52) pyrazolopyrimidine derivatives against Four (4) Wolbachia targets, 3-D QSAR modeling, Molecular Dynamics (MD) simulation, and prediction of pharmacokinetic properties of some selected analogs, to find a more potent drug molecule which would be suitable for the treatment of filarial diseases.

Materials and methods

Data acquisition

A series of Fifty-two (52) pyrazolopyrimidine derivatives with reported bioactivities (EC50 in nM) against Wolbachia-infected insect cells (Aedes albopictus, C6/36), were sourced from the literature (McGillan 2017). The various bioactivity (EC50) values were separately converted to pEC50 using Eq. (1) (Ugbe et al. 2022a). The molecular structures of the various pyrazolopyrimidine derivatives were shown in Online Resource 1.

$${pEC}_{50}={-log}_{10}\left({EC}_{50} \times {10}^{-9} \right)$$
(1)

Ligand preparation

The molecular structures of all the compounds were drawn using the ChemDraw Ultra, saved as MDL molfile format, and thereafter imported separately onto the Spartan ’14 Graphical User Interface while enabling the auto conversion of 2-D models to 3-D. The imported molecules were initially subjected to energy minimization and then saved in Spartan file format. The resulting structures were then fully optimized first by using Molecular Mechanics Force Field (MMFF) and thereafter Density Functional Theory (DFT) with Becke’s three-parameter read-Yang-Parr hybrid (B3LYP) option and utilizing the 6-31G basis set. The optimized structures were then saved as PDB and SD file formats for subsequent use in molecular docking and 3-D QSAR studies respectively (Wang et al. 2020; Ugbe et al. 2021).

Preparation of the protein receptors

The crystal structures of Four (4) Wolbachia target proteins (PDB codes: 3F4R, 6EEZ, 6W9O, and 7ESX) were retrieved from the RCSB Protein Data Bank in PDB file format, and then prepared separately using the Molegro virtual docker by eliminating water molecules, cofactors and co-crystallized ligands contained within the protein structures (Ugbe et al. 2022b). The various receptors used in the virtual docking screening were described in Table 1.

Table 1 Description of enzymes used for the docking screening

Molecular docking-based screening

Molecular docking investigation was performed separately between the Four (4) different receptors of Wolbachia pipientis and all 52 compounds, including the reference drug (Doxycycline) using the Auto Dock Vina of PyRx v software tool (Ugbe et al. 2021). The screening was conducted to ascertain the most active pyrazolopyrimidine compounds against the various protein targets. PyRx calculates the binding affinities of the receptor-ligand interactions which are necessary to describe how fit the molecules bind to the target protein. A more negative binding affinity will indicate a greater chance of the potential drug molecule to initiate protein biochemical action/reaction (Kumar et al. 2016).

Evaluation of pharmacokinetic properties

Predicting pharmacokinetics properties plays a critical role in the early stage of drug discovery. This is because only molecules which demonstrate good ADMET and drug-likeness properties reach the pre-clinical research phase (Ugbe et al. 2021). Therefore, Four (4) pyrazolopyrimidine analogs (14, 23, 26, and 29) having the highest binding scores with 6EEZ, 7ESX, 6W9O, and 3F4R respectively were subjected to drug-likeness and ADMET tests using two online web servers; http://www.swissadme.ch/index.php and http://biosig.unimelb.edu.au/pkcsm respectively. Lipinski’s rule of five (RO5) also called the Pfizer rule is a well-established provision for determining the oral bioavailability of a given compound (Lipinski et al. 2001; Lawal et al. 2021). Consequently, these analogs were subjected to the RO5 criterion to ascertain their oral bioavailability.

Molecular dynamics simulation and MM/GBSA calculation

Molecular dynamics (MD) simulation of 7ESX_23 complex was performed using the combined approach of Chemistry at Harvard Macromolecular Mechanics (CHARMM) force field, Nano-scale Molecular Dynamics (NAMD), and Visual Molecular Dynamics (VMD). The CHARMM-GUI, an established web-based platform that utilizes the CHARMM force field, was used to generate the input files for the simulation by NAMD (Lee et al. 2016). The periodic boundary condition was utilized while fitting the system into a cubic water box for solvation. The protein was solvated and neutralized explicitly in an aqueous solution of 0.10 M concentration of potassium chloride salt (Edache et al. 2022). To stabilize the complex structure and to ensure steric clashes will not result, energy minimization was performed. The resulting system of ions and solvent was then equilibrated to stabilize the system at a temperature chosen for the simulation (310.15 K) at a constant number of particles, volume, and temperature (NVT ensemble), and to stabilize the pressure by keeping the number of particles, pressure, and temperature (NPT ensemble) constant using 100ps time frame (Muniba 2019). MD was then performed on the resulting system for 1ns (500,000 steps), while the results were visualized using VMD and the Biovia discovery studio, all on an HP laptop computer; Processor (Intel(R) Core(TM) i5-4210U CPU @ 1.70 GHz 2.40 GHz), Installed RAM (8.00 GB), System type (64-bit operating system, x64-based processor), Edition (Windows 10 Home Single Language), Version 21H2. A similar procedure was described elsewhere (Edache et al. 2022). Additionally, MolAICal software was used to compute the ligand-binding affinity by Molecular Mechanics Generalized Born Surface Area (MM/GBSA) method based on the resulting MD log files obtained with NAMD (Bai et al. 2020). MM/GBSA is estimated using Eqs. (2)–(4) (Bai et al. 2020).

$${\varDelta G}_{bind}=\varDelta H-T\varDelta S\approx {\varDelta E}_{MM}+{\varDelta G}_{sol}-T\varDelta S$$
(2)
$${\varDelta E}_{MM}={\varDelta E}_{internal}+{\varDelta E}_{ele}+{\varDelta E}_{vdw}$$
(3)
$${\varDelta G}_{sol}={\varDelta G}_{SA}+{\varDelta G}_{GB}$$
(4)

Where, ∆EMM and −T∆S represent respectively the gas phase MM energy and conformational entropy. ∆EMM contains electrostatic ∆Eele, van der Waals energy ∆Evdw and ∆Einternal of bond, angle, and dihedral energies. ∆Gsol is the solvation free energy equal to the sum of the nonelectrostatic solvation component ∆GSA and electrostatic solvation energy ∆GGB.

3 – D QSAR modeling

The alignment of molecular structures plays a critical role in 3D-QSAR modeling (Al-Attraqchi and Mordi 2022) as it strongly determines the predictive accuracy and statistical quality of any given 3D-QSAR model (ElMchichi et al. 2020). Different alignment methods have been reported previously such as atom-based, docking-based, pharmacophore-based, and co-crystallized conformer-based alignments amongst others (Zhang et al. 2020; Al-Attraqchi and Mordi 2022). In this study, the atom-based alignment was adopted using the Open3DAlign (O3A) tool. The atom-based method attempts to match the atoms of the various structures to be aligned with those of the template structure, based on the atom’s properties such as the partial charge.

The aligned structures were used for building the 3-D QSAR model using the Open3DQSAR software (Zhang et al. 2020). The Comparative Molecular Field Analysis (CoMFA) which is concerned with steric and electrostatic fields’ contributions was studied (ElMchichi et al. 2020). A dataset of 52 compounds was divided into a training set and a test set of 36 and 16 molecules respectively, i.e. percentage ratio of 70:30. The steric and electrostatic Molecular Interaction Fields (MIFs) analysis was carried out on the aligned compounds placed within a 3-D cubic lattice of grid size 1.5 Å and a 5.0 Å out gap (Tosco and Balle 2011). Variables pretreatment was carried out as follows; energy cut-off (30.0 kJ/mol), elimination of variables having constant or near-constant values, and standard deviation cut-off (level = 2.0) (Al-Attraqchi and Mordi 2022). The Un-informative Variable Elimination-Partial Least Square (UVE-PLS) was used to build the statistical model and for generating the steric and electrostatic contour plots (Edache et al. 2022). The resulting model was then cross-validated using the Leave-One-Out (LOO), Leave-Two-Out (LTO), and Leave-Many-Out (LMO). The steric and electrostatic contour maps were visualized on Maestro v. 12.3.

Results and discussion

Virtual docking screening

The results (binding affinities) of the docking simulation conducted between the Four (4) receptors of Wolbachia pipientis and the various pyrazolopyrimidine derivatives, as well as the reference drug (Doxycycline), were reported in Table 2.

Table 2 Summary of binding affinities of interactions between pyrazolopyrimidine derivatives and different Wolbachia pipientis receptors used for the target fishing

. It can be observed from Table 2 that no particular ligand best interacted with all the studied receptors combined. That is, a ligand may bind very strongly with a given receptor but shows a weak interaction with another receptor. However, Four (4) ligand-protein interaction pairs with the greatest negative binding scores were identified in the order; compound 23 with 7ESX (-10.2 kcal/mol)> 14 with 6EEZ (− 9.0 kcal/mol)> 29 with 3F4R (− 8.0 kcal/mol) > 26 with 7ESX (− 7.7 kcal/mol). Also, no ligand-protein interaction pair involving the reference drug (Doxycycline) was identified that could compare with the identified interaction pairs, except doxycycline_7ESX complex with a binding score of − 7.7 kcal/mol equal to that of 26_7ESX complex. Therefore, the virtual screening was effective and subsequent discussion shall be based on these more active molecules (Table 3).

Table 3 Molecular structures of some selected pyrazolopyrimidine analogs

The pharmacological interactions between the receptors’ amino acid residues and the selected compounds (14, 23, 26, and 29) as well as the reference drug (Doxycycline) were summarized in Table 4, while the 2D and 3D views of the binding interactions as adapted from the Discovery Studio Visualizer were shown in Figs. 1, 2, 3, 4, and 5. This was to provide insight into the mode of binding of these ligands with the active sites of the various target proteins.

Table 4 Predicted binding interaction profiles of 14, 23, 26, 29, and Doxycycline with the receptors

These compounds were said to interact very adequately with the respective target receptors as shown by the presence of hydrogen bonding (H-bond), hydrophobic interactions, and in some cases electrostatic interactions. (Table 4). However, more interactions were visible from the binding profile of compound 23 with 7ESX, involving a total of Four (4) conventional H-bonds, One (1) π-donor H-bond, One (1) π-anion electrostatic interaction, and up to Eight (8) hydrophobic interactions. Four groups can be identified in the molecular structure of compounds 23 as pyridine, pyrimidine, pyrazole, and benzoate groups, all interacting significantly with the receptor’s amino acid residues. The carbonyl group (C = O) oxygen of the benzoate group formed 2 H-bonds with LYS-232 at interaction distances of 2.68 and 2.91 Å. The remaining 2 conventional H-bonds were formed by GLU-188 with the pyridine group and the linker amine group at 2.01 Å and 3.05 Å respectively. Also, the π-donor H-bond was between ASN-77 and the pyrazole π-system at 2.96 Å. Visible were the π-anion interactions between the π-electrons systems of GLU-191 and the benzoate group at 4.06Å. Several hydrophobic interactions were formed including π- π T shaped with PHE-228 (5.22 Å), π-sigma with ARG-74 (3.57 Å), π-alkyl with TRP-37 at 5.39 Å, LEU-75 at 5.44 Å, and ARG-74 at 4.30 Å and 5.49Å, and alkyl interactions with ARG-36 and LEU-75 at distances of 4.95 Å and 4.64 Å respectively. It is important to note that no unfavorable interaction was seen in the 23_7ESX binding interaction profile (Fig. 1). The complex involving the reference drug, doxycycline_7ESX on the other hand showed more H-bonding interactions than 23_7ESX, consisting of a total of Six (6) conventional H-bonds and Three (3) Carbon-H-bonds. Only One (1) hydrophobic interaction was however visible. More so, an unfavorable donor-donor clash with VAL-250 was formed (Fig. 5). Therefore, compound 23 exhibited stronger and safer binding interactions with the Cytoplasmic incompatibility factor CidA than the reference drug (doxycycline)

Fig. 1
figure 1

Binding interaction between 23 and Cytoplasmic incompatibility factor CidA (PDB: 7ESX)

Fig. 2
figure 2

Binding interaction between 14 and Alpha-DsbA2 (PDB: 6EEZ)

Fig. 3
figure 3

Binding interaction between 29 and Alpha-DsbA1 (PDB: 3F4R)

Fig. 4
figure 4

Binding interaction between 26 and OTU deubiquitinase (PDB: 6W9O)

Fig. 5
figure 5

Binding interaction between doxycycline and Cytoplasmic incompatibility factor CidA (PDB: 7ESX)

Evaluation of pharmacokinetic properties

Drug-likeness analysis and ADMET study were conducted on the Four (4) compounds (14, 23, 26, and 29) to ascertain their oral bioavailability. The results of both investigations were presented in Tables 5 and 6 respectively, while Fig. 6 shows their Boiled Egg’s representation.

Table 5 Predicted drug-likeness properties of some selected pyrazolopyrimidine derivatives
Table 6 Predicted ADMET properties of some selected pyrazolopyrimidine derivatives
Fig. 6
figure 6

The boiled-egg representation of compounds 14, 23, 26, and 29

Lipinski’s RO5 for oral-bioavailability has been widely applied in the discovery of new drug molecules (Ugbe et al. 2022b). It asserts that a drug molecule may likely not be orally bio-available when it has Hydrogen Bond Donors (HBD) of greater than 5, Hydrogen Bond Acceptors (HBA) > 10, Molecular Weight (MW) > 500, and lipophilicity (MLOGP > 4.15 or WLOGP > 5) (Lipinski et al. 2001). Whenever a molecule passed at least three of the four provisions of the RO5, it is said to comply with Lipinski’s rule for oral bioavailability (Lawal et al. 2021). Table 5 showed that all the tested pyrazolopyrimidine derivatives passed the drug-likeness test (Lipinski RO5) by showing no violation. The reported values of Topological Polar Surface Area (TPSA) for the molecules were less than 140 Å2. Also, the values of the synthetic accessibility (SA) scores of these compounds were less than 5.00 (easy portion on a scale of 1–10), suggesting easy laboratory synthesis. The predicted values of the estimated water solubility (Log S) are in the range of − 4 > Log S > − 6, indicating these molecules are moderately soluble. The compounds were equally estimated to be free from pains and brenk alerts.

The estimated ADMET properties reported in Table 6, showed a very high Human Intestinal Absorption (HIA) (greater than 90%) for all tested compounds. Skin permeability is a key factor in transdermal drug delivery development. Values of skin permeation constant LogKp > − 2.50 indicates poor skin permeability. As a result, the various compounds tested showed LogKp values < − 2.50, connoting good skin permeability. Drug molecule penetration through the Blood-Brain Barrier (BBB) and Central Nervous System (CNS) comes with certain criteria. To enable a drug molecule penetrates the BBB and CNS readily, the logarithmic ratio of brain to plasma drug concentration (logBB) must be > 0.3 and the blood-brain permeability-surface area product (logPS) be > − 2 respectively. Consequently, only 14 with logBB of 0.325 readily penetrate the BBB as also indicated by its location within the boiled egg’s yolk shown in Fig. 6, while the various compounds are non-CNS permeable. Also, 23, 26, and 29 were located in the Boiled Egg’s white, an indication that they were predicted to be passively absorbed by the gastrointestinal tract.

Furthermore, some group of enzymes called cytochrome P450 enzymes are important in the body to facilitate drug metabolism and to help in their excretion. The two major isoforms enhancing drug metabolism, CYP-34 A and CYP-2D6 were tested. The tested molecules are not substrates and inhibitors of CYP2D6 but are both substrates and inhibitors of CYP3A4, an indication of a well-moderated metabolic process. Figure 6 showed that only compound 23 was predicted not to be effluated from the central nervous system by P-glycoprotein. P-glycoprotein acts as a biological barrier by extruding toxins and xenobiotics, including drugs out of cells. The extent of drug removal from the body is determined by the drug’s total clearance. The range of values of total clearance for all the tested molecules is good. Additionally, all the compounds except 14 showed no AMES toxicity, implying that they are non-mutagenic and cannot act as carcinogens. Also available in Table 5 is the Maximum Recommended Tolerated Dose (MRTD) predicted for the various molecules. MRTD value of ≤ 0.477 log (mg/kg/day) is considered low, while a value > 0.477 log (mg/kg/day) is considered high. The overall drug-likeness and ADMET properties of the selected compounds showed good pharmacokinetic profiles, except compound 14 which showed positive AMES toxicity. Therefore, these molecules could be considered potential drug candidates for the treatment of filarial diseases.

Molecular dynamics simulation

To analyze the dynamics of the protein-ligand interaction, MD simulation was performed on the best protein-ligand interaction pair (23_7ESX complex) for 1ns (1000 ps) of chemical time (500,000 iterations). The results of this simulation as plots of Root-Mean-Square Deviation (RMSD), Root-Mean-Square Fluctuation (RMSF), Solvent Accessible Surface Area (SASA), and Radius of gyration (Rg) versus the time in ps were presented in Figs. 7, 8 and 9, and 10 respectively.

Fig. 7
figure 7

The plot of RMSD versus time for MD simulation of 23 with 7ESX

Fig. 8
figure 8

The plot of RMSF versus time for MD simulation of 23 with 7ESX

Fig. 9
figure 9

The plot of SASA versus time for MD simulation of 23 with 7ESX

Fig. 10
figure 10

The plot of radius of gyration versus time for MD simulation of 23 with 7ESX

The average RMSD value was estimated as 1.6801 Å which showed that the protein-ligand complex deviated only a little from its original conformation during the trajectory. The deviation was maximum during the first 100ps of the simulation, after which it drops and tends to remain slightly unstable until a further drastic drop in the RMSD at 1000 ps, an indication that the system was fast attaining stability and nearing equilibrium (Edache et al. 2020). RMSF is more like a calculation of the flexibility or the extent of movement of individual residue during a simulation. As seen from Fig. 8, the RMSF tends to drop as the simulation nears 1000ps, a further indication that the system was fast attaining stability. The SASA is simply the surface area that is in contact with the solvent in which the complex resides. From Fig. 9, it can be observed that the SASA only fluctuates slightly between 10.50 Å2 and 11.6 Å2 during the trajectory, an indication of stability (Edache et al. 2022). The Rg is the measure of the degree of compactness of a protein during the trajectory. Decreasing Rg indicates reducing residues’ flexibilities and more stability for the protein. Throughout the trajectory, the Rg varies between 27.283 Å and 28.365 Å which is equivalent to a difference of approximately 1.0 Å for the complex studied, connoting slight changes in the protein compactness as the simulation progresses, and therefore means the stability of the complex. Furthermore, it will not be complete without inspecting the simulated complex for possible protein-ligand interactions. As a result, the simulated complex was visualized using the Biovia discovery studio and the resulting binding interaction of 23 with the active site of 7ESX is presented in Fig. 11.

Fig. 11
figure 11

Binding interaction between 23 and Cytoplasmic incompatibility factor CidA (PDB: 7ESX) after MD simulation

The binding interaction pattern of the simulated complex (Fig. 11) deviated significantly from that of the non-simulated complex (Fig. 1) as several interactions majorly the hydrophobic interactions, electrostatic, and π-donor H-bond were lost. However, a significant number of important interactions were visible including Two (2) conventional H-bonding with SER-187 and ASN-77 at interaction distances of 2.32 Å and 1.94 Å respectively, Two (2) carbon H-bonding with ASN-77 and LEU-75 at 2.96 Å and 2.75 Å respectively. Others are hydrophobic interactions with ARG-36 and ARG-74 at 4.12 Å and 4.66 Å respectively. Additionally, no unfavorable steric bumps or clashes were visible. Furthermore, the result of binding free energy (MM/GBSA) computed for 23_7ESX by MolAICal is shown in Table 7.

Table 7 Binding free energy parameters of 23_7ESX complex

The negative value of the estimated binding free energy (MM/GBSA) of the complex (− 60.6552 kcal/mol) indicates the favorability of the ligand-protein binding. Also, Vander Waals energy (− 50.0611 kcal/mol) contributed most to the binding free energy of the complex, connoting that Vander Waal/hydrophobic interactions played a crucial role in the binding process (Xu et al. 2019). It can therefore be inferred that compound 23 binds readily with the Cytoplasmic incompatibility factor CidA even within a dynamically perturbed system, and hence could be considered as a potential drug candidate for the treatment of filariasis.

3 – D QSAR modeling

Molecular structural alignment represents a key factor in ascertaining the predictive strength of a built 3-D QSAR model. Figure 12 (a–b) shows the molecular structure of the alignment template (compound 30) and the aligned structures as obtained from the super-imposition of the remaining 51 molecules on the template. The UVEPLS approach was used to develop the model. Some significant statistical parameters calculated for the model were presented in Table 8. Reported in Table 9 were the experimental pEC50, predicted pEC50, and their residuals together with their O3A scores. Additionally, a plot showing the correlation between predicted and experimental activities for both training and test sets was obtained and presented in Fig. 13. Also, the CoMFA model equation was summarized graphically as 3D contour maps as shown in Figs. 14 (a–b) and 15 (a–b).

Fig. 12
figure 12

Molecular alignment of structures for the QSAR modeling a Alignment template (compound 30 with the highest O3A_Score of 9057.78); b All structures aligned

Fig. 13
figure 13

Correlation between predicted and experimental pEC50 for training and test sets

Fig. 14
figure 14

Steric field contour maps of compound 23 a Blue contours represent regions of favorable steric bulk; b Red contours showing regions of unfavorable steric bulk

Fig. 15
figure 15

Electrostatic field contour maps of compound 23 a Green contours showing regions of unfavorable high electron density or favorable to electron-withdrawing groups; b Yellow contours represent regions favored by high electron density or unfavorable to electron-withdrawing substituents

The alignment process involves an early step that provided all the 52 compounds the opportunity of being chosen as the alignment template based on the compound with the highest Open3DAlign (O3A) score. The O3A scores of the various compounds were included in Table 8. Compound 30 had the highest O3A score of 9057.78 and hence was selected as the template upon which the remaining structures were superimposed. The model’s statistical parameters were computed for Five (5) Principal Components (PC) amongst which the fifth PC (PC = 5) performed relatively better with R2 value of 0.9425, SDEC = 0.1446, and F-test = 98.282. The statistical parameters available in Table 9 were those associated with PC 5. The predictive strength of the regression models on new datasets of compounds can be estimated by cross-validation (Grohmann and Schindler 2008). A cross-validated coefficient of correlation (Q2) ≥ 0.50 indicates a good QSAR model. Here, three (3) types of Q2 were calculated; Leave-one-out (LOO), Leave-two-out (LTO), and Leave-many-out (LMO), together with their associated Standard Error of Prediction (SDEP). Only Q2LOO (0.5019) passed this criterion and was reported alone.

Table 8 Statistical parameters of the built model
Table 9 Observed pEC50, predicted pEC50, residuals, and Open3DAlign scores of the various pyrazolopyrimidine derivatives

A linear correlation between the CoMFA descriptors (independent variables) and the activity values (dependent variables) was established by the PLS analysis method. The lower residual values between the predicted and observed activity values (Table 8) shows a strong predictive strength of the model. This was supported by the clustering of points along the lines of best fit in the plots of predicted pEC50 versus the experimental pEC50 (Fig. 13). This observation was supported by the conformation of the model to the Golbraikh and Tropsha criteria (Table 5) (Roy et al. 2016). The CoMFA QSAR equation is summarized graphically as a 3D contour map, which shows the regions within the molecules’ 3-D structural space where steric and electrostatic fields are associated with extreme values. The underlying principle behind CoMFA is that variations in the shape and strength of non-covalent interaction fields surrounding the molecules, such as steric or electrostatic fields can be related to changes in binding affinities (Kakarla et al. 2016). Therefore, molecular fields are key factors in binding affinity. The steric and electrostatic field contributions were 50.93% and 49.07% respectively (Table 9).

From the steric field contour maps available in Fig. 14 (a–b), the red contours represent regions of unfavorable steric bulk, while the blue contours show regions of favorable steric bulk. Regions in which steric bulk may reduce activity or affinity of the compound include positions 3 and 4 on the pyridine group, position 5 on the pyrimidine group, and position 2 on the benzoate group (Fig. 14b). For example, substituting the methyl group on position 5 of the pyrimidine group with a more bulky group like ethyl, isopropyl or tert-butyl could reduce the activity or binding affinity of the compound. On the other hand, more steric bulk favorable regions were identified (Fig. 14a), which include position 6 in the pyrimidine group, position 6 in the pyridine group, and position 2 in the benzoate group. This implies that the introduction of bulky substituent groups at these positions will improve the inhibitory activity of the molecule. From the electrostatic field contour maps available in Fig. 15(a–b), yellow contours represent regions favored by high electron density or unfavorable to electron-withdrawing substituents, while the green contours represent regions of unfavorable high electron density or favorable to electron-withdrawing groups. Five (5) regions in which the introduction of electron-withdrawing groups could reduce the inhibitory activity or binding affinity include all positions in the pyridine group, positions 5 and 6 in the pyrimidine group, position 2 in the pyrazole group, and the carbonyl group of the benzoate moiety (Fig. 15b). Also, regions of unfavorable high electron density were visible around the benzene ring system of the benzoate group and between the linker amine group and the pyrazole hetero atom. These regions need not be too electron-dense, hence electron-withdrawing groups will keep these regions at a low electron density which in turn will enhance the molecule’s inhibitory activity or binding affinity. In general, contour map analysis serves as a guide to designing new molecules with improved potency by adhering to the information encoded in the contour maps.

Conclusion

In this study, a molecular docking-based virtual screening, pharmacokinetics analysis, molecular dynamic simulation, and 3-D QSAR modeling were performed on the pyrazolopyrimidine derivatives. The molecular docking screening was effective as the Five (5) best protein-ligand interaction pairs were identified and ranked as 23_7ESX (– 10.2 kcal/mol) > 14_6EEZ (– 9.0 kcal/mol) > 29_3F4R (– 8.0 kcal/mol) > 26_6W9O (– 7.7 kcal/mol) ≈ doxycycline_7ESX (– 7.7 kcal/mol). The selected analogs (14, 23, 26, and 29) all obeyed Lipinski’s RO5 for oral bio-availability and showed excellent ADMET properties except 14, with positive AMES toxicity. Results of the MD simulation showed the stability of the 23_7ESX complex, exhibiting a favorable ligand-protein binding process with an estimated ∆G binding (MM/GBSA) of – 60.6552 kcal/mol. The 3 – D QSAR (CoMFA) model was developed and found to satisfy the requirement for validation tests with R2 value of 0.9425, Q2LOO = 0.5019, SDEC = 0.1446, and F test = 98.282. The anti-Wolbachia activities of the various compounds were well predicted by the model. The analysis of the steric and electrostatic contour maps could provide a useful guide for the future design of more active analogs. Special emphasis on compound 23 because it appears to be consistent with the various employed validation protocols, being that it possessed the highest binding score, showed excellent pharmacokinetic properties, and binds pharmacologically well with the target protein (7ESX). Therefore, 23 could be considered as a potential filarial drug candidate, and/or template for the design of more prominent Wolbachia inhibitors.