Introduction

G protein coupled receptors (GPCRs), as a large family of transmembrane proteins, have key functions in receiving extracellular signals and physiological function of the body [1]. Therefore, they are important targets in a wide spectrum of diseases [2]. In terms of therapeutic relevance, approximately 500 members of this family have been identified, although only a few of them were structurally characterized [3].

Dopamine receptors, as one group of the GPCRs family, are best known for their regulating roles in essential functions including cognitive processes and emotional status [4]. Adverse activation of dopaminergic pathways takes part in neuropathological disorders such as schizophrenia, Parkinson’s disease, bipolar disorder, attention deficit hyperactivity disorder (ADHD), Huntington’s disease, and Tourette’s syndrome [5]. Among dopamine receptors, D4 is mostly expressed in GABAergic neurons of the cerebral cortex, hippocampus, and substantia nigra [6]. Therefore, D4 receptor plays an important role in cognitive tasks and memory performance and is one of the major goals in treating schizophrenia [7]. Some efforts were based on synthesis of selective ligands against D4 receptor. However, most of these compounds indicated weak affinities toward D4 receptor [8, 9]. ML398 has been presented as a selective D4 antagonist in recent studies [10]. Moreover, some selective ligands exhibited remarkable binding affinity to 5HT1AR ,5HT2AR, and 5HT2BR [11, 12]. A SAR study of selective D4 ligands reported an important arene-cation interaction between the ligands arene moiety and the residue Arg-186. This interaction was found to be critical in designing potent D4 ligands [13]. Also a ligand-based QSAR study for the antagonists of human D4 was reported in the literature [14]. According to these efforts, it is of great importance to get knowledge about D4R three dimensional (3D) structure. Obtaining 3D structure of transmembrane proteins is a difficult task as they are embedded in the lipid membrane and extracting adequate amount for X-ray crystallography is hard to achieve [15]. In addition, lipid membrane can affect protein conformation so that obtaining a true conformation by empirical method is so challenging [16]. Despite their importance, transmembrane protein structures constitute only 1 % of the Protein Data Bank [17]. Recently, 3D structures of some GPCRs, such as human dopamine D3 receptor (D3R), have been reported by experimental methods [18]. Since dopamine receptors are to some extent similar with regard to their sequences, human D3R structure can be utilized as the main template for homology modeling of other dopamine receptors such as D4R.

More knowledge about the structural features of human D4R could lead to progress in the area of designing potent and selective compounds against this target. A reasonable approach to reach a 3D conformation of the receptor for further binding mode analysis is molecular modeling methods [19]. Homology modeling is a technique able to identify and refine models of the receptor. It can also provide valuable information for further structure based virtual screening studies [20, 21].

In this study, a predictive 3D structure of D4R was proposed by means of homology modeling technique. MD studies on the whole system (protein in DPPC membrane) were also carried out to equilibrate and optimize the protein in physiological conditions [22, 23]. Subsequently, the resulted frames were evaluated for their ability to distinguish between experimentally known ligands and decoys of the receptor using docking approach. The frame with a higher predictive ability was finally proposed as a model for the natural conformation of the receptor at the binding site. Results of this context could possibly lead to structure-based drug design and targeting of this receptor in future studies [24].

Materials and methods

Homology modeling studies

The primary sequence of human dopamine receptor type 4(DRD4, access number P21917) was retrieved from UniProt database at www.expasy.org [25]. The FASTA sequence was subjected to Basic Local Alignment Search Toolbox (BLAST) using BLOSUM62 scoring matrix at NCBI database [26]. Recursive runs of BLAST were repeated for transmembrane regions of the receptor to find more related templates for these regions. Finally, four templates [PDB code: 3pbl (humandopamine D3 receptor in complex with Eticlopride), 3UON(human M2 muscarinic acetylcholine receptor bound to an antagonist), 4GRV(the neurotensin receptor NTS1 in complex with neurotensin), and 4IB4 (the chimeric protein of 5-HT2B-BRIL in complex with ergotamine)] were obtained based on their similarity toward D4 receptor sequence. Subsequently, a multiple alignment file has been generated between the sequence of the receptor and the templates using ClustalX2 software [27]. Some modifications were done in the alignment file by manual insertion and removing of the gap penalties.

For prediction of transmembrane helices, different methods including HMMTOP [28], TMHMM [29], DAS [30], SOSUI [31], TMpred [32], PolyPhobius [33], predict protein [34], APSSP [35], ExPASY [25], and TOPCONS [36] were employed. The final alignment file was used for 3D generation of D4 receptor using modeler 9v12 software [37]. About 100 different structures of the receptor were modeled with regards to energy refinement as implemented in modeller9v12 [38]. An in-house application (Modelface) was used, as interface to Modeller software, for generation and running of the python scripts needed for the experiments. A conservative disulfide bridge was introduced in all models between the residues 108 (extra cellular loop 2) and 185 (extra cellular loop 3). Ranking of the resulted models was done based on DOPE score values generated by modeler software. Afterward, Bioinf, Tasser, and Falc servers were utilized to refine the models and enhance the accuracy of low conserved loop regions [3941]. The model with the highest DOPE score value was submitted to Procheck server for further verification [42].

MD simulation

MD simulation study was performed on the final model obtained from homology modeling studies. The protein PDB structure was embedded in a DPPC membrane at appropriate orientation and location of TM regions. Before embedding the PDB structure, all coordinates were translated to proper locations based on the data of TM regions. Different algorithms have been presented to equilibrate a receptor inside lipid membrane. A very useful and convenient method with less consumption of time during equilibrium is called InflatGRO [43]. Therefore, to equilibrate the resulted system of lipid-protein, InflateGRO perl script was used [43]. At first, lipids and protein were placed on an expanded grid and then shrinking process was done consecutively to reach the favorable density area per lipid for DPPC membranes (62.9–64 Å2) [44]. GridMAT-MD_v2.0 perl script was used to calculate area per lipid of the resulted system [45]. The system was then solvated with spc216 water model and a concentration of 0.15 M NaCl was inserted to simulate the physiological conditions. Energy minimization followed by two short runs of MD (1 ns) was done on the whole system to stabilize it for main run of simulation. Steepest descent minimization integrator with maximum 50,000 cycles and maximum 1000.0 kJ (mol nm)−1 force were used during minimization. NVT and NPT ensembles with restrains on protein backbone were used for two runs of MD. The main run of MD was subsequently carried out and the system and trajectories were subjected to further studies. Leap-frog integrator with LINCS constraint algorithm and particle mesh Ewald coulomb type were set for MD. The more accurate Nose-Hoover thermostat together with Parrinello-Rahman pressure coupling method was used during the main run of MD. Energy and RMSD plot calculations and visualizations of the structures were done using Gromacs and VMD softwares, respectively [46]. All MD experiments were done using GROMOS force field implemented in Gromacs 4.5.3.[47].

Docking studies

Q-site finder was used in order to find the potential binding sites of the receptor [48]. Subsequently, different frames of the receptor were extracted after clustering MD trajectories using principal component analysis. For this purpose, a vector of backbone variation for all residues within a frame was generated by means of VMD software. The resulted matrix for all frames was subjected to singular value decomposition (SVD) algorithm as implemented in MATLAB software (Mathworks Inc). The clusters were obtained based on the plot of the first two PCs with maximum eigenvalues. Two frames inside each cluster were selected for further docking studies. In docking simulation study, 121 small molecule structures with experimental biological activities were obtained from EMBL database as SMILES [49]. Based on their IC50 values, 31 active structures and 90 inactive compounds were considered as ligands and decoys, respectively.

3D generation and energy minimization of the structures were accomplished using Openbabel 2.3.2 [50]. Docking simulations was performed using Autodock4.2 and grid box was centered on Cα atom of Asp115 with 60×60×60 dimensions. Parameters of genetic algorithm were set to ten runs, 2,500,000 energy evaluations and 150 population size.

The docking scores of ligands and decoys were analyzed by receiver operating characteristic (ROC) curves [51]. In addition, all visual inspection of protein ligand complexes was done using VMD software [46].

Results and discussion

To achieve an optimized 3D structure for D4 receptor, homology modeling was used in order to predict the 3D structure of receptor using known similar proteins. The templates used in modeling of the receptor are listed in Table 1. 3PBL (D3) was used as the main template due to its higher identity toward human dopamine D4 receptor (31.54 %). Helices orientations and diversity in loop portion are the main structural difference between D2-like receptors. Length of the sequence in human D4 (467) differs from that of D3 (400). It is also worth noting that 3PBL, 3UON, and 4GRV are nearly 50 % identical to each other in terms of their primary sequence. As described earlier, the homologue templates were found by means of basic local alignment search tool (BLAST) at NCBI and the threading based method as implemented in Tasser server. As displayed in Fig. 1, the sequences were aligned in such a way to cover the maximum area of the sequence for dopamine receptor. The alignment procedure was the key stone in homology modeling of the receptor. Since there was not any favored template covering the first extracellular loop (residue 1–37), Falc server was used to model and refine this part of the protein. Using refinement step led to dramatic enhancement of the final model. Different techniques could be applied for quality validation of the models. In this study, the primary ranking of the generated models was based on DOPE score according to the energy of the final 3D structures. Moreover, the final model needed to be subjected for further validations [42]. Ramachandran plot, as the output of PROCHECK software, assessed the final quality of protein by indicating residues located in disallowed regions and percentage of those in appropriate zone [42]. As displayed in Fig. 2, there are few residues located in disallowed regions and more than 96 % of residues are found in favored or allowed regions of the plot. It is therefore clear that the resulted model meets the required quality criteria for further studies.

Table 1 Percent identity matrix of sequences created by Clustalx2.1
Fig. 1
figure 1

The final alignment used in homology modeling step

Fig. 2
figure 2

Ramachandran plot of human dopamine D4 structure

Natural environment of body and cell membrane would cause conformational change in target protein. One useful method for transformational prediction, which makes a 3D model adapt with its physiological environment, is molecular dynamic simulation. TM regions of the receptor must be embedded in lipid bilayer at proper orientation before running simulation studies of a membrane protein. For this purpose, different methods described in the methodology section were aimed to predict transmembrane (TM) regions of the receptor. As displayed in Table 2, all methods converged reasonably in predicting TM residues of the receptor.

Table 2 Prediction of transmembrane domains of human dopamine D4 receptor

GPCRs and other membrane bound proteins are inserted into phospholipid membranes following a very specific geometry and according to their particular chemistry called “amphipathic character” (polar heads of phospholipids are in contact with water and hydrophobic aminoacids of membrane bound proteins in close contact with the hydrophobic tail of phospholipids). For this purpose the receptor was translated with care at correct orientation inside the membrane. As depicted in Fig. 3, the obtained geometry was in accord with the data of Table 2. The system was solvated in such a way not to permit penetration of water molecules inside hydrophobic TM regions of the receptor. To evaluate the conformational stability of the predicted protein during MD simulation, root mean square deviation (RMSD) plot was calculated on the protein backbone. As shown in Fig. 4, the simulation was going on 50 ns from the starting point. The energy plot revealed a stable status for the resulted system during simulation. In this study, heat map analysis of Cα RMSD was used for showing conformational changes within the system during simulation. The results of heatmap for all trajectories are displayed in Fig. 5. The most conformational fluctuations occurred in residues 1–32 and 240–350 which are located in the first extracellular and third cytoplasmic domains, respectively. In residues 1–32, the N-terminal of the protein is unrestrained so its conformation was prone to variations. In case of cytoplasmic domain (ICL3, residues 240–350), its nature as a long sequence, led to extra flexibility for this region and more conformational fluctuations were therefore anticipated.

Fig. 3
figure 3

The correct orientation of the receptor within lipid bilayer based on the data displayed in Table 2

Fig. 4
figure 4

RMSD of Cα during 50 ns MD simulation (left), Energy plot of MD simulation (right)

Fig. 5
figure 5

Heatmap analysis of trajectories during MD simulation

The conserved disulfide bridge between the two residues 108 in EL2 and 185 in EL3 was also deterministic in molecular dynamic simulation (Fig. 6). A similar disulfide bridge was previously reported in the models of human dopamine D2 receptor as well as in the crystal structure of D3 [18, 52]. However, in D4, disulfide bridge is located between ECL2 (108) and ECL3 (185) but in case of D3 receptor the location is at ECL2 (Cys 181) and helix 3 (Cys103) [18].

Fig. 6
figure 6

Presence of ionic lock and disulfide bridge in D4 model as two structural features of the receptor

A common feature of most class A GPCRs is the presence of ionic lock between “D[E]RY” motif and a negatively charged residue at the front cytoplasmic domain [18, 53]. Since our model was to represent the receptor at antagonist state, it was of great importance to monitor the formation of the aforementioned salt bridge between cytoplasmic region of helices III (Arg133) and IV (Glu389) (Fig. 6). The nature of ionic lock across NH chain of Arg133 and OE of Glu389 was investigated using the distance calculations for these two residues. As seen in Fig. 7, the ionic lock was present at steady state of the simulation.

Fig. 7
figure 7

Distance calculations between Arg133N and Glu389O. Ionic lock could be observed at steady states of the simulation

Docking studies were used as a tool for further validation of the experiments and to find a useful model for future studies of drug design. To avoid possible errors caused by random selection of the representative frames, we used an unsupervised classification approach. Two representative frames were thereafter selected within each cluster of PCA for docking purpose (Fig. 8).

Fig. 8
figure 8

Plot of PC1 and PC2 for trajectories of simulation

According to the data in Fig. 8, PC1, and PC2 were bearing 86.05 and 3.86 % of the data based on their eigenvalues, respectively.

A primary docking of known ligands with antagonist activity together with a series of decoys was performed on representative frames and the resulted dlgs were subjected for calculation of ROC values. The area under the curve for all representative frames are displayed in Table 3. The highest values of ROC were observed in frames 1638 (0.767), 2677 (0.722), and 5337 (0.775), suggesting reasonable predictive ability for these models of human dopamine D4 receptor. Since all used structures were previously evaluated for antagonist activity, the frames were therefore representing the receptor at antagonist state. Although the described frames with reasonable results of ROC values were extracted from different clusters, all of them were bearing the conserved ionic lock moiety needed for antagonist state of the receptor. In an attempt to improve the docking protocol, the procedure was repeated on the described frames using a more time consuming method (run = 100, evaluation = 25,000,000, population size = 300). As displayed in Table 3 and Fig. 9, the modified docking protocol led to a significant increase in ROC value in case of frame 1638 (0.839). Tuning of docking parameters was however futile in the case of the other two frames used in this experiment. In addition to analysis of binding energies for the ligands and receptor complexes, the binding mode of some docked structures was investigated by visual inspection. The proposed binding mode of CHEMBL332154; the selective D4 antagonist [54]; possessed the lowest docking energy, as displayed in Fig. 10. The three main residues which are crucial in the interaction of the ligand and human dopamine D4 receptor are Trp407, Asp115, and Ser122. As displayed in Fig. 10, residue Trp407 could take role in π-π and cation-π interactions with the ligand. In addition, Asp115 and Ser122 reinforced the contacts by formation of ionic bond and hydrogen bond interactions, respectively. Superposition of the best frame of D4 with the crystal structure of D3 at TM regions has revealed and RMSD value of 2.42. The superposed structures are provided in the Supplementary information section in order to avoid lengthening of the paper. All described observations demonstrated that the final optimized model is representative of D4 receptor at antagonist state.

Table 3 Docking results for the representative frames of human dopamine D4 receptor
Fig. 9
figure 9

ROC curve of frame 1638 with an improved docking protocol (run = 100, evaluation = 25,000,000, population size = 300)

Fig. 10
figure 10

Docking result for the interaction of human dopamine D4 receptor with CHEMBL332154

Conclusions

A 3D model for human dopamine D4 receptor was presented based on homology modeling, docking and molecular dynamic simulation studies. Verification of model using ramachandran plot showed that most of these residues are in favored or allowed regions of the plot. The presence of ionic lock as well as a conserved disulfide bridge in our proposed model is also additive to its advantages as a proper 3D structure. The further validation of the model by docking studies showed that this model could mimic the antagonistic state of the receptor with a reasonable ability to distinguish between ligands and decoys (ROC value = 0.839). It could be concluded that the presented 3D model of D4 receptor could donate an appropriate insight for designing new ligands in future studies.