Introduction

Background

Molecular dynamics (MD) simulations are valuable techniques used in chemistry, physics and material science. MD tools enable the natural evolution of molecular systems to be studied over time, allowing the prediction of static and dynamic properties directly from the underlying interactions. MD simulations are frequently used to explore conformational aspects of proteins and nucleic acids, providing valuable information concerning the fluctuations of these systems. Additionally, they also offer a significant scope in protein–ligand interactions, speeding up and lowering the costs of the drug development pipeline by aiding the identification and optimization of lead compounds [1].

In this context, MD can be coupled to molecular docking simulations (which provide a faster initial approach to explore the vast ligand conformational space) for the post-screening of large compound libraries, enabling the analyses of the different target conformations, the refinement of the final complexes structures and the calculation of more accurate ligand binding energies, critical issues in the rational drug-design process. By treating both ligands and proteins as flexible entities, MD simulations are proper tools for studying the induced fitting of the target around the docked ligand, also allowing the assessment of the role of water molecules in these interactions through the calculation of binding free energies [2].

The GROMACS package developed by Lindahl et al. [3] is probably the most common suite used to perform MD simulations with protein–ligand complexes, although the software is still not able to construct a topology for non-amino acid molecules. For building the topology of the ligands, the researcher must use another program such as ACPYPE (AnteChamber PYthon Parser interfacE), a tool based on ANTECHAMBER employed to calculate partial charges, generate topologies and parameters in different formats for non-amino acid compounds. A test with > 17,000 small molecules from the Protein Data Bank attested the robustness of the method, ascribing to ACPYPE an efficiency of > 76% [4].

Although GROMACS may be considered a very robust package, being a command line-based software, it requires some basic knowledge of the UNIX shell. This particular aspect, associated with the not so straightforward handling of its output files, may represent an extra hurdle for pure chemists, pharmacists and biologists, then preventing these researchers from using GROMACS more efficiently. To overcome these limitations, several graphical user interfaces (GUIs) have been designed for GROMACS in the last decade [5,6,7,8], but none is specific for complex protein–ligand parameterization. Herein, we report the development of LiGRO, aiming to merge the protein/ligand parameterization and MD tools in a single, easy-to-use, python-based GUI. Its design takes advantage of the extensive set of commands of the Linux platform, which, when organized into so-called “shell scripts,” provide the user with a set of options to manipulate and interact with the system. The development of the GUI involved the use of python program language [9] and the Tk GUI toolkit [10], as python is one of the most popular languages for scientific computing, and, when associated with proper scientific libraries, can produce high-level interactivity tools [11].

Methods

Implementation

LiGRO for GROMACS, a Linux-based software, was written in python programming language, version 2.7 while Tkinter was used for displaying the program graphics. The software is installed using the “LiGRO_Install.py” tool, which will install all the necessary dependencies for the appropriate program operation. GROMACS, ANTECHAMBER (AmberTools [12], ACPYPE, PLIP [13] and python libraries (Numpy [14], PANDAS [15] and Matplotlib [16]) are installed automatically using a user-friendly graphical interface.

The LiGRO workflow (Fig. 1) illustrates how python and Tkinter (low system libraries responsible for providing graphical interactivity to the user) were linked to the LiGRO script—the program layer that controls MD functions. All files created by the LiGRO program are saved into a temporary directory, and, subsequently, the user can save these files into a selected directory. ACPYPE is an ANTECHAMBER-dependent program, and, together with pdb2gmx, can be used to build complex topologies. Integration of LiGRO and ACPYPE allows the user to choose between AMBER03, AMBER94, AMBER96, AMBER99, AMBER99SB, AMBER99SB-ILDN, AMBERGS and OPLS-AA / L force fields for MD simulations.

Fig. 1
figure 1

LiGRO workflow

The LiGRO script for simulations (Fig. 2) includes nine programs from the GROMACS package (pdb2gmx, editconf, genbox/solvate, grompp, genion, mdrun, genrestr, make_ndx and trjconv). By using this script, it is possible to run a complete MD simulation (minimization, NVT, NPT and MD simulation) with an easy-to-use input from the researcher. The user just needs to provide some parameters: select a protein PDB file and a Ligand MOL2 file; select a water model and a box type; set the salt concentration to neutralize the system (or input the desired amount of sodium/chlorine ions); and parameterize the MDP files (simulation time, number of steps and temperature). The LiGRO GUI allows the user to impute two non-amino acid structures, one cofactor and one ligand. LiGRO will consider only the structure imputed as the ‘ligand’ for trajectory analysis.

Fig. 2
figure 2

LiGRO graphical user interface (GUI)

The user can check if the protein is properly placed in the simulation box with the Pymol molecular viewer (Fig. 3). The script also accounts for long-range electrostatic interactions by setting the Coulomb energies to particle mesh Ewald (PME) to stabilize the trajectory during the simulation [17]. Finally, the user can save the .tpr file to run in a GROMACS cluster.

Fig. 3
figure 3

Molecular dynamics (MD) simulation of Leishmania major pteridine reductase 1 in complex with NADP and biopterin (PDB-ID:2BF7)

In the linear interaction energy (LIE) binding energy calculation module (Fig. 4), it is possible to calculate the binding affinity prediction using the LIE method, which provides values that can be used to select compounds with a better protein-binding interaction profile [18]. The GROMACS LIE requires preliminary calculations of the Lennard-Jones (LJ) and Coulomb non-bonded energies in two simulations: one with the molecule of interest coupled to its receptor and one with the solvated molecule; in LiGRO all these operations can be performed through a single button click.

Fig. 4
figure 4

LiGRO linear interaction energy (LIE) calculation module

The analysis module of LiGRO generates several plots (Fig. 5) necessary for the evaluation of the simulation outputs. Three GROMACS package programs compose this script: rmsdist, msd, h_bond, energy and gyrate. With these programs, it is possible to generate the RMSD, MSD, radius of gyration and hydrogen bond number plots, while the user can visualize these graphics through a simple button click. The short-range electrostatic contributions (LJ and Coulomb) can be calculated by choosing the LJSR-CoulSR IE in the analysis module. In addition, LiGRO provides the minimum, maximum and average values for these analyses using the Numpy python library. The integration with protein–ligand interaction profiler (PLIP) confers to LiGRO a complete tool to describe a protein-ligand profiler.

Fig. 5
figure 5

LiGRO analysis of a root mean square deviation (RMSD) C-alpha plot

LiGRO also enables the analyses of protein-ligand interaction profiles through the course of the simulation by using the PLIP program. This software provides the ability to monitor ligand interactions, including hydrogen/halogen bonds, hydrophobic contacts, π-stacking/π-cation interactions and salt/water bridges, at any time during the simulation.

Conclusions

LiGRO provides a novel user-friendly graphical interface for protein-ligand MD simulations using GROMACS. This program eliminates the need for users to write command-lines and to manually construct complex topology files. For the ligand setting, sLiGRO uses ACPYPE—a tool-based on the ANTECHAMBER software that generates automatic topologies, parameters and partial charges for small molecules for use in a variety of MD programs. By using LiGRO, the user needs to enter only a few inputs to generate a complete protein-ligand MD simulation (complex energy minimization and NVT/NPT equilibration). LiGRO automatically unwrap the trajectory output, leaving it ready for the trajectory analysis.

With LiGRO, it is less likely for the user to encounter parameterization errors or to break the MD simulation. Currently, LiGRO is the only GUI capable of running protein-ligand simulations in GROMACS that is fully compatible with all the available versions. The LiGRO script follows the Bevan laboratory tutorial Protein-Ligand Complex [19] as its workflow. Users must remember that the default settings are not always the best choice for a MD simulation since these simulations are often complex operations that require more parameters to obtain better results. Researchers may freely download and modify the software, with the source code being available under the terms of the GPLv3 license from http://www.ufrgs.br/lasomfarmacia/ligro/.