Modeling and fitting protein-protein complexes to predict change of binding energy

Dourado, Daniel F.A.R.; Flores, Samuel Coulbourn

doi:10.1038/srep25406

Modeling and fitting protein-protein complexes to predict change of binding energy

Article
Open access
Published: 13 May 2016

Volume 6, article number 25406, (2016)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Modeling and fitting protein-protein complexes to predict change of binding energy

Download PDF

Daniel F.A.R. Dourado¹ &
Samuel Coulbourn Flores¹

4302 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

It is possible to accurately and economically predict change in protein-protein interaction energy upon mutation (ΔΔG), when a high-resolution structure of the complex is available. This is of growing usefulness for design of high-affinity or otherwise modified binding proteins for therapeutic, diagnostic, industrial, and basic science applications. Recently the field has begun to pursue ΔΔG prediction for homology modeled complexes, but so far this has worked mostly for cases of high sequence identity. If the interacting proteins have been crystallized in free (uncomplexed) form, in a majority of cases it is possible to find a structurally similar complex which can be used as the basis for template-based modeling. We describe how to use MMB to create such models, and then use them to predict ΔΔG, using a dataset consisting of free target structures, co-crystallized template complexes with sequence identify with respect to the targets as low as 44%, and experimental ΔΔG measurements. We obtain similar results by fitting to a low-resolution Cryo-EM density map. Results suggest that other structural constraints may lead to a similar outcome, making the method even more broadly applicable.

Modeling Binding Affinity of Pathological Mutations for Computational Protein Design

Methods for Detecting Protein Binding Interfaces

Prediction of Protein–Protein Binding Affinities from Unbound Protein Structures

Introduction

Modeling Protein-Protein Interactions (PPIs)¹ is fundamentally important in biology as it probes normal as well as diseased protein function. For example, such models explain the role of Parkinson’s-disease associated mutations in Parkin^1,2,3. PPIs are also important in the development of therapeutic and diagnostic biologics (monoclonal antibodies, or mAbs, and alternative scaffolds)⁴.

Biologics have a growing and economically substantial field of application. However raising antibodies or finding an alternative scaffold to bind a given target is difficult and time consuming. Even when starting with a scaffold that binds reasonably, affinity maturation requires a substantial experimental effort, and maintaining specificity can be a challenge⁵. Likewise protein engineering often creates many simultaneous mutations, with possible immunogenicity and solubility issues, and no insight as to which substitutions are responsible for the main effect⁶. Thus there is demand for an economical computational method which will suggest a relatively small number of substitutions which have high likelihood of improving binding.

Computational methods have made significant progress for cases where a crystallographic complex is available of the potential biologic bound to its target (we will refer to these as bound structures). Some are Molecular Dynamics (MD) based methods^7,8,9,10, which typically are associated with a high computational cost. So, the applicability of such methods to large complexes or to a substantial number of mutations, which is required the case for protein-protein affinity maturation protocols, can be quite limited. On the other hand, Knowledge Based (KB) methods, which empirically combine several energetic terms including implicit solvent^{11,12,13,14,15,16}, are fast but most perform little or no structural optimization and cannot model the backbone rearrangements induced by mutation. KB methods have also been combined with sequence analysis¹⁷, and interface structure alignments¹⁸ but this requires evolutionary information which is not available for all complexes (e.g. many biologics), and further has only been demonstrated for homology models based on high sequence identity (only 4% of their dataset had sequence identity below 50%)¹⁷. Recently, we described Zone Equilibration of Mutants (ZEMu)¹, validated with 1254 mutants (1–15 simultaneous mutations) of 65 different complexes, which offers both accuracy and economy. ZEMu is implemented in MacroMoleculeBuilder (MMB)^19,20, a multiscale internal-coordinate modeling code in which flexibility and an all-atom force field can be limited to regions of interest^1,21. The method significantly improves the existing FoldX potential¹³, and arguably shows promise to improve others^17,18 which perform limited structural minimization.

There are limited options for computing ΔΔG for the case in which the interacting proteins have only been crystallized in the free form. For many such structures, low resolution density maps of their complex are available²². The recent explosion in Cryo Electron Microscopy, brought about by the direct electron detector²³, promises a rich source of new structural data, notably of complexes which are hard to crystallize. In addition, solution scattering produces many low-resolution density maps²⁴, and Free-Electron Lasers promise to eventually reach comparable single-molecule resolution²⁵.

Alternatively, for most structures available in the free form (referred to as targets)²⁶, it is possible to find a structurally related template which can be used to build a template-based model of the complex²⁷. Template modeling uses a structural alignment, which can be done accurately even at low sequence identity^27,28.

This realization has led to considerable interest in template-based docking²⁹. Specific cases in which the free structure is related to one of the proteins in the template complex include antibody-bound IGF-I (complex exists of the related IGF-II bound to an antibody)³⁰, human Chorionic Somatomammotropin (hCS)³¹ vs. human Growth Hormone (GH) Receptor (complex exists of GH vs. GH Receptor), FcγRI vs. IgG1 (complexes have long existed of FcγRII and FcγRIII vs. IgG, while for FcγRI vs. IgG1 a mutated complex was recently solved)^32,33,34. In this study we implement a fast and simple to use internal-coordinate template-based docking protocol in MMB^19,20, that works even in the range of ~40% sequence identity for homologous proteins (quite near the twilight zone)^27,35 and extend ZEMu¹ to predict ΔΔG for thus-modeled and fitted complexes.

Results

The template-based docking protocol introduced here results in good ΔΔG precision for homologous templates, those which (in this work) have sequence identities (vs. the targets) in the range of 44% to 51%. It is naturally more precise for self-templates, meaning those which have high sequence identity (>93%) to their targets. In this work when we provide RMSD (Root Mean Square Deviation) we refer in all cases to the discrepancy in backbone 3D atomic structure of modeled vs. template complexes. When we provide RMSE (Root Mean Square Error) and correlation, we are comparing experimental to computed ΔΔG.

Double-free models are made by docking two targets onto the template complex, while single-free models are made by docking one target onto the template (see Methods). For the double-free models based on self-templates the Root Mean Square Deviations (RMSDs) range from 0.71 to 3.42 Å (Table 1). The exception is the TGF-βType II Receptor/TGF-β3 complex for which the RMSD is 22.84 Å (Table 1). If we analyze the structures of the model and template in detail (Fig. S1) we can observe that distal region of the co-crystal chain A (TGF-β 3) is poorly resolved. In fact, if we omit template chain A residues 40 to 80 the RMSD decreases from 21.7826 Å to just 0.8819 Å, which is in line with RMSD found for the other complexes (Table 1). Since these poorly resolved residues are placed on a distal region of chain A, far from the interface, the ΔΔG prediction precision with this model is in line with that of the rest of the dataset. We describe this complex in more detail later. For the single- and double-free models based on homologous templates the RMSD ranges from 2.76 to 4.86 Å. As expected the RMSDs of this sub-group are higher than the ones observed for the double-free models based on self-templates. However the differences are relatively small, which is in line with idea that structure is more conserved than sequence²⁷. We discuss this point in more detail below.

Table 1 Validation dataset.

Full size table

Based on the validation dataset described in Table 1, we compared the performance of ZEMu to FoldX-only (meaning FoldX with no prior MMB equilibration) in predicting ΔΔG. For the entire dataset the correlation between FoldX-only and experimental ΔΔG (ΔΔG_FoldX-only and ΔΔG_exp, respectively) is 0.12, while the Root Mean Square Error (RMSE) is 1.83 kcal/mol. To our knowledge this is the first time FoldX is evaluated with modeled complexes; this also serves as an external (non-ZEMu) validation of our modeling protocol. For ZEMu the correlation improves to 0.34 (p-value < 0.00001); the RMSE also improves, to 1.54 kcal/mol (Table 2, Fig. 1, Tables S1 and S2).

Table 2 Comparison between Foldx-only and ZEMu performance.

Full size table

The improvement over FoldX-only is reflected in the complete dataset as well as in the single and multiple mutant sub-groups (Table 2). For the single mutants, FoldX-only shows an RMSE of 1.89 kcal/mol and correlation of 0.06, while for ZEMu we obtain an RMSE of just 1.54 kcal/mol and a higher correlation of 0.24 (p-value = 0.00004) (Table 2, Fig. 1, Tables S1 and S2). In the case of the multiple mutants FoldX-only achieves an RMSE of 1.79 kcal/mol and correlation of 0.15, and ZEMu an RMSE of 1.54 kcal/mol and a correlation of 0.37 (p-value < 0.00001) (Table 2, Fig. 1, Tables S1 and S2).

From the main dataset we also created a sub-group comprising mutants for which self-template structures are available (Table 3). Based on this sub-group we compared the performance of ZEMu for modeled vs. crystallographic complexes. As expected, performance was better for crystallographic than for modelled complexes, but only moderately (RMSE of 1.58 vs. 1.76, Correlation of 0.61 vs. 0.38, respectively). This further highlights the quality of the modeling protocol. In the particular case of the TGF-βType II Receptor / TGF-β3 complex model, the RMSD vs. its self-template is 22.84 Å, when computed based on all resolved residues. As explained above the huge RMSD value found is due to poorly resolved residues in a distal region of chain A co-crystal (Table 1, Fig. S1) and so does not affect the quality and performance of the model at the interface. The RMSEs are 1.68 kcal/mol and 1.19 kcal/mol, and the correlations are is 0.28 and 0.83, for the model and co-crystal, respectively. The performance is thus in line with the other models.

Table 3 Comparing the performance of ZEMu on modeled vs. crystallographic complexes.

Full size table

We then tested a ZEMu variant with an additional flexibility radius (0–14 Å) (Fig. 2) centered on the mutation site, which can include residues from the binding partner. When a radius of 4 Å is used an RMSE very slightly higher than that of regular ZEMu is obtained, also an added computational cost is incurred, so there is no reason to use the added flexibility. Nonetheless, ZEMu variants with an extra flexibility radius of 4 to 10 Å still show better performance than FoldX-only.

Figure 2: Results of flexibilizing a spatial neighbourhood of the mutation sites for a dataset composed of 687 mutants (not includes the template models of FcγRIII/IgG1 and FcγRII/IgG1 based on a homologous co-crystal complex).

The application to the FcγRI/IgG1 complex illustrates some of the advantages of ZEMu. We created single-free and double-free template models, and a fitted model, of this complex (Table 4, Table S2), based on an FcγRIII/IgG1 template (sequence identity of FcγRI with FcγRIII is 45%)³³. When we performed this work no FcγRI/IgG1 co-crystal was available, but one was reported more recently³⁴. With the FcγRI/IgG1 single-free template model, FoldX-only yields an RMSE of 2.33 and a correlation of 0.12 while ZEMu RMSE is just 0.92 and correlation is 0.42. The difference in performance between the two protocols is overwhelmingly due to three mutations involving N-terminus residue G236. When the three mutants are left out, FoldX-only RMSE drops substantially to 0.32 and the correlation increases to 0.32. Though this is a small sample, it illustrates even more strikingly that FoldX’s rigid backbone is particularly unsuitable for modeling the terminal region, which has characteristically high mobility.

Table 4 Comparison between Foldx-only and ZEMu performance for FcγRI/IgG1 single-free and double-free template-based and fitted models.

Full size table

For the double-free and fitted FcγRI/IgG1 models the three mutations involving N-terminus residue G236 are immediately adjacent to unresolved residues. The free IgG1 structure (3DNK) was missing several residues from the N-terminus (residues 229 to 235, part of the lower hinge connecting Fc to Fab in a full-length IgG1), which were resolved in the template complex (1E4K). These three mutants plus one mutation in the missing region therefore could not be modeled in the double-free and fitted models. ZEMu still outperforms FoldX-only but by a smaller margin (Table 4). Directly comparing against the double-free and fitted models, we can conclude that for both FoldX-only and ZEMu the best performance was obtained for the fitted model (Table 4). If we analyze the RMSD of both models with respect to the co-crystal structure we can observe that the fitted model has a lower RMSD by ~0.32 Å. In the fitted model the hinges between the D1 and D2 domains on FcγRI, and between the CH2 and CH3 domains on Fc, were made flexible, to allow domain motions, explaining the better RMSD of chains A and C. This highlights the efficacy of the fitting protocol.

ZEMu and MMB performance for models of FcγRIII/IgG1 and FcγRII/IgG1 based on homologous templates further demonstrates the efficacy of the protocol in cases where the self-template is not available. For instance, for the FcγRIII/IgG1 model based on its self-template the RMSE is 0.85 kcal/mol and the RMSD is 1.98 Å. In the case of the FcγRIII/IgG1 model based on the FcγRII/IgG1 template (PDB:3RY6)³² (sequence identity of FcγRIII with FcγRII is 51%) the RMSE increases to 1.17 kcal/mol (0.32 kcal/mol higher). The RMSD (vs. the self-template) is 4.86 Å, which is low but still 2.4 fold higher than the RMSD found for the double-free model based on a self-template. Similarly, for the FcγRII/IgG1 model based on a self-template the RMSE is 0.90 kcal/mol while the RMSD (vs. the self-template) is 2.72 Å. For the FcγRII/IgG1 model based on FcγRIII/IgG1 template (PDB:1E4K)³³ (sequence identity of FcγRII with FcγRIII is 44%) the RMSE increases to just 1.13 kcal/mol, possibly because the RMSD increases only 1.3-fold to 3.61 Å. For these two complexes together (150 mutants, including some with single and some with multiple substitutions) the RMSE increased by 0.22 kcal/mol when lower sequence identity (44–51%) homologous templates were used in the modeling instead of high sequence identity (>93%) self-templates.

Positive Predictive Value (PPV) is defined as TP/(TP+FP) (see supporting information). In order to compute statistical quantities like this we would need a random sample of possible mutations. However many of the available experimental ΔΔG’s are the result of alanine scanning mutagenesis experiments and although 36% of the dataset have experimental ΔΔG < 0 kcal/mol, only 5% have experimental ΔΔG < −0.5 kcal/mol. Also given the interest in affinity maturation³⁶ it is likely that there are more affinity-improving mutations in the peer-reviewed literature and in SKEMPI than would occur through random mutagenesis. It is thus doubtful that SKEMPI, or our dataset, contains a representative sample of substituted residue types or a representative ratio of affinity increasing vs. affinity reducing mutations. We nonetheless computed the PPV which gives the odds of obtaining ΔΔG_exp ≤ c_exp for c_exp = 0, −0.5 and −1.0 kcal/mol, given ΔΔG_ZEMu ≤ c_ZEMu , for several c_ZEMu’s. Considering the entire dataset, for ΔΔG_ZEMu ≤ −0.5 kcal/mol, the probability of satisfying ΔΔG_exp ≤ −0.5 kcal/mol is 0.43. We emphasize strongly that this PPV is not applicable to the case that ΔΔG_ZEMu is computed for all possible point mutations at an interface, as would be done in a likely practical application.

Discussion

In prior work¹ we described a means to predict ΔΔG in crystallographically observed PPIs. A key goal in Structural Bioinformatics is the ability to compute ΔΔG even for the very common case of proteins whose structure is known crystallographically only in the free form. In many cases evolutionary information¹⁷ does not exist or is not applicable. We formed template models based on an available cocrystallized complex (which in principle could be e.g. isoforms or homologous of the free structures), with a new template modelling protocol that we describe. We here demonstrate that the protocol does not require a high sequence identity for building significant template models based on a homologous template (recall, sequence identity for homologous proteins ranged between 44% and 51%), whereas existing methods work only with high sequence identity templates^17,18. For low sequence identity, our method is only moderately less precise, again related to the idea that structure is more conserved than sequence²⁷. We also assembled protein complexes by flexibly fitting to a 10 Å resolution density map of an isoform protein²¹.

We emphasize that all results labelled “FoldX-only” were obtained on complexes modelled by us (albeit without subsequent ZEMu processing of the mutation sites), providing independent (non-ZEMu) validation of our template-docking protocol.

Globally, our success using template-docked and fitted models suggests that other means of docking under constraints, e.g. using biochemical or bioinformatical data, may lead to comparable results. We further suggest that if experimental ΔΔG data is available, ZEMu could be used to validate and/or refine the docked, modeled, or fitted structures. This could significantly improve docking results³⁷ but is important even for template-based modeling or fitting when the constraints come from an isoform or homolog, since the binding mode may not be conserved.

Our main validation dataset consists of template modeled complexes. We used several variations of ZEMu on these complexes and evaluated the accuracy of ΔΔG prediction. More-sophisticated variants of ZEMu, which flexibilized various spatial regions, had very similar results on the main dataset when compared to the published method¹, indicating that it is best to limit the flexibility to the immediate sequence neighborhood of the mutated residues. This validates the perturbative assumption underlying ZEMu, namely that the substitution mutations have the largest effect in the near neighborhood of the mutation site, and less effect farther from that position. Anecdotal observations suggest MD potentials introduce structural artifacts, so leaving as much as possible of the crystallographic structure unmodified may be key to ZEMu’s success. We introduce an MMB modeling protocol and show that it leads to a versatile method to predict ΔΔG on modeled and fitted protein-protein complexes.

Methods

We built models of protein-protein complexes using MMB template modeling to align the free protein structures to relevant protein crystal complexes and also by fitting to a low-resolution density maps using ICFF²¹ (described below). We then used ZEMu¹ to predict ΔΔG upon mutation and compared to the results of using FoldX-only¹³. Note that in all cases FoldX-only analysis was conducted with the MMB modeled or fitted structures. ZEMu was mostly used as described in^1,2,3. Though we also tested the effect of flexibilizing additional residues in the spatial neighborhood (up to 14 Å) of the mutation sites.

Dataset

The validation dataset comprises 846 mutants (each with 1-6 simultaneous substitutions) of 11 different protein-protein complexes models for which ΔΔG_exp is available (Table 1)^{36,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60}. The dataset consists mostly of double-free template models (two targets). In some cases the templates were the same proteins as the targets (self-templates, Table S1); such systems are useful for benchmarking purposes. In other cases we used homologous templates (Fig. 3A; Table S2). We also created a single-free template model, for which only one of the two targets comes from a free structure, while the other is retained from the template (Fig. 3B; Table S2). Finally, we also generated a model of the biomedically important FcγRI /IgG1⁶¹ by fitting to a low-resolution synthetic density map of FcγRIII/IgG1³³ using ICFF²¹.

**Figure 3: Creating complexes with MMB template modeling and ICFF.**

Template modeling in MMB

Several good template-based modeling methods exist^62,63,64. Our procedure starts with a sequence alignment⁶⁵ between unbound (target) and bound (template) target and template residues, followed by structural alignment. We then resolve any steric clashes, after which the model is ready to be used for ΔΔG prediction with ZEMu or potentially other purposes. The procedure (described in detail below) is straightforward and convenient to do in MMB.

Structural alignment based on gapped sequence alignment

We have previously described morphing^21,66 and homology modeling of RNA²⁰ and ribonucleoprotein complexes¹⁹, using springs which connect residues in a rigid molecule of known structure, with corresponding residues in a flexible molecule of unknown structure. The mentioned correspondence is determined by a simple SeqAn⁶⁵ gapped sequence alignment now available in MMB, with a mismatch and gap opening penalty of −1. In contrast with our homology modeling work (in which a fully-flexible unstructured target chain is aligned with a rigid template)¹⁹, or our morphing work (in which a partially-flexible structured target is aligned with a rigid template)^21,66, here those springs align one or more unbound targets (or their binding domains) onto corresponding domains in the template, (Fig. 3) with both the targets and the template remaining fully rigid during the process. The template is then deleted, leaving the modeled targets in their place (Fig. 3A,B).

Declashing

The thus-modeled complex typically has a small number of clashing residues, as the binding interface of the target was previously exposed to solvent allowing greater freedom to the side chains and to some extent the backbone. These clashes must be removed for accurate ΔΔG calculation (Fig. 3A,B). The problem of side chains that clash due to modeling is not unlike that of side chains that clash due to in silico mutagenesis, the problem that is addressed by ZEMu. Therefore we use a ZEMu-like method¹ to declash. We flexibilize only the clashing residue or, if needed, a 5-residue zone centered on the main clashing residue, and equilibrate under PARM99⁶⁷ interactions with near neighbors. We then minimize the structure under the FoldX potential¹³.

Flexible fitting to low-resolution density maps

In addition to the template models described above, we also flexibly fitted free FcγRI and IgG1 structures into a synthetic 10 Å-resolution density map (based on PDB ID 1E4K) of the FcγRIII-IgG1 complex. We previously described Internal Coordinate Flexible Fitting (ICFF). In ICFF, each atom in the molecule (or relevant fragment thereof) perceives a force which is proportional to the electronic density gradient at the nuclear position²¹. The molecule in question can have hinge and interface flexibility, and MD forces can be applied about such points of flexibility. Hinges between the D1 and D2 domains on FcγRI, and between the CH2 and CH3 domains on Fc, were made flexible, to allow domain motions. Interface residues 1236 and 1298 (on Fc) and 148 (on FcγRI) were granted side-chain flexibility, to relieve clashes which would otherwise occur as the two subunits come into contact to form the complex (Fig. 3C).

Evaluating ΔΔG for the modeled complexes

ZEMu involves first specifying a flexibility zone comprising five residues consecutive in sequence, centered on the mutation site. The flexibility zone is treated in torsion space, leaving the remainder of the protein rigid and fixed. Also, a physics zone of 12 Å around each flexible residue is defined, inside of which electrostatic and van der Waals forces are active. The system is then minimized by dynamics¹.

The interaction energy of the equilibrated complex is evaluated with the Knowledge Based (KB) potential FoldX¹³. We conduct the calculation for the MMB-equilibrated wild type and mutant complexes to obtain ΔG_{wild type} and ΔG_mutant, respectively. An estimate of ΔΔG_exp is obtained as follows⁶⁸:

ΔΔG_ZEMu ^≡ ΔG_mutant – ΔG_{wild type} ≈ ΔΔG_exp

ZEMu+ additional flexibility radius

We also tested a variation of ZEMu which grants flexibility to residues in the spatial neighborhood of the mutation sites. This was defined as all residues within a certain radius of the mutation site, potentially including residues of the adjacent protein. We evaluated radii ranging from 0 (equivalent to ordinary ZEMu) to 14 Å. After equilibration we evaluated ΔΔG with FoldX¹³ as before.

Additional Information

How to cite this article: Dourado, D. F.A.R. and Flores, S. C. Modeling and fitting protein-protein complexes to predict change of binding energy. Sci. Rep. 6, 25406; doi: 10.1038/srep25406 (2016).

References

Dourado, D. F. & Flores, S. C. A multiscale approach to predicting affinity changes in protein-protein interfaces. Proteins, doi: 10.1002/prot.24634 (2014).
Fiesel, F. C. et al. Structural and Functional Impact of Parkinson Disease-Associated Mutations in the E3 Ubiquitin Ligase Parkin. Hum Mutat 36, 774–786, doi: 10.1002/humu.22808 (2015).
Article CAS PubMed PubMed Central Google Scholar
Caulfield, T. R. et al. Phosphorylation by PINK1 releases the UBL domain and initializes the conformational opening of the E3 ubiquitin ligase Parkin. PLos Comput Biol 10, e1003935, doi: 10.1371/journal.pcbi.1003935 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kazlauskas, R. J. & Bornscheuer, U. T. Finding better protein engineering strategies. Nat Chem Biol 5, 526–529, doi: 10.1038/nchembio0809-526 (2009).
Article CAS PubMed Google Scholar
Levin, A. M. & Weiss, G. A. Optimizing the affinity and specificity of proteins with molecular display. Mol Biosyst 2, 49–57, doi: 10.1039/b511782h (2006).
Article CAS PubMed Google Scholar
Getts, D. R., Getts, M. T., McCarthy, D. P., Chastain, E. M. L. & Miller, S. D. Have we overestimated the benefit of human(ized) antibodies? Mabs-Austin 2, 682–694 doi:DOI 10.4161/mabs.2.6.13601 (2010).
Article Google Scholar
Marelius, J., Kolmodin, K., Feierberg, I. & Aqvist, J. Q: a molecular dynamics program for free energy calculations and empirical valence bond simulations in biomolecular systems. J Mol Graph Model 16, 213–225, 261 (1998).
Article CAS Google Scholar
Massova, I. & Kollman, P. A. Combined molecular mechanical and continuum solvent approach (MM-PBSA/GBSA) to predict ligand binding. Perspect Drug Discov 18, 113–135, doi: 10.1023/A:1008763014207 (2000).
Article CAS Google Scholar
Jorgensen, W. L. Free-Energy Calculations - a Breakthrough for Modeling Organic-Chemistry in Solution. Accounts Chem Res 22, 184–189, doi: 10.1021/Ar00161a004 (1989).
Article CAS Google Scholar
Kollman, P. Free-Energy Calculations - Applications to Chemical and Biochemical Phenomena. Chem Rev 93, 2395–2417, doi: 10.1021/Cr00023a004 (1993).
Article CAS Google Scholar
Dehouck, Y., Kwasigroch, J. M., Rooman, M. & Gilis, D. BeAtMuSiC: prediction of changes in protein-protein binding affinity on mutations. Nucleic Acids Res 41, W333–339, doi: 10.1093/nar/gkt450 (2013).
Article PubMed PubMed Central Google Scholar
Douglas, E. V., Pires, D. B. A., Tom, L. & Blundell . mCSM: predicting the effects of mutations in proteins using graph-based signatures. Bioinformatics 30, 335–342, doi: 10.1093/bioinformatics/btt691 (2014).
Article CAS Google Scholar
Guerois, R., Nielsen, J. E. & Serrano, L. Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. J Mol Biol 320, 369–387, doi: 10.1016/S0022-2836(02)00442-4 (2002).
Article CAS PubMed Google Scholar
Kortemme, T. & Baker, D. A simple physical model for binding energy hot spots in protein-protein complexes. Proceedings of the National Academy of Sciences of the United States of America 99, 14116–14121, doi: 10.1073/Pnas.202485799 (2002).
Article CAS ADS PubMed PubMed Central Google Scholar
Kamisetty, H., Ramanathan, A., Bailey-Kellogg, C. & Langmead, C. J. Accounting for conformational entropy in predicting binding free energies of protein-protein interactions. Proteins-Structure Function and Bioinformatics 79, 444–462, doi: 10.1002/Prot.22894 (2011).
Article CAS Google Scholar
Smith, C. A. & Kortemme, T. Backrub-like backbone simulation recapitulates natural protein conformational variability and improves mutant side-chain prediction. Journal of Molecular Biology 380, 742–756, doi: 10.1016/J.Jmb.2008.05.023 (2008).
Article CAS PubMed PubMed Central Google Scholar
Berliner, N., Teyra, J., Colak, R., Garcia Lopez, S. & Kim, P. M. Combining structural modeling with ensemble machine learning to accurately predict protein fold stability and binding affinity effects upon mutation. PLos One 9, e107353, doi: 10.1371/journal.pone.0107353 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Brender, J. R. & Zhang, Y. Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles. PLos Comput Biol 11, e1004494, doi: 10.1371/journal.pcbi.1004494 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Flores, S., Zemora, G. & Waldsich, C. Insights into diseases of human telomerase from dynamical modeling. Pacific Symposium on Biocomputing 18, 200–211 (2013).
Google Scholar
Flores, S., Sherman, M., Bruns, C., Eastman, P. & Altman, R. Fast flexible modeling of macromolecular structure using internal coordinates. IEEE Transactions in Computational Biology and Bioinformatics 8, 1247–1257 (2010).
Article Google Scholar
Flores, S. C. Fast fitting to low resolution density maps: elucidating large-scale motions of the ribosome. Nucleic Acids Res 42, e9, doi: 10.1093/nar/gkt906 (2014).
Article CAS PubMed Google Scholar
Hu, Z. et al. Structural and biochemical basis for induced self-propagation of NLRC4. Science 350, 399–404, doi: 10.1126/science.aac5489 (2015).
Article CAS ADS PubMed Google Scholar
Nogales, E. & Scheres, S. H. W. Cryo-EM: A Unique Tool for the Visualization of Macromolecular Complexity. Molecular Cell 58, 677–689, doi: 10.1016/j.molcel.2015.02.019 (2015).
Article CAS PubMed PubMed Central Google Scholar
Svergun, D. I. Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing (vol 76, pg 2879, 1999). Biophysical Journal 77, 2896–2896 (1999).
Article CAS Google Scholar
Chapman, H. N. et al. Femtosecond diffractive imaging with a soft-X-ray free-electron laser. Nature Physics 2, 839–843, doi: 10.1038/nphys461 (2006).
Article CAS ADS Google Scholar
Sinha, R., Kundrotas, P. J. & Vakser, I. A. Docking by structural similarity at protein-protein interfaces. Proteins-Structure Function and Bioinformatics 78, 3235–3241, doi: 10.1002/prot.22812 (2010).
Article CAS Google Scholar
Kundrotas, P. J., Zhu, Z., Janin, J. & Vakser, I. A. Templates are available to model nearly all complexes of structurally characterized proteins. P Natl Acad Sci USA 109, 9438–9441, doi: 10.1073/pnas.1200678109 (2012).
Article ADS Google Scholar
Chothia, C. & Lesk, A. M. The Relation between the Divergence of Sequence and Structure in Proteins. Embo Journal 5, 823–826 (1986).
Article CAS Google Scholar
Kundrotas, P. J. & Vakser, I. A. Global and local structural similarity in protein-protein complexes: implications for template-based docking. Proteins 81, 2137–2142, doi: 10.1002/prot.24392 (2013).
Article CAS PubMed PubMed Central Google Scholar
Murakami, H. et al. Phase 1 study of ganitumab (AMG 479), a fully human monoclonal antibody against the insulin-like growth factor receptor type I (IGF1R), in Japanese patients with advanced solid tumors. Cancer Chemother Pharmacol 70, 407–414, doi: 10.1007/s00280-012-1924-9 (2012).
Article CAS PubMed PubMed Central Google Scholar
Walsh, S. T. & Kossiakoff, A. A. Crystal structure and site 1 binding energetics of human placental lactogen. J Mol Biol 358, 773–784, doi: 10.1016/j.jmb.2006.02.038 (2006).
Article CAS PubMed Google Scholar
Ramsland, P. A. et al. Structural basis for Fc gammaRIIa recognition of human IgG and formation of inflammatory signaling complexes. J Immunol 187, 3208–3217, doi: 10.4049/jimmunol.1101467 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sondermann, P., Huber, R., Oosthuizen, V. & Jacob, U. The 3.2-A crystal structure of the human IgG1 Fc fragment-Fc gammaRIII complex. Nature 406, 267–273, doi: 10.1038/35018508 (2000).
Article CAS ADS PubMed Google Scholar
Lu, J. et al. Structure of FcgammaRI in complex with Fc reveals the importance of glycan recognition for high-affinity IgG binding. P Natl Acad Sci USA 112, 833–838, doi: 10.1073/pnas.1418812112 (2015).
Article CAS ADS Google Scholar
Rost, B. Twilight zone of protein sequence alignments. Protein Eng 12, 85–94 (1999).
Article CAS Google Scholar
Lippow, S. M., Wittrup, K. D. & Tidor, B. Computational design of antibody-affinity improvement beyond in vivo maturation. Nat Biotechnol 25, 1171–1176, doi: 10.1038/nbt1336 (2007).
Article CAS PubMed PubMed Central Google Scholar
Moreira, I. S., Martins, J. M., Coimbra, J. T., Ramos, M. J. & Fernandes, P. A. A new scoring function for protein-protein docking that identifies native structures with unprecedented accuracy. Phys Chem Chem Phys 17, 2378–2387, doi: 10.1039/c4cp04688a (2015).
Article CAS PubMed Google Scholar
Lim, D. et al. Crystal structure and kinetic analysis of beta-lactamase inhibitor protein-II in complex with TEM-1 beta-lactamase. Nat Struct Biol 8, 848–852, doi: 10.1038/nsb1001-848 (2001).
Article CAS PubMed Google Scholar
Petrosino, J., Rudgers, G., Gilbert, H. & Palzkill, T. Contributions of aspartate 49 and phenylalanine 142 residues of a tight binding inhibitory protein of beta-lactamases. J Biol Chem 274, 2394–2400, doi: 10.1074/jbc.274.4.2394 (1999).
Article CAS PubMed Google Scholar
Albeck, S., Unger, R. & Schreiber, G. Evaluation of direct and cooperative contributions towards the strength of buried hydrogen bonds and salt bridges. J Mol Biol 298, 503–520, doi: 10.1006/jmbi.2000.3656 (2000).
Article CAS PubMed Google Scholar
Selzer, T., Albeck, S. & Schreiber, G. Rational design of faster associating and tighter binding protein complexes. Nat Struct Biol 7, 537–541 (2000).
Article CAS Google Scholar
Reichmann, D. et al. The modular architecture of protein-protein binding interfaces. P Natl Acad Sci USA 102, 57–62, doi: 10.1073/pnas.0407280102 (2005).
Article CAS ADS Google Scholar
Braden, B. C. et al. 3-Dimensional Structures of the Free and the Antigen-Complexed Fab from Monoclonal Antilysozyme Antibody-D44.1. J Mol Biol 243, 767–781, doi: 10.1016/0022-2836(94)90046-9 (1994).
Article CAS PubMed Google Scholar
Bhat, T. N. et al. Bound water molecules and conformational stabilization help mediate an antigen-antibody association. P Natl Acad Sci USA 91, 1089–1093 (1994).
Article CAS ADS Google Scholar
DallAcqua, W., Goldman, E. R., Eisenstein, E. & Mariuzza, R. A. A mutational analysis of the binding of two different proteins to the same antibody. Biochemistry-Us 35, 9667–9676, doi: 10.1021/Bi960819i (1996).
Article CAS Google Scholar
Dall’Acqua, W. et al. A mutational analysis of binding interactions in an antigen-antibody protein-protein complex. Biochemistry-Us 37, 7981–7991, doi: 10.1021/bi980148j (1998).
Article Google Scholar
De Crescenzo, G. et al. Three key residues underlie the differential affinity of the TGFbeta isoforms for the TGFbeta type II receptor. J Mol Biol 355, 47–62, doi: 10.1016/j.jmb.2005.10.022 (2006).
Article CAS PubMed Google Scholar
Hart, P. J. et al. Crystal structure of the human TbetaR2 ectodomain–TGF-beta3 complex. Nat Struct Biol 9, 203–208, doi: 10.1038/nsb766 (2002).
Article CAS PubMed Google Scholar
Deisenhofer, J. Crystallographic refinement and atomic models of a human Fc fragment and its complex with fragment B of protein A from Staphylococcus aureus at 2.9- and 2.8-A resolution. Biochemistry-Us 20, 2361–2370 (1981).
Article CAS Google Scholar
Bottomley, S. P. et al. The Stability and Unfolding of an Igg Binding-Protein Based Upon the B-Domain of Protein-a from Staphylococcus-Aureus Probed by Tryptophan Substitution and Fluorescence Spectroscopy. Protein Eng 7, 1463–1470, doi: 10.1093/Protein/7.12.1463 (1994).
Article CAS PubMed Google Scholar
Cedergren, L., Andersson, R., Jansson, B., Uhlen, M. & Nilsson, B. Mutational Analysis of the Interaction between Staphylococcal Protein-a and Human Igg(1). Protein Eng 6, 441–448, doi: 10.1093/Protein/6.4.441 (1993).
Article CAS PubMed Google Scholar
Jendeberg, L. et al. Kinetic-Analysis of the Interaction between Protein-a Domain Variants and Human Fc Using Plasmon Resonance Detection. J Mol Recognit 8, 270–278, doi: 10.1002/jmr.300080405 (1995).
Article CAS PubMed Google Scholar
Li, Y. L., Li, H. M., Smith-Gill, S. J. & Mariuzza, R. A. Three-dimensional structures of the free and antigen-bound Fab from monoclonal antilysozyme antibody HyHEL-63. Biochemistry-Us 39, 6296–6309, doi: 10.1021/Bi000054l (2000).
Article CAS Google Scholar
Li, Y. L., Urrutia, M., Smith-Gill, S. J. & Mariuzza, R. A. Dissection of binding interactions in the complex between the anti-lysozyme antibody HyHEL-63 and its antigen. Biochemistry-Us 42, 11–22, doi: 10.1021/Bi020589+ (2003).
Article Google Scholar
Pelletier, H. & Kraut, J. Crystal-Structure of a Complex between Electron-Transfer Partners, Cytochrome-C Peroxidase and Cytochrome-C. Science 258, 1748–1755, doi: 10.1126/Science.1334573 (1992).
Article CAS ADS PubMed Google Scholar
Pielak, G. J. & Wang, X. M. Interactions between yeast iso-1-cytochrome c and its peroxidase. Biochemistry-Us 40, 422–428, doi: 10.1021/Bi002124u (2001).
Article CAS Google Scholar
Shields, R. L. et al. High resolution mapping of the binding site on human IgG1 for Fc gamma RI, Fc gamma RII, Fc gamma RIII, and FcRn and design of IgG1 variants with improved binding to the Fc gamma R. J Biol Chem 276, 6591–6604, doi: 10.1074/jbc.M009483200 (2001).
Article CAS PubMed Google Scholar
Tamm, A., Kister, A., Nolte, K. U., Gessner, J. E. & Schmidt, R. E. The IgG binding site of human FcgammaRIIIB receptor involves CC’ and FG loops of the membrane-proximal domain. J Biol Chem 271, 3659–3666 (1996).
Article CAS Google Scholar
Hibbs, M. L., Tolvanen, M. & Carpen, O. Membrane-proximal Ig-like domain of Fc gamma RIII (CD16) contains residues critical for ligand binding. J Immunol 152, 4466–4474 (1994).
CAS PubMed Google Scholar
Richards, J. O. et al. Optimization of antibody binding to FcgammaRIIa enhances macrophage phagocytosis of tumor cells. Mol Cancer Ther 7, 2517–2527, doi: 10.1158/1535-7163.MCT-08-0201 (2008).
Article CAS PubMed Google Scholar
Lu, J., Ellsworth, J. L., Hamacher, N., Oak, S. W. & Sun, P. D. Crystal structure of Fcgamma receptor I and its implication in high affinity gamma-immunoglobulin binding. J Biol Chem 286, 40608–40613, doi: 10.1074/jbc.M111.257550 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. Template-based modeling and free modeling by I-TASSER in CASP7. Proteins-Structure Function and Bioinformatics 69, 108–117, doi: 10.1002/prot.21702 (2007).
Article CAS Google Scholar
Zhang, Y. & Skolnick, J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Research 33, 2302–2309, doi: 10.1093/nar/gki524 (2005).
Article CAS PubMed PubMed Central Google Scholar
Tuncbag, N., Keskin, O., Nussinov, R. & Gursoy, A. Fast and accurate modeling of protein-protein interactions by combining template-interface-based docking with flexible refinement. Proteins 80, 1239–1249, doi: 10.1002/prot.24022 (2012).
Article CAS PubMed Google Scholar
Kehr, B., Weese, D. & Reinert, K. STELLAR: fast and exact local alignments. BMC Bioinformatics 12, doi: 10.1186/1471-2105-12-S9-S15 (2011).
Tek, A., Korostelev, A. & Flores, S. C. MMB-GUI: a fast morphing method demonstrates a possible ribosomal tRNA translocation trajectory. Nucleic Acids Res. 44(1), 95–105, doi: 10.1093/nar/gkv1457 (2015).
Article Google Scholar
Cornell, W. D. et al. A second generation force field for the simulation of proteins, nucleic acids, and organic molecules (vol 117, pg 5179, 1995). J Am Chem Soc 118, 2309–2309, doi: 10.1021/Ja955032e (1996).
Article CAS Google Scholar
Brandsdal, B. O., Aqvist, J. & Smalas, A. O. Computational analysis of binding of P1 variants to trypsin. Protein Sci 10, 1584–1595, doi: 10.1110/ps.940101 (2001).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors acknowledge funding from eSSENCE (essenceofescience.se), The Swedish Foundation for International Cooperation in Research and Higher Education (STINT), and Uppsala University, and a generous allocation of supercomputer time from the Swedish National Infrastructure for Computing (SNIC).

Author information

Authors and Affiliations

Department of Cell and Molecular Biology, Computational and Systems Biology, Uppsala University, Biomedical Center Box 596, 751 24, Uppsala, Sweden
Daniel F.A.R. Dourado & Samuel Coulbourn Flores

Authors

Daniel F.A.R. Dourado
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Coulbourn Flores
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.F. conceived the idea and designed the methodology. D.D. compiled the dataset and validated the methodology. Both authors wrote the paper.

Corresponding author

Correspondence to Samuel Coulbourn Flores.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (DOC 1293 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Dourado, D., Flores, S. Modeling and fitting protein-protein complexes to predict change of binding energy. Sci Rep 6, 25406 (2016). https://doi.org/10.1038/srep25406

Download citation

Received: 04 January 2016
Accepted: 18 April 2016
Published: 13 May 2016
DOI: https://doi.org/10.1038/srep25406
Springer Nature Limited

Modeling and fitting protein-protein complexes to predict change of binding energy

Abstract

Similar content being viewed by others

Modeling Binding Affinity of Pathological Mutations for Computational Protein Design

Methods for Detecting Protein Binding Interfaces

Prediction of Protein–Protein Binding Affinities from Unbound Protein Structures

Introduction

Results

Discussion

Methods

Dataset

Template modeling in MMB

Structural alignment based on gapped sequence alignment

Declashing

Flexible fitting to low-resolution density maps

Evaluating ΔΔG for the modeled complexes

ZEMu+ additional flexibility radius

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Information (DOC 1293 kb)

Rights and permissions

About this article

Cite this article

Navigation

Modeling and fitting protein-protein complexes to predict change of binding energy

Abstract

Similar content being viewed by others

Modeling Binding Affinity of Pathological Mutations for Computational Protein Design

Methods for Detecting Protein Binding Interfaces

Prediction of Protein–Protein Binding Affinities from Unbound Protein Structures

Introduction

Results

Discussion

Methods

Dataset

Template modeling in MMB

Structural alignment based on gapped sequence alignment

Declashing

Flexible fitting to low-resolution density maps

Evaluating ΔΔG for the modeled complexes

ZEMu+ additional flexibility radius

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Information (DOC 1293 kb)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation