Introduction

Ligand docking is a well established computational technique that has been successfully employed in medicinal chemistry to assist drug discovery and lead optimization efforts [1]. The aim of ligand docking is to find the binding pose of a small organic molecule in a receptor pocket, and, if multiple ligands are compared, an estimate of the ligand binding affinity, referred to as the docking score. Several conformational search algorithms and scoring functions have been proposed and their performances have been compared and reviewed [2, 3]. However, most of the comparisons have been performed for the pocket conformations from the complexes with cognate ligands, so called self-docking, thus avoiding the critical problem of overcoming the obstacle of specific ligand-induced conformational changes, a.k.a. the induced fit [4]. Unfortunately, when such a change occurs, neither the ligand pose nor the meaningful binding score can be calculated if the ligand in question induces a pocket rearrangement that is incompatible with the initial pocket geometry.

Figure 1 illustrates the difference between a completely artificial and frequently misleading self-docking and a realistic cross-docking test. Since the soft docking algorithm of Jiang and Kim [5], several attempts have been made to include the induced fit effect in a highly automated and general-purpose protocol [68]. If the diversity of possible conformations is well covered by multiple experimental structures, the multiple receptor conformers docking (MRC) is a natural extension to a single rigid receptor docking. Multiple receptor conformations can be retrieved from NMR [9] or X-ray diffraction studies [10]. Several crystals can be employed to calculate average potential grids to be later employed in ligand docking [11]. Other approaches combinatorially merge flexible parts of different protein structures to generate new conformations [12]. Huang and Zou have proposed an ensemble docking procedure that, after an optimization step, automatically selects the structure where the ligand best fits [13]. When several experimental structures are not available, multiple conformations can be generated by means of computational methods like molecular dynamics simulations [1416], relevant normal modes analysis [17], and stochastic pseudo-Brownian and internal coordinate (ICM) sampling [18]. Later the receptor flexibility was introduced into docking and virtual screening with a more complex protocol based on the ICM stochastic optimizer [19]. Their protocol was tested on 33 members of four different protein kinase families and its efficacy was assessed both in terms of docking accuracy and of enrichment factors. Zhao and Sanner encoded the receptor conformational space as a variable for genetic algorithm [20]. Their test set mainly consisted of Hiv protease—inhibitor co-crystals. Sherman et al. developed a procedure where rigid receptor docking and protein modelling are combined to independently sample ligand and receptor degrees of freedom [21]. After a first soft potential docking simulation, the receptor structure is optimized around the tentative ligand poses. The protein conformations from the best scoring complexes are employed in a second docking run. This time a hard potential function is used. Twenty one complexes, encompassing 20 ligands co-crystallized at the binding site of 11 different proteins, were included in the validation benchmark. A ligand pose with RMSD below 2 Å from the crystal provided the best ranking solution in 18 out of 21 examples. Meiler and Baker extended to ligand docking the previously reported ROSETTADOCK [22] protein—protein docking algorithm [23]. The ligand flexibility was represented by a set of discrete conformations while the side chains are explicitly sampled during a Monte Carlo optimization. A backbone-dependent rotamer library was used in the calculations. Twenty cross docking runs were carried out on 18 structures of 8 different proteins. The best scoring solution had an RMSD lower than 2 Å in 14 examples. However, the reported RMSD values are artificially lower and incompatible with other publications since they included not only the ligand atoms but also the binding pocket side chains atoms.

Fig. 1
figure 1

Cartoon representation of a cross docking experiment: (a) different co-crystals of the same protein display the so called induced fit effect; (b) limitations of a rigid receptor cross docking protocol; (c) real life expectations from a flexible receptor cross docking protocol

Almost all those docking protocols were tested on small and hand-picked cross-docking benchmarks. The difficulty of those benchmarks was not carefully quantified. Secondly, any enhanced docking protocol introducing new variables, such as receptor variations, or water molecule positions, while solving some difficult cases, may result in a worse overall performance in cases where such variation is not an issue. That effect needs to be monitored. Finally, the influence of the “known answer” in the form of (i) the bounding box defined by the correct ligand pose or (ii) the knowledge of which parts of the proteins are flexible deduced from multiple receptor structures including those close or identical to the correctly induced ones, were not fully eliminated.

In this paper we presented a new induced fit docking algorithm, nicknamed SCARE, which docks a ligand to receptor structure or a model by homology represented by a single conformer. The algorithm used no prior information about the ligand binding geometry, pocket location, preferred bounding box, ligand covalent geometry, locations or extent of the flexible receptor regions. Only the best rank prediction was considered. The performance of the SCARE algorithm was evaluated on a diverse set of 30 cross docking ligand–receptor pairs, and was compared with a single static receptor ICM docking procedure.

Results and discussion

The outline of the algorithm

The SCARE algorithm is built on top of well established and optimized ICM docking and scoring algorithm to a single static receptor [2426]. The SCARE algorithm is based on the following general scheme:

  1. (i)

    produce multiple variants of the receptor pocket (e,g, gapped versions of a single pocket conformer);

  2. (ii)

    dock a flexible ligand to each of the variants of the receptor pocket and record one or several (up to 5) best scored poses; geometrically cluster them and select best scoring position from each cluster;

  3. (iii)

    restrain the ligand for each of the unique ligand poses and globally optimize the receptor pocket around its pose (geometrically redundant poses are eliminated before refinement);

  4. (iv)

    re-score all optimized ligand–receptor pairs and select the top scoring pose.

In this general form the algorithm can be applied to both multiple known receptor conformations or to a single receptor conformation combined with a receptor variant generation algorithm. That may include combinations of both experimental and generated variants.

The key additional features of the SCARE algorithm are the following.

  1. (1)

    The unbiased determination of the bounding box and pocket residues. Not all binding pockets are fully enclosed cavities with a fixed volume. Most frequently the site is relatively open and extended in several directions. Furthermore, we did not want to rely on any knowledge about the binding site of other ligands since in some cases no single binder is known, in the other, the binding locations of different ligands may differ substantially. Therefore, determining the extent of the binding pocket and, consequently, the extent of the bounding box for the grid potential becomes a critical determinant of the docking success. If the box is too small or shifted the failure is guaranteed, if it is too large, the chances of success diminish with the size. Here we used our previously published Lennard–Jones-convolution method [27, 28] to determine the bonding box and the residues under consideration. All the residues with at least one side chain non-hydrogen atom in the range of 3.0 Å from the generated mesh were considered part of the pocket.

  2. (2)

    The Alanine Scanning. To alter each experimental starting conformation, instead of producing different conformers for side-chains or backbone as in [21], we generated a variety of “gapped” receptor pocket models in which parts of the pockets were systematically omitted. The expectation was that in at least one of the models the main obstacle for the correct positioning of the ligand was removed, while the remaining “ungapped” part of the pocket was still able to position it correctly.

In a general form of the algorithm the gaps may include single side-chains, multiple side-chains, loops, and parts of the backbone; in this paper, we tested three scenarios: single, dual, and triple residues alanine scanning. To eliminate combinatorial explosion of the number of combinations for dual and triple alanines we noticed that it is sufficient to mutate only one group of proximal side-chains at a time. Rather than randomly scattered couples or triplets, mutations concentrated in a very specific sub-region were more likely to redefine the binding pocket steric profile significantly enough to affect the outcome of the docking simulations.

In Fig. 2, the effect of a double mutation on the steric profile of a binding site is described. In the receptor structure extracted from the PPARγ—Rosiglitazone co-crystal (PDBid: 2PRG), the pocket was too small to properly accommodate the agonist G1262570; the residues Phe310 and Phe391 blocked a small lipophilic region that was available in the reference structure (PDBid: 1FM9). A single rigid receptor cross docking simulation could not possibly reproduce the experimental binding mode (Fig. 2a). When the hampering residues were mutated into alanine, the pocket shape was modified enough for the extra region to become available and a near native conformation of the bigger agonist could be sampled (Fig. 2b).

  1. (3)

    The Full Pocket Refinement. The unique top-scoring ligand poses were restrained in a full side-chain pocket model and underwent a refinement procedure. The refinement was mainly focused on the receptor flexibility. While the ligand was strongly tethered during the whole procedure, the pocket residues were actively optimized for both side-chain and the backbone. After the refinement, a re-scoring was performed. Re-scoring was based on a version of the standard ICM scoring function modified as to include the receptor contributions at the free energy of binding [19, 29, 30].

Figure 3 schematically summarizes the SCARE stepwise process.

Fig. 2
figure 2

Effect of alanine scanning during posing: (a) the experimental binding mode of the inhibitor G1262570 (grey) cannot be reproduced because of the conformation adopted by Phe310 and Phe391; (b) the dual mutation into alanine removes the steric hindrance and thus the ligand can be accommodated

Fig. 3
figure 3

A schematic outline of the SCARE docking algorithm

Analysis of the benchmark

The presented benchmark is, to best of our knowledge, the only cross docking test set to include structures from as many as 16 different proteins. The complete list of structures is reported in Table 1.

Table 1 List of the crystallographic structures included in the cross docking benchmark

A detailed comparative analysis of the cross-docking pairs in our benchmark was performed and the actual challenge posed by each pair could be directly measured by the dissimilarity of the binding sites. For each cross-docking pair, the ligand-binding site was identified as the set of all residues with at least one non hydrogen atom within 3 Å from the mesh of the ligand experimental pose. The size of the binding site was ranging from 13 to 29 residues, with the mean value of 19 residues.

To evaluate the magnitude of the induced fit, the rigid cores of the binding sites in the receptor and the reference structures were superimposed. This was done by means of a procedure that iteratively down-weighted the contributions from the more flexible regions. After superimposition, the average RMSD between the binding regions of the receptor and reference structures was found to be 0.56 Å for backbone atoms and 1.32 Å for all non-hydrogen atoms. The number of residues that displayed above-average displacement for the given pair was always comparatively low, never more than four amino acids.

In Table 2, the RMSD values are reported for each pair of structures. The residues whose RMSD was over 1 Å higher than the pocket average and the residues that in the receptor adopted a conformation that would lead to a clash with the ligand native conformation, are listed explicitly. A clash is originated when one or more non hydrogen atoms of a residue lie within 1 Å from the ligand. A close inspection of this data suggested that the differences between the selected structures were mainly due to a few specific side chains displacements and very limited backbone movements rather than extended loop transitions and domain motions. Furthermore, the clashing residues were not scattered around the pocket but they rather tended to cluster, spatially reshaping a very specific sub-region.

Table 2 Description and comparison of the benchmark binding sites; residues with at least one side chain heavy atom in the range of 3.0 Å from the ligand are listed

Cross docking to a single receptor conformer

There were two objectives of this exercise. First, we needed to establish the ability and extent of a single receptor conformation docking procedure to tolerate the induced fit distortions and optimize the parameters of the single pocket ICM protocol for better performance; the resulting performance also reflected the difficulty of the benchmark. Second, we needed to establish a baseline performance to make sure that the introduction of dozens of pocket variations does not lead to a drop in the performance for those simpler cases.

We used the grid docking protocol implemented in ICM [2426], in which the ligand binding site at the receptor is represented by pre-calculated potential grid maps, and the van der Waals potential is modified to reduce the repulsive part of the standard 6–12 Lennard–Jones equation. The binding region location was assumed to be known, defined by all the residues with at least one side chain non-hydrogen atom in the range of ∼5.0 Å from the ligand experimental conformation. The protocol was tested with two different cut-off values for the repulsive part of van der Waals energy: 4 kcal/mol and 1 kcal/mol.

When a 4 kcal/mol truncated potential was considered, a near-native pose was found at rank one in 12 out 30 cases (40% success rate). In 11 more cases a near-native pose was found among top 20 solutions but was not ranked first, while in the 7 remaining cases, no pose close to the crystallographic binding mode was selected.

When the van der Waals potential was capped at a more permissive level of 1 Kcal/mol, the results improved. A near native pose could be sampled and assigned the most favorable score in 15 of 30 cases (50% success rate—see for example Fig. 4a). In 10 of the remaining 15 cases, a near native structure could be sampled but it was not assigned the best score (Fig. 4b). In 5 cases, the crystallographic ligand pose could never be found (even with very high scores) in a cross-docking run. This was due to the side chain conformation of at least one residue being significantly different from the reference structure (Fig. 4c). Table 3 summarizes the cross-docking performance of the single rigid receptor docking protocol.

Fig. 4
figure 4

Three possible outcomes in a single cross grid docking experiment: (a) the experimental pose of the 6-Bromoindirubin-3′-Oxime inhibitor at the binding site of GSK-3β can be correctly reproduced and ranked first according to the energy scoring; (b) the experimental pose of the inhibitor TNK-6123 at the binding site of Hiv1 RT can be correctly reproduced but it is not assigned the top scoring position; (c) the displacement of the Phe340 side chain prevents the selection of any near native pose of the inhibitor GW3965 at the binding site of LXRβ ligand binding domain. The ligand experimental poses are reported in grey, those predicted by single cross grid docking are coloured by atom types

Table 3 The ICM cross docking results with a single pocket conformer (the pocket boundaries are defined by all the residues within ∼5 Å from the ligand experimental pose)

Therefore, for one half of the cross-docking runs the near native pose was not the best scoring one, and for one third of those cases it was never found despite the already “softened” single grid docking protocol. To overcome this limitation the variations of that initial receptor pocket needed to be considered.

Scanning alanines through the pocket residues

In order to overcome the limitations imposed by the rigid structure approach, a protocol based on multiple generated receptor variants was developed. The key idea of this step of the procedure stems from the GAP approach used in [31]. It established that having no atoms (hence removing all side-chain atoms except Cb) for a part of the environment of the currently optimized element (e.g. side chain or a ligand) may be preferred over trying out different conformers for that part of the environment. Here we propose to generate multiple pockets variants by systematically removing clusters of side chain atoms. The cluster may be represented by a single side chain (code ‘S’ for single), two neighboring side chains (code ‘D’ for dual), three neighboring side chains (code ‘T’ for triple) and the combinations of ‘S’,’D’ and ‘T’ (code ‘A’ all) (see Table 4 for details). Table 5 shows the small numbers of scans for each pocket. Cross-docking the ligand to all pocket variants dramatically improved the results with most of the hit collection protocols. We tested each protocol to find the optimal combination of the scanning method and the number of retained top hits from each docking run.

Table 4 Codes for the compared SCARE protocols defining the number of alanines in a scanned group and the number of top solutions selected from each grid docking for all atom refinement
Table 5 The performance of the DA1 protocol: total number of poses to refine and fraction of near native poses among them

Table 4 contains a brief definition of each assembling protocol. The individual performance of each assembling protocol was assessed considering the average results among the thirty complexes in the benchmark. The AA and the DA protocols, no matter how many top scoring conformations were retrieved, always provided at least one near native conformation for every complex in the benchmark (Fig. 5a). Among those, DA1 and AA1 were the protocols that provided the higher fraction of near native conformations, 36.7% and 35.5% among the total poses to refine, respectively (Fig. 5b). However, adopting DA1 as the assembling protocol, one might expect an average lower number of poses to refine, namely 12.2, rather than the 17.6 returned by AA1 (Fig. 5c). The SA and TA protocols did not improve the quality of the final results when combined together with the dual alanine runs.

Fig. 5
figure 5

Comparison of different method to select the poses for refinement: (a) success rate of each method at including at least one near native conformation for each receptor in the benchmark; (b) average fraction of near native conformations among all the poses to refine provided by the six 100% successful methods; (c) average number of poses to refine provided by the six 100% successful methods

In light of the above results, the ensemble of ligand poses provided by DA1 (scanning pairs of interacting side chains and retaining only the top hit for clustering and refinement) was selected to undergo the refinement procedure. A clustering procedure was used to reduce the geometrical redundancy of the set. The heavy atom ligand pose RMSD cut-off distance was set equal to 1.0 Å. Table 5 demonstrates that the number of ligand poses needed to be refined is now relatively small (median of 13.7). As we expected, the near native ligand poses were now found for all 30 cross docking tasks.

Ligand poses after full receptor refinement

A near native conformation could always be selected among the ten best ranking positions if the bounding box for docking was derived from the residues surrounding the correct ligand position and expanded by 5 Å (a common practice, e.g. [3]). In as many as 27 out of 30 cases, the correct binding mode was predicted by the best ranking pose. In the remaining three difficult cases, the first near native conformation achieved the third, fifth, and sixth rank, respectively. Seven cross docking pairs that could not be ranked first after the single receptor grid docking received rank 1 after refinement and rescoring. For example, in single receptor grid docking, the first near native structure of FXV673 at the binding site of the coagulation factor Xa (PDBid: 1XKA) was predicted with a poor binding score and placed at the nineteenth position of the final ranking (Fig. 6a). The SCARE protocol both improved the geometrical accuracy and resulted in the top rank for the correct pose (Fig. 6b). In five examples where the single receptor protocol could not identify the near native pose at all, the alanine mutation protocol generated near-native conformations. For example, the experimental structure of the inhibitor Compound 1 at the binding site of JNK3 (Fig. 6c) could be accurately predicted and assigned the best binding score (Fig. 6d) with the full SCARE protocol. Both single docking and SCARE protocols could correctly reproduce the binding mode of the inhibitor I–5 at the binding site of GSK3-B (PDBid: 1UV5) and that of the inhibitor 1051U91 at the binding site of the Hiv1 reverse transcriptase (PDBid: 1C1C). However, both protocols failed at assigning them the top binding score. Finally, while the single receptor docking protocol could correctly reproduce the binding mode of the 4-anilinoquinazoline at the binding site of P38 kinase (PDBid: 1BMK) and assign it the best score, the first near native conformation was ranked third by the SCARE protocol.

Fig. 6
figure 6

Improved results due to the SCARE protocol: (a) docking the inhibitor FXV673 at the pocket of Factor Xa, the best result that can be achieved employing the single rigid receptor docking is a high energy pose with a RMSD of 2.0 Å; (b) optimizing the position of Gln416 side chain, the alanine mutation protocol provide a top scoring conformation of the inhibitor with RMSD as low as 0.6 Å; (c) the displacement of the Met146 side chain prevents the selection of any near native pose of the Compound 1 inhibitor at the binding site of JNK3; (d) optimizing the position of Met146 side chain, the alanine mutation protocol provide a top scoring conformation of the inhibitor with 1.1 Å RMSD. The ligand experimental poses are reported in grey, those predicted by the SCARE docking algorithm are coloured by atom types

The results of the SCARE docking are summarized in Table 6. The proposed protocol clearly outperformed the single receptor docking for both simple and hard tasks: it was always able to provide at least a near native conformation among the top ten positions during the posing phase and the success rate in terms of top-ranked conformations rose from 50% to 90%.

Table 6 The SCARE docking results (the pocket boundaries are defined by all the residues within ∼5 Å from the ligand experimental pose)

The effect of the bounding box on the SCARE algorithm performance

The size and location of the bounding box for the docking simulation may have a profound effect on the docking performance even with a generous margin of 5 Å, especially for relatively open binding sites. However, the usual practice in benchmark calculations was to use the correct ligand pose to deduce the bounding box. In an attempt to simulate a truly unbiased situation in which no assumptions are made about the binding pose for a particular ligand, we used the largest envelope predicted by the Pocketome Gaussian Convolution algorithm [27] applied to the initial receptor conformation to define the bounding box.

In most cases, the experimental pocket and the predicted pocket overlapped without significantly affecting the “correct” ligand volume (Fig. 7a). In several structures, the two pockets overlapped almost completely except for a comparatively low number of amino acids that remained specific of one pocket or the other (Fig. 7b). This situation was expected when the receptor and the reference structure differed only because of a bulky side chain that adopted a different conformation. However, when the ligand was bound at a very specific sub-region of a bigger cavity, the predicted pocket volume was much greater than the experimental pocket one (Fig. 7c). Finally, only in the two Thymidine Kinase structures, the experimental pocket and the predicted pocket perfectly matched (Fig. 7d).

Fig. 7
figure 7

Comparison between experimental pocket and predicted pocket: (a) relative number of amino acids that define the two pockets; the residues that define only the experimental pocket, the overlapping ones, and those that define only the predicted pocket, are represented by the red, the green, and the blue portion of the bar, respectively; (b) comparison of the experimental (red) and predicted (blue) binding pocket of Aldose Reductase; the residues that define only the experimental pocket, the overlapping ones, and those that define only the predicted pocket, are represented in red, green, and blue, respectively; (c) comparison of the experimental (red) and predicted (blue) binding pocket of GSK-3β ; the residues that define only the experimental pocket, the overlapping ones, and those that define only the predicted pocket, are represented in red, green, and blue, respectively; (d) comparison of the experimental (red) and predicted (blue) binding pocket of Thymidine Kinase; all the residues (green) overlap

The single rigid receptor docking was not deeply influenced by the use of the predicted bounding box (see Table 7). The van der Waals potential truncated at 4.0 kcal/mol runs provided 13 correct predictions, 11 cases a near native conformation was not ranked among the twenty best positions, and 6 cases where no near native conformation was found. The more permissive potential truncated at 1.0 kcal/mol provided better results: 14 correct predictions and only 5 cases with no near-native ligand conformation. Again, in 11 cases a near native conformation was found but it was not ranked in the top position. The binding mode of the 4-anilinoquinazoline at the binding site of the p38 kinase (PDBid: 1BMK) was affected by the unbiased box definition. Similarly, the first near native conformation of the inhibitor 3-TAPAP at the Trypsin binding site (PDBid: 1PPC) was ranked second. Surprisingly, in case of the inhibitor H8 bound to PKA (PDBid: 1YDS) we observed a reversed picture.

Table 7 The ICM cross docking results with a single pocket conformer using the predicted binding pocket and bounding box and different softness levels

The unbiased definition of the binding region was next implemented in the SCARE protocol. In Table 8, the results are reported. A near native conformation was ranked first in 24 cases out of 30 (80% of cross-docks vs 46% with a single receptor). For five ligands, a near native conformation could be found among the top ten refined poses. In only one case, the procedure could not provide any near native structure (see Table 8).

Table 8 The unbiased SCARE docking protocol results using a predicted binding pocket and the bounding box

The predicted bounding box affected the near native score in case of CDK2 (PDBid: 1DM2). Likewise, small differences in the pocket definition compromised the final ranking of the inhibitor 3-TAPAP at the binding site of Trypsin (PDBid: 1PPC). When docking the inhibitor I–5 at the binding site of GSK-3β (PDBid: 1UV5), no good predictions could be achieved. The predicted pocket included eight amino acids that were not part of the experimental range: the ligand experimental binding region only occupies a fraction of what the prediction algorithm correctly recognized as a larger pocket. While the bigger pocket volume led to a larger amount of false positives, the maleimidic moiety responsible for the main interaction with the receptor and a bifurcated hydrogen bond with the hinge region of the kinase, matched the experimental binding mode almost perfectly (RMSD 0.6 Å) in the top scoring pose.

Conclusions and outlook

Accurate prediction of the binding pose of a flexible ligand to partially flexible binding pocket remains one of the main challenges of computer aided drug discovery and virtual ligand screening. Here we presented a fully automated algorithm that would predict the binding mode of a single ligand to a single distorted pocket conformation taken from another ligand–receptor complex or from an apo-structure. The SCARE algorithm does not need any prior knowledge of the ligand binding location or information about the sites of potential variability of the receptor pocket. Yet it achieves a significant improvement of the number of correct predictions compared with highly optimized single pocket conformation docking procedures. Figure 8 summarizes the improvements for the two methods of determining the docking bounding box on a diverse benchmark of 30 cross docking ligand–protein pairs.

Fig. 8
figure 8

Summary of the results for the 30 complexes in the cross docking benchmark, employing (a) single cross grid docking; the pocket boundaries are defined by the residues within 5 Å from the experimental ligand position, (b) SCARE docking; the pocket boundaries are defined by the residues within 5 Å from the experimental ligand position, (c) single cross grid docking; the pocket boundaries are defined by the residues within 3 Å from the predicted pocket envelope, and (d) SCARE docking; the pocket boundaries are defined by the residues within 3 Å from the predicted pocket envelope

The idea of removing the movable parts of the environment (e.g. mutating side chains to alanines) rather than sampling them to generate candidates for refinement was successfully applied to side chain placements [31] and then to ligand docking for a preselected list of side-chains [21]. Here we used a fully automated dual alanine scan of instead of concurrent removal of all preselected “flexible” side chains in an attempt to create an unbiased general method applicable to a single protein binding pocket model. In this implementation of the SCARE protocol the number of pocket variants and, consequently, refined ligand poses, was relatively small, and only small adjustments of the protein backbone were allowed during the refinement. However the protocol is easily extendable to a more aggressive loop sampling algorithms. The SCARE paradigm can also be combined with the traditional MRC approaches relying on multiple crystal structures or generated structures [17, 32, 33].

The current benchmark was relatively limited, in part due to our desire to stick to the previously published induced fit benchmarks, and in part due to the time constraints of the procedure. The future algorithms would benefit from at least an order of magnitude larger high quality cross docking benchmarks that are now available [34] or under development (Kufareva et al., manuscript in preparation). Overall, we believe that SCARE represents a reliable and practical unbiased “induced fit” docking protocol to assist rational structure based drug design, optimization and virtual screening efforts.