Abstract
One reason why ab initio protein structure predictors do not perform very well is their inability to reliably identify long-range interactions between amino acids. To achieve reliable long-range interactions, all potential pairings of β-strands (β-topologies) of a given protein are enumerated, including the native β-topology. Two very different β-topology scoring methods from the literature are then used to rank all potential β-topologies. This has not previously been attempted for any scoring method. The main result of this paper is a justification that one of the scoring methods, in particular, consistently top-ranks native β-topologies. Since the number of potential β-topologies grows exponentially with the number of β-strands, it is unrealistic to expect that all potential β-topologies can be enumerated for large proteins. The second result of this paper is an enumeration scheme of a subset of β-topologies. It is shown that native-consistent β-topologies often are among the top-ranked β-topologies of this subset. The presence of the native or native-consistent β-topologies in the subset of enumerated potential β-topologies relies heavily on the correct identification of β-strands. The third contribution of this paper is a method to deal with the inaccuracies of secondary structure predictors when enumerating potential β-topologies. The results reported in this paper are highly relevant for ab initio protein structure prediction methods based on decoy generation. They indicate that decoy generation can be heavily constrained using top-ranked β-topologies as they are very likely to contain native or native-consistent β-topologies.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Cheng, J. Baldi, P.: Three-stage prediction of protein beta-sheets by neural networks, alignments and graph algorithms. Bioinformatics 21(Suppl 1), 75–84 (2005)
Cheng, J., Randall, A.Z., Sweredoski, M.J., Baldi, P.: SCRATCH: a protein structure and structural feature prediction server. Nucleic Acids Res. 33, W72–W76 (2005)
Cole, C., Barber, J.D., Barton, G.J.: The Jpred 3 secondary structure prediction server. Nucleic Acids Res. 36, W197–W201 (2008)
Cui, Y., Chen, R.S., Wong, W.H.: Protein folding simulation with genetic algorithm and supersecondary structure constraints. Proteins 31, 247–257 (1998)
Fokas, A.S., Gelfand, I.M., Kister, A.E.: Prediction of the structural motifs of sandwich proteins. Proc. Natl. Acad. Sci. USA 101, 16780–16783 (2004)
Griep, S. Hobohm, U.: PDBselect 1992–2009 and PDBfilter-select. Nucleic Acids Res. 38, D318–319 (2010)
Helles, G. Fonseca, R.: Predicting dihedral angle probability distributions for protein coil residues from primary sequence using neural networks. BMC Bioinformatics 10, 338 (2009)
Jeong, J., Berman, P., Przytycka, T.M.: Improving strand pairing prediction through exploring folding cooperativity. IEEE/ACM Trans. Comp. Bio. Bioinf. 5, 484–491 (2008)
Jones, D.T.: Protein secondary structure prediction based on position-specific scoring matrices’. J. Mol. Biol. 292, 195–202 (1999)
Klepeis, J.L. Floudas, C.A.: ASTRO-FOLD: a combinatorial and global optimization framework for ab initio prediction of three-dimensional structures of proteins from the amino acid sequence. Biophys. J. 85, 2119–2146 (2003)
Lippi, M. Frasconi, P.: Prediction of protein beta-residue contacts by Markov logic networks with grounding-specific weights. Bioinformatics 25, 2326–2333 (2009)
MacCallum, R.M.: Striped sheets and protein contact prediction. Bioinformatics 20(Suppl 1), i224–i231 (2004)
Max, N., Hu, C., Kreylos, O., Crivelli, S.: BuildBeta—a system for automatically constructing beta sheets. Proteins 78, 559–574 (2009)
Moult, J., Fidelis, K., Kryshtafovych, A., Rost, B., Tramontano, A.: Critical assessment of methods of protein structure prediction—Round VIII. Proteins 77(S9), 1–4 (2009)
Porwal, G., Jain, S., Babu, S.D., Singh, D., Nanavati, H., Noronha, S.: Protein structure prediction aided by geometrical and probabilistic constraints. J. Comput. Chem. 28, 1943–1952 (2007)
Rajgaria, R., Wei, Y., Floudas, C.A.: Contact prediction for beta and alpha-beta proteins using integer linear optimization and its impact on the first principles 3D structure prediction method ASTRO-FOLD. Proteins 78(8), 1825–1846 (2010)
Ruczinski, I.: Logic Regression and Statistical Issues Related to the Protein Folding Problem. Ph.D. thesis, Univ. of Washington (2002)
Ruczinski, I., Kooperberg, C., Bonneau, R., Baker, D.: Distributions of beta sheets in proteins with application to structure prediction. Proteins 48, 85–97 (2002)
Siepen, J.A., Radford, S.E., Westhead, D.R.: Beta edge strands in protein structure prediction and aggregation. Protein Sci. 12(10), 2348–2359 (2003)
Tegge, A.N., Wang, Z., Eickholt, J., Cheng, J.: NNcon: improved protein contact map prediction using 2D-recursive neural networks. Nucleic Acids Res. 37(37), W315–W318 (2009)
Zimmermann, O., Hansmann, U.H.E.: Support vector machines for prediction of dihedral angle regions. Bioinformatics 22, 3009–3015 (2006)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fonseca, R., Helles, G. & Winter, P. Ranking Beta Sheet Topologies with Applications to Protein Structure Prediction. J Math Model Algor 10, 357–369 (2011). https://doi.org/10.1007/s10852-011-9162-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10852-011-9162-4