Abstract
In this paper, according to evolutionary information and physicochemical properties, we selected eight features, combined with Rotation Forest (RotF) to predict interaction sites. We built two models on both balanced datasets and imbalanced datasets, named balanced-RotF and unbalanced-RotF, respectively. The values of accuracy, F-Measure, precision, recall and CC of balanced-RotF were 0.8133, 0.8064, 0.8375, 0.7775 and 0.6283 respectively. The values of accuracy, precision and CC of unbalanced-RotF increased by 0.0679, 0.0122 and 0.0361 over balanced-RotF. Precision values of unbalanced-RotF on our four selected testing sets were 0.907, 0.875, 0.878 and, 0.889, respectively. Moreover, experiment only using two physicochemical features showed evolutionary information has effective effects for classification.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Kini, R.M., Evans, H.J.: Prediction of potential protein-protein interaction sites from amino acid sequence identification of a fibrin polymerization site. FEBS Lett. 385(1-2), 81–86 (1996)
Tuncbag, N., Keskin, O., Nussinov, R., Gursoy, A.: Fast and accurate modeling of protein–protein interactions by combining template-interface-based docking with flexible refinement. Proteins: Structure, Function, and Bioinformatics 80(4), 1239–1249 (2012)
Zhang, S.W., Hao, L.Y., Zhang, T.H.: Prediction of protein–protein interaction with pairwise kernel Support Vector Machine. International Journal of Molecular Sciences 15(2), 3220–3233 (2014)
Konc, J., Janezic, D.: Protein-protein binding-sites prediction by protein surface structure conservation. J. Chem. Inf. Model. 47(3), 940–944 (2007)
Yuehui, C., Jingru, X., Bin, Y., Yaou, Z., Wenxing, H.: A novel method for prediction for protein interaction sites based on integrated RBF neural networks. Computers in Biology and Medicine 42(4), 402–407 (2012)
You, Z.H., Lei, Y.K., Zhu, L., Xia, J., Wang, B.: Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis. BMC Bioinformatics 14(suppl. 8), S10 (2013)
Fariselli, P., Pazos, F., Valencia, A., Casadia, R.: Prediction of protein-protein interaction sites in heterocomplexes with neural networks. Eur. J. Biochem. 269, 1356–1361 (2002)
Sriwastava, B.K., Basu, S., Maulik, U., Plewczynski, D.: PPIcons: Identification of protein-protein interaction sites in selected organisms. J. Mol. Model. 19(9), 4059–4070 (2013)
Chen, C.T., Peng, H.P., Jian, J.W.: Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces. PLOS One 7(6), e37706 (2012)
Minhas, F.U., Geiss, B.J., Ben-hur, A.: PAIRpred: Partner-specific prediction of interacting residues from sequence and structure. Proteins: Structure, Function, and Bioinformatics (2013)
Chen, P., Li, J.: Sequence-based identification of interface residues by an integrative profile combining hydrophobic and evolutionary information. BMC Bioinformatics 11, 402–416 (2010)
Li, B.Q., Feng, K.Y., Chen, L., Huang, T., Cai, Y.D.: Prediction of Protein-Protein Interaction Sites by Random Forest Algorithm with mRMR and IFS. PLOS One 7(8), e43927 (2012)
Li, C.X., Drena, D., Vasant, H.: HomPPI: a class of sequence homology based protein-protein interface prediction methods. BMC Bioinformatics 12, 244–267 (2011)
Jordan, R.A., EI-Manzalawy, Y., Dobbs, D., Honavar, V.: Predicting protein-protein interface residues using local surface structural similarity. BMC Bioinformatics 13, 41–44 (2012)
Xu, B., Wei, X., Deng, L., Guan, J., Zhou, S.: A semi-supervised boosting SVM for predicting hot spots at protein-protein interfaces. BMC Syst. Biol. 6(suppl. 2) (2012)
Sriwastava, B.K., Basu, S., Maulik, U., Plewczynski, D.: PPIcons: identification of protein-protein interaction sites in selected organisms. J. Mol. Model. 19(9), 4059–4070 (2013)
Hwang, H., Vreven, T., Weng, Z.: Binding interface prediction by combining protein-protein docking results. Proteins 10, 1002 (2013)
Gallet, X., Charloteaux, B., Thomas, A., Brasseur, R.: A fast method to predict protein interaction sites from sequences. J. Mol. Biol. 302(4), 917–926 (2000)
Chris, S., Reinhard, S.: Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins Struct. Funct. Bioinforma. 9(1), 56–68 (1991)
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1) (2009)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2(27), 1–27 (2011)
Porollo, A., Meller, J.: Prediction-based fingerprints of protein-protein interactions. Proteins Struct. Funct. Bioinforma 66(3), 630–645 (2007)
Neqi, S.S., Schein, C.H., Oezquen, N., Power, T.D., Braun, W.: InterProSurf: a web server for predicting interacting sites on protein surfaces. Bioinformatics 23(24), 3397–3399 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Hu, X., Jing, A., Du, X. (2014). Predicting Protein-Protein Interaction Sites by Rotation Forests with Evolutionary Information. In: Huang, DS., Han, K., Gromiha, M. (eds) Intelligent Computing in Bioinformatics. ICIC 2014. Lecture Notes in Computer Science(), vol 8590. Springer, Cham. https://doi.org/10.1007/978-3-319-09330-7_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-09330-7_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09329-1
Online ISBN: 978-3-319-09330-7
eBook Packages: Computer ScienceComputer Science (R0)