Summary.
A newly synthesized secretory protein in cells bears a special sequence, called signal peptide or sequence, which plays the role of “address tag” in guiding the protein to wherever it is needed. Such a unique function of signal sequences has stimulated novel strategies for drug design or reprogramming cells for gene therapy. To realize these new ideas and plans, however, it is important to develop an automated method for fast and accurately identifying the signal sequences or their cleavage sites. In this paper, a new method is developed for predicting the signal sequence of a query secretory protein by fusing the results from a series of global alignments through a voting system. The very high success rates thus obtained suggest that the novel approach is very promising, and that the new method may become a useful vehicle in identifying signal sequence, or at least serve as a complementary tool to the existing algorithms of this field.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
P Arrigo F Giuliano F Scalia A Rapallo G Damiani (1991) ArticleTitleIdentification of a new motif on nucleic acid sequence data using Kohonen’s self-organizing map Comput Appl Biosci 7 353–357 Occurrence Handle1913218 Occurrence Handle1:CAS:528:DyaK38XhsVCqsrs%3D
P Baldi S Brunak (1998) Bioinformatics: the machine learning approach MIT Press Cambridge/Mass
JD Bendtsen H Nielsen G von Heijne S Brunak (2004) ArticleTitleImproved prediction of signal peptides: SignalP 3.0 J Mol Biol 340 783–795 Occurrence Handle15223320 Occurrence Handle10.1016/j.jmb.2004.05.028 Occurrence Handle1:CAS:528:DC%2BD2cXlt1emu7s%3D
JD Blake FE Cohen (2001) ArticleTitlePairwise sequence alignment below the twilight zone J Mol Biol 307 721–735 Occurrence Handle11254392 Occurrence Handle10.1006/jmbi.2001.4495 Occurrence Handle1:CAS:528:DC%2BD3MXhvFWns7o%3D
C Chen X Zhou Y Tian X Zou P Cai (2006) ArticleTitlePredicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116–121 Occurrence Handle16920060 Occurrence Handle10.1016/j.ab.2006.07.022 Occurrence Handle1:CAS:528:DC%2BD28XpsVOgs78%3D
KC Chou (2001a) ArticleTitlePrediction of protein signal sequences and their cleavage sites Proteins Struct Function Genet 42 136–139 Occurrence Handle10.1002/1097-0134(20010101)42:1<136::AID-PROT130>3.0.CO;2-F Occurrence Handle1:CAS:528:DC%2BD3MXitVWntg%3D%3D
KC Chou (2001b) ArticleTitlePrediction of signal peptides using scaled window Peptides 22 1973–1979 Occurrence Handle10.1016/S0196-9781(01)00540-X Occurrence Handle1:CAS:528:DC%2BD38Xitlaguw%3D%3D
KC Chou (2001c) ArticleTitleUsing subsite coupling to predict signal peptides Protein Eng 14 75–79 Occurrence Handle10.1093/protein/14.2.75 Occurrence Handle1:CAS:528:DC%2BD3MXjsVektrs%3D
KC Chou (2002) ArticleTitleReview: Prediction of protein signal sequences Curr Protein Pep Sci 3 615–622 Occurrence Handle10.2174/1389203023380468 Occurrence Handle1:CAS:528:DC%2BD38XosF2ku7w%3D
KC Chou (2004) ArticleTitleReview: Structural bioinformatics and its impact to biomedical science Curr Med Chem 11 2105–2134 Occurrence Handle15279552 Occurrence Handle1:CAS:528:DC%2BD2cXlslWltbw%3D
KC Chou HB Shen (2006) ArticleTitlePredicting protein subcellular location by fusing multiple classifiers J Cell Biochem 99 517–527 Occurrence Handle16639720 Occurrence Handle10.1002/jcb.20879 Occurrence Handle1:CAS:528:DC%2BD28XhtVSktL3J
KC Chou CT Zhang (1995) ArticleTitleReview: Prediction of protein structural classes Crit Rev Biochem Mol Biol 30 275–349 Occurrence Handle7587280 Occurrence Handle1:CAS:528:DyaK2MXosFentb8%3D
R Durbin S Dear (1998) ArticleTitleBase qualities help sequencing software Genome Res 8 161–162 Occurrence Handle9521917 Occurrence Handle1:CAS:528:DyaK1cXitlWlur0%3D
RM Durbin SR Eddy A Krogh G Mitchison (1998) Biological sequence analysis Cambridge University Press Cambridge
O Emanuelsson H Nielsen G von Heijne (1999) ArticleTitleChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites Protein Sci 8 978–984 Occurrence Handle10338008 Occurrence Handle1:CAS:528:DyaK1MXjtFGmsL0%3D
ZP Feng (2001) ArticleTitlePrediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition Biopolymers 58 491–499 Occurrence Handle11241220 Occurrence Handle10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I Occurrence Handle1:CAS:528:DC%2BD3MXisVSntb8%3D
ZP Feng (2002) ArticleTitleAn overview on predicting the subcellular location of a protein In Silico Biol 2 291–303 Occurrence Handle12542414 Occurrence Handle1:CAS:528:DC%2BD38Xpsl2lu7k%3D
RJ Folz JI Gordon (1987) ArticleTitleComputer-assisted predictions of signal peptidase processing sites Biochem Biophys Res Commun 146 870–877 Occurrence Handle3304288 Occurrence Handle10.1016/0006-291X(87)90611-5 Occurrence Handle1:CAS:528:DyaL2sXlvFeru74%3D
QB Gao ZZ Wang C Yan YH Du (2005) ArticleTitlePrediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444–3448 Occurrence Handle15949806 Occurrence Handle10.1016/j.febslet.2005.05.021 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KjsL0%3D
YZ Guo M Li M Lu Z Wen K Wang G Li J Wu (2006) ArticleTitleClassifying G protein-coupled receptors and nuclear receptors based on protein power spectrum from fast Fourier transform Amino Acids 30 397–402 Occurrence Handle16773242 Occurrence Handle10.1007/s00726-006-0332-z Occurrence Handle1:CAS:528:DC%2BD28Xls1egs7o%3D
I Ladunga F Czako I Csabai T Geszti (1991) ArticleTitleImproving signal peptide prediction accuracy by simulated neural network Comput Appl Biosci 7 485–487 Occurrence Handle1747780 Occurrence Handle1:CAS:528:DyaK38Xht1Chu7g%3D
H Liu J Yang JG Ling KC Chou (2005) ArticleTitlePrediction of protein signal sequences and their cleavage sites by statistical rulers Biochem Biophys Res Commun 338 1005–1011 Occurrence Handle16256954 Occurrence Handle10.1016/j.bbrc.2005.10.046 Occurrence Handle1:CAS:528:DC%2BD2MXht1Wjur3F
G Lubec L Afjehi-Sadat JW Yang JP John (2005) ArticleTitleSearching for hypothetical proteins: theory and practice based upon original data and literature Prog Neurobiol 77 90–127 Occurrence Handle16271823 Occurrence Handle10.1016/j.pneurobio.2005.10.001 Occurrence Handle1:CAS:528:DC%2BD2MXht1GhtbvK
RY Luo ZP Feng JK Liu (2002) ArticleTitlePrediction of protein structural class by amino acid and polypeptide composition Eur J Biochem 269 4219–4225 Occurrence Handle12199700 Occurrence Handle10.1046/j.1432-1033.2002.03115.x Occurrence Handle1:CAS:528:DC%2BD38Xnt1eiur8%3D
DJ McGeoch (1985) ArticleTitleOn the predictive recognition of signal peptide sequences Virus Res 3 271–286 Occurrence Handle3000102 Occurrence Handle10.1016/0168-1702(85)90051-6 Occurrence Handle1:CAS:528:DyaL2MXmtV2is78%3D
K Nakai (2000) ArticleTitleProtein sorting signals and prediction of subcellular localization Adv Protein Chem 54 277–344 Occurrence Handle10829231 Occurrence Handle1:CAS:528:DC%2BD3cXltFSqs70%3D Occurrence Handle10.1016/S0065-3233(00)54009-1
SB Needleman CD Wunsch (1970) ArticleTitleA general method applicable to the search for similarities in the amino acid sequence of two proteins J Mol Biol 48 443–453 Occurrence Handle5420325 Occurrence Handle10.1016/0022-2836(70)90057-4 Occurrence Handle1:CAS:528:DyaE3cXktVShu74%3D
H Nielsen J Engelbrecht S Brunak G von Heijne (1997) ArticleTitleIdentification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites Protein Eng 10 1–6 Occurrence Handle9051728 Occurrence Handle10.1093/protein/10.1.1 Occurrence Handle1:CAS:528:DyaK2sXhsVersrs%3D
H Nielsen A Krogh (1998) ArticleTitlePrediction of signal peptides and signal anchors by a hidden Markov model Intell Syst Mol Biol 6 122–130 Occurrence Handle1:STN:280:DyaK1cvls1GntA%3D%3D
B Niu YD Cai WC Lu GY Zheng KC Chou (2006) ArticleTitlePredicting protein structural class with AdaBoost learner Protein Peptide Lett 13 489–492 Occurrence Handle10.2174/092986606776819619 Occurrence Handle1:CAS:528:DC%2BD28XlsVGqs7o%3D
G Schneider S Rohlk P Wrede (1993) ArticleTitleAnalysis of cleavage-site patterns in protein precusor sequences with a perceptron-type neural network Biochem Biophys Res Commun 194 951–959 Occurrence Handle8343174 Occurrence Handle10.1006/bbrc.1993.1913 Occurrence Handle1:CAS:528:DyaK3sXlsFGnt7Y%3D
G Schneider P Wrede (1993) ArticleTitleSignal analysis of protein targeting sequences Protein Seq Data Anal 5 227–236 Occurrence Handle1:CAS:528:DyaK2cXpvFyqtw%3D%3D
XD Sun RB Huang (2006) ArticleTitlePrediction of protein structural classes using support vector machines Amino Acids 30 469–475 Occurrence Handle16622605 Occurrence Handle10.1007/s00726-005-0239-0 Occurrence Handle1:CAS:528:DC%2BD28Xls1ehu7c%3D
G von Heijne (1986) ArticleTitleA new method for predicting signal sequence cleavage sites Nucleic Acids Res 14 4683–4690 Occurrence Handle3714490 Occurrence Handle10.1093/nar/14.11.4683 Occurrence Handle1:CAS:528:DyaL28XksFKmt7w%3D
M Wang J Yang KC Chou (2005a) ArticleTitleUsing string kernel to predict signal peptide cleavage site based on subsite coupling model Amino Acids 28 395–402 Occurrence Handle10.1007/s00726-005-0189-6 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KmtbY%3D
M Wang J Yang ZJ Xu KC Chou (2005b) ArticleTitleSLLE for predicting membrane protein types J Theor Biol 232 7–15 Occurrence Handle10.1016/j.jtbi.2004.07.023 Occurrence Handle1:CAS:528:DC%2BD2cXovVKkur4%3D
Wen Z, Li M, Li Y, Guo Y, Wang K (2006) Delaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition. Amino Acids (in press)
X Xiao S Shao Y Ding Z Huang Y Huang KC Chou (2005) ArticleTitleUsing complexity measure factor to predict protein subcellular location Amino Acids 28 57–61 Occurrence Handle15611847 Occurrence Handle10.1007/s00726-004-0148-7 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqsro%3D
X Xiao SH Shao YS Ding ZD Huang KC Chou (2006a) ArticleTitleUsing cellular automata images and pseudo amino acid composition to predict protein sub-cellular location Amino Acids 30 49–54 Occurrence Handle10.1007/s00726-005-0225-6 Occurrence Handle1:CAS:528:DC%2BD28XhsFCksrk%3D
X Xiao SH Shao ZD Huang KC Chou (2006b) ArticleTitleUsing pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor J Comput Chem 27 478–482 Occurrence Handle10.1002/jcc.20354 Occurrence Handle1:CAS:528:DC%2BD28XitFyqsr4%3D
SW Zhang Q Pan HC Zhang ZC Shao JY Shi (2006) ArticleTitlePrediction protein homo-oligomer types by pseudo amino acid composition: approached with an improved feature extraction and naive Bayes feature fusion Amino Acids 30 461–468 Occurrence Handle16773245 Occurrence Handle10.1007/s00726-006-0263-8 Occurrence Handle1:CAS:528:DC%2BD28Xls1egsr0%3D
GP Zhou (1998) ArticleTitleAn intriguing controversy over protein structural class prediction J Protein Chem 17 729–738 Occurrence Handle9988519 Occurrence Handle10.1023/A:1020713915365 Occurrence Handle1:CAS:528:DyaK1MXnslaltw%3D%3D
GP Zhou K Doctor (2003) ArticleTitleSubcellular location prediction of apoptosis proteins Proteins Struct Funct Genet 50 44–48 Occurrence Handle12471598 Occurrence Handle10.1002/prot.10251 Occurrence Handle1:CAS:528:DC%2BD3sXlsVKmug%3D%3D
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Liu, DQ., Liu, H., Shen, HB. et al. Predicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments. Amino Acids 32, 493–496 (2007). https://doi.org/10.1007/s00726-006-0466-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00726-006-0466-z