Abstract
Several computational methods have been developed to predict amyloid propensity of a protein or peptide. These bioinformatics tools are time- and cost-saving alternatives to expensive and laborious experimental methods which are used to confirm self-aggregation of a protein. Computational approaches not only allow preselection of reliable candidates for amyloids but, most importantly, are capable of a thorough and informative analysis of a protein, indicating the sequence determinants of protein aggregation, identifying the potential causal mutations and likely mechanisms. Bioinformatics modeling applies several different approaches, which most typically include physicochemical or structure-based modeling, machine learning, or statistics based modeling. Bioinformatics methods typically use the amino acid sequence of a protein as an input, some also include additional information, for example, an available structure. This chapter describes the methods currently used to computationally predict amyloid propensity of a protein or peptide. Since the accuracy of bioinformatics methods may be highly dependent on reference data used to develop and evaluate the predictors, we also briefly present the main databases of amyloids used by the authors of bioinformatics tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Lopez de la Paz M, Serrano L (2004) Sequence determinants of amyloid fibril formation. Proc Natl Acad Sci U S A 101:87–92
Thompson MJ, Sievers SA, Karanicolas J et al (2006) The 3D profile method for identifying fibril-forming segments of proteins. Proc Natl Acad Sci U S A 103:4074–4078
Pawlicki S, Le Bechec A, Delamarche C (2008) AMYPdb: a database dedicated to amyloid precursor proteins. BMC Bioinformatics 9:273
Beerten J, Van Durme J, Gallardo R et al (2015) WALTZ-DB: a benchmark database of amyloidogenic hexapeptides. Bioinformatics 31:1698–1700
Louros N, Konstantoulea K, De Vleeschouwer M et al (2020) WALTZ-DB 2.0: an updated database containing structural information of experimentally determined amyloid-forming peptides. Nucleic Acids Res 48(D1):D389–D393
Wozniak PP, Kotulska M (2015) AmyLoad: website dedicated to amyloidogenic protein fragments. Bioinformatics 31:3395–3397
Thangakani AM, Nagarajan R, Kumar S et al (2016) CPAD, curated protein aggregation database: a repository of manually curated experimental data on protein and peptide aggregation. PLoS One 11:e0152949
Rawat P, Prabakaran R, Sakthivel R, Mary Thangakani A, Kumar S, Gromiha MM (2020) CPAD 2.0: a repository of curated experimental data on aggregating proteins and peptides. Amyloid 27:128
Varadi M, De Baets G, Vranken WF et al (2018) AmyPro: a database of proteins with validated amyloidogenic regions. Nucleic Acids Res 46(D1):D387–D392
Goldschmidt L, Teng PK, Riek R et al (2010) Identifying the amylome, proteins capable of forming amyloid-like fibrils. Proce Natl Acad Sci USA 107:3487–3492
Nastou KC, Tsiolaki PL, Iconomidou VA (2019) The amyloid interactome: mapping protein aggregation. Amyloid 26(Suppl 1):142–143
Fernandez-Escamilla AM, Rousseau F, Schymkowitz J, Serrano L (2004) Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins. Nat Biotechnol 22(10):1302–1306
Rousseau F, Schymkowitz J, Serrano L (2006) Protein aggregation and amyloidosis: confusion of the kinds? Curr Opin Struct Biol 16(1):118–126
Trovato A, Chiti F, Maritan A, Seno F (2006) Insight into the structure of amyloid fibrils from the analysis of globular proteins. PLoS Comput Biol 2(12):e170
Walsh I, Seno F, Tosatto SC, Trovato A (2014) PASTA 2.0: an improved server for protein aggregation prediction. Nucleic Acids Res 42:W301–W307
Conchillo-Solé O, de Groot NS, Avilés FX et al (2007) AGGRESCAN: a server for the prediction and evaluation of “hot spots” of aggregation in polypeptides. BMC Bioinformatics 8:65
de Groot NS, Aviles FX, Vendrell J, Ventura S (2006) Mutagenesis of the central hydrophobic cluster in Abeta42 Alzheimer’s peptide. Side-chain properties correlate with aggregation propensities. FEBS J 273(3):658–668
Zambrano R, Jamroz M, Szczasiuk A et al (2015) AGGRESCAN3D (A3D): server for prediction of aggregation properties of protein structures. Nucleic Acids Res 43(W1):W306–W313
Kuriata A, Iglesias V, Kurcinski M et al (2019) Aggrescan3D standalone package for structure-based prediction of protein aggregation properties. Bioinformatics 35(19):3834–3835
Kuriata A, Iglesias V, Pujols J et al (2019) Aggrescan3D (A3D) 2.0: prediction and engineering of protein solubility. Nucleic Acids Res 47(W1):W300–W307
Jamroz M, Kolinski A, Kmiecik S (2013) CABS-flex: server for fast simulation of protein structure fluctuations. Nucleic Acids Res 41:W427–W431
Kuriata A, Gierut AM, Oleniecki T et al (2018) CABS-flex 2.0: a web server for fast simulations of flexibility of protein structures. Nucleic Acids Res 46(W1):W338–W343
Kim C, Choi J, Lee SJ et al (2009) NetCSSP: web application for predicting chameleon sequences and amyloid fibril formation. Nucleic Acids Res 37:W469–W473
Tian J, Wu N, Guo J, Fan Y (2009) Prediction of amyloid fibril-forming segments based on a support vector machine. BMC Bioinformatics 10(Suppl 1):S45
Kawashima S, Pokarowski P, Pokarowska M et al (2008) AAindex: amino acid index database, progress report 2008. Nucleic Acids Res 36:D202–D205
Kotulska M, Unold O (2013) On the amyloid datasets used for training PAFIG - how (not) to extend the experimental dataset of hexapeptides. BMC Bioinformatics 14:351
Frousios KK, Iconomidou VA, Karletidi CM, Hamodrakas SJ (2009) Amyloidogenic determinants are usually not buried. BMC Struct Biol 9:44
Tsolis AC, Papandreou NC, Iconomidou VA, Hamodrakas SJ (2013) A consensus method for the prediction of ‘aggregation-prone’ peptides in globular proteins. PLoS One 8(1):e54175
Hamodrakas SJ, Liappa C, Iconomidou VA (2007) Consensus prediction of amyloidogenic determinants in amyloid fibril-forming proteins. Int J Biol Macromol 41(3):295–300
Galzitskaya OV, Garbuzynskiy SO, Lobanov MY (2006) Prediction of amyloidogenic and disordered regions in protein chains. PLoS Comput Biol 2(12):e177
Garbuzynskiy SO, Lobanov MY, Galzitskaya OV (2010) FoldAmyloid: a method of prediction of amyloidogenic regions from protein sequence. Bioinformatics 26(3):326–332
Zhang Z, Chen H, Lai L (2007) Identification of amyloid fibril-forming segments based on structure and residue-based statistical potential. Bioinformatics 23(17):2218–2225
O’Donnell CW, Waldispühl J, Lis M et al (2011) A method for probing the mutational landscape of amyloid structure. Bioinformatics 27(13):i34–i42
Zibaee S, Makin OS, Goedert M, Serpell LC (2007) A simple algorithm locates beta-strands in the amyloid fibril core of alpha-synuclein, Abeta, and tau using the amino acid sequence alone. Protein Sci 16(5):906–918. Erratum in: Protein Sci (2007)16(6):1242
Maurer-Stroh S, Debulpaep M, Kuemmerer N et al (2010) Exploring the sequence determinants of amyloid structure using position-specific scoring matrices. Nat Methods 7(3):237–242
Emily M, Talvas A, Delamarche C (2013) MetAmyl: a METa-predictor for AMYLoid proteins. PLoS One 8(11):e79722
Gasior P, Kotulska M (2014) FISH Amyloid - a new method for finding amyloidogenic segments in proteins based on site specific co-occurrence of aminoacids. BMC Bioinformatics 15(1):54
Ahmed AB, Znassi N, Château MT, Kajava AV (2015) A structure-based approach to predict predisposition to amyloidosis. Alzheimers Dement 11(6):681–690
Bondarev SA, Bondareva OV, Zhouravleva GA, Kajava AV (2018) BetaSerpentine: a bioinformatics tool for reconstruction of amyloid structures. Bioinformatics 34(4):599–608
Smaoui MR, Orland H, Waldispühl J (2015) Probing the binding affinity of amyloids to reduce toxicity of oligomers in diabetes. Bioinformatics 31(14):2294–2302
Smaoui MR, Poitevin F, Delarue M et al (2013) Computational assembly of polymorphic amyloid fibrils reveals stable aggregates. Biophys J 104(3):683–693
Família C, Dennison SR, Quintas A, Phoenix DA (2015) Prediction of peptide and protein propensity for amyloid formation. PLoS One 10(8):e0134679
Burdukiewicz M, Sobczyk P, Rödiger S et al (2017) Amyloidogenic motifs revealed by n-gram analysis. Sci Rep 7(1):12961
Sankar K, Krystek SR Jr, Carl SM et al (2018) AggScore: prediction of aggregation-prone regions in proteins based on the distribution of surface patches. Proteins 86(11):1147–1156
Niu M, Li Y, Wang C, Han K (2018) RFAmyloid: a web server for predicting amyloid proteins. Int J Mol Sci 19(7):2071
Orlando G, Silva A, Macedo-Ribeiro S et al (2020) Accurate prediction of protein beta-aggregation with generalized statistical potentials. Bioinformatics 36(7):2076–2081
Wojciechowski JW, Kotulska M (2020) PATH - prediction of amyloidogenicity by threading and machine learning. Sci Rep 10(1):7721
Sali A, Blundell TL (1994) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234(3):779–815
Chaudhury S, Lyskov S, Gray JJ (2010) PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta. Bioinformatics 26(5):689–691
Acknowledgment
This work was partially supported by the National Science Centre, Poland, Grant 2019/35/B/NZ2/03997.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Kotulska, M., Wojciechowski, J.W. (2022). Bioinformatics Methods in Predicting Amyloid Propensity of Peptides and Proteins. In: Li, M.S., Kloczkowski, A., Cieplak, M., Kouza, M. (eds) Computer Simulations of Aggregation of Proteins and Peptides . Methods in Molecular Biology, vol 2340. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1546-1_1
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1546-1_1
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1545-4
Online ISBN: 978-1-0716-1546-1
eBook Packages: Springer Protocols