Mustguseal and Sister Web-Methods: A Practical Guide to Bioinformatic Analysis of Protein Superfamilies

Suplatov, Dmitry; Sharapova, Yana; Švedas, Vytas

doi:10.1007/978-1-0716-1036-7_12

Dmitry Suplatov³,
Yana Sharapova^3,4 &
Vytas Švedas^3,4

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2231))

1925 Accesses
6 Citations
1 Altmetric

Abstract

Bioinformatic analysis of functionally diverse superfamilies can help to study the structure-function relationship in proteins, but represents a methodological challenge. The Mustguseal web-server can build large structure-guided sequence alignments of thousands of homologs that cover all currently available sequence variants within a common structural fold. The input to the method is a PDB code of the query protein, which represents the protein superfamily of interest. The collection and subsequent alignment of protein sequences and structures is fully automated and driven by the particular choice of parameters. Four integrated sister web-methods—the Zebra, pocketZebra, visualCMAT, and Yosshi—are available to further analyze the resulting superimposition and identify conserved, subfamily-specific, and co-evolving residues, as well as to classify and study disulfide bonds in protein superfamilies. The integration of these web-based bioinformatic tools provides an out-of-the-box easy-to-use solution, first of its kind, to study protein function and regulation and design improved enzyme variants for practical applications and selective ligands to modulate their functional properties. In this chapter, we provide a step-by-step protocol for a comprehensive bioinformatic analysis of a protein superfamily using a web-browser as the main tool and notes on selecting the appropriate values for the key algorithm parameters depending on your research objective. The web-servers are freely available to all users at https://biokinet.belozersky.msu.ru/m-platform with no login requirement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

CoeViz: a web-based tool for coevolution analysis of protein residues

Article Open access 08 March 2016

The Genome3D Consortium for Structural Annotations of Selected Model Organisms

Comparative Protein Structure Analysis with Bio3D-Web

References

Suplatov D, Kirilin E, Švedas V (2016) Bioinformatic analysis of protein families to select function-related variable positions. In: Svendsen A (ed) Understanding enzymes. Pan Stanford Publishing, Singapore
Google Scholar
Rozewicki J, Li S, Amada KM, Standley DM, Katoh K (2019) MAFFT-DASH: integrated protein sequence and structural alignment. Nucleic Acids Res 47(W1):W5–W10
CAS PubMed PubMed Central Google Scholar
Suplatov DA, Kopylov KE, Popova NN, Voevodin VV, Švedas VK (2018) Mustguseal: a server for multiple structure-guided sequence alignment of protein families. Bioinformatics 34(9):1583–1585
Article CAS Google Scholar
Shegay MV, Suplatov DA, Popova NN, Švedas VK, Voevodin VV (2019) parMATT: parallel multiple alignment of protein 3D-structures with translations and twists for distributed-memory systems. Bioinformatics 35(21):4456–4458
Article CAS Google Scholar
Krissinel E, Henrick K (2004) Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr D Biol Crystallogr 60(12):2256–2268
Article CAS Google Scholar
Menke M, Berger B, Cowen L (2008) Matt: local flexibility aids protein multiple structure alignment. PLoS Comput Biol 4(1):e10
Article Google Scholar
Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30(4):772–780
Article CAS Google Scholar
Suplatov D, Sharapova Y, Geraseva E, Švedas V (2020) Zebra2: advanced and easy-to-use web-server for bioinformatic analysis of subfamily-specific and conserved positions in diverse protein superfamilies. Nucleic Acids Res 48(W1):W65–W71
Google Scholar
Suplatov D, Shalaeva D, Kirilin E, Arzhanik V, Švedas V (2014) Bioinformatic analysis of protein families for identification of variable amino acid residues responsible for functional diversity. J Biomol Struct Dyn 32(1):75–87
Article CAS Google Scholar
Suplatov D, Voevodin V, Švedas V (2015) Robust enzyme design: bioinformatic tools for improved protein stability. Biotechnol J 10(3):344–355
Article CAS Google Scholar
Suplatov D, Kirilin E, Arbatsky M, Takhaveev V, Švedas V (2014) pocketZebra: a web-server for automated selection and classification of subfamily-specific binding sites by bioinformatic analysis of diverse protein families. Nucleic Acids Res 42(W1):W344–W349
Article CAS Google Scholar
Suplatov D, Sharapova Y, Timonina D, Kopylov K, Švedas V (2018) The visualCMAT: a web-server to select and interpret correlated mutations/co-evolving residues in protein families. J Bioinform Comput Biol 16(02):1840005
Article CAS Google Scholar
Suplatov DA, Timonina DS, Sharapova YA, Švedas VK (2019) Yosshi: a web-server for disulfide engineering by bioinformatic analysis of diverse protein families. Nucleic Acids Res 47(W1):W308–W314
Article CAS Google Scholar
Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ (2009) Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics 25(9):1189–1191
Article CAS Google Scholar
Fesko K, Suplatov D, Švedas V (2018) Bioinformatic analysis of the fold type I PLP-dependent enzymes reveals determinants of reaction specificity in l-threonine aldolase from Aeromonas jandaei. FEBS Open Bio 8(6):1013–1028
Article CAS Google Scholar
Dong R, Peng Z, Zhang Y, Yang J (2018) mTM-align: an algorithm for fast and accurate multiple protein structure alignment. Bioinformatics 34(10):1719–1725
Article CAS Google Scholar
Pei J, Kim BH, Grishin NV (2008) PROMALS3D: a tool for multiple protein sequence and structure alignments. Nucleic Acids Res 36(7):2295–2300
Article CAS Google Scholar
Sharapova Y, Suplatov D, Švedas V (2018) Neuraminidase A from streptococcus pneumoniae has a modular organization of catalytic and lectin domains separated by a flexible linker. FEBS J 285(13):2428–2445
Article CAS Google Scholar
Hanson RM, Prilusky J, Renjian Z, Nakane T, Sussman JL (2013) JSmol and the next-generation web-based representation of 3D molecular structure as applied to proteopedia. Isr J Chem 53(3–4):207–216
Article CAS Google Scholar
Suplatov D, Sharapova Y, Shegay M, Popova N, Fesko K, Voevodin V, Švedas V (2019) High-performance hybrid computing for bioinformatic analysis of protein superfamilies. In: Voevodin V, Sobolev S (eds) Communications in computer and information science, vol 1129. Springer Nature, Switzerland AG, Basel
Google Scholar
Gille C, Fähling M, Weyand B, Wieland T, Gille A (2014) Alignment-Annotator web server: rendering and annotating sequence alignments. Nucleic Acids Res 42(W1):W3–W6
Article CAS Google Scholar
Steffen-Munsberg F, Vickers C, Kohls H, Land H, Mallin H, Nobili A, Skalden L, van den Bergh T, Joosten HJ, Berglund P, Höhne M, Bornscheuer UT (2015) Bioinformatic analysis of a PLP-dependent enzyme superfamily suitable for biocatalytic applications. Biotechnol Adv 33(5):566–604
Article CAS Google Scholar
Webb B, Sali A (2017) Protein structure modeling with modeller. In: Kaufmann M, Klinger C, Savelsbergh A (eds) Functional genomics. Methods in molecular biology, vol 1654. Humana Press, New York
Google Scholar
Suplatov D, Panin N, Kirilin E, Shcherbakova T, Kudryavtsev P, Švedas V (2014) Computational design of a pH stable enzyme: understanding molecular mechanism of penicillin acylase's adaptation to alkaline conditions. PLoS One 9(6):e100643
Article Google Scholar
Söding J, Biegert A, Lupas AN (2005) The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33(Suppl. 2):W244–W248
Article Google Scholar
Fischer JD, Mayer CE, Söding J (2008) Prediction of protein functional residues from sequence by probability density estimation. Bioinformatics 24(5):613–620
Article CAS Google Scholar
Craig DB, Dombkowski AA (2013) Disulfide by Design 2.0: a web-based tool for disulfide engineering in proteins. BMC Bioinformatics 14(1):346
Article Google Scholar
Dani VS, Ramakrishnan C, Varadarajan R (2003) MODIP revisited: re-evaluation and refinement of an automated procedure for modeling of disulfide bonds in proteins. Protein Eng 16(3):187–193
Article CAS Google Scholar
Sadovnichy V, Tikhonravov A, Voevodin V, Opanasenko V (2017) “Lomonosov”: supercomputing at Moscow State University. In: Vetter JS (ed) Contemporary high performance computing. Chapman and Hall/CRC, New York
Google Scholar

Download references

Acknowledgments

This work was supported by the Russian Foundation for Basic Research grant #18-29-13060 and carried out using the equipment of the shared research facilities of HPC computing resources at Lomonosov Moscow State University supported by the project RFMEFI62117X0011 [29].

Author information

Authors and Affiliations

Belozersky Institute of Physicochemical Biology, Lomonosov Moscow State University, Moscow, Russia
Dmitry Suplatov, Yana Sharapova & Vytas Švedas
Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
Yana Sharapova & Vytas Švedas

Authors

Dmitry Suplatov
View author publications
You can also search for this author in PubMed Google Scholar
Yana Sharapova
View author publications
You can also search for this author in PubMed Google Scholar
Vytas Švedas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dmitry Suplatov .

Editor information

Editors and Affiliations

Research Institute for Microbial Disease, Osaka University, Osaka, Japan
Kazutaka Katoh

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Suplatov, D., Sharapova, Y., Švedas, V. (2021). Mustguseal and Sister Web-Methods: A Practical Guide to Bioinformatic Analysis of Protein Superfamilies. In: Katoh, K. (eds) Multiple Sequence Alignment. Methods in Molecular Biology, vol 2231. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1036-7_12

Download citation

DOI: https://doi.org/10.1007/978-1-0716-1036-7_12
Published: 09 December 2020
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1035-0
Online ISBN: 978-1-0716-1036-7
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

Mustguseal and Sister Web-Methods: A Practical Guide to Bioinformatic Analysis of Protein Superfamilies

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

CoeViz: a web-based tool for coevolution analysis of protein residues

The Genome3D Consortium for Structural Annotations of Selected Model Organisms

Comparative Protein Structure Analysis with Bio3D-Web

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Mustguseal and Sister Web-Methods: A Practical Guide to Bioinformatic Analysis of Protein Superfamilies

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

CoeViz: a web-based tool for coevolution analysis of protein residues

The Genome3D Consortium for Structural Annotations of Selected Model Organisms

Comparative Protein Structure Analysis with Bio3D-Web

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation