Abstract
The mouse, Mus musculus, is a popular model organism for the study of human genes involved in development, immunology, and disease phenotypes. Despite recent revolutions in gene-knockout technologies in mouse, identification of candidate genes for functions of interest can further accelerate the discovery of novel gene functions. The collaborative nature of genetic functions allows for the inference of gene functions based on the principle of guilt-by-association. Genome-scale co-functional networks could therefore provide functional predictions for genes via network analysis. We recently constructed such a network for mouse (MouseNet), which interconnects over 88% of protein-coding genes with 788,080 functional relationships. The companion web server (www.inetbio.org/mousenet) enables researchers with no bioinformatics expertise to generate predictions that facilitate discovery of novel gene functions. In this chapter, we present the theoretical framework for MouseNet, as well as step-by-step instructions and technical tips for functional prediction of genes and pathways in mouse and other model vertebrates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Farrell CM, O’Leary NA, Harte RA, Loveland JE, Wilming LG, Wallin C, Diekhans M, Barrell D, Searle SM, Aken B, Hiatt SM, Frankish A, Suner MM, Rajput B, Steward CA, Brown GR, Bennett R, Murphy M, Wu W, Kay MP, Hart J, Rajan J, Weber J, Snow C, Riddick LD, Hunt T, Webb D, Thomas M, Tamez P, Rangwala SH, McGarvey KM, Pujar S, Shkeda A, Mudge JM, Gonzalez JM, Gilbert JG, Trevanion SJ, Baertsch R, Harrow JL, Hubbard T, Ostell JM, Haussler D, Pruitt KD (2014) Current status and new features of the Consensus Coding Sequence database. Nucleic Acids Res 42(Database issue):D865–D872. doi:10.1093/nar/gkt1059
Blake JA, Christie KR, Dolan ME, Drabkin HJ, Hill DP, Ni L, Sitnikov D, Burgess S, Buza T, Gresham C, McCarthy F, Pillai L, Wang H, Carbon S, Dietze H, Lewis SE, Mungall CJ, Munoz-Torres MC, Feuermann M, Gaudet P, Basu S, Chisholm RL, Dodson RJ, Fey P, Mi H, Thomas PD, Muruganujan A, Poudel S, Hu JC, Aleksander SA, McIntosh BK, Renfro DP, Siegele DA, Attrill H, Brown NH, Tweedie S, Lomax J, Osumi-Sutherland D, Parkinson H, Roncaglia P, Lovering RC, Talmud PJ, Humphries SE, Denny P, Campbell NH, Foulger RE, Chibucos MC, Giglio MG, Chang HY, Finn R, Fraser M, Mitchell A, Nuka G, Pesseat S, Sangrador A, Scheremetjew M, Young SY, Stephan R, Harris MA, Oliver SG, Rutherford K, Wood V, Bahler J, Lock A, Kersey PJ, McDowall MD, Staines DM, Dwinell M, Shimoyama M, Laulederkind S, Hayman GT, Wang SJ, Petri V, D’Eustachio P, Matthews L, Balakrishnan R, Binkley G, Cherry JM, Costanzo MC, Demeter J, Dwight SS, Engel SR, Hitz BC, Inglis DO, Lloyd P, Miyasato SR, Paskov K, Roe G, Simison M, Nash RS, Skrzypek MS, Weng S, Wong ED, Berardini TZ, Li D, Huala E, Argasinska J, Arighi C, Auchincloss A, Axelsen K, Argoud-Puy G, Bateman A, Bely B, Blatter MC, Bonilla C, Bougueleret L, Boutet E, Breuza L, Bridge A, Britto R, Casals C, Cibrian-Uhalte E, Coudert E, Cusin I, Duek-Roggli P, Estreicher A, Famiglietti L, Gane P, Garmiri P, Gos A, Gruaz-Gumowski N, Hatton-Ellis E, Hinz U, Hulo C, Huntley R, Jungo F, Keller G, Laiho K, Lemercier P, Lieberherr D, MacDougall A, Magrane M, Martin M, Masson P, Mutowo P, O’Donovan C, Pedruzzi I, Pichler K, Poggioli D, Poux S, Rivoire C, Roechert B, Sawford T, Schneider M, Shypitsyna A, Stutz A, Sundaram S, Tognolli M, Wu C, Xenarios I, Chan J, Kishore R, Sternberg PW, Van Auken K, Muller HM, Done J, Li Y, Howe D, Westerfield M, Consortium GO (2015) Gene Ontology Consortium: going forward. Nucleic Acids Res 43(D1):D1049–D1056. doi:10.1093/nar/gku1179
Blake JA, Bult CJ, Eppig JT, Kadin JA, Richardson JE, Grp MGD (2014) The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse. Nucleic Acids Res 42(D1):D810–D817. doi:10.1093/nar/gkt1225
Shim JE, Lee I (2015) Network-assisted approaches for human disease research. Animal Cells Syst 19(4):231–235. doi:10.1080/19768354.2015.1074108
Cho A, Shin J, Hwang S, Kim C, Shim H, Kim H, Kim H, Lee I (2014) WormNet v3: a network-assisted hypothesis-generating server for Caenorhabditis elegans. Nucleic Acids Res 42(Web Server issue):W76–W82. doi:10.1093/nar/gku367
Shin J, Yang S, Kim E, Kim CY, Shim H, Cho A, Kim H, Hwang S, Shim JE, Lee I (2015) FlyNet: a versatile network prioritization server for the Drosophila community. Nucleic Acids Res 43(W1):W91–W97. doi:10.1093/nar/gkv453
Snider J, Kotlyar M, Saraon P, Yao Z, Jurisica I, Stagljar I (2015) Fundamentals of protein interaction network mapping. Mol Syst Biol 11(12):848. doi:10.15252/msb.20156351
Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO (1999) Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci U S A 96(8):4285–4288. doi:10.1073/pnas.96.8.4285
Overbeek R, Fonstein M, D’Souza M, Pusch GD, Maltsev N (1999) The use of gene clusters to infer functional coupling. Proc Natl Acad Sci U S A 96(6):2896–2901. doi:10.1073/pnas.96.6.2896
Stuart JM, Segal E, Koller D, Kim SK (2003) A gene-coexpression network for global discovery of conserved genetic modules. Science 302(5643):249–255. doi:10.1126/science.1087447
Kim E, Kim H, Lee I (2013) JiffyNet: a web-based instant protein network modeler for newly sequenced species. Nucleic Acids Res 41(W1):W192–W197. doi:10.1093/nar/gkt419
Lee I, Date SV, Adai AT, Marcotte EM (2004) A probabilistic functional network of yeast genes. Science 306(5701):1555–1558. doi:10.1126/science.1099511
Pena-Castillo L, Tasan M, Myers CL, Lee H, Joshi T, Zhang C, Guan YF, Leone M, Pagnani A, Kim WK, Krumpelman C, Tian WD, Obozinski G, Qi YJ, Mostafavi S, Lin GN, Berriz GF, Gibbons FD, Lanckriet G, Qiu J, Grant C, Barutcuoglu Z, Hill DP, Warde-Farley D, Grouios C, Ray D, Blake JA, Deng MH, Jordan MI, Noble WS, Morris Q, Klein-Seetharaman J, Bar-Joseph Z, Chen T, Sun FZ, Troyanskaya OG, Marcotte EM, Xu D, Hughes TR, Roth FP (2008) A critical assessment of Mus musculus gene function prediction using integrated genomic evidence. Genome Biol 9(Suppl 1):S2. doi:10.1186/Gb-2008-9-S1-S2
Kim WK, Krumpelman C, Marcotte EM (2008) Inferring mouse gene functions from genomic-scale data using a combined functional network/classification strategy. Genome Biol 9(Suppl 1):S5. doi:10.1186/Gb-2008-9-S1-S5
Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A (2013) NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res 41(Database issue):D991–D995. doi:10.1093/nar/gks1193
Kim E, Hwang S, Kim H, Shim H, Kang B, Yang S, Shim JH, Shin SY, Marcotte EM, Lee I (2016) MouseNet v2: a database of gene networks for studying the laboratory mouse and eight other model vertebrates. Nucleic Acids Res 44(D1):D848–D854. doi:10.1093/nar/gkv1155
Kim H, Shin J, Kim E, Kim H, Hwang S, Shim JE, Lee I (2014) YeastNet v3: a public database of data-specific and integrated functional gene networks for Saccharomyces cerevisiae. Nucleic Acids Res 42(D1):D731–D736. doi:10.1093/nar/gkt981
Lee I, Blom UM, Wang PI, Shim JE, Marcotte EM (2011) Prioritizing candidate disease genes by network-based boosting of genome-wide association data. Genome Res 21(7):1109–1121. doi:10.1101/gr.118992.110
Razick S, Magklaras G, Donaldson IM (2008) iRefIndex: a consolidated protein interaction database with provenance. BMC Bioinformatics 9:405. doi:10.1186/1471-2105-9-405
Shin J, Lee I (2015) Co-inheritance analysis within the domains of life substantially improves network inference by phylogenetic profiling. PLoS One 10(9):e0139006. doi:10.1371/journal.pone.0139006
Shin J, Lee T, Kim H, Lee I (2014) Complementarity between distance- and probability-based methods of gene neighbourhood identification for pathway reconstruction. Mol Biosyst 10(1):24–29. doi:10.1039/c3mb70366e
Kanehisa M, Goto S, Sato Y, Kawashima M, Furumichi M, Tanabe M (2014) Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res 42(Database issue):D199–D205. doi:10.1093/nar/gkt1076
Amberger JS, Bocchini CA, Schiettecatte F, Scott AF, Hamosh A (2015) OMIM.org: Online Mendelian Inheritance in Man (OMIM(R)), an online catalog of human genes and genetic disorders. Nucleic Acids Res 43(Database issue):D789–D798. doi:10.1093/nar/gku1205
Kibbe WA, Arze C, Felix V, Mitraka E, Bolton E, Fu G, Mungall CJ, Binder JX, Malone J, Vasant D, Parkinson H, Schriml LM (2015) Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data. Nucleic Acids Res 43(D1):D1071–D1078. doi:10.1093/nar/gku1011
Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, Klemm A, Flicek P, Manolio T, Hindorff L, Parkinson H (2014) The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 42(D1):D1001–D1006. doi:10.1093/nar/gkt1229
Li MJ, Liu Z, Wang P, Wong MP, Nelson MR, Kocher JP, Yeager M, Sham PC, Chanock SJ, Xia Z, Wang J (2016) GWASdb v2: an update database for human genetic variants identified by genome-wide association studies. Nucleic Acids Res 44(D1):D869–D876. doi:10.1093/nar/gkv1317
Caspi R, Altman T, Billington R, Dreher K, Foerster H, Fulcher CA, Holland TA, Keseler IM, Kothari A, Kubo A, Krummenacker M, Latendresse M, Mueller LA, Ong Q, Paley S, Subhraveti P, Weaver DS, Weerasinghe D, Zhang PF, Karp PD (2014) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 42(D1):D459–D471. doi:10.1093/nar/gkt1103
Smith CL, Eppig JT (2012) The mammalian phenotype ontology as a unifying standard for experimental and high-throughput phenotyping data. Mamm Genome 23(9-10):653–668. doi:10.1007/s00335-012-9421-3
Kohler S, Doelken SC, Mungall CJ, Bauer S, Firth HV, Bailleul-Forestier I, Black GCM, Brown DL, Brudno M, Campbell J, FitzPatrick DR, Eppig JT, Jackson AP, Freson K, Girdea M, Helbig I, Hurst JA, Jahn J, Jackson LG, Kelly AM, Ledbetter DH, Mansour S, Martin CL, Moss C, Mumford A, Ouwehand WH, Park SM, Riggs ER, Scott RH, Sisodiya S, Van Vooren S, Wapner RJ, Wilkie AOM, Wright CF, Vulto-van Silfhout AT, de Leeuw N, de Vries BBA, Washingthon NL, Smith CL, Westerfield M, Schofield P, Ruef BJ, Gkoutos GV, Haendel M, Smedley D, Lewis SE, Robinson PN (2014) The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data. Nucleic Acids Res 42(D1):D966–D974. doi:10.1093/nar/gkt1026
Han C, Choi E, Park I, Lee B, Jin S, Kim do H, Lee B, Nishimura H, Cho C (2009) Comprehensive analysis of reproductive ADAMs: relationship of ADAM4 and ADAM6 with an ADAM complex required for fertilization in mice. Biol Reprod 80(5):1001–1008. doi:10.1095/biolreprod.108.073700
Puthussery T, Gayet-Primo J, Taylor WR (2010) Localization of the calcium-binding protein secretagogin in cone bipolar cells of the mammalian retina. J Comp Neurol 518(4):513–525. doi:10.1002/cne.22234
Turczynska KM, Sward K, Hien TT, Wohlfahrt J, Mattisson IY, Ekman M, Nilsson J, Sjogren J, Murugesan V, Hultgardh-Nilsson A, Cidad P, Hellstrand P, Perez-Garcia MT, Albinsson S (2015) Regulation of smooth muscle dystrophin and synaptopodin 2 expression by actin polymerization and vascular injury. Arterioscler Thromb Vasc Biol 35(6):1489–1497. doi:10.1161/ATVBAHA.114.305065
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13(11):2498–2504. doi:10.1101/gr.1239303
Acknowledgments
This work was supported by grants from the National Research Foundation of Korea (2015R1A2A1A15055859, 2012M3A9B4028641, 2012M3A9C7050151).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media LLC
About this protocol
Cite this protocol
Kim, E., Lee, I. (2017). Network-Based Gene Function Prediction in Mouse and Other Model Vertebrates Using MouseNet Server. In: Kihara, D. (eds) Protein Function Prediction. Methods in Molecular Biology, vol 1611. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7015-5_14
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7015-5_14
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7013-1
Online ISBN: 978-1-4939-7015-5
eBook Packages: Springer Protocols