A Bioinformatics Primer to Data Science, with Examples for Metabolomics

Pittard, W. Stephen; Villaveces, Cecilia “Keeko”; Li, Shuzhao

doi:10.1007/978-1-0716-0239-3_14

W. Stephen Pittard³,
Cecilia “Keeko” Villaveces⁴ &
Shuzhao Li⁵

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2104))

6501 Accesses
2 Citations
1 Altmetric

Abstract

With the increasing importance of big data in biomedicine, skills in data science are a foundation for the individual career development and for the progress of science. This chapter is a practical guide to working with high-throughput biomedical data. It covers how to understand and set up the computing environment, to start a research project with proper and effective data management, and to perform common bioinformatics tasks such as data wrangling, quality control, statistical analysis, and visualization, with examples on metabolomics data. Concepts and tools related to coding and scripting are discussed. Version control, knitr and Jupyter notebooks are important to project management, collaboration, and research reproducibility. Overall, this chapter describes a core set of skills to work in bioinformatics, and can serve as a reference text at the level of a graduate course and interfacing with data science.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Bioinformatics Tools to Analyze Proteome and Genome Data

Introduction to Bioinformatics

OmixAnalyzer – A Web-Based System for Management and Analysis of High-Throughput Omics Data Sets

References

Zauhar RJ (2001) University bioinformatics programs on the rise. Nat Biotechnol 19(3):285
Article CAS Google Scholar
Gilbert W (2003) Life after the helix. Nature 421:315–316
Article CAS Google Scholar
De Livera AM, Olshansky G, Simpson JA, Creek DJ (2018) NormalizeMets: assessing, selecting and implementing statistical methods for normalizing metabolomics data. Metabolomics 14(5):54
Article Google Scholar
Gardinassi LG, Xia J, Safo SE, Li S (2017) Bioinformatics tools for the interpretation of metabolomics data. Curr Pharmacol Rep 3(6):374–383
Article Google Scholar
Krzywinski M et al (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639–1645
Article CAS Google Scholar
Li S, Park Y, Duraisingham S, Strobel FH, Khan N, Soltow QA, Jones DP, Pulendran B (2013) Predicting network activity from high throughput metabolomics. PLoS Comput Biol 9(7):e1003123
Article CAS Google Scholar

Download references

Acknowledgments

This work has been funded, in part, by the US national Institutes of Health via grants UH2 AI132345 (Li), U2C ES030163 (Jones, Li, Morgan, Miller), U01 CA235493 (Li, Xia, Siuzdak), U2C ES026560 (Miller), P30 ES019776 (Marsit), P50 ES026071 (McCauley), and the US EPA grant 83615301 (McCauley).

Author information

Authors and Affiliations

Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, USA
W. Stephen Pittard
Department of Mathematics, University of Georgia, Athens, GA, USA
Cecilia “Keeko” Villaveces
Department of Medicine, Emory University School of Medicine, Atlanta, GA, USA
Shuzhao Li

Authors

W. Stephen Pittard
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia “Keeko” Villaveces
View author publications
You can also search for this author in PubMed Google Scholar
Shuzhao Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shuzhao Li .

Editor information

Editors and Affiliations

Department of Medicine, Emory University School of Medicine, Atlanta, GA, USA
Shuzhao Li

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Pittard, W.S., Villaveces, C.“., Li, S. (2020). A Bioinformatics Primer to Data Science, with Examples for Metabolomics. In: Li, S. (eds) Computational Methods and Data Analysis for Metabolomics. Methods in Molecular Biology, vol 2104. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-0239-3_14

Download citation

DOI: https://doi.org/10.1007/978-1-0716-0239-3_14
Published: 18 January 2020
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-0238-6
Online ISBN: 978-1-0716-0239-3
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

A Bioinformatics Primer to Data Science, with Examples for Metabolomics

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Bioinformatics Tools to Analyze Proteome and Genome Data

Introduction to Bioinformatics

OmixAnalyzer – A Web-Based System for Management and Analysis of High-Throughput Omics Data Sets

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Bioinformatics Primer to Data Science, with Examples for Metabolomics

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Bioinformatics Tools to Analyze Proteome and Genome Data

Introduction to Bioinformatics

OmixAnalyzer – A Web-Based System for Management and Analysis of High-Throughput Omics Data Sets

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation