Abstract
Untargeted metabolite profiling based upon LC-MS methodology can be used to identify unique metabolic phenotypes associated with stress, disease or environmental exposure of cells using mathematical clustering. Here, we show how unsupervised data analysis is a powerful tool for both quality control and answering simple biological questions. We will demonstrate how to format untargeted mass spectrometry data for import into R, a programming language and software environment for statistical computing (R Development Core Team. R: A language and environment for statistical computing, reference index version 2.15. R Foundation for Statistical Computing, Vienna, 2012). Using R, we transform untargeted metabolite data using hierarchical clustering and principal component analysis (PCA) to create visual representations of change between biological samples and explore how these can be used predictively, in determining environmental stress, health and metabolic insight.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Development Core Team R (2012) R: A language and environment for statistical computing, reference index version 2.15.1. R Foundation for Statistical Computing, Vienna
Patti GJ, Tautenhahn R, Siuzdak G et al (2012) Meta-analysis of untargeted metabolomic data from multiple profiling experiments. Nat Protoc 7(3):508–516
Patti GJ, Yanes O, Shriver LP, Courade J, Tautenhahn R, Manchester M, Siuzdak G et al (2012) Metabolomics implicates altered sphingolipids in chronic pain of neuropathic origin. Nat Chem Biol 8(3):232–234
Everitt B (1974) Cluster analysis. Heinemann Educational Books, London
Hartigan JA (1975) Clustering algorithms. Wiley, New York
Anderberg MR (1973) Cluster analysis for applications. Academic Press, New York
Murtagh F (1985) Multidimensional Clustering Algorithms. In: COMPSTAT Lectures 4. Physica-Verlag, Wuerzburg
Becker RA, Chambers JM, Wilks AR (1988) The new S language. Wadsworth & Brooks/Cole Advanced Books & Software, Monterey
Mardia KV, Kent JT, Bibby JM (1979) Multivariate analysis. Academic Press, London
Venables WN, Ripley BD (2002) Modern applied statistics with S. Springer-Verlag, Berlin
Heinemann J, Hamerly T, Maaty WS, Movahed N, Steffens JD, Reeves BD, Hilmer JK, Therien J, Grieco PA, Peters JW, Bothner B et al (2014) Expanding the paradigm of thiol redox in the thermophilic root of life. Biochim Biophys Acta 1840:80–85
Maaty WS, Wiedenheft B, Tarlykov P, Schaff N, Heinemann J, Robison-Cox J, Valenzuela J, Bothner B et al (2009) Something old, something new, something borrowed; how the thermoacidophilic archaeon Sulfolobus solfataricus responds to oxidative stress. PLoS One 4(9):e6964
Gordon AD (1999) Classification. Chapman and Hall / CRC, London
McQuitty LL (1966) Similarity analysis by reciprocal pairs for discrete and continuous data. Educ Psychol Meas 26:825–831
Kessner D, Chambers M, Burke R, Agus D, Mallick P et al (2008) ProteoWizard: open source software for rapid proteomics tools development. Bioinformatics 24(21):2534–2536
Tautenhahn R, Böttcher C, Neumann S et al (2008) Highly sensitive feature detection for high resolution LC/MS. BMC bioinformatics 9:504
Yanes O, Tautenhahn R, Patti GJ, Siuzdak G et al (2011) Expanding coverage of the metabolome for global metabolite profiling. Anal Chem 83(6):2152–2161
https://stat.ethz.ch/R-manual/R-devel/library/stats/html/prcomp.html
https://www.rdocumentation.org/packages/rgl/versions/0.97.0/topics/plot3d
http://stat.ethz.ch/R-manual/R-devel/library/stats/html/hclust.html
Acknowledgments
The authors would also like to acknowledge that this work was part of the DOE Joint BioEnergy Institute (http://www.jbei.org) supported by the US Department of Energy, Office of Science, Office of Biological and Environmental Research, through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the US Department of Energy.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Heinemann, J. (2019). Cluster Analysis of Untargeted Metabolomic Experiments. In: Baidoo, E. (eds) Microbial Metabolomics. Methods in Molecular Biology, vol 1859. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8757-3_16
Download citation
DOI: https://doi.org/10.1007/978-1-4939-8757-3_16
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-8756-6
Online ISBN: 978-1-4939-8757-3
eBook Packages: Springer Protocols