Abstract
Gas chromatography and liquid chromatography coupled to mass spectrometry are used extensively in untargeted metabolomics, which involves the profiling of small metabolites in biological samples. The complex raw dataset produced from untargeted metabolomics requires proper processing before it can be statistically analyzed and interpreted. This chapter describes a high-throughput data processing workflow routinely used in our laboratory, including feature detection and alignment, data reduction, and spectral-matching-based annotation. This semiautomated workflow uses vendor neutral data file formats and freely available data processing tools and therefore can be readily implemented on datasets acquired from instruments of different vendors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hall R, Beale M, Fiehn O, Hardy N, Sumner L, Bino R (2002) The missing link in functional genomics strategies. Plant Cell 14:1437–1440
Rhee EP, Gerszten RE (2012) Metabolomics and cardiovascular biomarker discovery. Clin Chem 58:139–147
Newgard CB (2017) Metabolomics and metabolic diseases: where do we stand? Cell Metab 25:43–56
Gibney MJ, Walsh M, Brennan L, Roche HM, German B, van Ommen B (2005) Metabolomics in human nutrition: opportunities and challenges. Am J Clin Nutr 82:497–503
Plumb RS, Johnson KA, Rainville P, Smith BW, Wilson ID, Castro‐Perez JM, Nicholson JK (2006) UPLC/MSE; a new approach for generating molecular fragment information for biomarker structure elucidation. Rapid Commun Mass Spectrom 20:1989–1994
Smith CA, Want EJ, O’Maille G, Abagyan R, Siuzdak G (2006) XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal Chem 78:779–787
Broeckling CD, Afsar FA, Neumann S, Ben-Hur A, Prenni JE (2014) RAMClust: a novel feature clustering method enables spectral-matching-based annotation for metabolomics data. Anal Chem 86:6812–6817
Broeckling CD, Ganna A, Layer M, Brown K, Sutton B, Ingelsson E, Peers G, Prenni JE (2016) Enabling efficient and confident annotation of LC-MS metabolomics data through MS1 spectrum and time prediction. Anal Chem 88:9226–9234
R Core Team (2017) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. https://www.R-project.org/. Accessed 1 Feb 2018
Stacklies W, Redestig H, Scholz M, Walther D, Selbig J (2007) pcaMethods—a bioconductor package providing PCA methods for incomplete data. Bioinformatics 23:1164–1167
Ejigu BA, Valkenborg D, Baggerman G, Vanaerschot M, Witters E, Dujardin JC, Burzykowski T, Berg M (2013) Evaluation of normalization methods to pave the way towards large-scale LC-MS-based metabolomics profiling experiments. OMICS 17:473–485
Zelena E, Dunn WB, Broadhurst D, Francis-McIntyre S, Carroll KM, Begley P, O’Hagan S, Knowles JD, Halsall A, HUSERMET Consortium, Wilson ID, Kell DB (2009) Development of a robust and repeatable UPLC-MS method for the long-term metabolomic study of human serum. Anal Chem 81:1357–1364
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B 57:289–300
Chambers MC, MacLean B, Burke R, Amode D, Ruderman DL, Neumann S, Gatto L, Fischer B, Pratt B, Egertson J, Hoff K, Kessner D, Tasman N, Shulman N, Frewen B, Baker TA, Brusniak M-Y, Paulse C, Creasy D, Flashner L, Kani K, Moulding C, Seymour SL, Nuwaysir LM, Lefebvre B, Kuhlmann F, Roark J, Rainer P, Detlev S, Hemenway T, Huhmer A, Langridge J, Connolly B, Chadick T, Holly K, Eckels J, Deutsch EW, Moritz RL, Katz JE, Agus DB, MacCoss M, Tabb DL, Mallick P (2012) A cross-platform toolkit for mass spectrometry and proteomics. Nat Biotechnol 30:918–920
Li B, Tang J, Yang Q, Li S, Cui X, Li Y, Chen Y, Xue W, Li X, Zhu F (2017) NOREVA: normalization and evaluation of MS-based metabolomics data. Nucleic Acids Res. https://doi.org/10.1093/nar/gkx449
Bolstad BM, Irizarry RA, Astrand M, Speed TP (2003) A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19:185–193
Kopka J, Schauer N, Krueger S, Birkemeyer C, Usadel B, Bergmuller E, Dormann P, Weckwerth W, Gibon Y, Stitt M, Willmitzer L, Fernie AR, Steinhauser D (2005) GMD@CSB.DB: the Golm Metabolome Database. Bioinformatics 21:1635–1638
Linstrom PJ, Mallard WG (eds) (2005) NIST chemistry webbook, NIST standard reference database number 69. National Institute of Standards and Technology, Gaithersburg, MD. https://doi.org/10.18434/T4D303. Accessed 1 Feb 2018
Broeckling CD, Heuberger AL, Prince JA, Ingelsson E, Prenni JE (2013) Assigning precursor-product ion relationships in indiscriminant MS/MS data from non-targeted metabolite profiling studies. Metabolomics 9:33–43
Stein SE (1994) Optimization and testing of mass spectral library search algorithms for compound identification. J Am Soc Mass Spectrom 5:859–865
Xia J, Psychogios N, Young N, Wishart DS (2009) MetaboAnalyst: a web server for metabolomic data analysis and interpretation. Nucleic Acids Res 37:W652–W660
Smilde AK, Jansen JJ, Hoefsloot HC, Lamers RJA, Van Der Greef J, Timmerman ME (2005) ANOVA-simultaneous component analysis (ASCA): a new tool for analyzing designed metabolomics data. Bioinformatics 21:3043–3048
Wiklund S, Johansson E, Sjöström L, Mellerowicz EJ, Edlund U, Shockcor JP, Gottfries J, Moritz T, Trygg J (2008) Visualization of GC/TOF-MS-based metabolomics data for identification of biochemically interesting compounds using OPLS class models. Anal Chem 80:115–122
Worley B, Powers R (2013) Multivariate analysis in metabolomics. Curr Metabolomics 1:92–107
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Yao, L., Sheflin, A.M., Broeckling, C.D., Prenni, J.E. (2019). Data Processing for GC-MS- and LC-MS-Based Untargeted Metabolomics. In: D'Alessandro, A. (eds) High-Throughput Metabolomics. Methods in Molecular Biology, vol 1978. Humana, New York, NY. https://doi.org/10.1007/978-1-4939-9236-2_18
Download citation
DOI: https://doi.org/10.1007/978-1-4939-9236-2_18
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-4939-9235-5
Online ISBN: 978-1-4939-9236-2
eBook Packages: Springer Protocols