Abstract
Metabolomics aims to measure and characterise the complex composition of metabolites in a biological system. Metabolomics studies involve sophisticated analytical techniques such as mass spectrometry and nuclear magnetic resonance spectroscopy, and generate large amounts of high-dimensional and complex experimental data. Open source processing and analysis tools are of major interest in light of innovative, open and reproducible science. The scientific community has developed a wide range of open source software, providing freely available advanced processing and analysis approaches. The programming and statistics environment R has emerged as one of the most popular environments to process and analyse Metabolomics datasets. A major benefit of such an environment is the possibility of connecting different tools into more complex workflows. Combining reusable data processing R scripts with the experimental data thus allows for open, reproducible research. This review provides an extensive overview of existing packages in R for different steps in a typical computational metabolomics workflow, including data processing, biostatistics, metabolite annotation and identification, and biochemical network and pathway analysis. Multifunctional workflows, possible user interfaces and integration into workflow management systems are also reviewed. In total, this review summarises more than two hundred metabolomics specific packages primarily available on CRAN, Bioconductor and GitHub.
Original language | English |
---|---|
Article number | 200 |
Journal | Metabolites |
Volume | 9 |
Issue number | 10 |
Number of pages | 55 |
ISSN | 2218-1989 |
DOIs | |
Publication status | Published - Oct 2019 |
Keywords
- Faculty of Science
- Metabolomics
- Lipidomics
- Mass spectrometry
- NMR spectroscopy
- R
- CRAN
- Bioconductor
- Signal processing
- Statistical data analysis
- Feature selection
- Compound identification
- Metabolite networks
- Data integration