Forecasting Chronic Diseases Using Data Fusion

Acar Ataman Evrim, Gözde Gürdeniz, Francesco Savorani, Louise Hansen, Anja Olsen, Anne Tjønneland, Lars Ove Dragsted, Rasmus Bro

7 Citations (Scopus)

Abstract

Data fusion, that is, extracting information through the fusion of complementary data sets, is a topic of great interest in metabolomics because analytical platforms such as liquid chromatography-mass spectrometry (LC-MS) and nuclear magnetic resonance (NMR) spectroscopy commonly used for chemical profiling of biofluids provide complementary information. In this study, with a goal of forecasting acute coronary syndrome (ACS), breast cancer, and colon cancer, we jointly analyzed LC-MS, NMR measurements of plasma samples, and the metadata corresponding to the lifestyle of participants. We used supervised data fusion based on multiple kernel learning and exploited the linearity of the models to identify significant metabolites/features for the separation of healthy referents and the cases developing a disease. We demonstrated that (i) fusing LC-MS, NMR, and metadata provided better separation of ACS cases and referents compared with individual data sets, (ii) NMR data performed the best in terms of forecasting breast cancer, while fusion degraded the performance, and (iii) neither the individual data sets nor their fusion performed well for colon cancer. Furthermore, we showed the strengths and limitations of the fusion models by discussing their performance in terms of capturing known biomarkers for smoking and coffee. While fusion may improve performance in terms of separating certain conditions by jointly analyzing metabolomics and metadata sets, it is not necessarily always the best approach as in the case of breast cancer.

Original languageEnglish
JournalJournal of Proteome Research
Volume16
Pages (from-to)2435-2444
Number of pages10
ISSN1535-3893
DOIs
Publication statusPublished - 7 Jul 2017

Fingerprint

Dive into the research topics of 'Forecasting Chronic Diseases Using Data Fusion'. Together they form a unique fingerprint.

Cite this