TY - JOUR
T1 - Systematic identification of feature combinations for predicting drug response with Bayesian multi-view multi-task linear regression
AU - Ammad-Ud-Din, Muhammad
AU - Khan, Suleiman A
AU - Wennerberg, Krister
AU - Aittokallio, Tero
N1 - © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: [email protected]
PY - 2017/7/15
Y1 - 2017/7/15
N2 - Motivation: A prime challenge in precision cancer medicine is to identify genomic and molecular features that are predictive of drug treatment responses in cancer cells. Although there are several computational models for accurate drug response prediction, these often lack the ability to infer which feature combinations are the most predictive, particularly for high-dimensional molecular datasets. As increasing amounts of diverse genome-wide data sources are becoming available, there is a need to build new computational models that can effectively combine these data sources and identify maximally predictive feature combinations. Results: We present a novel approach that leverages on systematic integration of data sources to identify response predictive features of multiple drugs. To solve the modeling task we implement a Bayesian linear regression method. To further improve the usefulness of the proposed model, we exploit the known human cancer kinome for identifying biologically relevant feature combinations. In case studies with a synthetic dataset and two publicly available cancer cell line datasets, we demonstrate the improved accuracy of our method compared to the widely used approaches in drug response analysis. As key examples, our model identifies meaningful combinations of features for the well known EGFR, ALK, PLK and PDGFR inhibitors.
AB - Motivation: A prime challenge in precision cancer medicine is to identify genomic and molecular features that are predictive of drug treatment responses in cancer cells. Although there are several computational models for accurate drug response prediction, these often lack the ability to infer which feature combinations are the most predictive, particularly for high-dimensional molecular datasets. As increasing amounts of diverse genome-wide data sources are becoming available, there is a need to build new computational models that can effectively combine these data sources and identify maximally predictive feature combinations. Results: We present a novel approach that leverages on systematic integration of data sources to identify response predictive features of multiple drugs. To solve the modeling task we implement a Bayesian linear regression method. To further improve the usefulness of the proposed model, we exploit the known human cancer kinome for identifying biologically relevant feature combinations. In case studies with a synthetic dataset and two publicly available cancer cell line datasets, we demonstrate the improved accuracy of our method compared to the widely used approaches in drug response analysis. As key examples, our model identifies meaningful combinations of features for the well known EGFR, ALK, PLK and PDGFR inhibitors.
KW - Algorithms
KW - Antineoplastic Agents/pharmacology
KW - Bayes Theorem
KW - Computational Biology/methods
KW - Humans
KW - Linear Models
KW - Models, Biological
KW - Neoplasms/drug therapy
KW - Precision Medicine/methods
KW - Signal Transduction/drug effects
KW - Software
U2 - 10.1093/bioinformatics/btx266
DO - 10.1093/bioinformatics/btx266
M3 - Journal article
C2 - 28881998
SN - 1367-4811
VL - 33
SP - i359-i368
JO - Bioinformatics (Online)
JF - Bioinformatics (Online)
IS - 14
ER -