Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases

Martin Vincent, Katharina Perell, Finn Cilius Nielsen, Gedske Daugaard, Niels Richard Hansen

5 Citations (Scopus)

Abstract

Motivation: Contamination of a cancer tissue by the surrounding benign (non-cancerous) tissue is a concern for molecular cancer diagnostics. This is because an observed molecular signature will be distorted by the surrounding benign tissue, possibly leading to an incorrect diagnosis. One example is molecular identification of the primary tumor site of metastases because biopsies of metastases typically contain a significant amount of benign tissue. Results: A model of tissue contamination is presented. This contamination model works independently of the training of a molecular predictor, and it can be combined with any predictor model. The usability of the model is illustrated on primary tumor site identification of liver biopsies, specifically, on a human dataset consisting of microRNA expression measurements of primary tumor samples, benign liver samples and liver metastases. For a predictor trained on primary tumor and benign liver samples, the contamination model decreased the test error on biopsies from liver metastases from 77 to 45%. A further reduction to 34% was obtained by including biopsies in the training data.

Original languageEnglish
JournalBioinformatics
Volume30
Issue number10
Pages (from-to)1417-1423
ISSN1367-4803
DOIs
Publication statusPublished - 15 May 2014

Fingerprint

Dive into the research topics of 'Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases'. Together they form a unique fingerprint.

Cite this