Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases

Martin Vincent; Katharina Perell; Finn Cilius Nielsen; Gedske Daugaard; Niels Richard Hansen

doi:10.1093/bioinformatics/btu044

Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases

Martin Vincent, Katharina Perell, Finn Cilius Nielsen, Gedske Daugaard, Niels Richard Hansen

5 Citations (Scopus)

Abstract

Motivation: Contamination of a cancer tissue by the surrounding benign (non-cancerous) tissue is a concern for molecular cancer diagnostics. This is because an observed molecular signature will be distorted by the surrounding benign tissue, possibly leading to an incorrect diagnosis. One example is molecular identification of the primary tumor site of metastases because biopsies of metastases typically contain a significant amount of benign tissue. Results: A model of tissue contamination is presented. This contamination model works independently of the training of a molecular predictor, and it can be combined with any predictor model. The usability of the model is illustrated on primary tumor site identification of liver biopsies, specifically, on a human dataset consisting of microRNA expression measurements of primary tumor samples, benign liver samples and liver metastases. For a predictor trained on primary tumor and benign liver samples, the contamination model decreased the test error on biopsies from liver metastases from 77 to 45%. A further reduction to 34% was obtained by including biopsies in the training data.

Original language	English
Journal	Bioinformatics
Volume	30
Issue number	10
Pages (from-to)	1417-1423
ISSN	1367-4803
DOIs	https://doi.org/10.1093/bioinformatics/btu044
Publication status	Published - 15 May 2014

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1093/bioinformatics/btu044

Cite this

@article{a0e0e79ea34948c8b95e9dc5a00e8bd2,

title = "Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases",

abstract = "Motivation: Contamination of a cancer tissue by the surrounding benign (non-cancerous) tissue is a concern for molecular cancer diagnostics. This is because an observed molecular signature will be distorted by the surrounding benign tissue, possibly leading to an incorrect diagnosis. One example is molecular identification of the primary tumor site of metastases because biopsies of metastases typically contain a significant amount of benign tissue. Results: A model of tissue contamination is presented. This contamination model works independently of the training of a molecular predictor, and it can be combined with any predictor model. The usability of the model is illustrated on primary tumor site identification of liver biopsies, specifically, on a human dataset consisting of microRNA expression measurements of primary tumor samples, benign liver samples and liver metastases. For a predictor trained on primary tumor and benign liver samples, the contamination model decreased the test error on biopsies from liver metastases from 77 to 45%. A further reduction to 34% was obtained by including biopsies in the training data.",

author = "Martin Vincent and Katharina Perell and Nielsen, {Finn Cilius} and Gedske Daugaard and Hansen, {Niels Richard}",

year = "2014",

month = may,

day = "15",

doi = "10.1093/bioinformatics/btu044",

language = "English",

volume = "30",

pages = "1417--1423",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "10",

}

TY - JOUR

T1 - Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases

AU - Vincent, Martin

AU - Perell, Katharina

AU - Nielsen, Finn Cilius

AU - Daugaard, Gedske

AU - Hansen, Niels Richard

PY - 2014/5/15

Y1 - 2014/5/15

N2 - Motivation: Contamination of a cancer tissue by the surrounding benign (non-cancerous) tissue is a concern for molecular cancer diagnostics. This is because an observed molecular signature will be distorted by the surrounding benign tissue, possibly leading to an incorrect diagnosis. One example is molecular identification of the primary tumor site of metastases because biopsies of metastases typically contain a significant amount of benign tissue. Results: A model of tissue contamination is presented. This contamination model works independently of the training of a molecular predictor, and it can be combined with any predictor model. The usability of the model is illustrated on primary tumor site identification of liver biopsies, specifically, on a human dataset consisting of microRNA expression measurements of primary tumor samples, benign liver samples and liver metastases. For a predictor trained on primary tumor and benign liver samples, the contamination model decreased the test error on biopsies from liver metastases from 77 to 45%. A further reduction to 34% was obtained by including biopsies in the training data.

AB - Motivation: Contamination of a cancer tissue by the surrounding benign (non-cancerous) tissue is a concern for molecular cancer diagnostics. This is because an observed molecular signature will be distorted by the surrounding benign tissue, possibly leading to an incorrect diagnosis. One example is molecular identification of the primary tumor site of metastases because biopsies of metastases typically contain a significant amount of benign tissue. Results: A model of tissue contamination is presented. This contamination model works independently of the training of a molecular predictor, and it can be combined with any predictor model. The usability of the model is illustrated on primary tumor site identification of liver biopsies, specifically, on a human dataset consisting of microRNA expression measurements of primary tumor samples, benign liver samples and liver metastases. For a predictor trained on primary tumor and benign liver samples, the contamination model decreased the test error on biopsies from liver metastases from 77 to 45%. A further reduction to 34% was obtained by including biopsies in the training data.

U2 - 10.1093/bioinformatics/btu044

DO - 10.1093/bioinformatics/btu044

M3 - Journal article

C2 - 24463184

SN - 1367-4803

VL - 30

SP - 1417

EP - 1423

JO - Bioinformatics

JF - Bioinformatics

IS - 10

ER -

Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases

Abstract

UN SDGs

Access to Document

Fingerprint

Cite this