Confidence scores for prediction models

Thomas Alexander Gerds; MA van de Wiel

doi:10.1002/bimj.201000157

Confidence scores for prediction models

Thomas Alexander Gerds, MA van de Wiel

Section of Biostatistics

6 Citations (Scopus)

Abstract

In medical statistics, many alternative strategies are available for building a prediction model based on training data. Prediction models are routinely compared by means of their prediction performance in independent validation data. If only one data set is available for training and validation, then rival
strategies can still be compared based on repeated bootstraps of the same data. Often, however, the overall performance of rival strategies is similar and it is thus difficult to decide for one model. Here, we investigate the variability of the prediction models that results when the same modelling strategy
is applied to different training sets. For each modelling strategy we estimate a confidence score based on the same repeated bootstraps. A new decomposition of the expected Brier score is obtained, as well as the estimates of population average confidence scores. The latter can be used to distinguish rival prediction models with similar prediction performances. Furthermore, on the subject level a
confidence score may provide useful supplementary information for new patients who want to base a medical decision on predicted risk. The ideas are illustrated and discussed using data from cancer studies, also with high-dimensional predictor space.

Translated title of the contribution	Confidence scores for prediction models
Original language	English
Journal	Biometrical Journal
Volume	53
Issue number	2
Pages (from-to)	259-274
Number of pages	16
ISSN	0323-3847
DOIs	https://doi.org/10.1002/bimj.201000157
Publication status	Published - Mar 2011

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1002/bimj.201000157

Cite this

@article{c7cface76fea46bca8df94d8c8335c1a,

title = "Confidence scores for prediction models",

abstract = "In medical statistics, many alternative strategies are available for building a prediction model based on training data. Prediction models are routinely compared by means of their prediction performance in independent validation data. If only one data set is available for training and validation, then rival strategies can still be compared based on repeated bootstraps of the same data. Often, however, the overall performance of rival strategies is similar and it is thus difficult to decide for one model. Here, we investigate the variability of the prediction models that results when the same modelling strategy is applied to different training sets. For each modelling strategy we estimate a confidence score based on the same repeated bootstraps. A new decomposition of the expected Brier score is obtained, as well as the estimates of population average confidence scores. The latter can be used to distinguish rival prediction models with similar prediction performances. Furthermore, on the subject level a confidence score may provide useful supplementary information for new patients who want to base a medical decision on predicted risk. The ideas are illustrated and discussed using data from cancer studies, also with high-dimensional predictor space.",

author = "Gerds, {Thomas Alexander} and {van de Wiel}, MA",

year = "2011",

month = mar,

doi = "10.1002/bimj.201000157",

language = "English",

volume = "53",

pages = "259--274",

journal = "Biometrical Journal",

issn = "0323-3847",

publisher = "Wiley - V C H Verlag GmbH & Co. KGaA",

number = "2",

}

TY - JOUR

T1 - Confidence scores for prediction models

AU - Gerds, Thomas Alexander

AU - van de Wiel, MA

PY - 2011/3

Y1 - 2011/3

N2 - In medical statistics, many alternative strategies are available for building a prediction model based on training data. Prediction models are routinely compared by means of their prediction performance in independent validation data. If only one data set is available for training and validation, then rival strategies can still be compared based on repeated bootstraps of the same data. Often, however, the overall performance of rival strategies is similar and it is thus difficult to decide for one model. Here, we investigate the variability of the prediction models that results when the same modelling strategy is applied to different training sets. For each modelling strategy we estimate a confidence score based on the same repeated bootstraps. A new decomposition of the expected Brier score is obtained, as well as the estimates of population average confidence scores. The latter can be used to distinguish rival prediction models with similar prediction performances. Furthermore, on the subject level a confidence score may provide useful supplementary information for new patients who want to base a medical decision on predicted risk. The ideas are illustrated and discussed using data from cancer studies, also with high-dimensional predictor space.

AB - In medical statistics, many alternative strategies are available for building a prediction model based on training data. Prediction models are routinely compared by means of their prediction performance in independent validation data. If only one data set is available for training and validation, then rival strategies can still be compared based on repeated bootstraps of the same data. Often, however, the overall performance of rival strategies is similar and it is thus difficult to decide for one model. Here, we investigate the variability of the prediction models that results when the same modelling strategy is applied to different training sets. For each modelling strategy we estimate a confidence score based on the same repeated bootstraps. A new decomposition of the expected Brier score is obtained, as well as the estimates of population average confidence scores. The latter can be used to distinguish rival prediction models with similar prediction performances. Furthermore, on the subject level a confidence score may provide useful supplementary information for new patients who want to base a medical decision on predicted risk. The ideas are illustrated and discussed using data from cancer studies, also with high-dimensional predictor space.

U2 - 10.1002/bimj.201000157

DO - 10.1002/bimj.201000157

M3 - Journal article

C2 - 21328604

SN - 0323-3847

VL - 53

SP - 259

EP - 274

JO - Biometrical Journal

JF - Biometrical Journal

IS - 2

ER -

Confidence scores for prediction models

Abstract

UN SDGs

Access to Document

Fingerprint

Cite this