Why does synthesized data improve multi-sequence classification?

Gijs van Tulder, Marleen de Bruijne

36 Citationer (Scopus)

Abstract

The classification and registration of incomplete multi-modal medical images, such as multi-sequence MRI with missing sequences, can sometimes be improved by replacing the missing modalities with synthetic data. This may seem counter-intuitive: synthetic data is derived from data that is already available, so it does not add new information. Why can it still improve performance? In this paper we discuss possible explanations. If the synthesis model is more flexible than the classifier, the synthesis model can provide features that the classifier could not have extracted from the original data. In addition, using synthetic information to complete incomplete samples increases the size of the training set. We present experiments with two classifiers, linear support vector machines (SVMs) and random forests, together with two synthesis methods that can replace missing data in an image classification problem: neural networks and restricted Boltzmann machines (RBMs). We used data from the BRATS 2013 brain tumor segmentation challenge, which includes multi-modal MRI scans with T1, T1 post-contrast, T2 and FLAIR sequences. The linear SVMs appear to benefit from the complex transformations offered by the synthesis models, whereas the random forests mostly benefit from having more training data. Training on the hidden representation from the RBM brought the accuracy of the linear SVMs close to that of random forests.

OriginalsprogEngelsk
TitelMedical image computing and computer assisted interventions - MICCAI 2015 : 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part I
Antal sider8
ForlagSpringer
Publikationsdato2015
Sider531-538
ISBN (Trykt)978-3-319-24552-2
ISBN (Elektronisk)978-3-319-24553-9
DOI
StatusUdgivet - 2015
BegivenhedInternational Conference on Medical Image Computing and Computer Assisted Intervention 2015 - Munich, Tyskland
Varighed: 5 okt. 20159 okt. 2015
Konferencens nummer: 18

Konference

KonferenceInternational Conference on Medical Image Computing and Computer Assisted Intervention 2015
Nummer18
Land/OmrådeTyskland
ByMunich
Periode05/10/201509/10/2015
NavnLecture notes in computer science
Vol/bind9349
ISSN0302-9743

Fingeraftryk

Dyk ned i forskningsemnerne om 'Why does synthesized data improve multi-sequence classification?'. Sammen danner de et unikt fingeraftryk.

Citationsformater