Learning Cross-Modality Representations from Multi-Modal Images

Gijs van Tulder; Marleen de Bruijne

doi:10.1109/TMI.2018.2868977

Learning Cross-Modality Representations from Multi-Modal Images

Gijs van Tulder, Marleen de Bruijne

17 Citationer (Scopus)

58 Downloads (Pure)

Abstract

Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.

Originalsprog	Engelsk
Tidsskrift	IEEE Transactions on Medical Imaging
Vol/bind	38
Udgave nummer	2
Sider (fra-til)	638-648
ISSN	0278-0062
DOI	https://doi.org/10.1109/TMI.2018.2868977
Status	Udgivet - feb. 2019

Adgang til dokumentet

10.1109/TMI.2018.2868977

Tulder_TMI18_ownAccepteret manuskript, 1,03 MB

Andre filer og links

Link to publication in Scopus

Citationsformater

@article{3255e7bb17e44a8eab0d5f1bf2d3018f,

title = "Learning Cross-Modality Representations from Multi-Modal Images",

abstract = "Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.",

keywords = "Autoencoders, Deep learning, Representation learning, Transfer learning",

author = "{van Tulder}, Gijs and {de Bruijne}, Marleen",

year = "2019",

month = feb,

doi = "10.1109/TMI.2018.2868977",

language = "English",

volume = "38",

pages = "638--648",

journal = "IEEE Transactions on Medical Imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers",

number = "2",

}

TY - JOUR

T1 - Learning Cross-Modality Representations from Multi-Modal Images

AU - van Tulder, Gijs

AU - de Bruijne, Marleen

PY - 2019/2

Y1 - 2019/2

N2 - Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.

AB - Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.

KW - Autoencoders

KW - Deep learning

KW - Representation learning

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85052900021&partnerID=8YFLogxK

U2 - 10.1109/TMI.2018.2868977

DO - 10.1109/TMI.2018.2868977

M3 - Journal article

C2 - 30188817

SN - 0278-0062

VL - 38

SP - 638

EP - 648

JO - IEEE Transactions on Medical Imaging

JF - IEEE Transactions on Medical Imaging

IS - 2

ER -

Learning Cross-Modality Representations from Multi-Modal Images

Abstract

Adgang til dokumentet

Andre filer og links

Fingeraftryk

Citationsformater