Learning Cross-Modality Representations from Multi-Modal Images

Gijs van Tulder; Marleen de Bruijne

doi:10.1109/TMI.2018.2868977

Learning Cross-Modality Representations from Multi-Modal Images

Gijs van Tulder, Marleen de Bruijne

17 Citations (Scopus)

58 Downloads (Pure)

Abstract

Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.

Original language	English
Journal	IEEE Transactions on Medical Imaging
Volume	38
Issue number	2
Pages (from-to)	638-648
ISSN	0278-0062
DOIs	https://doi.org/10.1109/TMI.2018.2868977
Publication status	Published - Feb 2019

Keywords

Autoencoders
Deep learning
Representation learning
Transfer learning

Access to Document

10.1109/TMI.2018.2868977

Tulder_TMI18_ownAccepted author manuscript, 1.03 MB

Cite this

@article{3255e7bb17e44a8eab0d5f1bf2d3018f,

title = "Learning Cross-Modality Representations from Multi-Modal Images",

abstract = "Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.",

keywords = "Autoencoders, Deep learning, Representation learning, Transfer learning",

author = "{van Tulder}, Gijs and {de Bruijne}, Marleen",

year = "2019",

month = feb,

doi = "10.1109/TMI.2018.2868977",

language = "English",

volume = "38",

pages = "638--648",

journal = "IEEE Transactions on Medical Imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers",

number = "2",

}

TY - JOUR

T1 - Learning Cross-Modality Representations from Multi-Modal Images

AU - van Tulder, Gijs

AU - de Bruijne, Marleen

PY - 2019/2

Y1 - 2019/2

N2 - Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.

AB - Machine learning algorithms can have difficulties adapting to data from different sources, for example from different imaging modalities. We present and analyze three techniques for unsupervised cross-modality feature learning, using a shared autoencoder-like convolutional network that learns a common representation from multi-modal data. We investigate a form of feature normalization, a learning objective that minimizes crossmodality differences, and modality dropout, in which the network is trained with varying subsets of modalities. We measure the same-modality and cross-modality classification accuracies and explore whether the models learn modality-specific or shared features. This paper presents experiments on two public datasets, with knee images from two MRI modalities, provided by the Osteoarthritis Initiative, and brain tumor segmentation on four MRI modalities from the BRATS challenge. All three approaches improved the cross-modality classification accuracy, with modality dropout and per-feature normalization giving the largest improvement. We observed that the networks tend to learn a combination of cross-modality and modality-specific features. Overall, a combination of all three methods produced the most cross-modality features and the highest cross-modality classification accuracy, while maintaining most of the samemodality accuracy.

KW - Autoencoders

KW - Deep learning

KW - Representation learning

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85052900021&partnerID=8YFLogxK

U2 - 10.1109/TMI.2018.2868977

DO - 10.1109/TMI.2018.2868977

M3 - Journal article

C2 - 30188817

SN - 0278-0062

VL - 38

SP - 638

EP - 648

JO - IEEE Transactions on Medical Imaging

JF - IEEE Transactions on Medical Imaging

IS - 2

ER -

Learning Cross-Modality Representations from Multi-Modal Images

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this