Adversarial Evaluation of Multimodal Machine Translation

    Abstract

    The promise of combining vision and language in multimodal machine translation is that systems will produce better translations by leveraging the image data. However, inconsistent results have lead to uncertainty about whether the images actually improve translation quality. We present an adversarial evaluation method to directly examine the utility of the image data in this task. Our evaluation measures whether multimodal translation systems perform better given either the congruent image or a random incongruent image, in addition to the correct source language sentence. We find that two out of three publicly available systems are sensitive to this perturbation of the data, and recommend that all systems pass this evaluation in the future.

    OriginalsprogEngelsk
    TitelProceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
    Antal sider5
    Publikationsdato2018
    Sider2974-2978
    StatusUdgivet - 2018

    Fingeraftryk

    Dyk ned i forskningsemnerne om 'Adversarial Evaluation of Multimodal Machine Translation'. Sammen danner de et unikt fingeraftryk.

    Citationsformater