Abstract
Two studies on multilingual multimodal image description provide empirical evidence towards two questions at the core of the task: (i) whether target language speakers prefer descriptions generated directly in their native language, as compared to descriptions translated from a different language; (ii) whether images improve human translation of descriptions. These results provide guidance for future work in multimodal natural language processing by first showing that on the whole, translations are not distinguished from native language descriptions, and second delineating and quantifying the information gained from the image during the human translation task.
Original language | English |
---|---|
Journal | Journal of Natural Language Engineering |
Volume | 24 |
Issue number | 3 |
Pages (from-to) | 393-413 |
Number of pages | 21 |
Publication status | Published - 1 May 2018 |