What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions

Albert Gatt; Patrizia Paggio

What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions

Centre for Language Technology

4 Citations (Scopus)

Abstract

Pointing gestures are pervasive in human referring actions, and are often combined with spoken descriptions. Combining gesture and speech naturally to refer to objects is an essential task in multimodal NLG systems. However, the way gesture and speech should be combined in a referring act remains an open question. In particular, it is not clear whether, in planning a pointing gesture in conjunction with a description, an NLG system should seek to minimise the redundancy between them, e.g. by letting the pointing gesture indicate locative information, with other, non-locative properties of a referent included in the description. This question has a bearing on whether the gestural and spoken parts of referring acts are planned separately or arise from a common underlying computational mechanism. This paper investigates this question empirically, using machine-learning techniques on a new corpus of dialogues involving multimodal references to objects. Our results indicate that human pointing strategies interact with descriptive strategies. In particular, pointing gestures are strongly associated with the use of locative features in referring expressions.

Original language	English
Title of host publication	Proceedings of the 14th European Workshop on Natural Language Generation : ENLG'13
Publisher	Association for Computational Linguistics
Publication date	2013
Pages	82-91
ISBN (Electronic)	978-1-937284-56-5
Publication status	Published - 2013
Event	14th European Workshop on Natural Language Generation (ENLG'13) - Sofia, Armenia Duration: 8 Aug 2013 → 9 Aug 2013

Workshop

Workshop	14th European Workshop on Natural Language Generation (ENLG'13)
Country/Territory	Armenia
City	Sofia
Period	08/08/2013 → 09/08/2013

Access to Document

http://www.aclweb.org/anthology/siggen.html#2013_0

Cite this

What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions. / Gatt, Albert; Paggio, Patrizia.
Proceedings of the 14th European Workshop on Natural Language Generation: ENLG'13. Association for Computational Linguistics, 2013. p. 82-91.

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Gatt, A & Paggio, P 2013, What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions. in Proceedings of the 14th European Workshop on Natural Language Generation: ENLG'13. Association for Computational Linguistics, pp. 82-91, 14th European Workshop on Natural Language Generation (ENLG'13), Sofia, Armenia, 08/08/2013. <http://www.aclweb.org/anthology/siggen.html#2013_0>

@inproceedings{298ac97bdb424c88a4bb9685404b059d,

title = "What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions",

abstract = "Pointing gestures are pervasive in human referring actions, and are often combined with spoken descriptions. Combining gesture and speech naturally to refer to objects is an essential task in multimodal NLG systems. However, the way gesture and speech should be combined in a referring act remains an open question. In particular, it is not clear whether, in planning a pointing gesture in conjunction with a description, an NLG system should seek to minimise the redundancy between them, e.g. by letting the pointing gesture indicate locative information, with other, non-locative properties of a referent included in the description. This question has a bearing on whether the gestural and spoken parts of referring acts are planned separately or arise from a common underlying computational mechanism. This paper investigates this question empirically, using machine-learning techniques on a new corpus of dialogues involving multimodal references to objects. Our results indicate that human pointing strategies interact with descriptive strategies. In particular, pointing gestures are strongly associated with the use of locative features in referring expressions.",

author = "Albert Gatt and Patrizia Paggio",

year = "2013",

language = "English",

pages = "82--91",

booktitle = "Proceedings of the 14th European Workshop on Natural Language Generation",

publisher = "Association for Computational Linguistics",

note = "14th European Workshop on Natural Language Generation (ENLG'13) ; Conference date: 08-08-2013 Through 09-08-2013",

}

TY - GEN

T1 - What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions

AU - Gatt, Albert

AU - Paggio, Patrizia

PY - 2013

Y1 - 2013

N2 - Pointing gestures are pervasive in human referring actions, and are often combined with spoken descriptions. Combining gesture and speech naturally to refer to objects is an essential task in multimodal NLG systems. However, the way gesture and speech should be combined in a referring act remains an open question. In particular, it is not clear whether, in planning a pointing gesture in conjunction with a description, an NLG system should seek to minimise the redundancy between them, e.g. by letting the pointing gesture indicate locative information, with other, non-locative properties of a referent included in the description. This question has a bearing on whether the gestural and spoken parts of referring acts are planned separately or arise from a common underlying computational mechanism. This paper investigates this question empirically, using machine-learning techniques on a new corpus of dialogues involving multimodal references to objects. Our results indicate that human pointing strategies interact with descriptive strategies. In particular, pointing gestures are strongly associated with the use of locative features in referring expressions.

AB - Pointing gestures are pervasive in human referring actions, and are often combined with spoken descriptions. Combining gesture and speech naturally to refer to objects is an essential task in multimodal NLG systems. However, the way gesture and speech should be combined in a referring act remains an open question. In particular, it is not clear whether, in planning a pointing gesture in conjunction with a description, an NLG system should seek to minimise the redundancy between them, e.g. by letting the pointing gesture indicate locative information, with other, non-locative properties of a referent included in the description. This question has a bearing on whether the gestural and spoken parts of referring acts are planned separately or arise from a common underlying computational mechanism. This paper investigates this question empirically, using machine-learning techniques on a new corpus of dialogues involving multimodal references to objects. Our results indicate that human pointing strategies interact with descriptive strategies. In particular, pointing gestures are strongly associated with the use of locative features in referring expressions.

M3 - Article in proceedings

SP - 82

EP - 91

BT - Proceedings of the 14th European Workshop on Natural Language Generation

PB - Association for Computational Linguistics

T2 - 14th European Workshop on Natural Language Generation (ENLG'13)

Y2 - 8 August 2013 through 9 August 2013

ER -

What and where: An empirical investigation of pointing gestures and descriptions in multimodal referring actions

Abstract

Workshop

Access to Document

Fingerprint

Cite this