Analysis and modeling of "focus" in context

Dirk Hovy; Gopala Anumanchipalli; Alok Parlikar; Carolin Vaughn; Adam Lammert; Eduard Hovy; Alan W Black

Analysis and modeling of "focus" in context

Dirk Hovy, Gopala Anumanchipalli, Alok Parlikar, Carolin Vaughn, Adam Lammert, Eduard Hovy, Alan W Black

LUKKET: Center for Sprogteknologi

4 Citationer (Scopus)

Abstract

This paper uses a crowd-sourced definition of a speech phenomenon we have called focus. Given sentences, text and speech, in isolation and in context, we asked annotators to identify what we term the focus word. We present their consistency in identifying the focused word, when presented with text or speech stimuli. We then build models to show how well we predict that focus word from lexical (and higher) level features. Also, using spectral and prosodic information, we show the differences in these focus words when spoken with and without context. Finally, we show how we can improve speech synthesis of these utterances given focus information.

Originalsprog	Engelsk
Titel	Proceedings of the 14th Annual Conference of the International Speech Communication Association : Interspeech 2013
Antal sider	5
Forlag	International Speech Communication Association (ISCA)
Publikationsdato	2013
ISBN (Elektronisk)	978-1-62993-443-3
Status	Udgivet - 2013

Citationsformater

Analysis and modeling of "focus" in context. / Hovy, Dirk; Anumanchipalli, Gopala; Parlikar, Alok et al.

Proceedings of the 14th Annual Conference of the International Speech Communication Association : Interspeech 2013. International Speech Communication Association (ISCA), 2013.

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › peer review

@inproceedings{d8f79e7764a347fd9faab777fa392073,

title = "Analysis and modeling of {"}focus{"} in context",

abstract = "This paper uses a crowd-sourced definition of a speech phenomenon we have called focus. Given sentences, text and speech, in isolation and in context, we asked annotators to identify what we term the focus word. We present their consistency in identifying the focused word, when presented with text or speech stimuli. We then build models to show how well we predict that focus word from lexical (and higher) level features. Also, using spectral and prosodic information, we show the differences in these focus words when spoken with and without context. Finally, we show how we can improve speech synthesis of these utterances given focus information.",

author = "Dirk Hovy and Gopala Anumanchipalli and Alok Parlikar and Carolin Vaughn and Adam Lammert and Eduard Hovy and Black, {Alan W}",

note = "Submitted while first author was still at USC and visiting scholar at CMU",

year = "2013",

language = "English",

booktitle = "Proceedings of the 14th Annual Conference of the International Speech Communication Association",

publisher = "International Speech Communication Association (ISCA)",

}

TY - GEN

T1 - Analysis and modeling of "focus" in context

AU - Hovy, Dirk

AU - Anumanchipalli, Gopala

AU - Parlikar, Alok

AU - Vaughn, Carolin

AU - Lammert, Adam

AU - Hovy, Eduard

AU - Black, Alan W

N1 - Submitted while first author was still at USC and visiting scholar at CMU

PY - 2013

Y1 - 2013

N2 - This paper uses a crowd-sourced definition of a speech phenomenon we have called focus. Given sentences, text and speech, in isolation and in context, we asked annotators to identify what we term the focus word. We present their consistency in identifying the focused word, when presented with text or speech stimuli. We then build models to show how well we predict that focus word from lexical (and higher) level features. Also, using spectral and prosodic information, we show the differences in these focus words when spoken with and without context. Finally, we show how we can improve speech synthesis of these utterances given focus information.

AB - This paper uses a crowd-sourced definition of a speech phenomenon we have called focus. Given sentences, text and speech, in isolation and in context, we asked annotators to identify what we term the focus word. We present their consistency in identifying the focused word, when presented with text or speech stimuli. We then build models to show how well we predict that focus word from lexical (and higher) level features. Also, using spectral and prosodic information, we show the differences in these focus words when spoken with and without context. Finally, we show how we can improve speech synthesis of these utterances given focus information.

M3 - Article in proceedings

BT - Proceedings of the 14th Annual Conference of the International Speech Communication Association

PB - International Speech Communication Association (ISCA)

ER -

Analysis and modeling of "focus" in context

Abstract

Fingeraftryk

Citationsformater