Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora

Costanza Navarretta; Magdalena Lis

Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora

LUKKET: Center for Sprogteknologi

2 Citationer (Scopus)

Abstract

The paper is an investigation of the reusability of the annotations of head movements in a corpus in a language to predict the feedback functions of head movements in a comparable corpus in another language. The two corpora consist of naturally occurring triadic conversations in Danish and Polish, which were annotated according to the same scheme. The intersection of common annotation features was used in the experiments. A Naïve Bayes classifier was trained on the annotations of a corpus and tested on the annotations of the other corpus. Training and test datasets were then reversed and the experiments repeated. The results show that the classifier identifies more feedback behaviours than the majority baseline in both cases and the improvements are significant. The performance of the classifier decreases significantly compared with the results obtained when training and test data belong to the same corpus. Annotating multimodal data is resource consuming, thus the results are promising. However, they also confirm preceding studies that have identified both similarities and differences in the use of feedback head movements in different languages. Since our datasets are small and only regard a communicative behaviour in two languages, the experiments should be tested on more data types.

Originalsprog	Engelsk
Titel	Proceedings of the 9th International Conference on Language Resources and Evaluation : LREC2014
Udgivelsessted	Reykjavik, Iceland
Forlag	European Language Resources Association
Publikationsdato	2014
Sider	3597-3603
ISBN (Elektronisk)	978-2-9517408-8-4
Status	Udgivet - 2014

Adgang til dokumentet

http://www.lrec-conf.org/proceedings/lrec2014/index.html

Citationsformater

Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora. / Navarretta, Costanza; Lis, Magdalena.

Proceedings of the 9th International Conference on Language Resources and Evaluation: LREC2014. Reykjavik, Iceland : European Language Resources Association, 2014. s. 3597-3603.

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › peer review

@inproceedings{751bc74879624615ab444699ecce671a,

title = "Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora",

abstract = "The paper is an investigation of the reusability of the annotations of head movements in a corpus in a language to predict the feedback functions of head movements in a comparable corpus in another language. The two corpora consist of naturally occurring triadic conversations in Danish and Polish, which were annotated according to the same scheme. The intersection of common annotation features was used in the experiments. A Na{\"i}ve Bayes classifier was trained on the annotations of a corpus and tested on the annotations of the other corpus. Training and test datasets were then reversed and the experiments repeated. The results show that the classifier identifies more feedback behaviours than the majority baseline in both cases and the improvements are significant. The performance of the classifier decreases significantly compared with the results obtained when training and test data belong to the same corpus. Annotating multimodal data is resource consuming, thus the results are promising. However, they also confirm preceding studies that have identified both similarities and differences in the use of feedback head movements in different languages. Since our datasets are small and only regard a communicative behaviour in two languages, the experiments should be tested on more data types.",

author = "Costanza Navarretta and Magdalena Lis",

year = "2014",

language = "English",

pages = "3597--3603",

booktitle = "Proceedings of the 9th International Conference on Language Resources and Evaluation",

publisher = "European Language Resources Association",

}

TY - GEN

T1 - Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora

AU - Navarretta, Costanza

AU - Lis, Magdalena

PY - 2014

Y1 - 2014

N2 - The paper is an investigation of the reusability of the annotations of head movements in a corpus in a language to predict the feedback functions of head movements in a comparable corpus in another language. The two corpora consist of naturally occurring triadic conversations in Danish and Polish, which were annotated according to the same scheme. The intersection of common annotation features was used in the experiments. A Naïve Bayes classifier was trained on the annotations of a corpus and tested on the annotations of the other corpus. Training and test datasets were then reversed and the experiments repeated. The results show that the classifier identifies more feedback behaviours than the majority baseline in both cases and the improvements are significant. The performance of the classifier decreases significantly compared with the results obtained when training and test data belong to the same corpus. Annotating multimodal data is resource consuming, thus the results are promising. However, they also confirm preceding studies that have identified both similarities and differences in the use of feedback head movements in different languages. Since our datasets are small and only regard a communicative behaviour in two languages, the experiments should be tested on more data types.

AB - The paper is an investigation of the reusability of the annotations of head movements in a corpus in a language to predict the feedback functions of head movements in a comparable corpus in another language. The two corpora consist of naturally occurring triadic conversations in Danish and Polish, which were annotated according to the same scheme. The intersection of common annotation features was used in the experiments. A Naïve Bayes classifier was trained on the annotations of a corpus and tested on the annotations of the other corpus. Training and test datasets were then reversed and the experiments repeated. The results show that the classifier identifies more feedback behaviours than the majority baseline in both cases and the improvements are significant. The performance of the classifier decreases significantly compared with the results obtained when training and test data belong to the same corpus. Annotating multimodal data is resource consuming, thus the results are promising. However, they also confirm preceding studies that have identified both similarities and differences in the use of feedback head movements in different languages. Since our datasets are small and only regard a communicative behaviour in two languages, the experiments should be tested on more data types.

M3 - Article in proceedings

SP - 3597

EP - 3603

BT - Proceedings of the 9th International Conference on Language Resources and Evaluation

PB - European Language Resources Association

CY - Reykjavik, Iceland

ER -

Transfer learning of feedback head expressions in Danish and Polish comparable multimodal corpora

Abstract

Adgang til dokumentet

Fingeraftryk

Citationsformater