An empirical study of differences between conversion schemes and annotation guidelines

Abstract

We establish quantitative methods for comparing and estimating the quality of dependency annotations or conversion schemes. We use generalized tree-edit distance to measure divergence between annotations and propose theoretical learnability, derivational perplexity and downstream performance for evaluation. We present systematic experiments with tree-to-dependency conversions of the Penn-III treebank, as well as observations from experiments using treebanks from multiple languages. Our most important observations are: (a) parser bias makes most parsers insensitive to non-local differences between annotations, but (b) choice of annotation nevertheless has significant impact on most downstream applications, and (c) while learnability does not correlate with downstream performance, learnable annotations will lead to more robust performance across domains.

OriginalsprogEngelsk
TitelDepLing 2013 : Proceedings of the Second International Conference on Dependency Linguistics 2013
Antal sider9
UdgivelsesstedPrag
ForlagAssociation for Computational Linguistics
Publikationsdato2013
Sider298-306
ISBN (Elektronisk)978-80-7378-240-5
StatusUdgivet - 2013
BegivenhedInternational Conference on Dependency Linguistics: DepLing - Prag, Tjekkiet
Varighed: 27 aug. 201330 aug. 2013
Konferencens nummer: 2

Konference

KonferenceInternational Conference on Dependency Linguistics
Nummer2
Land/OmrådeTjekkiet
ByPrag
Periode27/08/201330/08/2013

Fingeraftryk

Dyk ned i forskningsemnerne om 'An empirical study of differences between conversion schemes and annotation guidelines'. Sammen danner de et unikt fingeraftryk.

Citationsformater