Abstract
We establish quantitative methods for comparing and estimating the quality of dependency annotations or conversion schemes. We use generalized tree-edit distance to measure divergence between annotations and propose theoretical learnability, derivational perplexity and downstream performance for evaluation. We present systematic experiments with tree-to-dependency conversions of the Penn-III treebank, as well as observations from experiments using treebanks from multiple languages. Our most important observations are: (a) parser bias makes most parsers insensitive to non-local differences between annotations, but (b) choice of annotation nevertheless has significant impact on most downstream applications, and (c) while learnability does not correlate with downstream performance, learnable annotations will lead to more robust performance across domains.
Originalsprog | Engelsk |
---|---|
Titel | DepLing 2013 : Proceedings of the Second International Conference on Dependency Linguistics 2013 |
Antal sider | 9 |
Udgivelsessted | Prag |
Forlag | Association for Computational Linguistics |
Publikationsdato | 2013 |
Sider | 298-306 |
ISBN (Elektronisk) | 978-80-7378-240-5 |
Status | Udgivet - 2013 |
Begivenhed | International Conference on Dependency Linguistics: DepLing - Prag, Tjekkiet Varighed: 27 aug. 2013 → 30 aug. 2013 Konferencens nummer: 2 |
Konference
Konference | International Conference on Dependency Linguistics |
---|---|
Nummer | 2 |
Land/Område | Tjekkiet |
By | Prag |
Periode | 27/08/2013 → 30/08/2013 |