Abstract
High agreement is a common objective when annotating data for word senses. However, a number of factors make perfect agreement impossible, e.g. the limitations of sense inventories, the difficulty of the examples or the interpretation preferences of the annotators. Estimating potential agreement is thus a relevant task to supplement the evaluation of sense annotations. In this article we propose two methods to predict agreement on word-annotation instances. We experiment with a continuous representation and a three-way discretization of observed agreement. In spite of the difficulty of the task, we find that different levels of agreement can be identified-in particular, low-agreement examples are easier to identify.
Original language | English |
---|---|
Title of host publication | LSDSem 2015 : Linking Models of Lexical, Sentential and Discourse-level Semantics |
Number of pages | 6 |
Publisher | Association for Computational Linguistics |
Publication date | 2015 |
Pages | 89-94 |
ISBN (Print) | 978-1-941643-32-7 |
Publication status | Published - 2015 |