Predicting word sense annotation agreement

Hector Martinez Alonso, Anders Trærup Johannsen, Oier Lopez de Lacalle, Eneko Agirre

Abstract

High agreement is a common objective when annotating data for word senses. However, a number of factors make perfect agreement impossible, e.g. the limitations of sense inventories, the difficulty of the examples or the interpretation preferences of the annotators. Estimating potential agreement is thus a relevant task to supplement the evaluation of sense annotations. In this article we propose two methods to predict agreement on word-annotation instances. We experiment with a continuous representation and a three-way discretization of observed agreement. In spite of the difficulty of the task, we find that different levels of agreement can be identified-in particular, low-agreement examples are easier to identify.

Original languageEnglish
Title of host publicationLSDSem 2015 : Linking Models of Lexical, Sentential and Discourse-level Semantics
Number of pages6
PublisherAssociation for Computational Linguistics
Publication date2015
Pages89-94
ISBN (Print)978-1-941643-32-7
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'Predicting word sense annotation agreement'. Together they form a unique fingerprint.

Cite this