A voting scheme to detect semantic underspecification

Hector Martinez Alonso, Nuria Bel, Bolette Sandford Pedersen

Abstract

The following work describes a voting system to automatically classify the sense selection of the complex types Location/Organization and Container/Content, which depend on regular polysemy, as described by the Generative Lexicon (Pustejovsky, 1995) . This kind of sense alternations very often presents semantic underspecificacion between its two possible selected senses. This kind of underspecification is not traditionally contemplated in word sense disambiguation systems, as disambiguation systems are still coping with the need of a representation and recognition of underspecification (Pustejovsky, 2009) The data are characterized by the morphosyntactic and lexical enviroment of the headwords and provided as input for a classifier. The baseline decision tree classifier is compared against an eight-member voting scheme obtained from variants of the training data generated by modifications on the class representation and from two different classification algorithms, namely decision trees and k-nearest neighbors. The voting system improves the accuracy for the non-underspecified senses, but the underspecified sense remains difficult to identify.

Original languageEnglish
Title of host publicationProceedings of the Eighth International Conference on Language Resources and Evaluation
Number of pages6
Place of PublicationIstanbul
PublisherEuropean Language Resources Association
Publication date2012
Pages569-575
ISBN (Electronic)978-2-9517408-7-7
Publication statusPublished - 2012
EventInternational Conference on Language Resources and Evaluation - Istanbul, Turkey
Duration: 23 May 201225 May 2012
Conference number: 8

Conference

ConferenceInternational Conference on Language Resources and Evaluation
Number8
Country/TerritoryTurkey
CityIstanbul
Period23/05/201225/05/2012

Fingerprint

Dive into the research topics of 'A voting scheme to detect semantic underspecification'. Together they form a unique fingerprint.

Cite this