A voting scheme to detect semantic underspecification

Hector Martinez Alonso, Nuria Bel, Bolette Sandford Pedersen

Abstract

The following work describes a voting system to automatically classify the sense selection of the complex types Location/Organization and Container/Content, which depend on regular polysemy, as described by the Generative Lexicon (Pustejovsky, 1995) . This kind of sense alternations very often presents semantic underspecificacion between its two possible selected senses. This kind of underspecification is not traditionally contemplated in word sense disambiguation systems, as disambiguation systems are still coping with the need of a representation and recognition of underspecification (Pustejovsky, 2009) The data are characterized by the morphosyntactic and lexical enviroment of the headwords and provided as input for a classifier. The baseline decision tree classifier is compared against an eight-member voting scheme obtained from variants of the training data generated by modifications on the class representation and from two different classification algorithms, namely decision trees and k-nearest neighbors. The voting system improves the accuracy for the non-underspecified senses, but the underspecified sense remains difficult to identify.

OriginalsprogEngelsk
TitelProceedings of the Eighth International Conference on Language Resources and Evaluation
Antal sider6
UdgivelsesstedIstanbul
ForlagEuropean Language Resources Association
Publikationsdato2012
Sider569-575
ISBN (Elektronisk)978-2-9517408-7-7
StatusUdgivet - 2012
BegivenhedInternational Conference on Language Resources and Evaluation - Istanbul, Tyrkiet
Varighed: 23 maj 201225 maj 2012
Konferencens nummer: 8

Konference

KonferenceInternational Conference on Language Resources and Evaluation
Nummer8
Land/OmrådeTyrkiet
ByIstanbul
Periode23/05/201225/05/2012

Fingeraftryk

Dyk ned i forskningsemnerne om 'A voting scheme to detect semantic underspecification'. Sammen danner de et unikt fingeraftryk.

Citationsformater