Abstract
The following work describes a voting system to automatically classify the sense selection of the complex types Location/Organization and Container/Content, which depend on regular polysemy, as described by the Generative Lexicon (Pustejovsky, 1995) . This kind of sense alternations very often presents semantic underspecificacion between its two possible selected senses. This kind of underspecification is not traditionally contemplated in word sense disambiguation systems, as disambiguation systems are still coping with the need of a representation and recognition of underspecification (Pustejovsky, 2009) The data are characterized by the morphosyntactic and lexical enviroment of the headwords and provided as input for a classifier. The baseline decision tree classifier is compared against an eight-member voting scheme obtained from variants of the training data generated by modifications on the class representation and from two different classification algorithms, namely decision trees and k-nearest neighbors. The voting system improves the accuracy for the non-underspecified senses, but the underspecified sense remains difficult to identify.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the Eighth International Conference on Language Resources and Evaluation |
Antal sider | 6 |
Udgivelsessted | Istanbul |
Forlag | European Language Resources Association |
Publikationsdato | 2012 |
Sider | 569-575 |
ISBN (Elektronisk) | 978-2-9517408-7-7 |
Status | Udgivet - 2012 |
Begivenhed | International Conference on Language Resources and Evaluation - Istanbul, Tyrkiet Varighed: 23 maj 2012 → 25 maj 2012 Konferencens nummer: 8 |
Konference
Konference | International Conference on Language Resources and Evaluation |
---|---|
Nummer | 8 |
Land/Område | Tyrkiet |
By | Istanbul |
Periode | 23/05/2012 → 25/05/2012 |