Abstract
The following work describes a voting system to automatically classify the sense selection of the complex types Location/Organization and Container/Content, which depend on regular polysemy, as described by the Generative Lexicon (Pustejovsky, 1995) . This kind of sense alternations very often presents semantic underspecificacion between its two possible selected senses. This kind of underspecification is not traditionally contemplated in word sense disambiguation systems, as disambiguation systems are still coping with the need of a representation and recognition of underspecification (Pustejovsky, 2009) The data are characterized by the morphosyntactic and lexical enviroment of the headwords and provided as input for a classifier. The baseline decision tree classifier is compared against an eight-member voting scheme obtained from variants of the training data generated by modifications on the class representation and from two different classification algorithms, namely decision trees and k-nearest neighbors. The voting system improves the accuracy for the non-underspecified senses, but the underspecified sense remains difficult to identify.
Original language | English |
---|---|
Title of host publication | Proceedings of the Eighth International Conference on Language Resources and Evaluation |
Number of pages | 6 |
Place of Publication | Istanbul |
Publisher | European Language Resources Association |
Publication date | 2012 |
Pages | 569-575 |
ISBN (Electronic) | 978-2-9517408-7-7 |
Publication status | Published - 2012 |
Event | International Conference on Language Resources and Evaluation - Istanbul, Turkey Duration: 23 May 2012 → 25 May 2012 Conference number: 8 |
Conference
Conference | International Conference on Language Resources and Evaluation |
---|---|
Number | 8 |
Country/Territory | Turkey |
City | Istanbul |
Period | 23/05/2012 → 25/05/2012 |