Abstract
Our aim is to develop principled methods for sense clustering which can make existing lexi-cal resources practically useful in NLP - not too fine-grained to be operational and yet fine-grained enough to be worth the trouble. Where traditional dictionaries have a highly structured sense inventory typically describing the vocabulary by means of main- and subsenses, wordnets are generally fine-grained and un-structured. We present a series of clustering and annotation experiments with 10 of the most polysemous nouns in Danish. We com-bine the structured information of a traditional Danish dictionary with the ontological types found in the Danish wordnet, DanNet. This constellation enables us to automatically clus-ter senses in a principled way and improve in-ter-annotator agreement and wsd performance.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of Global WordNet Conference 2018 |
Antal sider | 6 |
Udgivelsessted | Singapore |
Forlag | Global WordNet Association |
Publikationsdato | 2018 |
ISBN (Elektronisk) | 978-981-11-7087-4 |
Status | Udgivet - 2018 |
Begivenhed | Global WordNet Conference - Singapore, Singapore Varighed: 8 jan. 2018 → 12 jan. 2018 |
Konference
Konference | Global WordNet Conference |
---|---|
Land/Område | Singapore |
By | Singapore |
Periode | 08/01/2018 → 12/01/2018 |