Disambiguating explicit discourse connectives without oracles

Anders Trærup Johannsen, Anders Søgaard

Abstract

Deciding whether a word serves a discourse function in context is a prerequisite for discourse processing, and the performance of this subtask bounds performance on subsequent tasks. Pitler and Nenkova (2009) report 96.29% accuracy (F1 94.19%) relying on features extracted from gold-standard parse trees. This figure is an average over several connectives, some of which are extremely hard to classify. More importantly, performance drops considerably in the absence of an oracle providing gold-standard features. We show that a very simple model using only lexical and predicted part-of-speech features actually performs slightly better than Pitler and Nenkova (2009) and not significantly different from a state-of-the-art model, which combines lexical, part-of-speech, and parse features.

OriginalsprogEngelsk
TitelThe 6th International Joint Conference on Natural Language Processing (IJCNLP)
ForlagAssociation for Computational Linguistics
Publikationsdato2013
Sider997-1001
ISBN (Elektronisk)978-4-9907348-0-0
StatusUdgivet - 2013

Fingeraftryk

Dyk ned i forskningsemnerne om 'Disambiguating explicit discourse connectives without oracles'. Sammen danner de et unikt fingeraftryk.

Citationsformater