Abstract
Deciding whether a word serves a discourse function in context is a prerequisite for discourse processing, and the performance of this subtask bounds performance on subsequent tasks. Pitler and Nenkova (2009) report 96.29% accuracy (F1 94.19%) relying on features extracted from gold-standard parse trees. This figure is an average over several connectives, some of which are extremely hard to classify. More importantly, performance drops considerably in the absence of an oracle providing gold-standard features. We show that a very simple model using only lexical and predicted part-of-speech features actually performs slightly better than Pitler and Nenkova (2009) and not significantly different from a state-of-the-art model, which combines lexical, part-of-speech, and parse features.
Original language | English |
---|---|
Title of host publication | The 6th International Joint Conference on Natural Language Processing (IJCNLP) |
Publisher | Association for Computational Linguistics |
Publication date | 2013 |
Pages | 997-1001 |
ISBN (Electronic) | 978-4-9907348-0-0 |
Publication status | Published - 2013 |