Simple readable sub-sentences

Sigrid Klerke, Anders Søgaard

Abstract

We present experiments using a new unsupervised approach to automatic text simplification, which builds on sampling and ranking via a loss function informed by readability research. The main idea is that a loss function can distinguish good simplification candidates among randomly sampled sub-sentences of the input sentence. Our approach is rated as equally grammatical and beginner reader appropriate as a supervised SMT-based baseline system by native speakers, but our setup performs more radical changes that better resembles the variation observed in human generated simplifications.

OriginalsprogEngelsk
TitelThe 51st Annual Meeting of the Association for Computational Linguistics (ACL), Student Research Workshop
ForlagAssociation for Computational Linguistics
Publikationsdato2013
Sider142-149
ISBN (Elektronisk)978-1-62748-976-8
StatusUdgivet - 2013

Fingeraftryk

Dyk ned i forskningsemnerne om 'Simple readable sub-sentences'. Sammen danner de et unikt fingeraftryk.

Citationsformater