Abstract
We show that text readability prediction improves significantly from hard parameter sharing with models predicting first pass duration, total fixation duration and regression duration. Specifically, we induce multi-task Multilayer Perceptrons and Logistic Regression models over sentence representations that capture various aggregate statistics, from two different text readability corpora for English, as well as the Dundee eye-tracking corpus. Our approach leads to significant improvements over Single task learning and over previous systems. In addition, our improvements are consistent across train sample sizes, making our approach especially applicable to small datasets.
Original language | English |
---|---|
Title of host publication | Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications |
Publisher | Association for Computational Linguistics |
Publication date | 2017 |
Pages | 438-443 |
ISBN (Print) | 978-1-945626-85-2 |
Publication status | Published - 2017 |
Event | 12th Workshop on Innovative Use of NLP for Building Educational Applications - Copenhagen, Denmark Duration: 8 Sept 2017 → 8 Sept 2017 |
Conference
Conference | 12th Workshop on Innovative Use of NLP for Building Educational Applications |
---|---|
Country/Territory | Denmark |
City | Copenhagen |
Period | 08/09/2017 → 08/09/2017 |