Abstract
Most computational sociolinguistics studies have focused on phonological and lexical variation. We present the first large-scale study of syntactic variation among demographic groups (age and gender) across several languages. We harvest data from online user-review sites and parse it with universal dependencies. We show that several age and gender-specific variations hold across languages, for example that women are more likely to use VP conjunctions.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the Nineteenth Conference on Computational Natural Language Learning : CoNLL |
Antal sider | 10 |
Forlag | Association for Computational Linguistics |
Publikationsdato | 2015 |
Sider | 103-112 |
ISBN (Trykt) | 978-1-941643-77-8 |
Status | Udgivet - 2015 |