Abstract
Most computational sociolinguistics studies have focused on phonological and lexical variation. We present the first large-scale study of syntactic variation among demographic groups (age and gender) across several languages. We harvest data from online user-review sites and parse it with universal dependencies. We show that several age and gender-specific variations hold across languages, for example that women are more likely to use VP conjunctions.
Original language | English |
---|---|
Title of host publication | Proceedings of the Nineteenth Conference on Computational Natural Language Learning : CoNLL |
Number of pages | 10 |
Publisher | Association for Computational Linguistics |
Publication date | 2015 |
Pages | 103-112 |
ISBN (Print) | 978-1-941643-77-8 |
Publication status | Published - 2015 |