Cross-lingual Syntactic Variation over Age and Gender

Anders Trærup Johannsen, Dirk Hovy, Anders Søgaard

31 Citations (Scopus)

Abstract

Most computational sociolinguistics studies have focused on phonological and lexical variation. We present the first large-scale study of syntactic variation among demographic groups (age and gender) across several languages. We harvest data from online user-review sites and parse it with universal dependencies. We show that several age and gender-specific variations hold across languages, for example that women are more likely to use VP conjunctions.

Original languageEnglish
Title of host publicationProceedings of the Nineteenth Conference on Computational Natural Language Learning : CoNLL
Number of pages10
PublisherAssociation for Computational Linguistics
Publication date2015
Pages103-112
ISBN (Print)978-1-941643-77-8
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'Cross-lingual Syntactic Variation over Age and Gender'. Together they form a unique fingerprint.

Cite this