Experiments with crowdsourced re-annotation of a POS tagging data set

Dirk Hovy, Barbara Plank, Anders Søgaard

21 Citationer (Scopus)

Abstract

Crowdsourcing lets us collect multiple annotations for an item from several annotators. Typically, these are annotations for non-sequential classification tasks. While there has been some work on crowdsourcing named entity annotations, researchers have largely assumed that syntactic tasks such as part-of-speech (POS) tagging cannot be crowdsourced. This paper shows that workers can actually annotate sequential data almost as well as experts. Further, we show that the models learned from crowdsourced annotations fare as well as the models learned from expert annotations in downstream tasks.

OriginalsprogEngelsk
TitelProceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
UdgivelsesstedBaltimore, Maryland
ForlagAssociation for Computational Linguistics
Publikationsdatojun. 2014
Sider377-382
StatusUdgivet - jun. 2014

Fingeraftryk

Dyk ned i forskningsemnerne om 'Experiments with crowdsourced re-annotation of a POS tagging data set'. Sammen danner de et unikt fingeraftryk.

Citationsformater