Simple semi-supervised training of part-of-speech taggers

43 Citations (Scopus)

Abstract

Most attempts to train part-of-speech taggers on a mixture of labeled and unlabeled data have failed. In this work stacked learning is used to reduce tagging to a classification task. This simplifies semisupervised training considerably. Our prefered semi-supervised method combines tri-training (Li and Zhou, 2005) and disagreement-based co-training. On the Wall Street Journal, we obtain an error reduction of 4.2% with SVMTool (Gimenez and Marquez, 2004).

Original languageEnglish
Title of host publicationProceedings of the 48th Annual Meeting of the Association for Computational Linguistics
PublisherAssociation for Computational Linguistics
Publication date2010
ISBN (Electronic) 978-1-932432-67-1
Publication statusPublished - 2010

Fingerprint

Dive into the research topics of 'Simple semi-supervised training of part-of-speech taggers'. Together they form a unique fingerprint.

Cite this