Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens

Marek Rei, Anders Søgaard

    16 Citationer (Scopus)

    Abstract

    Can attention-or gradient-based visualization techniques be used to infer token-level labels for binary sequence tagging problems, using networks trained only on sentence-level labels? We construct a neural network architecture based on soft attention, train it as a binary sentence classifier and evaluate against tokenlevel annotation on four different datasets. Inferring token labels from a network provides a method for quantitatively evaluating what the model is learning, along with generating useful feedback in assistance systems. Our results indicate that attention-based methods are able to predict token-level labels more accurately, compared to gradient-based methods, sometimes even rivaling the supervised oracle network.

    OriginalsprogEngelsk
    TitelProceedings, 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies : (Long Ppaers)
    Vol/bind1
    ForlagAssociation for Computational Linguistics
    Publikationsdato2018
    Sider293–302
    StatusUdgivet - 2018
    Begivenhed16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - New Orleans, USA
    Varighed: 1 jun. 20186 jun. 2018

    Konference

    Konference16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
    Land/OmrådeUSA
    ByNew Orleans
    Periode01/06/201806/06/2018

    Fingeraftryk

    Dyk ned i forskningsemnerne om 'Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens'. Sammen danner de et unikt fingeraftryk.

    Citationsformater