Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction

Barbara Plank; Alessandro Moschitti

Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction

Barbara Plank, Alessandro Moschitti

Centre for Language Technology

73 Citations (Scopus)

Abstract

Relation Extraction (RE) is the task of extracting semantic relationships between entities in text. Recent studies on relation extraction are mostly supervised. The clear drawback of supervised methods is the need of training data: labeled data is expensive to obtain, and there is often a mismatch between the training data and the data the system will be applied to. This is the problem of domain adaptation. In this paper, we propose to combine (i) term generalization approaches such as word clustering and latent semantic analysis (LSA) and (ii) structured kernels to improve the adaptability of relation extractors to new text genres/domains. The empirical evaluation on ACE 2005 domains shows that a suitable combination of syntax and lexical generalization is very promising for domain adaptation.

Original language	English
Title of host publication	Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL)
Publisher	Association for Computational Linguistics
Publication date	2013
Pages	1498-1507
ISBN (Electronic)	978-1-62748-975-1
Publication status	Published - 2013

Cite this

Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction. / Plank, Barbara; Moschitti, Alessandro.

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, 2013. p. 1498-1507.

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

@inproceedings{2cb2b4e8f8fc44ce8ed52fb2c44e02e5,

title = "Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction",

abstract = "Relation Extraction (RE) is the task of extracting semantic relationships between entities in text. Recent studies on relation extraction are mostly supervised. The clear drawback of supervised methods is the need of training data: labeled data is expensive to obtain, and there is often a mismatch between the training data and the data the system will be applied to. This is the problem of domain adaptation. In this paper, we propose to combine (i) term generalization approaches such as word clustering and latent semantic analysis (LSA) and (ii) structured kernels to improve the adaptability of relation extractors to new text genres/domains. The empirical evaluation on ACE 2005 domains shows that a suitable combination of syntax and lexical generalization is very promising for domain adaptation.",

author = "Barbara Plank and Alessandro Moschitti",

year = "2013",

language = "English",

pages = "1498--1507",

booktitle = "Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL)",

publisher = "Association for Computational Linguistics",

}

TY - GEN

T1 - Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction

AU - Plank, Barbara

AU - Moschitti, Alessandro

PY - 2013

Y1 - 2013

N2 - Relation Extraction (RE) is the task of extracting semantic relationships between entities in text. Recent studies on relation extraction are mostly supervised. The clear drawback of supervised methods is the need of training data: labeled data is expensive to obtain, and there is often a mismatch between the training data and the data the system will be applied to. This is the problem of domain adaptation. In this paper, we propose to combine (i) term generalization approaches such as word clustering and latent semantic analysis (LSA) and (ii) structured kernels to improve the adaptability of relation extractors to new text genres/domains. The empirical evaluation on ACE 2005 domains shows that a suitable combination of syntax and lexical generalization is very promising for domain adaptation.

AB - Relation Extraction (RE) is the task of extracting semantic relationships between entities in text. Recent studies on relation extraction are mostly supervised. The clear drawback of supervised methods is the need of training data: labeled data is expensive to obtain, and there is often a mismatch between the training data and the data the system will be applied to. This is the problem of domain adaptation. In this paper, we propose to combine (i) term generalization approaches such as word clustering and latent semantic analysis (LSA) and (ii) structured kernels to improve the adaptability of relation extractors to new text genres/domains. The empirical evaluation on ACE 2005 domains shows that a suitable combination of syntax and lexical generalization is very promising for domain adaptation.

M3 - Article in proceedings

SP - 1498

EP - 1507

BT - Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL)

PB - Association for Computational Linguistics

ER -

Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction

Abstract

Fingerprint

Cite this