Cross-domain answer ranking using importance sampling

Anders Trærup Johannsen, Anders Søgaard

Abstract

We consider the problem of learning how to rank answers across domains in community question answering using stylistic features. Our main contribution is an importance sampling technique for selecting training data per answer thread. Our approach is evaluated across 30 community sites and shown to be significantly better than random sampling. We show that the most useful features in our model relate to answer length and overlap with question.

OriginalsprogEngelsk
TitelThe 6th International Joint Conference on Natural Language Processing (IJCNLP)
ForlagAssociation for Computational Linguistics
Publikationsdato2013
Sider987-991
ISBN (Elektronisk)978-4-9907348-0-0
StatusUdgivet - 2013
BegivenhedThe 6th International Joint Conference on Natural Language Processing (IJCNLP): IJCNLP - Nagoya, Japan
Varighed: 14 okt. 201318 maj 2014
Konferencens nummer: 6

Konference

KonferenceThe 6th International Joint Conference on Natural Language Processing (IJCNLP)
Nummer6
Land/OmrådeJapan
ByNagoya
Periode14/10/201318/05/2014

Fingeraftryk

Dyk ned i forskningsemnerne om 'Cross-domain answer ranking using importance sampling'. Sammen danner de et unikt fingeraftryk.

Citationsformater