An improved multileaving algorithm for online ranker evaluation

Brian Brost; Ingemar Johansson Cox; Yevgeny Seldin; Christina Lioma

doi:10.1145/2911451.2914706

An improved multileaving algorithm for online ranker evaluation

Brian Brost, Ingemar Johansson Cox, Yevgeny Seldin, Christina Lioma

Datalogisk Institut

9 Citationer (Scopus)

Abstract

Online ranker evaluation is a key challenge in information retrieval. An important task in the online evaluation of rankers is using implicit user feedback for inferring preferences between rankers. Interleaving methods have been found to be efficient and sensitive, i.e. they can quickly detect even small differences in quality. It has recently been shown that multileaving methods exhibit similar sensitivity but can be more efficient than interleaving methods. This paper presents empirical results demonstrating that existing multileaving methods either do not scale well with the number of rankers, or, more problematically, can produce results which substantially differ from evaluation measures like NDCG. The latter problem is caused by the fact that they do not correctly account for the similarities that can occur between rankers being multileaved. We propose a new multileaving method for handling this problem and demonstrate that it substantially outperforms existing methods, in some cases reducing errors by as much as 50%.

Originalsprog	Engelsk
Titel	Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval : SIGIR '16
Antal sider	4
Forlag	Association for Computing Machinery
Publikationsdato	7 jul. 2016
Sider	745-748
ISBN (Trykt)	978-1-4503-4069-4
DOI	https://doi.org/10.1145/2911451.2914706
Status	Udgivet - 7 jul. 2016
Begivenhed	International ACM SIGIR conference on Research and Development in Information Retrieval 2016: SIGIR '16 - Pisa, Italien Varighed: 17 jul. 2016 → 21 jul. 2016 Konferencens nummer: 39 http://sigir.org/sigir2016/

Konference

Konference	International ACM SIGIR conference on Research and Development in Information Retrieval 2016
Nummer	39
Land/Område	Italien
By	Pisa
Periode	17/07/2016 → 21/07/2016
Internetadresse	http://sigir.org/sigir2016/

Adgang til dokumentet

10.1145/2911451.2914706

An Improved Multileaving Algorithm for Online Ranker EvaluationForlagets udgivne version, 687 KB

Citationsformater

An improved multileaving algorithm for online ranker evaluation. / Brost, Brian; Cox, Ingemar Johansson ; Seldin, Yevgeny et al.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval: SIGIR '16. Association for Computing Machinery, 2016. s. 745-748.

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › peer review

Brost, B, Cox, IJ , Seldin, Y & Lioma, C 2016, An improved multileaving algorithm for online ranker evaluation. i Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval: SIGIR '16. Association for Computing Machinery, s. 745-748, International ACM SIGIR conference on Research and Development in Information Retrieval 2016, Pisa, Italien, 17/07/2016. https://doi.org/10.1145/2911451.2914706

@inproceedings{f5a06f7d28524321a433a0556aea23c4,

title = "An improved multileaving algorithm for online ranker evaluation",

abstract = "Online ranker evaluation is a key challenge in information retrieval. An important task in the online evaluation of rankers is using implicit user feedback for inferring preferences between rankers. Interleaving methods have been found to be efficient and sensitive, i.e. they can quickly detect even small differences in quality. It has recently been shown that multileaving methods exhibit similar sensitivity but can be more efficient than interleaving methods. This paper presents empirical results demonstrating that existing multileaving methods either do not scale well with the number of rankers, or, more problematically, can produce results which substantially differ from evaluation measures like NDCG. The latter problem is caused by the fact that they do not correctly account for the similarities that can occur between rankers being multileaved. We propose a new multileaving method for handling this problem and demonstrate that it substantially outperforms existing methods, in some cases reducing errors by as much as 50%.",

author = "Brian Brost and Cox, {Ingemar Johansson} and Yevgeny Seldin and Christina Lioma",

year = "2016",

month = jul,

day = "7",

doi = "10.1145/2911451.2914706",

language = "English",

isbn = "978-1-4503-4069-4",

pages = "745--748",

booktitle = "Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval",

publisher = "Association for Computing Machinery",

note = "International ACM SIGIR conference on Research and Development in Information Retrieval 2016 ; Conference date: 17-07-2016 Through 21-07-2016",

url = "http://sigir.org/sigir2016/",

}

TY - GEN

T1 - An improved multileaving algorithm for online ranker evaluation

AU - Brost, Brian

AU - Cox, Ingemar Johansson

AU - Seldin, Yevgeny

AU - Lioma, Christina

N1 - Conference code: 39

PY - 2016/7/7

Y1 - 2016/7/7

N2 - Online ranker evaluation is a key challenge in information retrieval. An important task in the online evaluation of rankers is using implicit user feedback for inferring preferences between rankers. Interleaving methods have been found to be efficient and sensitive, i.e. they can quickly detect even small differences in quality. It has recently been shown that multileaving methods exhibit similar sensitivity but can be more efficient than interleaving methods. This paper presents empirical results demonstrating that existing multileaving methods either do not scale well with the number of rankers, or, more problematically, can produce results which substantially differ from evaluation measures like NDCG. The latter problem is caused by the fact that they do not correctly account for the similarities that can occur between rankers being multileaved. We propose a new multileaving method for handling this problem and demonstrate that it substantially outperforms existing methods, in some cases reducing errors by as much as 50%.

AB - Online ranker evaluation is a key challenge in information retrieval. An important task in the online evaluation of rankers is using implicit user feedback for inferring preferences between rankers. Interleaving methods have been found to be efficient and sensitive, i.e. they can quickly detect even small differences in quality. It has recently been shown that multileaving methods exhibit similar sensitivity but can be more efficient than interleaving methods. This paper presents empirical results demonstrating that existing multileaving methods either do not scale well with the number of rankers, or, more problematically, can produce results which substantially differ from evaluation measures like NDCG. The latter problem is caused by the fact that they do not correctly account for the similarities that can occur between rankers being multileaved. We propose a new multileaving method for handling this problem and demonstrate that it substantially outperforms existing methods, in some cases reducing errors by as much as 50%.

U2 - 10.1145/2911451.2914706

DO - 10.1145/2911451.2914706

M3 - Article in proceedings

SN - 978-1-4503-4069-4

SP - 745

EP - 748

BT - Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

PB - Association for Computing Machinery

T2 - International ACM SIGIR conference on Research and Development in Information Retrieval 2016

Y2 - 17 July 2016 through 21 July 2016

ER -

An improved multileaving algorithm for online ranker evaluation

Abstract

Konference

Adgang til dokumentet

Fingeraftryk

Citationsformater