An uncertainty-aware query selection model for evaluation of IR systems

Mehdi Hosseini; Ingemar J Cox; Natasa Milic-Frayling; Milad Shokouhi; Emine Yilmaz

An uncertainty-aware query selection model for evaluation of IR systems

Mehdi Hosseini, Ingemar J Cox, Natasa Milic-Frayling, Milad Shokouhi, Emine Yilmaz

15 Citations (Scopus)

Abstract

We propose a mathematical framework for query selection as a mechanism for reducing the cost of constructing information retrieval test collections. In particular, our mathematical formulation explicitly models the uncertainty in the retrieval effectiveness metrics that is introduced by the absence of relevance judgments. Since the optimization problem is computationally intractable, we devise an adaptive query selection algorithm, referred to as Adaptive, that provides an approximate solution. Adaptive selects queries iteratively and assumes that no relevance judgments are available for the query under consideration. Once a query is selected, the associated relevance assessments are acquired and then used to aid the selection of subsequent queries. We demonstrate the effectiveness of the algorithm on two TREC test collections as well as a test collection of an online search engine with 1000 queries. Our experimental results show that the queries chosen by Adaptive produce reliable performance ranking of systems. The ranking is better correlated with the actual systems ranking than the rankings produced by queries that were selected using the considered baseline methods.

Original language	English
Title of host publication	Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Number of pages	10
Publication date	2012
Pages	901-910
Publication status	Published - 2012
Externally published	Yes

Cite this

An uncertainty-aware query selection model for evaluation of IR systems. / Hosseini, Mehdi; Cox, Ingemar J; Milic-Frayling, Natasa et al.
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval. 2012. p. 901-910.

Research output: Chapter in Book/Report/Conference proceeding › Book chapter › Research › peer-review

@inbook{a19454c7a6e540b0a5d2d7fb080886b6,

title = "An uncertainty-aware query selection model for evaluation of IR systems",

abstract = "We propose a mathematical framework for query selection as a mechanism for reducing the cost of constructing information retrieval test collections. In particular, our mathematical formulation explicitly models the uncertainty in the retrieval effectiveness metrics that is introduced by the absence of relevance judgments. Since the optimization problem is computationally intractable, we devise an adaptive query selection algorithm, referred to as Adaptive, that provides an approximate solution. Adaptive selects queries iteratively and assumes that no relevance judgments are available for the query under consideration. Once a query is selected, the associated relevance assessments are acquired and then used to aid the selection of subsequent queries. We demonstrate the effectiveness of the algorithm on two TREC test collections as well as a test collection of an online search engine with 1000 queries. Our experimental results show that the queries chosen by Adaptive produce reliable performance ranking of systems. The ranking is better correlated with the actual systems ranking than the rankings produced by queries that were selected using the considered baseline methods.",

author = "Mehdi Hosseini and Cox, {Ingemar J} and Natasa Milic-Frayling and Milad Shokouhi and Emine Yilmaz",

year = "2012",

language = "English",

pages = "901--910",

booktitle = "Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval",

}

TY - CHAP

T1 - An uncertainty-aware query selection model for evaluation of IR systems

AU - Hosseini, Mehdi

AU - Cox, Ingemar J

AU - Milic-Frayling, Natasa

AU - Shokouhi, Milad

AU - Yilmaz, Emine

PY - 2012

Y1 - 2012

N2 - We propose a mathematical framework for query selection as a mechanism for reducing the cost of constructing information retrieval test collections. In particular, our mathematical formulation explicitly models the uncertainty in the retrieval effectiveness metrics that is introduced by the absence of relevance judgments. Since the optimization problem is computationally intractable, we devise an adaptive query selection algorithm, referred to as Adaptive, that provides an approximate solution. Adaptive selects queries iteratively and assumes that no relevance judgments are available for the query under consideration. Once a query is selected, the associated relevance assessments are acquired and then used to aid the selection of subsequent queries. We demonstrate the effectiveness of the algorithm on two TREC test collections as well as a test collection of an online search engine with 1000 queries. Our experimental results show that the queries chosen by Adaptive produce reliable performance ranking of systems. The ranking is better correlated with the actual systems ranking than the rankings produced by queries that were selected using the considered baseline methods.

AB - We propose a mathematical framework for query selection as a mechanism for reducing the cost of constructing information retrieval test collections. In particular, our mathematical formulation explicitly models the uncertainty in the retrieval effectiveness metrics that is introduced by the absence of relevance judgments. Since the optimization problem is computationally intractable, we devise an adaptive query selection algorithm, referred to as Adaptive, that provides an approximate solution. Adaptive selects queries iteratively and assumes that no relevance judgments are available for the query under consideration. Once a query is selected, the associated relevance assessments are acquired and then used to aid the selection of subsequent queries. We demonstrate the effectiveness of the algorithm on two TREC test collections as well as a test collection of an online search engine with 1000 queries. Our experimental results show that the queries chosen by Adaptive produce reliable performance ranking of systems. The ranking is better correlated with the actual systems ranking than the rankings produced by queries that were selected using the considered baseline methods.

M3 - Book chapter

SP - 901

EP - 910

BT - Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

ER -

An uncertainty-aware query selection model for evaluation of IR systems

Abstract

Fingerprint

Cite this