Measuring the variability in effectiveness of a retrieval system

Mehdi Hosseini; Ingemar J Cox; Natasa Millic-Frayling; Vishwa Vinay

Measuring the variability in effectiveness of a retrieval system

Mehdi Hosseini, Ingemar J Cox, Natasa Millic-Frayling, Vishwa Vinay

Abstract

A typical evaluation of a retrieval system involves computing an effectiveness metric, e.g. average precision, for each topic of a test collection and then using the average of the metric, e.g. mean average precision, to express the overall effectiveness. However, averages do not capture all the important aspects of effectiveness and, used alone, may not be an informative measure of systems' effectiveness. Indeed, in addition to the average, we need to consider the variation of effectiveness across topics. We refer to this variation as the variability in effectiveness. In this paper we explore how the variance of a metric can be used as a measure of variability. We define a variability metric, and illustrate how the metric can be used in practice.

Original language	English
Title of host publication	Advances in Multidisciplinary Retrieval
Number of pages	14
Publisher	Springer Science+Business Media
Publication date	2010
Pages	70-83
Publication status	Published - 2010
Externally published	Yes

Cite this

@inbook{c4d9debab2b147c5bdf4089838d33490,

title = "Measuring the variability in effectiveness of a retrieval system",

abstract = "A typical evaluation of a retrieval system involves computing an effectiveness metric, e.g. average precision, for each topic of a test collection and then using the average of the metric, e.g. mean average precision, to express the overall effectiveness. However, averages do not capture all the important aspects of effectiveness and, used alone, may not be an informative measure of systems' effectiveness. Indeed, in addition to the average, we need to consider the variation of effectiveness across topics. We refer to this variation as the variability in effectiveness. In this paper we explore how the variance of a metric can be used as a measure of variability. We define a variability metric, and illustrate how the metric can be used in practice.",

author = "Mehdi Hosseini and Cox, {Ingemar J} and Natasa Millic-Frayling and Vishwa Vinay",

year = "2010",

language = "English",

pages = "70--83",

booktitle = "Advances in Multidisciplinary Retrieval",

publisher = "Springer Science+Business Media",

address = "Singapore",

}

TY - CHAP

T1 - Measuring the variability in effectiveness of a retrieval system

AU - Hosseini, Mehdi

AU - Cox, Ingemar J

AU - Millic-Frayling, Natasa

AU - Vinay, Vishwa

PY - 2010

Y1 - 2010

N2 - A typical evaluation of a retrieval system involves computing an effectiveness metric, e.g. average precision, for each topic of a test collection and then using the average of the metric, e.g. mean average precision, to express the overall effectiveness. However, averages do not capture all the important aspects of effectiveness and, used alone, may not be an informative measure of systems' effectiveness. Indeed, in addition to the average, we need to consider the variation of effectiveness across topics. We refer to this variation as the variability in effectiveness. In this paper we explore how the variance of a metric can be used as a measure of variability. We define a variability metric, and illustrate how the metric can be used in practice.

AB - A typical evaluation of a retrieval system involves computing an effectiveness metric, e.g. average precision, for each topic of a test collection and then using the average of the metric, e.g. mean average precision, to express the overall effectiveness. However, averages do not capture all the important aspects of effectiveness and, used alone, may not be an informative measure of systems' effectiveness. Indeed, in addition to the average, we need to consider the variation of effectiveness across topics. We refer to this variation as the variability in effectiveness. In this paper we explore how the variance of a metric can be used as a measure of variability. We define a variability metric, and illustrate how the metric can be used in practice.

M3 - Book chapter

SP - 70

EP - 83

BT - Advances in Multidisciplinary Retrieval

PB - Springer Science+Business Media

ER -

Measuring the variability in effectiveness of a retrieval system

Abstract

Fingerprint

Cite this