User Perspectives on Query Difficulty

Christina Lioma; Birger Larsen; Hinrich Schütze

doi:10.1007/978-3-642-23318-0_3

User Perspectives on Query Difficulty

Christina Lioma, Birger Larsen, Hinrich Schütze

2 Citationer (Scopus)

586 Downloads (Pure)

Abstract

The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data, or to statistical and linguistic features of the queries that may render them difficult. This work addresses query difficulty from a different angle, namely the users’ own perspectives on query difficulty. Two research questions are asked: (1) Are users aware that the query they submit to an IR system may be difficult for the system to address? (2) Are users aware of specific features in their query (e.g., domain-specificity, vagueness) that may render their query difficult for an IR system to address? A study of 420 queries from a Web search engine query log that are pre-categorised as easy, medium, hard by TREC based on system performance, reveals an interesting finding: users do not seem to reliably assess which query might be difficult; however, their assessments of which query features might render queries difficult are notably more accurate. Following this, a formal approach is presented for synthesising the user-assessed causes of query difficulty through opinion fusion into an overall assessment of query difficulty. The resulting assessments of query difficulty are found to agree notably more to the TREC categories than the direct user assessments.

Originalsprog	Engelsk
Titel	Advances in Information Retrieval Theory : Lecture Notes in Computer Science, 2011, Volume 6931/2011, 3-14, DOI: 10.1007/978-3-642-23318-0_3
Vol/bind	6931/2011
Forlag	Springer
Publikationsdato	2011
Sider	3-14
DOI	https://doi.org/10.1007/978-3-642-23318-0_3
Status	Udgivet - 2011
Udgivet eksternt	Ja
Begivenhed	International Conference on the Theory of Information Retrieval - Bertinoro, Italien Varighed: 12 sep. 2011 → 14 sep. 2011

Konference

Konference	International Conference on the Theory of Information Retrieval
Land/Område	Italien
By	Bertinoro
Periode	12/09/2011 → 14/09/2011

Navn	Lecture notes in computer science

Adgang til dokumentet

10.1007/978-3-642-23318-0_3

ictir2011Indsendt manuskript, 407 KB

http://www.springerlink.com/content/055242814q11h840/

Citationsformater

User Perspectives on Query Difficulty. / Lioma, Christina; Larsen, Birger; Schütze, Hinrich.
Advances in Information Retrieval Theory: Lecture Notes in Computer Science, 2011, Volume 6931/2011, 3-14, DOI: 10.1007/978-3-642-23318-0_3 . Bind 6931/2011 Springer, 2011. s. 3-14 (Lecture notes in computer science).

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › peer review

Lioma, C, Larsen, B & Schütze, H 2011, User Perspectives on Query Difficulty. i Advances in Information Retrieval Theory: Lecture Notes in Computer Science, 2011, Volume 6931/2011, 3-14, DOI: 10.1007/978-3-642-23318-0_3 . bind 6931/2011, Springer, Lecture notes in computer science, s. 3-14, International Conference on the Theory of Information Retrieval, Bertinoro, Italien, 12/09/2011. https://doi.org/10.1007/978-3-642-23318-0_3

@inproceedings{b8048411f77f41de9680ad987b1f11fe,

title = "User Perspectives on Query Difficulty",

abstract = "The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data, or to statistical and linguistic features of the queries that may render them difficult. This work addresses query difficulty from a different angle, namely the users{\textquoteright} own perspectives on query difficulty. Two research questions are asked: (1) Are users aware that the query they submit to an IR system may be difficult for the system to address? (2) Are users aware of specific features in their query (e.g., domain-specificity, vagueness) that may render their query difficult for an IR system to address? A study of 420 queries from a Web search engine query log that are pre-categorised as easy, medium, hard by TREC based on system performance, reveals an interesting finding: users do not seem to reliably assess which query might be difficult; however, their assessments of which query features might render queries difficult are notably more accurate. Following this, a formal approach is presented for synthesising the user-assessed causes of query difficulty through opinion fusion into an overall assessment of query difficulty. The resulting assessments of query difficulty are found to agree notably more to the TREC categories than the direct user assessments. ",

author = "Christina Lioma and Birger Larsen and Hinrich Sch{\"u}tze",

note = "G. Amati and F. Crestani (Eds.): ICTIR 2011, LNCS 6931, pp. 3–14, 2011. c Springer-Verlag Berlin Heidelberg 2011 ; International Conference on the Theory of Information Retrieval, ICTIR ; Conference date: 12-09-2011 Through 14-09-2011",

year = "2011",

doi = "10.1007/978-3-642-23318-0_3",

language = "English",

volume = "6931/2011",

series = "Lecture notes in computer science",

publisher = "Springer",

pages = "3--14",

booktitle = "Advances in Information Retrieval Theory",

}

TY - GEN

T1 - User Perspectives on Query Difficulty

AU - Lioma, Christina

AU - Larsen, Birger

AU - Schütze, Hinrich

N1 - G. Amati and F. Crestani (Eds.): ICTIR 2011, LNCS 6931, pp. 3–14, 2011. c Springer-Verlag Berlin Heidelberg 2011

PY - 2011

Y1 - 2011

N2 - The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data, or to statistical and linguistic features of the queries that may render them difficult. This work addresses query difficulty from a different angle, namely the users’ own perspectives on query difficulty. Two research questions are asked: (1) Are users aware that the query they submit to an IR system may be difficult for the system to address? (2) Are users aware of specific features in their query (e.g., domain-specificity, vagueness) that may render their query difficult for an IR system to address? A study of 420 queries from a Web search engine query log that are pre-categorised as easy, medium, hard by TREC based on system performance, reveals an interesting finding: users do not seem to reliably assess which query might be difficult; however, their assessments of which query features might render queries difficult are notably more accurate. Following this, a formal approach is presented for synthesising the user-assessed causes of query difficulty through opinion fusion into an overall assessment of query difficulty. The resulting assessments of query difficulty are found to agree notably more to the TREC categories than the direct user assessments.

AB - The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data, or to statistical and linguistic features of the queries that may render them difficult. This work addresses query difficulty from a different angle, namely the users’ own perspectives on query difficulty. Two research questions are asked: (1) Are users aware that the query they submit to an IR system may be difficult for the system to address? (2) Are users aware of specific features in their query (e.g., domain-specificity, vagueness) that may render their query difficult for an IR system to address? A study of 420 queries from a Web search engine query log that are pre-categorised as easy, medium, hard by TREC based on system performance, reveals an interesting finding: users do not seem to reliably assess which query might be difficult; however, their assessments of which query features might render queries difficult are notably more accurate. Following this, a formal approach is presented for synthesising the user-assessed causes of query difficulty through opinion fusion into an overall assessment of query difficulty. The resulting assessments of query difficulty are found to agree notably more to the TREC categories than the direct user assessments.

U2 - 10.1007/978-3-642-23318-0_3

DO - 10.1007/978-3-642-23318-0_3

M3 - Article in proceedings

VL - 6931/2011

T3 - Lecture notes in computer science

SP - 3

EP - 14

BT - Advances in Information Retrieval Theory

PB - Springer

T2 - International Conference on the Theory of Information Retrieval

Y2 - 12 September 2011 through 14 September 2011

ER -

User Perspectives on Query Difficulty

Abstract

Konference

Adgang til dokumentet

Fingeraftryk

Citationsformater