University of glasgow at TREC 2006: Experiments in terabyte and enterprise tracks with terrier

Christina Lioma; C. Macdonald; V. Plachouras; J. Peng; B. He; I. Ounis

University of glasgow at TREC 2006: Experiments in terabyte and enterprise tracks with terrier

Christina Lioma, C. Macdonald, V. Plachouras, J. Peng, B. He, I. Ounis

Datalogisk Institut

Abstract

In TREC 2006, we participate in three tasks of the Terabyte and Enterprise tracks. We continue experiments using Terrier1, our modular and scalable Information Retrieval (IR) platform. Furthering our research into the Divergence From Randomness (DFR) framework of weighting models, we introduce two new effective and low-cost models, which combine evidence from document structure and capture term dependence and proximity, respectively. Additionally, in the Terabyte track, we improve on our query expansion mechanism on fields, presented in TREC 2005, with a new and more refined technique, which combines evidence in a linear, rather than uniform, way. We also introduce a novel, low-cost syntacticallybased noise reduction technique, which we flexibly apply to both the queries and the index. Furthermore, in the Named Page Finding task, we present a new technique for combining query-independent evidence, in the form of prior probabilities. In the Enterprise track, we test our new voting model for expert search. Our experiments focus on the need for candidate length normalisation, and on how retrieval performance can be enhanced by applying retrieval techniques to the underlying ranking of documents.

Originalsprog	Engelsk
Titel	University of glasgow at TREC 2006 : Experiments in terabyte and enterprise tracks with terrier
Publikationsdato	1 jan. 2006
Status	Udgivet - 1 jan. 2006

Navn	N I S T Special Publication
ISSN	1048-776X

Andre filer og links

Link to publication in Scopus

Citationsformater

University of glasgow at TREC 2006: Experiments in terabyte and enterprise tracks with terrier. / Lioma, Christina; Macdonald, C.; Plachouras, V. et al.
University of glasgow at TREC 2006: Experiments in terabyte and enterprise tracks with terrier. 2006. (N I S T Special Publication).

Publikation: Bidrag til bog/antologi/rapport › Bidrag til bog/antologi › Forskning › peer review

@inbook{a3737e38e3f243b9bf172a4956db7f37,

title = "University of glasgow at TREC 2006: Experiments in terabyte and enterprise tracks with terrier",

abstract = "In TREC 2006, we participate in three tasks of the Terabyte and Enterprise tracks. We continue experiments using Terrier1, our modular and scalable Information Retrieval (IR) platform. Furthering our research into the Divergence From Randomness (DFR) framework of weighting models, we introduce two new effective and low-cost models, which combine evidence from document structure and capture term dependence and proximity, respectively. Additionally, in the Terabyte track, we improve on our query expansion mechanism on fields, presented in TREC 2005, with a new and more refined technique, which combines evidence in a linear, rather than uniform, way. We also introduce a novel, low-cost syntacticallybased noise reduction technique, which we flexibly apply to both the queries and the index. Furthermore, in the Named Page Finding task, we present a new technique for combining query-independent evidence, in the form of prior probabilities. In the Enterprise track, we test our new voting model for expert search. Our experiments focus on the need for candidate length normalisation, and on how retrieval performance can be enhanced by applying retrieval techniques to the underlying ranking of documents.",

author = "Christina Lioma and C. Macdonald and V. Plachouras and J. Peng and B. He and I. Ounis",

year = "2006",

month = jan,

day = "1",

language = "English",

series = "N I S T Special Publication",

publisher = "National Instiute of Standards & Technology",

booktitle = "University of glasgow at TREC 2006",

}

TY - CHAP

T1 - University of glasgow at TREC 2006

T2 - Experiments in terabyte and enterprise tracks with terrier

AU - Lioma, Christina

AU - Macdonald, C.

AU - Plachouras, V.

AU - Peng, J.

AU - He, B.

AU - Ounis, I.

PY - 2006/1/1

Y1 - 2006/1/1

N2 - In TREC 2006, we participate in three tasks of the Terabyte and Enterprise tracks. We continue experiments using Terrier1, our modular and scalable Information Retrieval (IR) platform. Furthering our research into the Divergence From Randomness (DFR) framework of weighting models, we introduce two new effective and low-cost models, which combine evidence from document structure and capture term dependence and proximity, respectively. Additionally, in the Terabyte track, we improve on our query expansion mechanism on fields, presented in TREC 2005, with a new and more refined technique, which combines evidence in a linear, rather than uniform, way. We also introduce a novel, low-cost syntacticallybased noise reduction technique, which we flexibly apply to both the queries and the index. Furthermore, in the Named Page Finding task, we present a new technique for combining query-independent evidence, in the form of prior probabilities. In the Enterprise track, we test our new voting model for expert search. Our experiments focus on the need for candidate length normalisation, and on how retrieval performance can be enhanced by applying retrieval techniques to the underlying ranking of documents.

AB - In TREC 2006, we participate in three tasks of the Terabyte and Enterprise tracks. We continue experiments using Terrier1, our modular and scalable Information Retrieval (IR) platform. Furthering our research into the Divergence From Randomness (DFR) framework of weighting models, we introduce two new effective and low-cost models, which combine evidence from document structure and capture term dependence and proximity, respectively. Additionally, in the Terabyte track, we improve on our query expansion mechanism on fields, presented in TREC 2005, with a new and more refined technique, which combines evidence in a linear, rather than uniform, way. We also introduce a novel, low-cost syntacticallybased noise reduction technique, which we flexibly apply to both the queries and the index. Furthermore, in the Named Page Finding task, we present a new technique for combining query-independent evidence, in the form of prior probabilities. In the Enterprise track, we test our new voting model for expert search. Our experiments focus on the need for candidate length normalisation, and on how retrieval performance can be enhanced by applying retrieval techniques to the underlying ranking of documents.

UR - http://www.scopus.com/inward/record.url?scp=84873545645&partnerID=8YFLogxK

M3 - Book chapter

AN - SCOPUS:84873545645

T3 - N I S T Special Publication

BT - University of glasgow at TREC 2006

ER -

University of glasgow at TREC 2006: Experiments in terabyte and enterprise tracks with terrier

Abstract

Andre filer og links

Fingeraftryk

Citationsformater