Random walk term weighting for information retrieval

R. Blanco; Christina Lioma

doi:10.1145/1277741.1277930

Random walk term weighting for information retrieval

R. Blanco, Christina Lioma

Department of Computer Science

14 Citations (Scopus)

Abstract

We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms is between 6 and 30 terms.

Original language	English
Title of host publication	Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
Number of pages	2
Publication date	1 Jan 2007
Pages	829-830
ISBN (Print)	9781595935977
DOIs	https://doi.org/10.1145/1277741.1277930
Publication status	Published - 1 Jan 2007

Access to Document

10.1145/1277741.1277930

Cite this

@inbook{11b9e2e02a5c43778b53cb53dbc7a057,

title = "Random walk term weighting for information retrieval",

abstract = "We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms is between 6 and 30 terms.",

author = "R. Blanco and Christina Lioma",

year = "2007",

month = jan,

day = "1",

doi = "10.1145/1277741.1277930",

language = "English",

isbn = "9781595935977",

pages = "829--830",

booktitle = "Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07",

}

TY - CHAP

T1 - Random walk term weighting for information retrieval

AU - Blanco, R.

AU - Lioma, Christina

PY - 2007/1/1

Y1 - 2007/1/1

N2 - We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms is between 6 and 30 terms.

AB - We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms is between 6 and 30 terms.

UR - http://www.scopus.com/inward/record.url?scp=36448963046&partnerID=8YFLogxK

U2 - 10.1145/1277741.1277930

DO - 10.1145/1277741.1277930

M3 - Book chapter

AN - SCOPUS:36448963046

SN - 9781595935977

SP - 829

EP - 830

BT - Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07

ER -

Random walk term weighting for information retrieval

Abstract

Access to Document

Other files and links

Fingerprint

Cite this