An Exploration of Retrieval-Enhancing Methods for Integrated Search in a Digital Library

Diana Ransgaard Sørensen, Toine Bogers, Birger Larsen

505 Downloads (Pure)

Abstract

Integrated search is defined as searching across different document types and representations simultaneously, with the goal of presenting the user with a single ranked result list containing the optimal mix of document types. In this paper, we compare various approaches to integrating three different types of documents (bibliographic records for articles and books as well as full-text articles) using the iSearch collection: combining all document types in a single index, weighting the different document types using priors, and using collection fusion techniques to merge the retrieval results on three separate indexes corresponding to each of the document types. We find that a properly optimized retrieval model on a single combined index containing all documents without any special treatment performs no worse than our weighting and fusion methods, suggesting that more work is needed on alternative approaches to integrated search.
Original languageEnglish
Title of host publicationProceedings of the ECIR 2012 Workshop on Task-Based and Aggregated Search (TBAS2012)
EditorsBirger Larsen, Christina Lioma, Arjen P. de Vries
Number of pages5
Publication date1 Apr 2012
Pages4-8
Publication statusPublished - 1 Apr 2012
EventTBAS 2012: ECIR Workshop on Task-based and Aggregated Search - Barcelona, Spain
Duration: 1 Apr 20121 Apr 2012

Workshop

WorkshopTBAS 2012: ECIR Workshop on Task-based and Aggregated Search
Country/TerritorySpain
CityBarcelona
Period01/04/201201/04/2012

Keywords

  • information retrieval
  • integrated search
  • collection fusion
  • data fusion
  • prior weighting
  • ISearch collection

Fingerprint

Dive into the research topics of 'An Exploration of Retrieval-Enhancing Methods for Integrated Search in a Digital Library'. Together they form a unique fingerprint.

Cite this