Abstract
Integrated search is defined as searching across different document types and representations simultaneously, with the goal of presenting the user with a single ranked result list containing the optimal mix of document types. In this paper, we compare various approaches to integrating three different types of documents (bibliographic records for articles and books as well as full-text articles) using the iSearch collection: combining all document types in a single index, weighting the different document types using priors, and using collection fusion techniques to merge the retrieval results on three separate indexes corresponding to each of the document types. We find that a properly optimized retrieval model on a single combined index containing all documents without any special treatment performs no worse than our weighting and fusion methods, suggesting that more work is needed on alternative approaches to integrated search.
Original language | English |
---|---|
Title of host publication | Proceedings of the ECIR 2012 Workshop on Task-Based and Aggregated Search (TBAS2012) |
Editors | Birger Larsen, Christina Lioma, Arjen P. de Vries |
Number of pages | 5 |
Publication date | 1 Apr 2012 |
Pages | 4-8 |
Publication status | Published - 1 Apr 2012 |
Event | TBAS 2012: ECIR Workshop on Task-based and Aggregated Search - Barcelona, Spain Duration: 1 Apr 2012 → 1 Apr 2012 |
Workshop
Workshop | TBAS 2012: ECIR Workshop on Task-based and Aggregated Search |
---|---|
Country/Territory | Spain |
City | Barcelona |
Period | 01/04/2012 → 01/04/2012 |
Keywords
- information retrieval
- integrated search
- collection fusion
- data fusion
- prior weighting
- ISearch collection