Improving query correctness using centralized probably approximately correct (pac) search

Bidragets oversatte titel: Improving query correctness using centralized probably approximately correct (pac) search

Ingemar Cox, Jianhan Zhu, Ruoxun Fu, Lars Kai Hansen

3 Citationer (Scopus)

Abstract

A non-deterministic architecture for information retrieval, known as probably approximately correct (PAC) search, has recently been proposed. However, for equivalent storage and computational resources, the performance of PAC is only 63% of a deterministic system. We propose a modification to the PAC architecture, introducing a centralized query coordination node. To respond to a query, random sampling of computers is replaced with pseudo-random sampling using the query as a seed. Then, for queries that occur frequently, this pseudo-random sample is iteratively refined so that performance improves with each iteration. A theoretical analysis is presented that provides an upper bound on the performance of any iterative algorithm. Two heuristic algorithms are then proposed to iteratively improve the performance of PAC search. Experiments on the TREC-8 dataset demonstrate that performance can improve from 67% to 96% in just 10 iterations, and continues to improve with each iteration. Thus, for queries that occur 10 or more times, the performance of a non-deterministic PAC architecture can closely match that of a deterministic system.

Bidragets oversatte titelImproving query correctness using centralized probably approximately correct (pac) search
OriginalsprogEngelsk
TitelAdvances in Information Retrieval
Antal sider16
ForlagSpringer Science+Business Media
Publikationsdato2010
Sider265-280
StatusUdgivet - 2010
Udgivet eksterntJa

Fingeraftryk

Dyk ned i forskningsemnerne om 'Improving query correctness using centralized probably approximately correct (pac) search'. Sammen danner de et unikt fingeraftryk.

Citationsformater