Dynamic ordered sets with approximate queries, approximate heaps and soft heaps

Mikkel Thorup, Or Zamir, Uri Zwick

    Abstract

    We consider word RAM data structures for maintaining ordered sets of integers whose select and rank operations are allowed to return approximate results, i.e., ranks, or items whose rank, differ by less than ∆ from the exact answer, where ∆ = ∆(n) is an error parameter. Related to approximate select and rank is approximate (one-dimensional) nearest-neighbor. A special case of approximate select queries are approximate min queries. Data structures that support approximate min operations are known as approximate heaps (priority queues). Related to approximate heaps are soft heaps, which are approximate heaps with a different notion of approximation. We prove the optimality of all the data structures presented, either through matching cell-probe lower bounds, or through equivalences to well studied static problems. For approximate select, rank, and nearest-neighbor operations we get matching cell-probe lower bounds. We prove an equivalence between approximate min operations, i.e., approximate heaps, and the static partitioning problem. Finally, we prove an equivalence between soft heaps and the classical sorting problem, on a smaller number of items. Our results have many interesting and unexpected consequences. It turns out that approximation greatly speeds up some of these operations, while others are almost unaffected. In particular, while select and rank have identical operation times, both in comparison-based and word RAM implementations, an interesting separation emerges between the approximate versions of these operations in the word RAM model. Approximate select is much faster than approximate rank. It also turns out that approximate min is exponentially faster than the more general approximate select. Next, we show that implementing soft heaps is harder than implementing approximate heaps. The relation between them corresponds to the relation between sorting and partitioning. Finally, as an interesting byproduct, we observe that a combination of known techniques yields a deterministic word RAM algorithm for (exactly) sorting n items in O(n log logw n) time, where w is the word length. Even for the easier problem of finding duplicates, the best previous deterministic bound was O(min{n log log n, n logw n}). Our new unifying bound is an improvement when w is sufficiently large compared with n.

    OriginalsprogEngelsk
    Titel46th International Colloquium on Automata, Languages, and Programming, ICALP 2019
    RedaktørerIoannis Chatzigiannakis, Christel Baier, Stefano Leonardi, Paola Flocchini
    Antal sider13
    ForlagSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
    Publikationsdato2019
    Artikelnummer95
    ISBN (Elektronisk)9783959771092
    DOI
    StatusUdgivet - 2019
    Begivenhed46th International Colloquium on Automata, Languages, and Programming, ICALP 2019 - Patras, Grækenland
    Varighed: 9 jul. 201912 jul. 2019

    Konference

    Konference46th International Colloquium on Automata, Languages, and Programming, ICALP 2019
    Land/OmrådeGrækenland
    ByPatras
    Periode09/07/201912/07/2019
    SponsorCenter for Perspicuous Computing (CPEC), University of Patras
    NavnLeibniz International Proceedings in Informatics, LIPIcs
    Vol/bind132
    ISSN1868-8969

    Fingeraftryk

    Dyk ned i forskningsemnerne om 'Dynamic ordered sets with approximate queries, approximate heaps and soft heaps'. Sammen danner de et unikt fingeraftryk.

    Citationsformater