Dynamic ordered sets with approximate queries, approximate heaps and soft heaps

Mikkel Thorup; Or Zamir; Uri Zwick

doi:10.4230/LIPIcs.ICALP.2019.95

Dynamic ordered sets with approximate queries, approximate heaps and soft heaps

Mikkel Thorup, Or Zamir, Uri Zwick

Abstract

We consider word RAM data structures for maintaining ordered sets of integers whose select and rank operations are allowed to return approximate results, i.e., ranks, or items whose rank, differ by less than ∆ from the exact answer, where ∆ = ∆(n) is an error parameter. Related to approximate select and rank is approximate (one-dimensional) nearest-neighbor. A special case of approximate select queries are approximate min queries. Data structures that support approximate min operations are known as approximate heaps (priority queues). Related to approximate heaps are soft heaps, which are approximate heaps with a different notion of approximation. We prove the optimality of all the data structures presented, either through matching cell-probe lower bounds, or through equivalences to well studied static problems. For approximate select, rank, and nearest-neighbor operations we get matching cell-probe lower bounds. We prove an equivalence between approximate min operations, i.e., approximate heaps, and the static partitioning problem. Finally, we prove an equivalence between soft heaps and the classical sorting problem, on a smaller number of items. Our results have many interesting and unexpected consequences. It turns out that approximation greatly speeds up some of these operations, while others are almost unaffected. In particular, while select and rank have identical operation times, both in comparison-based and word RAM implementations, an interesting separation emerges between the approximate versions of these operations in the word RAM model. Approximate select is much faster than approximate rank. It also turns out that approximate min is exponentially faster than the more general approximate select. Next, we show that implementing soft heaps is harder than implementing approximate heaps. The relation between them corresponds to the relation between sorting and partitioning. Finally, as an interesting byproduct, we observe that a combination of known techniques yields a deterministic word RAM algorithm for (exactly) sorting n items in O(n log log_w n) time, where w is the word length. Even for the easier problem of finding duplicates, the best previous deterministic bound was O(min{n log log n, n log_w n}). Our new unifying bound is an improvement when w is sufficiently large compared with n.

Originalsprog	Engelsk
Titel	46th International Colloquium on Automata, Languages, and Programming, ICALP 2019
Redaktører	Ioannis Chatzigiannakis, Christel Baier, Stefano Leonardi, Paola Flocchini
Antal sider	13
Forlag	Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Publikationsdato	2019
Artikelnummer	95
ISBN (Elektronisk)	9783959771092
DOI	https://doi.org/10.4230/LIPIcs.ICALP.2019.95
Status	Udgivet - 2019
Begivenhed	46th International Colloquium on Automata, Languages, and Programming, ICALP 2019 - Patras, Grækenland Varighed: 9 jul. 2019 → 12 jul. 2019

Konference

Konference	46th International Colloquium on Automata, Languages, and Programming, ICALP 2019
Land/Område	Grækenland
By	Patras
Periode	09/07/2019 → 12/07/2019
Sponsor	Center for Perspicuous Computing (CPEC), University of Patras

Navn	Leibniz International Proceedings in Informatics, LIPIcs
Vol/bind	132
ISSN	1868-8969

Adgang til dokumentet

10.4230/LIPIcs.ICALP.2019.95

Thorup-Dynamic Ordered Sets with Approximate Queries,Forlagets udgivne version, 457 KBLicens: CC BY

Andre filer og links

Link to publication in Scopus

Citationsformater

Thorup, M., Zamir, O., & Zwick, U. (2019). Dynamic ordered sets with approximate queries, approximate heaps and soft heaps. I I. Chatzigiannakis, C. Baier, S. Leonardi, & P. Flocchini (red.), 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019 Artikel 95 Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing. https://doi.org/10.4230/LIPIcs.ICALP.2019.95

Dynamic ordered sets with approximate queries, approximate heaps and soft heaps. / Thorup, Mikkel; Zamir, Or; Zwick, Uri.
46th International Colloquium on Automata, Languages, and Programming, ICALP 2019. red. / Ioannis Chatzigiannakis; Christel Baier; Stefano Leonardi; Paola Flocchini. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2019. 95 (Leibniz International Proceedings in Informatics, LIPIcs, Bind 132).

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › peer review

Thorup, M, Zamir, O & Zwick, U 2019, Dynamic ordered sets with approximate queries, approximate heaps and soft heaps. i I Chatzigiannakis, C Baier, S Leonardi & P Flocchini (red), 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019., 95, Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, Leibniz International Proceedings in Informatics, LIPIcs, bind 132, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, Patras, Grækenland, 09/07/2019. https://doi.org/10.4230/LIPIcs.ICALP.2019.95

Thorup M, Zamir O, Zwick U. Dynamic ordered sets with approximate queries, approximate heaps and soft heaps. I Chatzigiannakis I, Baier C, Leonardi S, Flocchini P, red., 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing. 2019. 95. (Leibniz International Proceedings in Informatics, LIPIcs, Bind 132). doi: 10.4230/LIPIcs.ICALP.2019.95

Thorup, Mikkel ; Zamir, Or ; Zwick, Uri. / Dynamic ordered sets with approximate queries, approximate heaps and soft heaps. 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019. red. / Ioannis Chatzigiannakis ; Christel Baier ; Stefano Leonardi ; Paola Flocchini. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 2019. (Leibniz International Proceedings in Informatics, LIPIcs, Bind 132).

@inproceedings{452f13229f064d8390a6c01a18d36f48,

title = "Dynamic ordered sets with approximate queries, approximate heaps and soft heaps",

abstract = "We consider word RAM data structures for maintaining ordered sets of integers whose select and rank operations are allowed to return approximate results, i.e., ranks, or items whose rank, differ by less than ∆ from the exact answer, where ∆ = ∆(n) is an error parameter. Related to approximate select and rank is approximate (one-dimensional) nearest-neighbor. A special case of approximate select queries are approximate min queries. Data structures that support approximate min operations are known as approximate heaps (priority queues). Related to approximate heaps are soft heaps, which are approximate heaps with a different notion of approximation. We prove the optimality of all the data structures presented, either through matching cell-probe lower bounds, or through equivalences to well studied static problems. For approximate select, rank, and nearest-neighbor operations we get matching cell-probe lower bounds. We prove an equivalence between approximate min operations, i.e., approximate heaps, and the static partitioning problem. Finally, we prove an equivalence between soft heaps and the classical sorting problem, on a smaller number of items. Our results have many interesting and unexpected consequences. It turns out that approximation greatly speeds up some of these operations, while others are almost unaffected. In particular, while select and rank have identical operation times, both in comparison-based and word RAM implementations, an interesting separation emerges between the approximate versions of these operations in the word RAM model. Approximate select is much faster than approximate rank. It also turns out that approximate min is exponentially faster than the more general approximate select. Next, we show that implementing soft heaps is harder than implementing approximate heaps. The relation between them corresponds to the relation between sorting and partitioning. Finally, as an interesting byproduct, we observe that a combination of known techniques yields a deterministic word RAM algorithm for (exactly) sorting n items in O(n log logw n) time, where w is the word length. Even for the easier problem of finding duplicates, the best previous deterministic bound was O(min{n log log n, n logw n}). Our new unifying bound is an improvement when w is sufficiently large compared with n.",

keywords = "Lower bounds, Order queries, Word RAM",

author = "Mikkel Thorup and Or Zamir and Uri Zwick",

year = "2019",

doi = "10.4230/LIPIcs.ICALP.2019.95",

language = "English",

series = "Leibniz International Proceedings in Informatics, LIPIcs",

publisher = "Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing",

editor = "Ioannis Chatzigiannakis and Christel Baier and Stefano Leonardi and Paola Flocchini",

booktitle = "46th International Colloquium on Automata, Languages, and Programming, ICALP 2019",

note = "46th International Colloquium on Automata, Languages, and Programming, ICALP 2019 ; Conference date: 09-07-2019 Through 12-07-2019",

}

TY - GEN

T1 - Dynamic ordered sets with approximate queries, approximate heaps and soft heaps

AU - Thorup, Mikkel

AU - Zamir, Or

AU - Zwick, Uri

PY - 2019

Y1 - 2019

N2 - We consider word RAM data structures for maintaining ordered sets of integers whose select and rank operations are allowed to return approximate results, i.e., ranks, or items whose rank, differ by less than ∆ from the exact answer, where ∆ = ∆(n) is an error parameter. Related to approximate select and rank is approximate (one-dimensional) nearest-neighbor. A special case of approximate select queries are approximate min queries. Data structures that support approximate min operations are known as approximate heaps (priority queues). Related to approximate heaps are soft heaps, which are approximate heaps with a different notion of approximation. We prove the optimality of all the data structures presented, either through matching cell-probe lower bounds, or through equivalences to well studied static problems. For approximate select, rank, and nearest-neighbor operations we get matching cell-probe lower bounds. We prove an equivalence between approximate min operations, i.e., approximate heaps, and the static partitioning problem. Finally, we prove an equivalence between soft heaps and the classical sorting problem, on a smaller number of items. Our results have many interesting and unexpected consequences. It turns out that approximation greatly speeds up some of these operations, while others are almost unaffected. In particular, while select and rank have identical operation times, both in comparison-based and word RAM implementations, an interesting separation emerges between the approximate versions of these operations in the word RAM model. Approximate select is much faster than approximate rank. It also turns out that approximate min is exponentially faster than the more general approximate select. Next, we show that implementing soft heaps is harder than implementing approximate heaps. The relation between them corresponds to the relation between sorting and partitioning. Finally, as an interesting byproduct, we observe that a combination of known techniques yields a deterministic word RAM algorithm for (exactly) sorting n items in O(n log logw n) time, where w is the word length. Even for the easier problem of finding duplicates, the best previous deterministic bound was O(min{n log log n, n logw n}). Our new unifying bound is an improvement when w is sufficiently large compared with n.

AB - We consider word RAM data structures for maintaining ordered sets of integers whose select and rank operations are allowed to return approximate results, i.e., ranks, or items whose rank, differ by less than ∆ from the exact answer, where ∆ = ∆(n) is an error parameter. Related to approximate select and rank is approximate (one-dimensional) nearest-neighbor. A special case of approximate select queries are approximate min queries. Data structures that support approximate min operations are known as approximate heaps (priority queues). Related to approximate heaps are soft heaps, which are approximate heaps with a different notion of approximation. We prove the optimality of all the data structures presented, either through matching cell-probe lower bounds, or through equivalences to well studied static problems. For approximate select, rank, and nearest-neighbor operations we get matching cell-probe lower bounds. We prove an equivalence between approximate min operations, i.e., approximate heaps, and the static partitioning problem. Finally, we prove an equivalence between soft heaps and the classical sorting problem, on a smaller number of items. Our results have many interesting and unexpected consequences. It turns out that approximation greatly speeds up some of these operations, while others are almost unaffected. In particular, while select and rank have identical operation times, both in comparison-based and word RAM implementations, an interesting separation emerges between the approximate versions of these operations in the word RAM model. Approximate select is much faster than approximate rank. It also turns out that approximate min is exponentially faster than the more general approximate select. Next, we show that implementing soft heaps is harder than implementing approximate heaps. The relation between them corresponds to the relation between sorting and partitioning. Finally, as an interesting byproduct, we observe that a combination of known techniques yields a deterministic word RAM algorithm for (exactly) sorting n items in O(n log logw n) time, where w is the word length. Even for the easier problem of finding duplicates, the best previous deterministic bound was O(min{n log log n, n logw n}). Our new unifying bound is an improvement when w is sufficiently large compared with n.

KW - Lower bounds

KW - Order queries

KW - Word RAM

UR - http://www.scopus.com/inward/record.url?scp=85069199527&partnerID=8YFLogxK

U2 - 10.4230/LIPIcs.ICALP.2019.95

DO - 10.4230/LIPIcs.ICALP.2019.95

M3 - Article in proceedings

AN - SCOPUS:85069199527

T3 - Leibniz International Proceedings in Informatics, LIPIcs

BT - 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019

A2 - Chatzigiannakis, Ioannis

A2 - Baier, Christel

A2 - Leonardi, Stefano

A2 - Flocchini, Paola

PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing

T2 - 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019

Y2 - 9 July 2019 through 12 July 2019

ER -

Dynamic ordered sets with approximate queries, approximate heaps and soft heaps

Abstract

Konference

Adgang til dokumentet

Andre filer og links

Fingeraftryk

Citationsformater