The power of simple tabulation hashing

Mihai Patrascu, Mikkel Thorup

26 Citations (Scopus)

Abstract

Randomized algorithms are often enjoyed for their simplicity, but the hash functions used to yield the desired theoretical guarantees are often neither simple nor practical. Here we show that the simplest possible tabulation hashing provides unexpectedly strong guarantees. The scheme itself dates back to Carter and Wegman (STOC'77). Keys are viewed as consisting of c characters. We initialize c tables T1, ..., Tc mapping characters to random hash codes. A key x = (x1, ..., xc) is hashed to T1[x1] ⊕ ... ⊕ Tc[xc], where ⊕ denotes xor. While this scheme is not even 4-independent, we show that it provides many of the guarantees that are normally obtained via higher independence, e.g., Chernoff-type concentration, min-wise hashing for estimating set intersection, and cuckoo hashing.

Original languageEnglish
Title of host publicationProceedings of the 43rd annual ACM symposium on Theory of computing
Number of pages10
PublisherAssociation for Computing Machinery
Publication date2011
Pages1-10
ISBN (Print)978-1-4503-0691-1
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event43rd ACM Symposium on Theory of Computing - San Jose, California, United States
Duration: 6 Jun 20118 Jun 2011
Conference number: 43

Conference

Conference43rd ACM Symposium on Theory of Computing
Number43
Country/TerritoryUnited States
CitySan Jose, California
Period06/06/201108/06/2011

Fingerprint

Dive into the research topics of 'The power of simple tabulation hashing'. Together they form a unique fingerprint.

Cite this