Fast HyperLogLog
Project description
Fast HyperLogLog for Python.
The HyperLogLog algorithm [1] is a space efficient method to estimate the cardinality of extraordinarily large data sets. This library implements a 64 bit variant [2] for Python, written in C, that uses a MurmurHash64A hash function.
[1] Flajolet, Philippe; Fusy, Eric; Gandouet, Olivier; Meunier, Frederic (2007). “Hyperloglog: The analysis of a near-optimal cardinality estimation algorithm” (PDF). Disc. Math. and Theor. Comp. Sci. Proceedings. AH: 127146. CiteSeerX 10.1.1.76.4286.
[2] Omar Ertl, “New cardinality estimation algorithms for HyperLogLog Sketches” arXiv:1702.01284 [cs] Feb. 2017.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.