Skip to main content

A SuperMinHash implementation

Project description

SuperMinHash, Simhash and SimhashIndex

SuperMinHash

A New Minwise Hashing Algorithm for Jaccard Similarity Estimation

This is an implementation of Otmar Ertl's paper with the same title. The implementation is still in progress but almost there...

It is fork to Python from Go (source https://github.com/seiflotfy/superminhash)

Simhash and SimhashIndex

It is fork and redesign (source https://github.com/leonsim/simhash)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

superminhash-0.1.0.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

superminhash-0.1.0-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file superminhash-0.1.0.tar.gz.

File metadata

  • Download URL: superminhash-0.1.0.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.4.3 requests-toolbelt/0.9.1 tqdm/4.30.0 CPython/3.7.2

File hashes

Hashes for superminhash-0.1.0.tar.gz
Algorithm Hash digest
SHA256 181fe4aa119ceaa139ba0569f21fb5804431df65f85a25131f5002ec78ec142a
MD5 17a9c12df438e93bbb102a07b9be2d83
BLAKE2b-256 f26f08af3c8e44ce654c825cd9fab5412cdfcdcee4b866479099e261c9c48055

See more details on using hashes here.

File details

Details for the file superminhash-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: superminhash-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.4.3 requests-toolbelt/0.9.1 tqdm/4.30.0 CPython/3.7.2

File hashes

Hashes for superminhash-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3a33e56b3cd82638ba36ac22e8542bd070939841d10f9825c5a18a6493273e5a
MD5 2888bc0c90a68f9e222664c0660ce0f3
BLAKE2b-256 eeebee7b424b604e5938132294f48bbad03f06efecc667eae85c3531fc378135

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page