Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (
Help us improve Python packaging - Donate today!

A C implementation of Nilsimsa for Python.

Project Description

# cNilsimsa

A C implementation of Nilsimsa for Python.

`shell $ pip install cnilsimsa `

We are building this module one piece at a time. So far, that means only compare_hexdigests because needing a faster way to do that was the primary motivation to start this project.

`python from cnilsimsa import compare_hexdigests `

It works exactly like the method of the same name from pynilsimsa but is more than an order of magnitude faster, so if you need to do lots of deduplication over a large corpus of documents via nilsimsa hex digests from Python, this should be helpful.

Building out the rest of of the methods for representing and cooking LSHs to provide a full drop-in replacement for pynilsimsa is the longer term goal.

`python import cnilsimsa as nilsimsa `

The more complete pure Python implementation is here:

Thanks to the authors of the Ruby/C implementation from which our our fillpopcount() function is taken.

Thanks to the Perl/C implementation that inspired both predecessors.

Contributions welcome.

Release History

This version
History Node


History Node


History Node


Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, Size & Hash SHA256 Hash Help File Type Python Version Upload Date
(3.1 kB) Copy SHA256 Hash SHA256
Source None Jan 22, 2014

Supported By

Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Google Google Cloud Servers