Skip to main content

Reference baseline for high-performance cheminformatics fingerprint search benchmarking

Project description

This version of chemfp includes command-lines tools to generate
cheminformatics fingerprints and search those fingerprints by
Tanimoto similarity, as well as a Python library which you can use
to build new tools.

It is the no-cost/open source chemfp release track. It only supports
Python 2.7. It is being maintained only to provide a good reference
baseline for benchmarking other similarity search tools.

The commercial track, currently chemfp 3.4, includes faster
performance, many new features, and support for Python 3.

Chemfp is designed for the dense, 100-10,000 bit fingerprints which
occur in small-molecule/pharmaceutical chemisty. The Tanimoto search
algorithms are implemented in C and assembly for performance, and
support both threshold and k-nearest searches using the BitBound
algorithm of Swamidass and Baldi.

Fingerprint generation can be done either by extracting existing
fingerprint data from an SD file or by using an existing chemistry
toolkit. chemfp supports the Python libraries from Open Babel,
OpenEye, and RDKit toolkits. Be aware that those vendors no longer
support Python 2.7.

The main web site is .

Extensive documentation is at .

To cite chemfp use:
Dalke, Andrew. The chemfp project. J. Cheminformatics 11, 76

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chemfp-1.6.1.tar.gz (560.4 kB view hashes)

Uploaded source

Built Distributions

chemfp-1.6.1-cp27-cp27m-manylinux1_x86_64.whl (502.6 kB view hashes)

Uploaded cp27

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page