Skip to main content

Fast hierarchical clustering routines for R and Python.

Project description

This library provides Python functions for hierarchical clustering. It generates hierarchical clusters from distance matrices or from vector data.

This module is intended to replace the functions

    linkage, single, complete, average, weighted, centroid, median, ward

in the module scipy.cluster.hierarchy with the same functionality but much faster algorithms. Moreover, the function linkage_vector provides memory-efficient clustering for vector data.

The interface is very similar to MATLAB's Statistics Toolbox API to make code easier to port from MATLAB to Python/NumPy. The core implementation of this library is in C++ for efficiency.

User manual: fastcluster.pdf.

The “Yule” distance function changed in fastcluster version 1.2.0. This is following a change in SciPy 1.6.3. It is recommended to use fastcluster version 1.1.x together with SciPy versions before 1.6.3 and fastcluster 1.2.x with SciPy ≥1.6.3.

The fastcluster package is considered stable and will undergo few changes from now on. If some years from now there have not been any updates, this does not necessarily mean that the package is unmaintained but maybe it just was not necessary to correct anything. Of course, please still report potential bugs and incompatibilities to daniel@danifold.net. You may also use my GitHub repository for bug reports, pull requests etc.

Note that PyPI and my GitHub repository host the source code for the Python interface only. The archive with both the R and the Python interface is available on CRAN and the GitHub repository “cran/fastcluster”. Even though I appear as the author also of this second GitHub repository, this is just an automatic, read-only mirror of the CRAN archive, so please do not attempt to report bugs or contact me via this repository.

Installation files for Windows are provided on PyPI and on Christoph Gohlke's web page.

Christoph Dalitz wrote a pure C++ interface to fastcluster.

Reference: Daniel Müllner, fastcluster: Fast Hierarchical, Agglomerative Clustering Routines for R and Python, Journal of Statistical Software, 53 (2013), no. 9, 1–18, https://www.jstatsoft.org/v53/i09/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for fastcluster, version 1.2.3
Filename, size File type Python version Upload date Hashes
Filename, size fastcluster-1.2.3-cp37-cp37m-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (202.1 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp37-cp37m-manylinux_2_5_i686.manylinux1_i686.whl (146.2 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp37-cp37m-manylinux_2_5_x86_64.manylinux1_x86_64.whl (155.6 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp37-cp37m-win32.whl (33.0 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp37-cp37m-win_amd64.whl (36.4 kB) File type Wheel Python version cp37 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp38-cp38-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (201.9 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp38-cp38-manylinux_2_5_i686.manylinux1_i686.whl (145.5 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp38-cp38-manylinux_2_5_x86_64.manylinux1_x86_64.whl (155.6 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp38-cp38-win32.whl (33.0 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp38-cp38-win_amd64.whl (36.5 kB) File type Wheel Python version cp38 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp39-cp39-macosx_11_0_x86_64.whl (39.9 kB) File type Wheel Python version cp39 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (201.2 kB) File type Wheel Python version cp39 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp39-cp39-manylinux_2_5_i686.manylinux1_i686.whl (145.1 kB) File type Wheel Python version cp39 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp39-cp39-manylinux_2_5_x86_64.manylinux1_x86_64.whl (155.1 kB) File type Wheel Python version cp39 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp39-cp39-win32.whl (33.0 kB) File type Wheel Python version cp39 Upload date Hashes View
Filename, size fastcluster-1.2.3-cp39-cp39-win_amd64.whl (36.4 kB) File type Wheel Python version cp39 Upload date Hashes View
Filename, size fastcluster-1.2.3.tar.gz (173.5 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page