Skip to main content

A Python package for common-nearest neighbour (CNN) clustering

Project description

Build Status

Common nearest neighbours (CNN) clustering and core-set Markov-state model estimation


NOTE

This project is currently under development in the alpha state. The implementation may change in the future. Check the examples and the documentation for up-to-date information.


cnnclustering

The cnnclustering Python package provides a flexible interface to use the common-nearest-neighbours cluster algorithm. While the method can be applied to abitrary data, this implementation was made before the background of processing trajectories from Molecular Dynamics simulations. In this context the cluster result can serve as a suitable basis for the construction of a core-set Markov-state (csMSM) model to capture the essential dynamics of the underlying molecular processes.

The package provides two modules:

  • cnn: (Hierarchical) CNN clustering and analysis
  • cmsm: csMSM estimation and analysis

Features:

  • Flexible: Clustering can be done for data sets in different input formats. Easy interfacing with external methods.
  • Convenient: Integration of functionality, handy in the context of Molecular Dynamics.
  • Fast: Core functionalities use Cython.

Please refer to the following papers for the scientific background (and consider citing if you find the method useful):

  • B. Keller, X. Daura, W. F. van Gunsteren J. Chem. Phys., 2010, 132, 074110.
  • O. Lemke, B.G. Keller J. Chem. Phys., 2016, 145, 164104.
  • O. Lemke, B.G. Keller Algorithms, 2018, 11, 19.

Documentation

The package documentation (under developement) is available here.

Quickstart

Alternative scikit-learn implementation

We provide an alternative approach to CNN clustering in the spirit of the scikit-learn project over this fork of scikit-learn-contrib/scikit-learn-extra.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cnnclustering-0.3.5.tar.gz (12.4 MB view details)

Uploaded Source

File details

Details for the file cnnclustering-0.3.5.tar.gz.

File metadata

  • Download URL: cnnclustering-0.3.5.tar.gz
  • Upload date:
  • Size: 12.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.3.1 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.8.0

File hashes

Hashes for cnnclustering-0.3.5.tar.gz
Algorithm Hash digest
SHA256 3f2d9950a610f6ece6454313609bfc3ac0d631674c129064761a4d8fc6c73280
MD5 f7d822ac12b66c56a5ace7911f3bacec
BLAKE2b-256 3951b7bc5e7062602f225eb2550aa0f0a24e84b0d47582bd515f53453e87f0f8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page