Skip to main content

a flexible n-ary clustering package for all applications.

Project description

MDANCE (Molecular Dynamics Analysis with N-ary Clustering Ensembles) is a flexible n-ary clustering package that provides a set of tools for clustering Molecular Dynamics trajectories. The package is written in Python and an extension of the n-ary similarity framework. The package is designed to be modular and extensible, allowing for the addition of new clustering algorithms and similarity metrics.

MDANCE

Background

Molecular Dynamics (MD) simulations are a powerful tool for studying the dynamics of biomolecules. However, the analysis of MD trajectories is challenging due to the large amount of data generated. Clustering is an unsupervised machine learning approach to group similar frames into clusters. The clustering results can be used to reveal the structure of the data, identify the most representative structures, and to study the dynamics of the system.

Installation

$ pip install mdance

To check for proper installation, run the following command:

>>> import mdance
>>> mdance.__version__

Clustering Algorithms

NANI

k-Means N-Ary Natural Initiation (NANI) is an algorithm for selecting initial centroids for k-Means clustering. NANI is an extension of the k-Means++ algorithm. NANI stratifies the data to high density region and perform diversity selection on top of the it to select the initial centroids. This is a deterministic algorithm that will always select the same initial centroids for the same dataset and improve on k-means++ by reducing the number of iterations required to converge and improve the clustering quality.

A tutorial is available for NANI.

Please refer to the NANI paper.

Example usage:

from mdance.cluster.nani import KmeansNANI

data = np.load('data.npy')
n_clusters = 4
mod = KmeansNANI(data=data, n_clusters=n_clusters, metric='MSD', N_atoms=1,
                 init_type='comp_sim', percentage=10)
initiators = mod.initiate_kmeans()
initiators = initiators[:n_clusters]
kmeans = KMeans(n_clusters, init=initiators, n_init=1, random_state=None)
kmeans.fit(data)

Clustering Postprocessing

PRIME

Protein Retrieval via Integrative Molecular Ensembles (PRIME) is a novel algorithm that predicts the native structure of a protein from simulation or clustering data. These methods perfectly mapped all the structural motifs in the studied systems and required unprecedented linear scaling.

A tutorial is available for PRIME.

Please refer to the PRIME paper.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mdance-0.2.6.tar.gz (31.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

MDANCE-0.2.6-py3-none-any.whl (31.0 MB view details)

Uploaded Python 3

File details

Details for the file mdance-0.2.6.tar.gz.

File metadata

  • Download URL: mdance-0.2.6.tar.gz
  • Upload date:
  • Size: 31.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for mdance-0.2.6.tar.gz
Algorithm Hash digest
SHA256 31d61e312bdb265aad5fb9db358b441019e0dd6defc95dede551674cf0823929
MD5 68005e5919eba085736b254543f10835
BLAKE2b-256 e00332c22e87d0bc8cc2738a0feccfe46196526ac10b18cca31dce2fc5fcaf77

See more details on using hashes here.

File details

Details for the file MDANCE-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: MDANCE-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 31.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for MDANCE-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f084dd8da1af018456762050fa0090b33dad38f34430d913c5cbfa31535f1b7c
MD5 53d59e66ef57efc3e9ae3bbe7787a33e
BLAKE2b-256 87f298824fa9d9940e49d8e4964d5b80a1afb23ab8a5444d411da4405e3262e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page