Skip to main content

Fast and memory-efficient clustering of long Molecular Dynamics

Project description

BitClust: Fast and memory-efficient clustering of long Molecular Dynamics

Home Page

BitClust´s latest documentation is available here

Description

BitClust is a Python command-line interface (CLI) conceived for fast clustering of relatively long Molecular Dynamics trajectories following Daura's algorithm [1]. Retrieved clusters are roughly equivalent to those reported by VMD's internal command measure cluster but they are computed in a much faster way (see benchmark section for more details).

Motivation

Nowadays very long simulations are carried on routinely. Enhanced sampling methods like metadynamics, REMD, and accelerated dynamics allow escaping from potential energy minima, returning trajectories that are conformationally sparsed and where every cluster can be potentially important to detect and analyze. Improvements on software designed to address this task is an important field of research.

BitClust offer is a classical tradeoff; RAM for speed. It can calculate all pairwise distances between frames to run a clustering job and then store them in memory instead of recalculating them whenever a cluster is found. It is worth noting that used memory has been deeply optimized by encoding similarity distances as bits (0 if the distance is less equal than a specified threshold, 1 otherwise). This encoding result in a storage reduction as high as 32X/64X compared to similar algorithms that saves the same information as single-precision/double-precision float values.

Main Dependencies

BitClust is built on the shoulders of two giants:

  • MDTraj software that allows a very fast calculation of RMSD pairwise distances between all frames of trajectories in a parallelized fashion and

  • bitarray third-party python library which offers a memory-efficient data structure of bit-vectors (bit arrays) and a set of bitwise operations that are the very heart of our clustering implementation.

Citation

If you make use of BitClust in your scientific work, BitCool and cite it ;)

Licence

BitClust is licensed under GNU General Public License v3.0.

Reference

[1] Daura, X.; van Gunsteren, W. F.; Jaun, B.; Mark, A. E.; Gademann, K.; Seebach, D. Peptide Folding: When Simulation Meets Experiment. Angew. Chemie Int. Ed. 1999, 38 (1/2), 236–240.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bitclust-0.0.13.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

bitclust-0.0.13-py3-none-any.whl (7.8 kB view details)

Uploaded Python 3

File details

Details for the file bitclust-0.0.13.tar.gz.

File metadata

  • Download URL: bitclust-0.0.13.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for bitclust-0.0.13.tar.gz
Algorithm Hash digest
SHA256 79ee3fe0a9c2c1c24dfeb0d5cb348c65eb742f7c2bd1ef41f60d473c3ee9fb45
MD5 bbd6ec5d87c024d8f4d5a11e4ac62f83
BLAKE2b-256 39170e1da2990de5fb96a1a936c20a5508c6fccc27cf87dd163c64e3ad99070c

See more details on using hashes here.

File details

Details for the file bitclust-0.0.13-py3-none-any.whl.

File metadata

  • Download URL: bitclust-0.0.13-py3-none-any.whl
  • Upload date:
  • Size: 7.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for bitclust-0.0.13-py3-none-any.whl
Algorithm Hash digest
SHA256 bf327f22128e25e57fbee9c967641c1541fa5874475304ffc5f1589b488ead68
MD5 cbaef9d34adf2c43fabfb3f65aa61cae
BLAKE2b-256 7447ccdb1f16194cee0282f5af0016137c8c787e740a97effb88a56815e33b05

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page