Skip to main content

a python package to analyze short molecular dynamics simulations for the effect of point mutations on stability

Project description

BoostMut

Powered by MDAnalysis

BoostMut is a python package to analyze high-throughput molecular dynamics simulations to investigate the effect of point mutations on stability. Since molecular dynamics simulations are costly, the simulations are short (50ps-1ns) and their analysis by BoostMut are meant to serve as a secondary filter after an initial selection has been made using a primary predictor such a FoldX. As input, BoostMut requires a directory with subdirectories for the wildtype and each of the mutations, each subdirectory in turn containing a set of short trajectories and a topology:

input_directory
├── Subdir_D24K
├── ...
├── Subdir_T85V
└── Subdir_template
    ├── trajectory_1.xtc
    ├── ...
    ├── trajectory_5.xtc
    └── topology.tpr

BoostMut is pip installable. It can be installed using pip. To avoid conflicts with other packages, it is safest to do this within a virtual environment:

python -m venv boostmut-venv
source boostmut-venv/bin/activate
pip install BoostMut

After installing, BoostMut is run in the command line. To get an overview of the input flags, use:

boostmut_run -h

BoostMut can analyze hydrogen bonding, RMSF of backbone and sidechains, hydrophobic surface exposure, and other structural checks. This can be done on three selections: the whole protein, 8Å surrounding a given mutation, or just the residue of the mutation. The final output returns a .csv with for each analysis and mutation the difference between mutant and wildtype. The analyses and the selections for each analysis can be customized in the command line. For example, if you want the surrounding and residue selections for hydrogen bonding, but only the whole protein selection for the other analyses, this can be specified with the -s flag with the desired analysis and selection divided by a colon:

boostmut_run -i input_directory -s hbse:sr c:p 

where the analyses are specified using:

  • h : hydrogen bonding
  • b : RMSF of backbone
  • s : flexibility score of sidechains
  • e : hydrophobic surface exposure
  • c : other structural checks

and the selections are specified using:

  • p : whole protein selection
  • s : 8Å surrounding selection
  • r : residue selection

By default, BoostMut assumes each trajectory is 50ps long and simulated with an amber forcefield. The analyses for the sidechain score and hydrophobic exposure rely on benchmark data for specific simulation lengths. The other analyses do not use benchmarks and therefore work the same regardless of timestep or forcefield. Benchmarks for different forcefields (available options are amber99, yamber3, charm27, and opls) in 50ps intervals for simulations up to 1000ps long are provided. The appropriate benchmark files can be selected by providing the simulation length in the commandline interface:

boostmut_run -i input_directory_500ps -t 500 -f opls

For faster calculations (the main computational cost comes from re-aligning the structure for each side-chain when calculating the sidechain score) or simulations for which no forcefield benchmarks are available, the sidechain score and hydrophobic exposure can be left out of the analysis selection:

boostmut_run -i input_directory -s hb:sr c:p 

After the calculations have finished, the output can be processed with one of the tools in boostmut_process if needed. If the calculation of the mutations has been split up into separate parallel runs, the output has to be combined and rescaled. Combining can be done using boostmut_process combine. Rescaling the newly combined output file, or adding additional metrics can be done using boostmut_process scale. To obtain an easy human-readable excel version of the data, use boostmut_process excel.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

boostmut-1.0.4.tar.gz (2.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

boostmut-1.0.4-py3-none-any.whl (2.5 MB view details)

Uploaded Python 3

File details

Details for the file boostmut-1.0.4.tar.gz.

File metadata

  • Download URL: boostmut-1.0.4.tar.gz
  • Upload date:
  • Size: 2.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for boostmut-1.0.4.tar.gz
Algorithm Hash digest
SHA256 03e4c0f23b34249f9ccff72afd3db14cf8053a5f4dfa3feebdea2ff95a2c3bc7
MD5 a38fe6e4391769c8f3714ae62e4ae36a
BLAKE2b-256 1fff4895bcd223f2f12ba457569c6343b13bac54ee6dcd9540813a9d49aa2855

See more details on using hashes here.

File details

Details for the file boostmut-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: boostmut-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 2.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.7

File hashes

Hashes for boostmut-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 6acfdf3f340d9bec4377c8033cc5b2c585c563b62f01a08e7ad88926574a995c
MD5 805b244b3b5614e5c76091a89d4c4b01
BLAKE2b-256 b3b92cc8bc5f23ed2ce8c07e37d20a25973bce04a2daca3612a250efaad53fad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page