Skip to main content

A Python tool for identifying and mapping water-sites from MD-based HTP mutational series

Project description

WaSiMap

WaSiMap is a Python tool for identifying and mapping persistent water-sites from molecular dynamics (MD) simulations of biomolecular complexes.

The software detects regions where solvent molecules repeatedly occupy similar spatial positions across simulation frames and clusters them into water-sites, enabling comparative analysis across multiple simulations.


Overview

Water molecules frequently mediate hydrogen bonding and stabilize interaction patterns that extend beyond direct residue–residue contacts in biomolecular recognition. Identifying solvent-mediated interactions is particularly important in protein–protein and protein–DNA interfaces, where conserved water-sites may contribute to binding specificity and stability.

WaSiMap analyzes explicitly solvated MD simulations to:

  • Detect water molecules with increased interfacial persistence
  • Determine the heavy atoms contacted by these waters
  • Cluster water positions into spatial water-site centroids
  • Enable visual comparison of water-sites across many closely related MD simulations

The tool is designed to facilitate post-processing of MD-based HTP mutational series, allowing researchers to investigate how solvent-mediated interactions evolve across closely related molecular systems.


Features

  • Detection of persistent water molecules at biomolecular interfaces
  • Identification of contacted heavy atoms for each water molecule
  • Clustering of water coordinates into water-site centroids
  • Aggregation of results from multiple MD simulations
  • Generation of an interactive HTML visualization interface
  • Compatible with multiple MD trajectory formats (based on MDTraj)

Installation

The installation of WaSiMap is straightforward.

Directly from PIP (Make sure to use python >=3.11):

pip install wasimap

Or install from source:

git clone https://github.com/pisabarro-group/WaSiMap
cd wasimap
pip install .

Example usage (with test data)

The following commands download three (3) MD simulations into the current folder (~320 Mb) and perform a WaSiMap analysis on all test simulations.

wasimap --testdata
wasimap --gui

Upon completion, the WaSiMap viewer should open automatically in the default browser. Else, manually open http://localhost:8080/wasimap.html

To exit the program, use CTRL-C.

Basic Usage

WaSiMap searches for MD trajectory files and their corresponding topology files in the folder where it is executed. Each trajectory must share the same base filename as its topology file. For example, an AMBER simulation named SIM1 should exist as SIM1.mdcrd and SIM1.prmtop. In contrast, H5 trajectories contain topology information as embedded metadata and therefore exist as a single file. WaSiMap automatically attempts to identify and pair trajectory files with their matching topologies.

The program supports MD trajectories readable by MDTraj, including: .".xtc", ".trr", ".dcd", ".nc", ".h5", ".hdf5", ".lh5", ".binpos" - and also topology formats ".pdb", ".gro", ".prmtop", ".parm7", ".psf", and ".top".

Standard usage syntax:

wasimap --flag --flag ... [input]

For a typical automated execution, use:

wasimap --gui

To process a single trajectory, use:

wasimap --gui <<trajectory_filename>> (name only, no extension)

The command "wasimap" accepts several flags, described as follows:

 -c --cuttoff      H-bonding distance cutoff (default: 0.35 nm)
 -r --persistence  Minimum percentage of frames to consider water as resident (default: 5)
 --gui             Start the WaSiMap explorer after execution
 --onlygui         Do not process anything. Show GUI for existing results.
 --testdata        Download test MD simulations to the current folder
 -h --help         Displays help
 -v --version      Displays current version

Navigating Results

WaSiMap automatically creates a folder called ./wasimap_outputs and stores:

  • The 3D structure of the first frame of each trajectory, in PDB format
  • A collection of water-site data, centroids and water ids in file wasimap.json
  • A copy of the HTML viewer wasimap.html

A small web server binds to all interfaces (port 8080) upon completion, and the viewer is launched on the default browser. This web application embeds a 3D viewer (based on NGL Viewer) that superposes all structures. Water-sites are shown as spheres. The size of each sphere is proportional to the residence time of all waters in the water-site.

  • Each structure is assigned a color for ease of identification
  • Interfacial residues involved in water-sites are displayed as "ball+sticks"
  • Molecular backbones are displayed as ribbons
  • An intuitive menu (left panel) allows toggling structures and water-sites on/off
  • The stage can be conveniently navigated with the mouse (see NGL Viewer documentation)
  • Clicking on a water-site sphere shows the water-site table for the given structure.
  • The table displays one water-site per row, and also its resident waters.
  • Resident waters at each water-site are shown as ovals, each containing its AtomID.
  • For convenience, hovering the mouse over waters highlights its occurence nearby other residues.

Citation

If you use this software in academic work, please cite our work. (DOI PENDING)

Contributing

Contributions, bug reports, and feature requests are welcome. Please open an issue or submit a pull request.

Dependencies

The tool should run on windows and linux (x86 arch).

WaSiMap makes use of these great open-source projects:

  • MDTraj – Trajectory management library
  • NGL Viewer – Molecular visualization

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wasimap-0.1.5.tar.gz (38.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wasimap-0.1.5-py3-none-any.whl (39.3 kB view details)

Uploaded Python 3

File details

Details for the file wasimap-0.1.5.tar.gz.

File metadata

  • Download URL: wasimap-0.1.5.tar.gz
  • Upload date:
  • Size: 38.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for wasimap-0.1.5.tar.gz
Algorithm Hash digest
SHA256 571636740467125c0a7bbcf35d133a7258c99b5a133b05169c195b8c5698fdbb
MD5 3fcb9bf5e5ec6504e44a8cfd9473dd87
BLAKE2b-256 db47ba27b51cda24cc9ebed03ee38a947b12c954ce2c21db83a9a09d4eeb76a2

See more details on using hashes here.

File details

Details for the file wasimap-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: wasimap-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 39.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for wasimap-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 a32b9ecd8687fa779c673df43a1ea3065d916684aa26d6fa8f7e54e04ecb5c40
MD5 e26f06435a33ffab43fac011980b2390
BLAKE2b-256 37eff79ae496aec8b8ae09a859cb475427b2ef3272ebdb294821b4552c6f4d41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page