Skip to main content

A Weighted Ensemble simulation framework in pure Python with a focus on molecular dynamics.

Project description

* Weighted Ensemble Python (wepy)


Modular implementation and framework for running weighted ensemble
simulations in pure python, where the aim is to have simple things
simple and complicated things possible. The latter being the priority.

The goal of the architecture is that it should be highly modular to
allow extension, but provide a "killer app" for most uses that just
works, no questions asked.

Comes equipped with support for [[https://github.com/pandegroup/openmm][OpenMM]] molecular dynamics,
parallelization using multiprocessing, the [[http://pubs.acs.org/doi/abs/10.1021/jp411479c][WExplore]]
and REVO (Resampling Ensembles by Variance Optimization) resampling
algorithms, and an HDF5 file format and library for storing and
querying your WE datasets that can be used from the command line.

The deeper architecture of ~wepy~ is intended to be loosely coupled,
so that unforeseen use cases can be accomodated, but tightly
integrated for the most common of use cases, i.e. molecular dynamics.

This allows freedom for fast development of new methods.

** Community

Discussion takes place on riot.im (#wepy:matrix.org) which is a slack-like app that works
on the Matrix protocol:
[[https://riot.im/app/#/room/#wepy:matrix.org]]

You can also contact me directly:

samuel.lotz@salotz.info

** Installation

Wepy is still in beta but you can install by cloning this repository,
switching to the last release and installing with pip:

#+BEGIN_SRC bash
git clone https://github.com/ADicksonLab/wepy
cd wepy
pip install -e .
#+END_SRC

PyPI and Anaconda repos are planned.

The only absolutely necessary dependencies are ~numpy~ and ~h5py~
which are used in the core classes.

Outside of the core classes there are a couple of dependencies which
are needed for this distribution but not in general if you use ~wepy~ as
a library:
- OpenMM (http://openmm.org/) (7.2 suggested)
- pandas (https://pandas.pydata.org/)
- mdtraj (http://mdtraj.org)
- networkx >=2 (https://networkx.github.io/)
- geomm (https://github.com/ADicksonLab/geomm)

To install these see their pages and the instructions below for geomm.

The default ~Runner~ is for ~OpenMM~ and should also be installed
automatically when following these instructions (defined in
~setup.py~) although it is not necessary to use openmm when using ~wepy~
as a library.

Currently, some things are still coupled to ~mdtraj~ and thus this is
also a dependency for some functionality, although this will be
relaxed in the future and replaced with a dependency on our nascent
project ~geomm~.

To install geomm:
#+BEGIN_SRC bash
git clone https://github.com/ADicksonLab/geomm.git
cd geomm
# compile the cython modules
python setup.py build_ext --inplace
# install it
pip install -e .

#+END_SRC


There are other uses for ~mdtraj~ such as export of trajectories to
mdtraj trajectories, which will not be removed.

Pandas is used for outputting some data records, for which there is
always a non-pandas option.

NetworkX is used in the WExplore resampler for the region tree and
also for the tree module for manipulating the walker cloning/merging
trees.

** Roadmap

- [X] Weighted Ensemble Layer
- [X] simulation manager
- [X] Resampling sub-module
- [X] Random clone-merge resampler
- [X] WExplore
- [X] REVO
- [X] OpenMM support
- [X] HDF5 output and API
- [ ] Command Line Interface
- [ ] PyPI and Anaconda repositories


** Getting Started

There are a few examples here (https://github.com/ADicksonLab/wepy/tree/master/examples).

There is an example with a pair of Lennard-Jones particles that runs
on the reference implementation. This is the "Hello World" example and
should be your starting point.

A more advanced (and interesting) example is a non-equilibrium
unbinding WExplore simulation of the soluble epoxide hydrolase (sEH)
protein with the inhibitor TPPU, which was the subject of this paper:

Lotz and Dickson. 2018. JACS 140 (2) pp. 618-628 (DOI: 10.1021/jacs.7b08572)

Be sure to install the extra dependencies for the examples as above in
the installation instructions.

** Architecture

The overall architecture of the project is broken into separate modules:
- Simulation Management :: a framework for running simulations, needs:
- Runner :: module that implements whatever dynamics you want to run
- e.g.
- OpenMM
- Resampler :: the key functionality of the Weighted Ensemble
resampling procedure is implemented here
- e.g.
- WExplore
- WorkMapper :: a function that implements the map function that
allows for arbitrary methods of parallelization
- Reporter :: Responsible for the collection and saving of data from wepy runs
- e.g. HDF5 or plaintext
- BoundaryConditions :: describes and performs boundary condition
transformations as the simulation progresses
- simulation manager :: coordinates all of these components to run simulations

- helper sub-modules will make the construction of new simulation
management modules easier and standardized
- Application Layer :: This is a convenience layer for building the
CLI and perhaps high level functions for users to write their own
scripts

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wepy-0.9.3.tar.gz (112.1 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page