Dimensionality reduction method for MD trajectories
Project description
Unrolr
Conformational analysis of MD trajectories based on (pivot-based) Stochastic Proximity Embedding using dihedral distance as a metric (https://github.com/jeeberhardt/unrorl).
Prerequisites
You need, at a minimum (requirements.txt):
- Python 2.7 or python 3
- NumPy
- H5py
- Pandas
- Matplotlib
- PyOpenCL
- MDAnalysis (>=0.17)
Installation on UNIX (Debian/Ubuntu)
I highly recommand you to install the Anaconda distribution (https://www.continuum.io/downloads) if you want a clean python environnment with nearly all the prerequisites already installed (NumPy, H5py, Pandas, Matplotlib).
1 . First, you have to install OpenCL:
- MacOS: Good news, you don't have to install OpenCL, it works out-of-the-box. (Update: bad news, OpenCL is now depreciated in macOS 10.14. Thanks Apple.)
- AMD: You have to install the AMD OpenCL™ 2.0 Driver.
- Nvidia: You have to install the CUDA toolkit.
- Intel: And of course it's working also on CPU just by installing this runtime software package. Alternatively, the CPU-based OpenCL driver can be also installed through the package
pocl
(http://portablecl.org/) with the conda package manager.
For any other informations, the official installation guide of PyOpenCL is available here.
2 . As a final step,
# Get the package
wget https://github.com/jeeberhardt/unrolr/archive/master.zip
unzip unrolr-master.zip
rm unrolr-master.zip
cd unrolr-master
# Install the package
python setup.py install
If somehow pip is having problem to install all the dependencies,
conda config --append channels conda-forge
conda install pyopencl mdanalysis
# Try again
python setup.py install
OpenCL context
Before running Unrolr, you need to define the OpenCL context. And it is a good way to see if everything is working correctly.
python -c 'import pyopencl as cl; cl.create_some_context()'
Here in my example, I have the choice between 3 differents computing device (2 graphic cards and one CPU).
Choose platform:
[0] <pyopencl.Platform 'AMD Accelerated Parallel Processing' at 0x7f97e96a8430>
Choice [0]:0
Choose device(s):
[0] <pyopencl.Device 'Tahiti' on 'AMD Accelerated Parallel Processing' at 0x1e18a30>
[1] <pyopencl.Device 'Tahiti' on 'AMD Accelerated Parallel Processing' at 0x254a110>
[2] <pyopencl.Device 'Intel(R) Core(TM) i7-3820 CPU @ 3.60GHz' on 'AMD Accelerated Parallel Processing' at 0x21d0300>
Choice, comma-separated [0]:1
Set the environment variable PYOPENCL_CTX='0:1' to avoid being asked again.
Now you can set the environment variable.
export PYOPENCL_CTX='0:1'
Example
from __future__ import print_function
from unrolr import Unrolr
from unrolr.feature_extraction import Dihedral
from unrolr.utils import save_dataset
top_file = 'examples/inputs/villin.psf'
trj_file = 'examples/inputs/villin.dcd'
# Extract all calpha dihedral angles from trajectory and store them into a HDF5 file (start/stop/step are optionals)
d = Dihedral(top_file, trj_file, selection='all', dihedral_type='calpha', start=0, stop=None, step=1).run()
X = d.result
save_dataset('dihedral_angles.h5', "dihedral_angles", X)
# Fit X using Unrolr (pSPE + dihedral distance) and save the embedding into a csv file
U = Unrolr(r_neighbor=0.27, n_iter=50000, verbose=1)
U.fit(X)
U.save(fname='embedding.csv')
print('%4.2f %4.2f' % (U.stress, U.correlation))
Todo list
- Compatibility with python 3
- Compatibility with the latest version of MDAnalysis (==0.17)
- Unit tests
- Accessible directly from pip
- Improve OpenCL performance (global/local memory)
Citation
Eberhardt, J., Stote, R. H., & Dejaegere, A. (2018). Unrolr: Structural analysis of protein conformations using stochastic proximity embedding. Journal of Computational Chemistry, 39(30), 2551-2557. https://doi.org/10.1002/jcc.25599
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.