Python package that calculates Lipinsky descriptors, predicts pIC50 and performs docking
Project description
MLDockKit
This is a simple platform for computing Lipinsky's Rule of five using the rdkit package, predicting pIC50 of canonical SMILES that are potential targets against Oestrogen receptor alpha protein as ant-prostate cancer agaents using a preformatted RandomForest model, and docking of the canonical SMILE with the Oestrogen receptor alpha protein using Audodock Vina package.
Purpose of the Package
The purpose of the package is to provide a unified platform for computing prostate cancer drug likeness indicess and performing docking on the same compounds.
Features
Important chemoinformatics features of Oestrogen receptor alpha antagonists such as: - Lipinsky descriptors - Prediction of pIC50 - Docking and visiualization
Getting Started
The package is found on pypi hence can be installed with pip
Pre-requisites
Installation of Vina requires boost and swig
Importantly: Install pymol as the first package in the new environment, this is due to environmental package conflict.
Installation
It is important to ensure that all the required dependencies are installed in your working environment. It would be much easier if you create a conda environment before installation of packages. The following packages are required, pymol, rdkit, pandas, padelpy, joblib, meeko, Autodock Vina, java, scipy, and scikit-learn.
conda create -n MLDockKit
conda activate MLDockKit
Then, install pymol before installing other packages:
conda install -c conda-forge pymol-open-source
conda install -c conda-forge openbabel
conda install -c cyclus java-jre
pip install -U numpy vina
pip install MLDockKit
Run MLDockKit pipeline
>>>from MLDockKit import MLDockKit
>>>MLDockKit("Oc1ccc2c(c1)S[C@H](c1ccco1)[C@H](c1ccc(OCCN3CCCCC3)cc1)O2")
Output
The pipeline's output is an MLDockKit_output.txt file which contains Lipinsky descriptos, predicted pIC50 value and the docking score. Docking image is rentered in pymol for further analysis by the user. Also, the ligand's and protein's .sfd and .pdpqt files are rentered in the user's working directory.
Acknowledgment
Autodock Vina and pymol were greatily used in writing the codes for molecular docking and visualization. If you use these functions in your work, please cite the original publications for vina and pymol
We extracted part of Angel Ruiz Moreno's Jupyter_Dock Jupyter Dock to include it in our visualization function.
Contribution
We welcome any contributions. Should you notice a bug, please let us know through issues in the, GitHub Issue Tracker
Authors
Edwin mwakio, Dr. Clabe Wekesa and Dr. Patrick Okoth
Department of Biological Sciences, Masinde Muliro University of Science and Technology
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file MLDockKit-0.0.5.tar.gz
.
File metadata
- Download URL: MLDockKit-0.0.5.tar.gz
- Upload date:
- Size: 4.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d3d67bd69648485a591f731f2798aaa33e59486b755cabe5085bf8043f933260 |
|
MD5 | b9f79007dcd0cf43eaff8e51afe5d2a0 |
|
BLAKE2b-256 | e215e9e1173a17759ac20980d0dd55352b403f15219f0c4a7a587834365f3bf5 |
File details
Details for the file MLDockKit-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: MLDockKit-0.0.5-py3-none-any.whl
- Upload date:
- Size: 4.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f8328bbf2a0330788e8bcb3b090686ad97298a7ea524d001db0692d8512be55 |
|
MD5 | f4122e6ebbf7e3b5c0e6ebd680dc2369 |
|
BLAKE2b-256 | ed78d7b48f6c689b97ac4a28ef03294c353afacaa160a4aedc2a4e184e2c14cb |