Skip to main content

Python package that calculates Lipinski descriptors, predicts pIC50, and performs docking

Project description

MLDockKit

This is a simple platform for computing Lipinsky's Rule of five using the rdkit package, predicting pIC50 of canonical SMILES that are potential targets against Oestrogen receptor alpha protein as ant-prostate cancer agaents using a preformatted RandomForest model, and docking of the canonical SMILE with the Oestrogen receptor alpha protein using Audodock Vina package.

Purpose of the Package

The purpose of the package is to provide a unified platform for computing prostate cancer drug likeness indicess and performing docking on the same compounds.

Features

Important chemoinformatics features of Oestrogen receptor alpha antagonists such as: - Lipinsky descriptors - Prediction of pIC50 - Docking and visiualization

Getting Started

The package is found on pypi hence can be installed with pip

Pre-requisites

Installation of Vina requires boost and swig
Importantly: Install pymol as the first package in the new environment, this is due to environmental package conflict.

Installation

It is important to ensure that all the required dependencies are installed in your working environment. It would be much easier if you create a conda environment before installation of packages. The following packages are required, pymol, rdkit, pandas, gemmi, padelpy, joblib,openbabel, Autodock Vina, java, scipy, MDAnalysis and scikit-learn.

conda create -n MLDockKit
conda activate MLDockKit

Then, install pymol before installing other packages:

conda install -c conda-forge pymol-open-source vina mdanalysis 

conda install -c cyclus java-jre

pip install MLDockKit

MLDockKit requires the installation of Meeko for ligand preparation. Please note that the current release of Meeko is only compatible with Python 3.11 or lower. If you're using a Python version higher than 3.11, you must install the development version from the source for compatibility within the MLDockKit environment.

To install Meeko in the MLDockKit environment, follow these steps:

git clone https://github.com/forlilab/Meeko.git
cd Meeko
git checkout develop
pip install .

Run MLDockKit pipeline

>>>from MLDockKit import MLDockKit

>>>MLDockKit(smiles,output_file,presentation,show_iAA,color,num_poses,exhaustiveness)

Params:

1. smiles (str): SMILES string for ligand.
2. output_file (str): File path for saving output [default: "MLDockKit_results.txt"].
3. presentation (str): How to display the receptor [[e.g., 'surface', 'sticks', 'spheres', 'cartoon', etc.] default: 'cartoon')].
4. show_iAA (bool): Option to show interacting amino acid residues only (default: True).
5. color (str): Color for the receptor and interacting residues (default: 'green')
6. num_poses (int): Number of docking poses to generate (default: 10).
7. exhaustiveness (int): Controls search thoroughness (default: 10).

Example running with default settings

>>>from MLDockKit import MLDockKit

>>>MLDockKit(smiles="Oc1ccc2c(c1)S[C@H](c1ccco1)[C@H](c1ccc(OCCN3CCCCC3)cc1)O2")

Output

The pipeline's output is an MLDockKit_output.txt file which contains Lipinsky descriptos, predicted pIC50 value and the docking score. Docking image is rentered in pymol for further analysis by the user. Also, the ligand's and protein's .sfd and .pdpqt files are rentered in the user's working directory.

Acknowledgment

Autodock Vina and pymol were greatily used in writing the codes for molecular docking and visualization. If you use these functions in your work, please cite the original publications for vina and pymol

Contribution

We welcome any contributions. Should you notice a bug, please let us know through issues in the, GitHub Issue Tracker

Authors

Edwin mwakio, Clabe Wekesa and Patrick Okoth
Department of Biological Sciences, Masinde Muliro University of Science and Technology

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mldockkit-0.1.2.tar.gz (4.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mldockkit-0.1.2-py3-none-any.whl (4.8 MB view details)

Uploaded Python 3

File details

Details for the file mldockkit-0.1.2.tar.gz.

File metadata

  • Download URL: mldockkit-0.1.2.tar.gz
  • Upload date:
  • Size: 4.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for mldockkit-0.1.2.tar.gz
Algorithm Hash digest
SHA256 50ee84e34f9ef00bdf12f553edb3a3f9eea22664d14874a07a95e6732d26a4c7
MD5 326111500c68804f4e13408f1d1927c2
BLAKE2b-256 1fd2259c019ca9f97135961df0ab18619be4bcfe64c76c4d8a506b19456fd015

See more details on using hashes here.

File details

Details for the file mldockkit-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: mldockkit-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 4.8 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for mldockkit-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 9daa648b9d52aed8e718940591479d49776239bc9480d65aabfea7853d57144e
MD5 c7feca4e59ff09c87790a9232f6e9868
BLAKE2b-256 d2e507e2bd83df69e3faa20e28e84e34a481f63ff0a24181b9071888b3f62d42

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page