Skip to main content

Applicability domains for cheminformactics.

Project description

MLChemAD

Applicability domain definitions for cheminformatics modelling.

Getting Started

Install

pip install mlchemad

Example Usage

from mlchemad import TopKatApplicabilityDomain, data

# Create the applicability domain
app_domain = TopKatApplicabilityDomain()
# Fit it to the training set
app_domain.fit(data.training)

# Determine outliers from multiple samples (rows) ...
print(app_domain.contains(data.test))

# ... or a unique sample
sample = data.test[5] # Obtain the 5th row as a pandas.Series object 
print(app_domain.contains(sample))

Depending on the definition of the applicability domain, some samples of the training set might be outliers themselves.

Applicability domains

The applicability domain defined by MLChemAD as the following:

  • Bounding Box
  • PCA Bounding Box
  • Convex Hull
    (does not scale well)
  • TOPKAT's Optimum Prediction Space
    (recommended with molecular descriptors)
  • Leverage
  • Hotelling T²
  • Distance to Centroids
  • k-Nearest Neighbors
    (recommended with molecular fingerprints with the use of dist='rogerstanimoto', scaling=None and hard_threshold=0.75 for ECFP fingerprints)
  • Isolation Forests
  • Non-parametric Kernel Densities

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlchemad-1.1.2.tar.gz (13.6 kB view details)

Uploaded Source

Built Distribution

mlchemad-1.1.2-py3-none-any.whl (13.0 kB view details)

Uploaded Python 3

File details

Details for the file mlchemad-1.1.2.tar.gz.

File metadata

  • Download URL: mlchemad-1.1.2.tar.gz
  • Upload date:
  • Size: 13.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for mlchemad-1.1.2.tar.gz
Algorithm Hash digest
SHA256 b39dc0630938c65a2ffcdb87d76b6e9f94182845785dd5f97603b844edea6653
MD5 466d881cc1b7d1fbed12f162fe2eab46
BLAKE2b-256 02808d436d826dc8e411695a2e352e4df3064d84c4f9ec66a1e12defdbb5ebef

See more details on using hashes here.

File details

Details for the file mlchemad-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: mlchemad-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 13.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for mlchemad-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e777e1c98d03cdc4561c04f5ad8d7483e89d32da99c67c21bb46b9eba8a379b1
MD5 05aab3cb33d0aa2d0d2936c081a22441
BLAKE2b-256 226b7f5fa1986b025ce3fdf6d60704c3ca3e7f75cc1b21886cd610960ad15353

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page