Skip to main content

Log likelihood estimation of molecules.

Project description

molll: Data Driven Estimation of Molecular Log-Likelihood using Fingerprint Key Counting

This software provides models for estimating the likelihood of a molecule belonging to a specific dataset based on simple fingerprint key counting. The models, AtomLL and MolLL, are designed for outlier detection and class membership assignment. They offer potential applications in molecular generation and optimization. PropLL is included and uses scikit kernel density estimates on RDKit-derived and user-selectable properties.

Installation

Clone and install directly from the main directory:

pip install .

or directly from the repository without cloning:

pip install git+https://github.com/EBjerrum/molll.git

(PyPI package is underway)

Usage

The code works on lists of RDKit Mol objects:

from molll import MolLL
molll = MolLL()
molll.analyze_dataset(mols_list)
molll.calculate_lls(other_or_same_mols)
#Or a single Mol object
molll.calculate_ll(single_mols)

Saving and loading from a text-based format:

molll.save("MySaveFile.json")

molll_clone = MolLL()
molll_clone.load("MySaveFile.json")

For convenience, some classes with precomputed data are available, currently based on LibInvent train data:

from molll import LibInventMolLLr1
molll = LibInventMolLLr1()
molll.calculate_lls(mols_list)

Additional Reading

There's a preprint on ChemRxiv with some example usages: https://doi.org/10.26434/chemrxiv-2024-hzddj

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

molecule_ll-0.1.2.tar.gz (40.3 MB view details)

Uploaded Source

Built Distribution

molecule_ll-0.1.2-py3-none-any.whl (42.0 MB view details)

Uploaded Python 3

File details

Details for the file molecule_ll-0.1.2.tar.gz.

File metadata

  • Download URL: molecule_ll-0.1.2.tar.gz
  • Upload date:
  • Size: 40.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for molecule_ll-0.1.2.tar.gz
Algorithm Hash digest
SHA256 531d9ef8b379eeabd583a1606d4077d90b5e23852a43cae1377fe6c20e235871
MD5 7ecbcf6b0beb73c2cc17d2a34f67320b
BLAKE2b-256 b27d35e004d87ba1feaa851dbdf2ddb0da7557fbc5cc2a498d12441176348929

See more details on using hashes here.

File details

Details for the file molecule_ll-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: molecule_ll-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 42.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for molecule_ll-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 cdbb012c788965a7ecf71057160ef26c40012d574c2babaa548f286c412bfd66
MD5 01ce5aeca18d271642d4f36763d8b04c
BLAKE2b-256 d0fb7ba74af92fbe2a428d888a6b7af5ef368989c24cf5dc566f63c5a2c7899a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page