Skip to main content

RetroMol is retrosynthetic analysis tool for modular natural products

Project description

RetroMol logo

RetroMol

testing & quality PyPI PyPI - Python Version DOI

RetroMol is retrosynthetic parsing and fingerprinting tool for modular natural products.

RetroMol is designed to facilitate clustering modular natural products based on biosynthetic similarity, and to enable cross-modal retrieval between modular natural products and their coding biosynthetic gene clusters.

RetroMol powers the RetroMol webapp, available at here. The webapp can be used to query NPAtlas, MIBiG, and antiSMASH database entries for modular natural products based on biosynthetic similarity.

Installation

The most recent code and data can be installed directly from GitHub with:

pip install git+https://github.com/MolTools/RetroMol.git

The latest stable release can be installed from PyPI with:

pip install retromol

RetroMol has been developed for Linux and MacOS.

Getting started

The retromol command line tool is automatically installed alongside installation of the package. The command line tool can be used from the shell with the --help flag to show all subcommands:

python3 -m retromol --help

The RetroMol CLI has two modes, single and batch:

  • single: process a single compound at a time.
  • batch: process multiple compounds in a single command.

In either case a the output folder will contain a log file together with the results in either JSON or JSONL format. JSONL is standard output mode for batch mode to allow for easy parsing of large result sets. Batch mode also supports parallel processing.

Any column, field, or property in the input file, either CSV, TSV, SDF, or JSON, is preserved as props in the output JSON or JSONL.

Stereochemistry parsing is supported by supplying the --matchstereochem flag. This flag annotates the identifier of every identified monomer with R/S and E/Z annotation where applicable.

Result JSONs or lines from a JSONL file can be loaded into Python using RetroMol's Result class for further downstream analyses:

from retromol.io import Result

result = Result.from_serialized(<json_dict>)

# e.g., calculate coverage
coverage = result.best_total_coverage()

Check out the examples folder for example scripts demonstrating how to use RetroMol as a library:

Using custom rules

RetroMol comes with a default set of retrosynthetic rules for modular natural products.

You can also provide your own custom rules and configurations. See the custom rules documentation for details on the YAML formats supported.

Attribution

License

The code in this package is licensed under the MIT License.

For Developers

The final section of the README is for if you want to get involved by making a code contribution.

Development installation

First fork the repository on GitHub, then clone your fork locally and install the package in "editable" mode with the development dependencies:

git clone git+https://github.com/MolTools/RetroMol.git
cd RetroMol
pip install -e .[dev]
pip install hatch     # if you don't have hatch installed yet; needed for building the package
hatch env create dev  # create the development environment

You can now make code changes locally and have them immediately available for testing.

After testing your changes, you can commit and push them to your fork, and then open a pull request on the main repository explaining your changes.

Testing

After cloning the repository, the unit tests in the tests/ folder can be run reproducibly with:

pytest tests

Additionally, these tests are automatically re-run with each push and pull request on main in a GitHub Action.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

retromol-1.1.0.tar.gz (117.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

retromol-1.1.0-py3-none-any.whl (77.0 kB view details)

Uploaded Python 3

File details

Details for the file retromol-1.1.0.tar.gz.

File metadata

  • Download URL: retromol-1.1.0.tar.gz
  • Upload date:
  • Size: 117.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for retromol-1.1.0.tar.gz
Algorithm Hash digest
SHA256 d17d56da2bc71fffcb98736c9902a33bae9c84125f5ea3a6a95e31c7afc37c5d
MD5 acae234d1635d6e4a06c957a3deffb72
BLAKE2b-256 d22e3e45220d2375839888500015ce0cc27ef710b57bd734beb1430bc7c2ed2c

See more details on using hashes here.

Provenance

The following attestation bundles were made for retromol-1.1.0.tar.gz:

Publisher: release.yml on moltools/RetroMol

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file retromol-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: retromol-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 77.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for retromol-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 11bb13d3e6ef1b5369923d68599911392b02929d78be9b36b450ada0a7536beb
MD5 e2ba6c9d8d0a0ad68de7ed5284ded8a0
BLAKE2b-256 ce74e2eb4c1b72dc02c953244f1f31db1e2f640e1e609bc115fc8eedbb7394db

See more details on using hashes here.

Provenance

The following attestation bundles were made for retromol-1.1.0-py3-none-any.whl:

Publisher: release.yml on moltools/RetroMol

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page