Skip to main content

Reaction and Resource-aware SAScore

Project description

BR-SAScore

Implementation of BR-SAScore developed by Prof. Yousung Jung group at Seoul National University (contact: yousung@gmail.com).

Contents

Developer

Shuan Chen (shuan.micc@gmail.com)

OS Requirements

This repository has been tested on both Linux and Windows operating systems.

Python Dependencies

  • Python (version >= 3.6)
  • Numpy (version >= 1.16.4)
  • rdkit (version >= 2019)

Installation Guide

From Github

git clone https://github.com/snu-micc/BR-SAScore.git
cd BR-SAScore
pip install -e .

Usage

from BRSAScore import SAScorer
scorer = SAScorer()
smi = 'CC(OC1=CC=CC=C1C(O)=O)=O' # Aspirin
score, contribution = scorer.calculateScore(smi)

The expected output of score should be

0.8789047205405656

See Demo.ipynb for the examples of estimating the synthetic accessibility of 18 structurally complex molecules shown in the paper.

Data

USPTO reaction dataset

The reactions of of the training reactions train_data.csv are downloaded from the Dropbox link available at the GitHub repo of GLN, originally from Figshare.

Building blocks

The zip file of eMolecules building blocks origin_dict.csv can be downloaded from the Dropbox link available the GitHub repo of Retro*.

Reproduce the results

[1] Download reaction and building-block data

Downlaod the data can put them in ./data/ and rename them to uspto.csv and emolecules.csv with column titled reactants>reagents>production and SMILES.

Feel free to change the (atom-mapped) reaction data and building blocks data as long as the data is correctly formatted. If the reaction data is not atom-mapped, we recommend to use LocalMapper to prepare high-quality the atom-mappings for your reactions.

[2] Prepare the BRScores from reaction and building-block data

Prepare the BRSAScores by

python scripts/prepare_Scores.py -r uspto -b emolecules

[3] Use BRSAScocre

from BRSAScore import SAScorer
reaction_from = 'uspto'
buildingblock_from = 'emolecules'
scorer = SAScorer(reaction_from=reaction_from, buildingblock_from=buildingblock_from)

To use the BRSAScore on you own data, replace the reaction_from and buildingblock_from

Publication

@article{chen2021deep, title={Estimating the synthetic accessibility of molecules with building block and reaction-aware SAScore}, author={Chen, Shuan and Jung, Yousung}, journal={Journal of Cheminformatics}, volume={}, number={}, pages={}, year={2024}, publisher={Springer} }

License

This project is covered under the MIT Liscence.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

BRSAScore-0.1.0.tar.gz (13.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

BRSAScore-0.1.0-py3-none-any.whl (13.0 MB view details)

Uploaded Python 3

File details

Details for the file BRSAScore-0.1.0.tar.gz.

File metadata

  • Download URL: BRSAScore-0.1.0.tar.gz
  • Upload date:
  • Size: 13.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.10.0 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/1.0.0 urllib3/1.26.18 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.13

File hashes

Hashes for BRSAScore-0.1.0.tar.gz
Algorithm Hash digest
SHA256 99858421427ee24c488fd6d511fb772c347286dc84008100cfece02d34400747
MD5 2c89cdf1cb5c7700fafa1aef4b5ce53f
BLAKE2b-256 b03de67f46dcbaf53a8c8dbcd656ce32fb81b53adc26dd4d1be698f1fc4d80fc

See more details on using hashes here.

File details

Details for the file BRSAScore-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: BRSAScore-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 13.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.10.0 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/1.0.0 urllib3/1.26.18 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.13

File hashes

Hashes for BRSAScore-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ad8c0a9bfbd499e377feff4dea962bec1e037f3c82f08c94ff91ca8c6a5de4ef
MD5 44f33149d4d2a28abc99c817924b1f2b
BLAKE2b-256 fe6b37577ac71a4bcb4d9f8d011d727a26d8d897c00d18be5a3e597d8bf7c376

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page