RXN insight package
Project description
Rxn-INSIGHT: Fast Chemical Reaction Analysis Using Bond-Electron Matrices
Rxn-INSIGHT is an open-source algorithm, written in python, to classify and name chemical reactions, and suggest reaction conditions based on similarity and popularity.
- https://doi.org/10.1186/s13321-024-00834-z: Peer-reviewed publication on Rxn-INSIGHT
1. Installation
Rxn-INSIGHT relies on NumPy, Pandas, RDKit, RDChiral, and RXNMapper.
A virtual environment can be installed with Anaconda as follows:
conda create -n rxn-insight python=3.10
conda activate rxn-insight
Option 1: Installing via PyPI:
pip install rxn-insight
Option 2: Installing directly from source:
git clone https://github.com/mrodobbe/Rxn-INSIGHT.git
cd Rxn-INSIGHT
pip install .
Or, for developing with the optional dependencies, which are required to run the tests and build the docs:
pip install -e ".[test,doc]"
All of the test environments can be run using the command tox from the top directory.
Alternatively, individual test environments can be run using the -e flag as
in tox -e env-name. To run the tests, tests with coverage report, style checks, and
docs build, respectively:
tox -e py3
tox -e py3-coverage
tox -e style
tox -e docs
2. Usage
Basic Usage
from rxn_insight.reaction import Reaction
r = "c1ccccc1I.C=CC(=O)OC>>COC(=O)/C=C/c1ccccc1" # Define a Reaction SMILES identifier
rxn = Reaction(r)
ri = rxn.get_reaction_info()
The reaction info contains most of the information:
{'REACTION': 'C=CC(=O)OC.Ic1ccccc1>>COC(=O)/C=C/c1ccccc1',
'MAPPED_REACTION': '[CH3:1][O:2][C:3](=[O:4])[CH:5]=[CH2:6].I[c:7]1[cH:8][cH:9][cH:10][cH:11][cH:12]1>>[CH3:1][O:2][C:3](=[O:4])/[CH:5]=[CH:6]/[c:7]1[cH:8][cH:9][cH:10][cH:11][cH:12]1',
'N_REACTANTS': 2,
'N_PRODUCTS': 1,
'FG_REACTANTS': ('Aromatic halide', 'Vinyl'),
'FG_PRODUCTS': (),
'PARTICIPATING_RINGS_REACTANTS': ('c1ccccc1',),
'PARTICIPATING_RINGS_PRODUCTS': ('c1ccccc1',),
'ALL_RINGS_PRODUCTS': ('c1ccccc1',),
'BY-PRODUCTS': ('HI',),
'CLASS': 'C-C Coupling',
'TAG': '55becfded1a3842d5a03bbf3e1610411c659aff0806930400c4db2ef61f9c87f',
'SOLVENT': ('',),
'REAGENT': ('',),
'CATALYST': ('',),
'REF': '',
'NAME': 'Heck terminal vinyl',
'SCAFFOLD': 'c1ccccc1'}
Similarity Search
A similarity search can be performed when a database with similar reactions is provided as a pandas DataFrame (df in this case). Another Pandas DataFrame is returned.
df_nbs = rxn.find_neighbors(df, fp="MACCS", concatenate=True, threshold=0.5, broaden=True, full_search=False)
Condition Suggestion
Reaction conditions can be suggested when a Pandas DataFrame is provided.
rxn.suggest_conditions(df)
suggested_solvents = rxn.suggested_solvent
suggested_catalysts = rxn.suggested_catalyst
suggested_reagents = rxn.suggested_reagent
3. Datasets
The complete USPTO dataset that is analyzed by Rxn-INSIGHT,
as described in the manuscript, can be found on
Zenodo: https://doi.org/10.5281/zenodo.10171745.
The gzip file should be downloaded and placed in the folder data/.
4. Reference
When using Rxn-INSIGHT for your own work, please refer to the original publication:
M. R. Dobbelaere, I. Lengyel, C. V. Stevens, and K. M. Van Geem, ‘Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices’, J. Cheminform., vol. 16, no. 1, Mar. 2024.
@ARTICLE{Dobbelaere2024-es,
title = "{Rxn-INSIGHT}: fast chemical reaction analysis using
bond-electron matrices",
author = "Dobbelaere, Maarten R and Lengyel, Istv{\'a}n and Stevens,
Christian V and Van Geem, Kevin M",
journal = "J. Cheminform.",
publisher = "Springer Science and Business Media LLC",
volume = 16,
number = 1,
month = mar,
year = 2024,
copyright = "https://creativecommons.org/licenses/by/4.0",
language = "en"
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rxn_insight-0.0.3.tar.gz.
File metadata
- Download URL: rxn_insight-0.0.3.tar.gz
- Upload date:
- Size: 961.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
875bb6f68f94fefdf139b2d8bee56c3fdd697cb60bba5b251e30574a96394745
|
|
| MD5 |
fdb9cec7a3f29489e8b3468454eded1a
|
|
| BLAKE2b-256 |
746129b378c49a8663ebe0458b0466e00e6ea535cf4df33dd17976f8ba0af477
|
File details
Details for the file rxn_insight-0.0.3-py3-none-any.whl.
File metadata
- Download URL: rxn_insight-0.0.3-py3-none-any.whl
- Upload date:
- Size: 47.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f476b870973315db8809dac46297da7a3dc55534920e82be3e51bd080c12749c
|
|
| MD5 |
573b5f42ad48c7d847ac8d60de71c202
|
|
| BLAKE2b-256 |
f6b19009599e0ca379f9dcf41d94c973bce772c7a8910238e17dfd3790cbd238
|