A Python package that automatically generates derived variables from a column with SMILES (Simplified Molecular-Input Line-Entry System).
Project description
Development Status :: 3 - Alpha
SMILES featurizer
A Python package that automatically generates derived variables from a column with SMILES (Simplified Molecular-Input Line-Entry System)
Install
$ pip install smilesfeaturizer
$ pip install git+https://github.com/dsdanielpark/SMILES-featurizer.git
Usage
The dataset assumes the presence of SMILES strings in a column named SMILES
.
Feature generation
-
Create fingerprint columns for SMILES representations based on various packages RDKit, Mol2Vec, DataMol, MolFeat, Scikit-Learn.
from smilesfeaturizer import generate_smiles_feature df = generate_smiles_feature(df) # default method="simple" df = generate_smiles_feature(df, method="specific")
Create dashboard
-
Through the dashboard, you can determine which compounds exhibit what prediction performance.
from smilesfeaturizer import create_inline_dash_dashboard # Load your DataFrame and specify the columns true_col = 'pIC50' predicted_col = 'predicted_pIC50' # Create and run the Dash dashboard create_inline_dash_dashboard(df, true_col, predicted_col)
Save reporting images
- Molecular images, basic information, and the prediction versus actual values are visually represented in bar graphs for easy viewing.
from smilesfeaturizer import smiles_insight_plot selected_metric = 'RMSE' # Choose the error metric you want to display true_col = 'pIC50' # Replace with your true column name predicted_col = 'predicted_pIC50' # Replace with your predicted column name smiles_insight_plot(df[:1], true_col, predicted_col, selected_metric, 'output_folder', show=True)
License
Bugs and Issues
Sincerely grateful for any reports on new features or bugs. Your valuable feedback on the code is highly appreciated.
Contacts
- Core maintainer: Daniel Park, South Korea
- E-mail: parkminwoo1991@gmail.com
Copyright (c) 2023 MinWoo Park, South Korea
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file smilesfeaturizer-0.1.2.tar.gz
.
File metadata
- Download URL: smilesfeaturizer-0.1.2.tar.gz
- Upload date:
- Size: 19.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 372a0fa24e54cad48c8e531ca00795a2b1b571eebab4dde5c798cdb3671dcf44 |
|
MD5 | 45b64749355c47731515a2b4aa16d7d9 |
|
BLAKE2b-256 | 674a370b09cbc89d6ebcf4e6206d682ebf74fe46fb9d4b2381c6db343968819a |
File details
Details for the file smilesfeaturizer-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: smilesfeaturizer-0.1.2-py3-none-any.whl
- Upload date:
- Size: 19.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7178d533fd7156294f6a3739c53d88449c7f05183a3da0a493901d0de3505dd8 |
|
MD5 | 578c29f7ad74a644d4d25e303660f5a2 |
|
BLAKE2b-256 | 4feefe55349b4ef1c018d282fc83b1541a3b592ff8b9c63e3742fb807bf6ba55 |