a *biosynformatic* fingerprint to explore natural product distance and diversity
Project description
🌿 a biosynformatic molecular fingerprint tailored to natural product chem- and bioinformatic research 🌿
________________________________________________________________________________________
bi·o·syn·for·ma·tic
/ˌbaɪ oʊ sɪn fərˈ mæt ɪk/
adjective Computers, Biochemistry
relating to biosynthetic information and biochemical logic.
as a concatenation of biosynthetic and bioinformatics, it was coined
during the creation of BioSynFoni.
_________________________________________________________________________________________
Getting started 🌿
Predict biosynthetic class
We have trained a biosynthetic class predictor on biosynfoni fingerprints.
You can try out the predictor on your own molecules here!
Installation
Biosynfoni requires Python 3.9 or later. RDKit is installed as a dependency when installing Biosynfoni.
To install the package, you can use pip:
pip install biosynfoni
Now you can import the biosynfoni package in your Python code or use the command line tool.
Usage in Python
Convert a SMILES string to a fingerprint:
from biosynfoni import Biosynfoni
from rdkit import Chem
smi = <SMILES>
mol = Chem.MolFromSmiles(smi)
fp = Biosynfoni(mol).fingerprint # returns biosynfoni's count fingerprint of the molecule
Usage in the command line
Create a fingerprint from a SMILES string:
biosynfoni <SMILES>
Create a fingerprint from an InChI string:
biosynfoni <InChI>
Write the fingerprints of all molecules in an SDF file to a CSV file:
biosynfoni <molecule_supplier.sdf>
Data availability
We created several biosynthetic class predictors for our manuscript, which can be downloaded from Zenodo here.
We have used data from the COCONUT natural product database (DOI) and ZINC compound database (DOI). The parsed data used for the analysis in our manuscript can be downloaded from Zenodo here.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file biosynfoni-1.0.0.tar.gz.
File metadata
- Download URL: biosynfoni-1.0.0.tar.gz
- Upload date:
- Size: 23.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.21
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d9996bf8dfbab7bec4319855cf217ef621c6393711a36b32c05adff99cef13ee
|
|
| MD5 |
aabe5993dfaf98ebb0d6b0e060c70bf9
|
|
| BLAKE2b-256 |
4141299a70e955e9bce176c7cbb6923ca778b25658b338b1d9dab0e9a8d1aa8d
|
File details
Details for the file biosynfoni-1.0.0-py3-none-any.whl.
File metadata
- Download URL: biosynfoni-1.0.0-py3-none-any.whl
- Upload date:
- Size: 26.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.21
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5075089807f9b02b161e2be3e73502577d88f3b7f3e874cd44b277bf327f1a36
|
|
| MD5 |
506cd89a41780e847f58522cb75a67bc
|
|
| BLAKE2b-256 |
fb92729e604ca890ef69c72cab714d50a781d125c021b18e9411ae59dd9a7da4
|