A python library for extracting molecular SMILES embeddings from language models pre-trained with various objectives and/or architectures.
Project description
SMILES Featurizers
Extract SMILES embeddings from language models pre-trained with various objectives architectures.
Getting Started
pip install smiles-featurizers
Model List
Our released models are listed as following. You can import these models by using the smiles-featurizers
package or using HuggingFace's Transformers.
Model | Type |
---|---|
shahrukhx01/smole-bert | Bert |
shahrukhx01/smole-bert-mtr | Bert |
shahrukhx01/smole-bart | Bart |
shahrukhx01/muv2x-simcse-smole-bart | Simcse |
shahrukhx01/siamese-smole-bert-muv-1x | SentenceTransformer |
Use SMILES Featurizers
Bert Featurizer
from smiles_featurizers import BertFeaturizer
import torch
## set device
use_gpu = True if torch.cuda.is_available() else False
featurizer = BertFeaturizer("shahrukhx01/smole-bert", use_gpu=use_gpu)
embeddings = featurizer.embed(["CCC(C)(C)Br"])
Bart (Encoder) Featurizer
from smiles_featurizers import BartFeaturizer
featurizer = BartFeaturizer("shahrukhx01/smole-bart")
embeddings = featurizer.embed(["CCC(C)(C)Br"], embedder="encoder")
Bart (Decoder) Featurizer
from smiles_featurizers import BartFeaturizer
featurizer = BartFeaturizer("shahrukhx01/smole-bart")
embeddings = featurizer.embed(["CCC(C)(C)Br"], embedder="decoder")
SimCSE Featurizer
from smiles_featurizers import SimcseFeaturizer
import torch
## set device
device = "cuda" if torch.cuda.is_available() else "cpu"
featurizer = SimcseFeaturizer("shahrukhx01/mv2x-simcse-smole-bert")
embeddings = featurizer.embed(["CCC(C)(C)Br"], device=device)
SentenceTransformer Featurizer
from smiles_featurizers import SentenceTransformersFeaturizer
import torch
## set device
device = "cuda" if torch.cuda.is_available() else "cpu"
featurizer = SentenceTransformersFeaturizer("shahrukhx01/siamese-smole-bert-muv-1x")
embeddings = featurizer.embed(["CCC(C)(C)Br"], device=device)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file smiles_featurizers-1.0.6-py2.py3-none-any.whl
.
File metadata
- Download URL: smiles_featurizers-1.0.6-py2.py3-none-any.whl
- Upload date:
- Size: 9.2 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9658c42d8b8ff9c34ce861c41e12f4687a3c1e2b25d3acd4f587416ea492815d |
|
MD5 | d9c72024aba6d834b0f9a46860811ed8 |
|
BLAKE2b-256 | e32f2796094f2dea8ccd6d4dcf48c4be19b81ab6ad70a8799163eba3fdb56f7b |