A python library for extracting molecular SMILES embeddings from language models pre-trained with various objectives and/or architectures.
Project description
SMILES Featurizers
Extract SMILES embeddings from language models pre-trained with various objectives architectures.
Getting Started
pip install smiles-featurizers
Model List
Our released models are listed as following. You can import these models by using the smiles-featurizers
package or using HuggingFace's Transformers.
Model | Type |
---|---|
shahrukhx01/smole-bert | Bert |
shahrukhx01/smole-bert-mtr | Bert |
shahrukhx01/smole-bart | Bart |
shahrukhx01/muv2x-simcse-smole-bart | Simcse |
shahrukhx01/siamese-smole-bert-muv-1x | SentenceTransformer |
Use SMILES Featurizers
Bert Featurizer
from smiles_featurizers import BertFeaturizer
featurizer = BertFeaturizer("shahrukhx01/smole-bert")
embeddings = featurizer.embed(["CCC(C)(C)Br"])
Bart (Encoder) Featurizer
from smiles_featurizers import BartFeaturizer
featurizer = BartFeaturizer("shahrukhx01/smole-bart")
embeddings = featurizer.embed(["CCC(C)(C)Br"], embedder="encoder")
Bart (Decoder) Featurizer
from smiles_featurizers import BartFeaturizer
featurizer = BartFeaturizer("shahrukhx01/smole-bart")
embeddings = featurizer.embed(["CCC(C)(C)Br"], embedder="decoder")
SimCSE Featurizer
from smiles_featurizers import SimcseFeaturizer
featurizer = SimcseFeaturizer("shahrukhx01/mv2x-simcse-smole-bart")
embeddings = featurizer.embed(["CCC(C)(C)Br"])
SentenceTransformer Featurizer
from smiles_featurizers import SentenceTransformersFeaturizer
featurizer = SentenceTransformersFeaturizer("shahrukhx01/siamese-smole-bert-muv-1x")
embeddings = featurizer.embed(["CCC(C)(C)Br"])
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for smiles_featurizers-1.0.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f5be85335f3159ed7e1a6c73464d9572cf921d38ca3c77e9899fb425981a07b |
|
MD5 | 612ab3a0705a22f4c65e6e8eeee99f53 |
|
BLAKE2b-256 | 06d137a02629fd5283ccce09505022d49755faaf86904796fd3423e486fb5893 |