Tokenize SMILES with substructure units
Project description
SMILES Pair Encoding.
SMILES Pair Encoding (SmilesPE) trains a substructure tokenizer from a large set of SMILES strings (e.g., ChEMBL) based on byte-pair-encoding (BPE).
Install
pip install your_project_name
Use the Pre-trained
Fill me in please! Don't forget code examples:
1+1
2
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
SmilesPE-0.0.1.tar.gz
(13.5 kB
view hashes)
Built Distribution
SmilesPE-0.0.1-py3-none-any.whl
(13.3 kB
view hashes)