Bengali Transformer for natural language processing using state of the art transformer(language model)
Project description
Bengali Transformer
Bengali Transformer for natural language processing using state of the art transformer(language model)
Thanks to huggingface transformers
Installation
pip install bntransformer
Tokenizer
Bert Multilingual Tokenizer
from bntransformer.bnbert import Tokenizer
tokenizer = Tokenizer()
tokens = tokenizer.tokenize('আমি ভাত খাই।')
print(tokens)
# output: ['আ', '##মি', 'ভ', '##াত', 'খা', '##ই', '।']
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
bntransformer-1.0.tar.gz
(2.3 kB
view hashes)
Built Distribution
Close
Hashes for bntransformer-1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 45b1045c04b469c36f8f8855e7430709c43d24be057a09a44065024338984a99 |
|
MD5 | 27c552b89411cfeccb04a20b87ba9b7d |
|
BLAKE2b-256 | 50dca5bbe633fa215a8053a1be71761f46a3a54ea3a101d934267d353c228eb9 |