Multi-Launguage RoBERTa trained by RIKEN-AIP LIAT.
Project description
liat_ml_roberta
Multi-Language RoBERTa trained by RIKEN-AIP LIAT.
This repository is not yet complete.
How to install
Can use pypi to install.
pip install liat_ml_roberta
How to use
from liat_ml_roberta import RoBERTaTokenizer
def main():
tokenizer = RoBERTaTokenizer.from_pretrained(version="en_20190121_m10000_v24000_base")
print(tokenizer.tokenize("This is a pen."))
if __name__ == "__main__":
main()
Versions
name | lang | size | bpe merges | vocab size | wikipedia version |
---|---|---|---|---|---|
ja_20190121_m10000_v24000_base | ja | base | 10000 | 24000 | 20190121 |
en_20190121_m10000_v24000_base | en | base | 10000 | 24000 | 20190121 |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
liat_ml_roberta-0.0.4.tar.gz
(368.8 kB
view details)
Built Distribution
File details
Details for the file liat_ml_roberta-0.0.4.tar.gz
.
File metadata
- Download URL: liat_ml_roberta-0.0.4.tar.gz
- Upload date:
- Size: 368.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 41b54a7bef859e1c2e38e0cf73470ac3e666c2299837f82626cbea20ac9f8567 |
|
MD5 | cff59cd3a2fc235e0552f2e4201bda40 |
|
BLAKE2b-256 | ff67cf947d17b895aa5f28150a0c77413e22b91918e24a191a551965ee9a6aa1 |
File details
Details for the file liat_ml_roberta-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: liat_ml_roberta-0.0.4-py3-none-any.whl
- Upload date:
- Size: 373.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a8aef65246deb7b175895f523583d3da4a053108db0f397883a04b8642f80a8c |
|
MD5 | 7cd34feb345201b5821b6d2657e23350 |
|
BLAKE2b-256 | 8319cb3359a46b150473284d746c71cf0c2ba15f762b1f08d0d49e9becfffccb |