Skip to main content

Multi-Launguage RoBERTa trained by RIKEN-AIP LIAT.

Project description

liat_ml_roberta

Multi-Language RoBERTa trained by RIKEN-AIP LIAT.
This repository is not yet complete.

How to install

Can use pypi to install.

pip install liat_ml_roberta

How to use

from liat_ml_roberta import RoBERTaTokenizer


def main():
    tokenizer = RoBERTaTokenizer.from_pretrained(version="en_20190121_m10000_v24000_base")
    print(tokenizer.tokenize("This is a pen."))


if __name__ == "__main__":
    main()

Versions

name lang size bpe merges vocab size wikipedia version
ja_20190121_m10000_v24000_base ja base 10000 24000 20190121
en_20190121_m10000_v24000_base en base 10000 24000 20190121

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

liat_ml_roberta-0.0.4.tar.gz (368.8 kB view details)

Uploaded Source

Built Distribution

liat_ml_roberta-0.0.4-py3-none-any.whl (373.6 kB view details)

Uploaded Python 3

File details

Details for the file liat_ml_roberta-0.0.4.tar.gz.

File metadata

  • Download URL: liat_ml_roberta-0.0.4.tar.gz
  • Upload date:
  • Size: 368.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.11

File hashes

Hashes for liat_ml_roberta-0.0.4.tar.gz
Algorithm Hash digest
SHA256 41b54a7bef859e1c2e38e0cf73470ac3e666c2299837f82626cbea20ac9f8567
MD5 cff59cd3a2fc235e0552f2e4201bda40
BLAKE2b-256 ff67cf947d17b895aa5f28150a0c77413e22b91918e24a191a551965ee9a6aa1

See more details on using hashes here.

File details

Details for the file liat_ml_roberta-0.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for liat_ml_roberta-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 a8aef65246deb7b175895f523583d3da4a053108db0f397883a04b8642f80a8c
MD5 7cd34feb345201b5821b6d2657e23350
BLAKE2b-256 8319cb3359a46b150473284d746c71cf0c2ba15f762b1f08d0d49e9becfffccb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page