Skip to main content

Multi-Launguage RoBERTa trained by RIKEN-AIP LIAT.

Project description

liat_ml_roberta

Multi-Language RoBERTa trained by RIKEN-AIP LIAT.

How to install

Can use pip to install.

pip install liat_ml_roberta

How to use

The loaded models and configurations can be used in the same way as transformers.roberta.

from liat_ml_roberta import RoBERTaTokenizer


def main():
    tokenizer = RoBERTaTokenizer.from_pretrained(version="en_20190121_m10000_v24000_base")
    print(tokenizer.tokenize("This is a pen."))

    config = RoBERTaConfig.from_pretrained("roberta_base_en_20190121_m10000_v24000_u125000")
    model = RoBERTaModel.from_pretrained("roberta_base_en_20190121_m10000_v24000_u125000", config=config)


if __name__ == "__main__":
    main()

Models

name lang size bpe merges vocab size updates wikipedia version
roberta_base_ja_20190121_m10000_v24000_u125000 ja roberta-base 10000 24000 125000 20190121
roberta_base_en_20190121_m10000_v24000_u125000 en roberta-base 10000 24000 125000 20190121

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

liat_ml_roberta-1.0.1.tar.gz (369.2 kB view details)

Uploaded Source

Built Distribution

liat_ml_roberta-1.0.1-py3-none-any.whl (744.3 kB view details)

Uploaded Python 3

File details

Details for the file liat_ml_roberta-1.0.1.tar.gz.

File metadata

  • Download URL: liat_ml_roberta-1.0.1.tar.gz
  • Upload date:
  • Size: 369.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.11

File hashes

Hashes for liat_ml_roberta-1.0.1.tar.gz
Algorithm Hash digest
SHA256 357f22b43ca2d0bee6febe7187884d9ae59c99e2b16968c2eb3a5dbef581f702
MD5 0e04b712af4ab065143d5d49835f7951
BLAKE2b-256 448249fadf41bd9d0309dc2888c558628edf2eeb4e3f4c5d5f1dc8ccd10f2925

See more details on using hashes here.

File details

Details for the file liat_ml_roberta-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for liat_ml_roberta-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 da4aaf77f04490e90cd975c6a89d507492ba4c166fcdb66f0952d07f72caa33a
MD5 125e31dc1f0005711fc929667271efca
BLAKE2b-256 96d3063cd766d076c40bbe9ac42abce157fdeac94d09690df82586c9d93ba7b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page