Skip to main content

Multi-Launguage RoBERTa trained by RIKEN-AIP LIAT.

Project description

liat_ml_roberta

Multi-Language RoBERTa trained by RIKEN-AIP LIAT.

How to install

Can use pip to install.

pip install liat_ml_roberta

How to use

The loaded models and configurations can be used in the same way as transformers.roberta.

from liat_ml_roberta import RoBERTaTokenizer


def main():
    tokenizer = RoBERTaTokenizer.from_pretrained(version="en_20190121_m10000_v24000_base")
    print(tokenizer.tokenize("This is a pen."))

    config = RoBERTaConfig.from_pretrained("roberta_base_en_20190121_m10000_v24000_u125000")
    model = RoBERTaModel.from_pretrained("roberta_base_en_20190121_m10000_v24000_u125000", config=config)


if __name__ == "__main__":
    main()

Models

name lang size bpe merges vocab size updates wikipedia version
roberta_base_ja_20190121_m10000_v24000_u125000 ja roberta-base 10000 24000 125000 20190121
roberta_base_en_20190121_m10000_v24000_u125000 en roberta-base 10000 24000 125000 20190121

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

liat_ml_roberta-1.0.0.tar.gz (369.2 kB view details)

Uploaded Source

Built Distribution

liat_ml_roberta-1.0.0-py3-none-any.whl (744.3 kB view details)

Uploaded Python 3

File details

Details for the file liat_ml_roberta-1.0.0.tar.gz.

File metadata

  • Download URL: liat_ml_roberta-1.0.0.tar.gz
  • Upload date:
  • Size: 369.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.11

File hashes

Hashes for liat_ml_roberta-1.0.0.tar.gz
Algorithm Hash digest
SHA256 9229292a558a7489cd199c3c98b522c0ac70dd12503a6ffc79292b0bdf01ca5a
MD5 9177905b1c48830ba9040516b26ddd04
BLAKE2b-256 0e61e23cefb51d440448b1330d2de9fe901554d305e00299ddc669ff93ba4985

See more details on using hashes here.

File details

Details for the file liat_ml_roberta-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for liat_ml_roberta-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e806011c6080352d2f9a9488ef669f44ea9b40cbf412906c50a2f72247e2a053
MD5 8c83c177d2f6889ad570ba0ef31ae353
BLAKE2b-256 6220c73a8b3ad2a9685086eaba55898ebcad88fc73d3c34089a4d8f3ed03f8f6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page