Skip to main content

ailia Tokenizer

Project description

ailia Tokenizer Python API

!! CAUTION !! “ailia” IS NOT OPEN SOURCE SOFTWARE (OSS). As long as user complies with the conditions stated in License Document, user may use the Software for free of charge, but the Software is basically paid software.

About ailia Tokenizer

The ailia Tokenizer is an NLP tokenizer that can be used from Unity or C++. The tokenizer is an API for converting text into tokens (sequences of symbols) that AI can handle, or for converting tokens back into text.

Traditionally, tokenization has been performed using Pytorch's Transformers. However, since Transformers only work with Python, there has been an issue of not being able to tokenize from applications on Android or iOS.

With ailia Tokenizer, this problem is solved by directly performing NLP tokenization without using Pytorch's Transforms. This makes it possible to perform tokenization on Android and iOS as well.

Since ailia Tokenizer includes Mecab and SentencePiece, it is possible to perform complex tokenizations, such as those for BERT Japanese or Sentence Transformer, on the device.

Install from pip

You can install the ailia SDK free evaluation package with the following command.

pip3 install ailia_tokenizer

Install from package

You can install the ailia SDK from Package with the following command.

python3 bootstrap.py
pip3 install .

API specification

https://github.com/ailia-ai/ailia-sdk

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ailia_tokenizer-1.5.1.0.tar.gz (17.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ailia_tokenizer-1.5.1.0-py3-none-any.whl (17.4 MB view details)

Uploaded Python 3

File details

Details for the file ailia_tokenizer-1.5.1.0.tar.gz.

File metadata

  • Download URL: ailia_tokenizer-1.5.1.0.tar.gz
  • Upload date:
  • Size: 17.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for ailia_tokenizer-1.5.1.0.tar.gz
Algorithm Hash digest
SHA256 0cdb366e4cc2c25415748be0bd567b3b25b1c548371b664b60a39ea8598241bf
MD5 4a0b08bc61f54b1ef19c950114ed190a
BLAKE2b-256 e05e95a6ffbd0834b71299c798da831612ace3fc5d38086b4a5cd6591e2a67f5

See more details on using hashes here.

File details

Details for the file ailia_tokenizer-1.5.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for ailia_tokenizer-1.5.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0996da7f666f89b3fdb8e381d8eca5727cd04fe9268d80c39e3b9ba5139cb849
MD5 462a1754bb3c3b306aa2f2fd6decb807
BLAKE2b-256 7c5cd081261e7f8ad9d1a5e6539817a680601fa98cfcbb9cebee034ca831094e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page