Skip to main content

Tokeniser toolkit: a collection of Pythonic subword tokenisers and supporting tools.

Project description

TkTkT

A collection of Pythonic subword tokenisers.

Pronunciation

The acronym stands for ToKeniser ToolKiT and is supposed to be pronounced fast and with beatbox hi-hats (kind of like "tuh-kuh-tuh-kuh-ts" but as fast as you can). It is mandatory that you do this, because I said so.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tktkt-2024.2.1.tar.gz (43.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tktkt-2024.2.1-py3-none-any.whl (52.9 kB view details)

Uploaded Python 3

File details

Details for the file tktkt-2024.2.1.tar.gz.

File metadata

  • Download URL: tktkt-2024.2.1.tar.gz
  • Upload date:
  • Size: 43.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.13.12 HTTPX/0.28.1

File hashes

Hashes for tktkt-2024.2.1.tar.gz
Algorithm Hash digest
SHA256 031546621895caf5786e3adf79598e267a9f5b43e549a36ecf2f2be0d135db92
MD5 f0df7b5b3eba579fc3c7b62582b2b077
BLAKE2b-256 5555bc41124b6f185abd57fe95226f39dccf57916cda8a1ba39c355967cfcba4

See more details on using hashes here.

File details

Details for the file tktkt-2024.2.1-py3-none-any.whl.

File metadata

  • Download URL: tktkt-2024.2.1-py3-none-any.whl
  • Upload date:
  • Size: 52.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.13.12 HTTPX/0.28.1

File hashes

Hashes for tktkt-2024.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 94b225a90a2b0e1cc1552bb8de433a664b9992cac91a4d5da7ab0ed5beb58c69
MD5 18f5a98b204ddcc8de27f7ca0f46eee2
BLAKE2b-256 70e586f3740eb96f661ce8f534d2a444cf6ac46c078a421439758dc22d087751

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page