Skip to main content

Encoder/Decoder and tokens counter for GPT3

Project description

An OpenAI GPT3 helper library for encoding/decoding strings and counting tokens.
Counting tokens gives the same output as OpenAI’s tokenizer

Supported python versions: >=2.7 <3.0 OR >=3.3

Installing

pip install gpt3_tokenizer

Examples

Encoding/decoding a string

import gpt3_tokenizer

a_string = "That's my beautiful and sweet string"
encoded = gpt3_tokenizer.encode(a_string) # outputs [2504, 338, 616, 4950, 290, 6029, 4731]
decoded = gpt3_tokenizer.decode(encoded) # outputs "That's my beautiful and sweet string"

Counting tokens

import gpt3_tokenizer

a_string = "That's my beautiful and sweet string"
tokens_count = gpt3_tokenizer.count_tokens(a_string) # outputs 7

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt3_tokenizer-0.1.5.tar.gz (560.7 kB view hashes)

Uploaded Source

Built Distribution

gpt3_tokenizer-0.1.5-py2.py3-none-any.whl (567.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page