Count the number of tokens in a text string or file, similar to the Unix 'wc' utility.
Project description
Token Count
Token Count is a command-line utility that counts the number of tokens in a text string, file, or directory, similar to the Unix wc
utility. It uses the OpenAI tiktoken
library for tokenization and is compatible with GPT-3.5-turbo or any other OpenAI model token counts.
Installation
To install Token Count, run the following command in your terminal:
pip install token-count
Usage - Python Library
from token_count import TokenCount
tc = TokenCount(model_name="gpt-3.5-turbo")
text = "Your text here"
tokens = tc.num_tokens_from_string(text)
print(f"Tokens in the string: {tokens}")
file_path = "path/to/your/file.txt"
tokens = tc.num_tokens_from_file(file_path)
print(f"Tokens in the file: {tokens}")
dir_path = "path/to/your/directory"
tokens = tc.num_tokens_from_directory(dir_path)
print(f"Tokens in the directory: {tokens}")
Usage - Command Line
Token Count has three main options:
Count tokens in a text string:
token-count --text "Your text here"
Count tokens in a file:
token-count --file path/to/your/file.txt
Count tokens in a directory (recursively):
token-count --directory path/to/your/directory
You can provide any combination of these options. Token Count will print the token count for each input type.
Additionally, you can provide any OpenAI model(gpt-4) to get token count according to the model. By default it uses "gpt-3.5-turbo".
token-count --model_name "gpt-4"
License
This project is licensed under the MIT License.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file token-count-0.2.1.tar.gz
.
File metadata
- Download URL: token-count-0.2.1.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f6462985a5ca2b925cc869932c3a2f64f57d91e78d9f5dc04dfdf717c206293a |
|
MD5 | 7bcf87f236db069e2db86d5e5061093e |
|
BLAKE2b-256 | 207ff779c0ef6b683c3b9c104b64e0b83f0435950a8d692eb7e0db1e443fbe98 |
File details
Details for the file token_count-0.2.1-py3-none-any.whl
.
File metadata
- Download URL: token_count-0.2.1-py3-none-any.whl
- Upload date:
- Size: 4.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b71fd116e8e5f15002570e00c1aa1c8a5e11e3b76599f70d81bfb98b2daf6bb4 |
|
MD5 | e22fe3476a65c3358f482b77ab61ce6d |
|
BLAKE2b-256 | 9553010fcaf4021799d74452eddeec9ad6bb244e41b98485d036cf6253172d2f |