Trims the messages array for ChatGPT API
Project description
gpt-trim
This is a (slightly) faster version of KillianLucas/tokentrim for longer message arrays.
In average, gpt-trim is ~80% faster than tokentrim, and that tokentrim is around 5x~7x slower.
Although gpt-trim is fast, I still need to finish my LeetCode problems that I left years ago, just so that I can make it 20x faster than 95% of people.
Usage
The usage is quite similiar to tokentrim
.
import gpt_trim
trimmed = gpt_trim.trim(
messages,
model="gpt-3.5-turbo"
)
print(trimmed)
Alternatively, you can assign the token limit manually:
gpt_trim.trim(
messages,
max_tokens=100
)
You can also add system messages with ease:
import gpt_trim
messages = [
..., # long, long content
{
"role": "user",
"content": "It's about drive, it's about power"
}
]
trimmed = gpt_trim.advanced_trim(
messages,
system_messages=[
{
"role": "system",
"content": "You'll act like the celebrity: The Rock."
}
],
model="gpt-3.5-turbo",
)
print(trimmed)
The catch? It's slower. With great power comes great... patience.
Comparison
You can compare this project to KillianLucas/tokentrim like so:
import time
import gpt_trim
import tiktoken
import tokentrim
pattern = "d!3h.l7$fj" # 10 tokens
messages = [
{
"role": "user",
"content": pattern * 5000 # 50000 tokens
}
]
# cache first
enc = tiktoken.get_encoding("cl100k_base")
gpt_trim.num_tokens_from_messages(
messages,
enc
)
def test(provider):
print("Testing", provider.__name__)
s = time.time()
result = provider.trim(
messages,
model="gpt-3.5-turbo",
)
print(f"took {(time.time() - s):.4f}s\n")
# Swap the following for every test and see tokentrim
# struggles when dealing with longer context.
test(gpt_trim)
test(tokentrim)
Right. I was bored.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file gpt-trim-0.1.tar.gz
.
File metadata
- Download URL: gpt-trim-0.1.tar.gz
- Upload date:
- Size: 4.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | edda599ed622283d7b002ef992976a89f216c10655927f84fac68aae2e4a5526 |
|
MD5 | 52dd44b8f350a21987f200cf716aaeba |
|
BLAKE2b-256 | b5768689552465e3a4678982170f2a2509edccbc1dae4656518b97d0d52a4801 |