Token Optimized Object Notation - Reduce LLM token usage by 40-60%

These details have not been verified by PyPI

Project links

Project description

TOON Converter

Token Optimized Object Notation — A Python library for reducing LLM token usage by 40-60% when sending structured data.

The Problem

When you send JSON arrays to LLMs, you repeat attribute names for every single record:

[
  {"customerId": "C12345", "firstName": "John", "status": "active"},
  {"customerId": "C12346", "firstName": "Jane", "status": "active"},
  {"customerId": "C12347", "firstName": "Bob", "status": "inactive"}
]

The strings "customerId", "firstName", and "status" appear three times each. Every occurrence costs tokens. At enterprise scale with thousands of records, this redundancy becomes expensive.

The Solution

TOON separates the schema from the data, declaring attribute names once:

@schema:customerId,firstName,status
C12345|John|active
C12346|Jane|active
C12347|Bob|inactive

Result: 40-60% fewer tokens for the same data.

Installation

# Clone the repository
git clone https://github.com/prashantdudami/toon-converter.git
cd toon-converter

# Install in development mode
pip install -e .

# Or install dependencies directly
pip install -r requirements.txt

Quick Start

Convert JSON to TOON

from toon_converter import json_to_toon

data = [
    {"name": "John", "age": 30, "city": "NYC"},
    {"name": "Jane", "age": 25, "city": "LA"},
    {"name": "Bob", "age": 35, "city": "Chicago"},
]

toon = json_to_toon(data)
print(toon)

Output:

@schema:name,age,city
John|30|NYC
Jane|25|LA
Bob|35|Chicago

Convert TOON back to JSON

from toon_converter import toon_to_json

toon_string = """@schema:name,age,city
John|30|NYC
Jane|25|LA"""

data = toon_to_json(toon_string)
print(data)
# [{'name': 'John', 'age': '30', 'city': 'NYC'}, {'name': 'Jane', 'age': '25', 'city': 'LA'}]

Features

Nested Object Flattening

Nested objects are automatically flattened using dot notation:

data = [{"customer": {"name": "John", "address": {"city": "NYC"}}}]
toon = json_to_toon(data)

Output:

@schema:customer.name,customer.address.city
John|NYC

Array Serialization

Arrays of simple values are serialized as comma-separated strings:

data = [{"tags": ["premium", "active", "verified"]}]
toon = json_to_toon(data)

Output:

@schema:tags
premium,active,verified

Special Character Handling

Pipe characters in values are automatically escaped:

data = [{"description": "A|B|C", "id": "1"}]
toon = json_to_toon(data)
# Values with pipes are escaped as \|

Null and Empty Values

Missing or null values become empty strings:

data = [
    {"name": "John", "email": "john@test.com"},
    {"name": "Jane", "email": None},
]
toon = json_to_toon(data)

Output:

@schema:name,email
John|john@test.com
Jane|

Advanced Usage

Using the TOONConverter Class

For more control, use the TOONConverter class directly:

from toon_converter import TOONConverter

# Create converter with custom options
converter = TOONConverter(
    flatten_nested=True,    # Flatten nested objects with dot notation
    serialize_arrays=True,  # Serialize arrays as comma-separated values
)

data = [{"user": {"name": "John"}, "tags": ["a", "b"]}]
toon = converter.json_to_toon(data)

Disabling Flattening

If you need to preserve nested structure as JSON strings:

converter = TOONConverter(flatten_nested=False)
data = [{"user": {"name": "John", "role": "admin"}}]
toon = converter.json_to_toon(data)
# Nested object is serialized as JSON string

Using TOON with LLMs

Example: OpenAI API

import openai
from toon_converter import json_to_toon

# Your data
customers = [
    {"id": "C001", "name": "Acme Corp", "status": "active", "tier": "premium"},
    {"id": "C002", "name": "TechStart", "status": "active", "tier": "basic"},
    # ... hundreds more records
]

# Convert to TOON
toon_data = json_to_toon(customers)

# Use in prompt
prompt = f"""Analyze these customer records and identify upsell opportunities.

Data format: TOON (schema on first line, pipe-delimited values)
{toon_data}

Provide your analysis:"""

response = openai.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": prompt}]
)

Example: Anthropic Claude

import anthropic
from toon_converter import json_to_toon

client = anthropic.Anthropic()

# Convert data to TOON
toon_data = json_to_toon(your_data)

message = client.messages.create(
    model="claude-3-sonnet-20240229",
    max_tokens=1024,
    messages=[{
        "role": "user",
        "content": f"""The following data is in TOON format (Token Optimized Object Notation).
The first line defines the schema, subsequent lines are pipe-delimited values.

{toon_data}

Summarize this data."""
    }]
)

TOON Format Specification

Element	Description
`@schema:`	Schema line prefix (required)
`,`	Attribute separator in schema line
`\|`	Value delimiter in data rows
`\\|`	Escaped pipe character in values
`.`	Nested key separator (e.g., `user.name`)
Empty between `\|\|`	Null or empty value

Example

@schema:id,user.name,user.email,tags,status
C001|John Doe|john@example.com|premium,active|active
C002|Jane Smith||basic|pending

Running Tests

# Install test dependencies
pip install pytest

# Run all tests
pytest tests/ -v

# Run with coverage
pip install pytest-cov
pytest tests/ --cov=toon_converter --cov-report=term-missing

Token Savings Analysis

Records	JSON Tokens	TOON Tokens	Savings
10	~850	~340	60%
100	~8,500	~3,200	62%
1,000	~85,000	~31,000	64%

Based on typical customer records with 8-10 attributes each.

When to Use TOON

✅ Use TOON when:

Processing hundreds or thousands of records
All records share the same schema
Token costs are a significant concern
Batch/analytical workloads (not real-time chat)
RAG context injection

⚠️ Consider alternatives when:

Under 10 records (schema overhead not justified)
Objects have varying schemas
Users see raw prompts/responses
You need the model to return structured JSON

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License - see LICENSE for details.

Author

Prashant Dudami

LinkedIn: linkedin.com/in/prashantdudami
GitHub: github.com/prashantdudami

TOON was developed as part of research into token-efficient data representation for enterprise LLM systems.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Jan 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

toon_token_optimizer-1.0.0.tar.gz (14.0 kB view details)

Uploaded Jan 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

toon_token_optimizer-1.0.0-py3-none-any.whl (9.0 kB view details)

Uploaded Jan 20, 2026 Python 3

File details

Details for the file toon_token_optimizer-1.0.0.tar.gz.

File metadata

Download URL: toon_token_optimizer-1.0.0.tar.gz
Upload date: Jan 20, 2026
Size: 14.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.16

File hashes

Hashes for toon_token_optimizer-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`0d8823e3efda49cd9bb31cceaf44c68f0d67d19b48087bdc420b03ab86933a9d`
MD5	`2b1096b8191a96524f7a2e17254755ea`
BLAKE2b-256	`ed04047a73f44ff93f925462c313ae1b105e3c6ecc38eb1e4da19ec589431f95`

See more details on using hashes here.

File details

Details for the file toon_token_optimizer-1.0.0-py3-none-any.whl.

File metadata

Download URL: toon_token_optimizer-1.0.0-py3-none-any.whl
Upload date: Jan 20, 2026
Size: 9.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.16

File hashes

Hashes for toon_token_optimizer-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`988bd65c6c5a83301f70c782cccbf394282bae76db0f1e24b557f7cf35cc17ad`
MD5	`e8ce2be1b7cc4e7c6dfd6bc73c1a2014`
BLAKE2b-256	`0831409ad31727c5f5171208c9e56a11d9300623bb4df4efa6a1a82b83627862`

See more details on using hashes here.

toon-token-optimizer 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TOON Converter

The Problem

The Solution

Installation

Quick Start

Convert JSON to TOON

Convert TOON back to JSON

Features

Nested Object Flattening

Array Serialization

Special Character Handling

Null and Empty Values

Advanced Usage

Using the TOONConverter Class

Disabling Flattening

Using TOON with LLMs

Example: OpenAI API

Example: Anthropic Claude

TOON Format Specification

Example

Running Tests

Token Savings Analysis

When to Use TOON

Contributing

License

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes