Compress LLM context by rendering text as optimized images

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gventuri

These details have not been verified by PyPI

Project links

Documentation

Project description

PixelPrompt

Compress LLM context by rendering text as optimized images. Based on the research paper "Pixels Beat Tokens: Multimodal LLMs See Better With Image Sources for Text-Rich VQA".

Why PixelPrompt?

When working with LLMs, token counts directly impact cost and latency. PixelPrompt converts text content into visually optimized PNG images, achieving 4-8x compression compared to raw text tokens, while maintaining or improving accuracy.

Key benefits:

🎯 Significant token savings — text rendered as images uses fewer tokens
📊 Flexible formatting — control font size, layout, and visual hierarchy
🔄 Automatic splitting — large content automatically split across multiple images
🎨 Configurable rendering — customize fonts, colors, background
🚀 Easy integration — simple API for any LLM workflow

Installation

uv pip install pixelprompt

Or with pip:

pip install pixelprompt

Quick Start

from pixelprompt import PixelPrompt

# Initialize with default settings
pxl = PixelPrompt()

# Render text as image(s)
text = "Your long context here..."
images = pxl.render(text)

# Use with Claude API
from anthropic import Anthropic

client = Anthropic()
message = client.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    messages=[
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "Analyze this document:"
                },
                *[
                    {
                        "type": "image",
                        "source": {
                            "type": "base64",
                            "media_type": "image/png",
                            "data": img.base64()
                        }
                    }
                    for img in images
                ],
                {
                    "type": "text",
                    "text": "What are the key points?"
                }
            ]
        }
    ]
)

print(message.content[0].text)

Configuration

from pixelprompt import PixelPrompt, RenderConfig

config = RenderConfig(
    font_size=9,  # Default: 9 (range: 6-20)
    font_family="monospace",  # Default: "monospace"
    width=1568,  # Image width in pixels (default: 1568)
    height=1568,  # Image height in pixels (default: 1568)
    background_color=(255, 255, 255),  # RGB tuple (default: white)
    text_color=(0, 0, 0),  # RGB tuple (default: black)
    padding=20,  # Padding in pixels (default: 20)
    line_spacing=1.2,  # Line height multiplier (default: 1.2)
)

pxl = PixelPrompt(config=config)
images = pxl.render(text)

Advanced Usage

Analyze compression metrics

from pixelprompt import estimate_tokens

text = "Your long context..."
original_tokens = estimate_tokens(text)
compressed_tokens = estimate_tokens(f"[Image with compressed content]")

compression_ratio = original_tokens / compressed_tokens
print(f"Compression: {compression_ratio:.1f}x")

Handle large documents

# Automatically splits into multiple images if content exceeds limits
images = pxl.render(long_document)
print(f"Generated {len(images)} images")

# Access individual images
for i, img in enumerate(images):
    img.save(f"page_{i}.png")
    print(f"Image {i}: {img.width}x{img.height}, size: {img.size_bytes} bytes")

Custom fonts

config = RenderConfig(
    font_family="serif",  # Options: "monospace", "serif", "sans-serif"
    font_size=10,
)
pxl = PixelPrompt(config=config)

API Reference

`PixelPrompt`

Main class for rendering text to images.

class PixelPrompt:
    def __init__(self, config: RenderConfig | None = None):
        """Initialize with optional configuration."""

    def render(self, text: str) -> list[RenderedImage]:
        """
        Render text to one or more PNG images.

        Args:
            text: Text content to render

        Returns:
            List of RenderedImage objects
        """

`RenderConfig`

Configuration dataclass for rendering parameters.

@dataclass
class RenderConfig:
    font_size: int = 9
    font_family: str = "monospace"
    width: int = 1568
    height: int = 1568
    background_color: tuple[int, int, int] = (255, 255, 255)
    text_color: tuple[int, int, int] = (0, 0, 0)
    padding: int = 20
    line_spacing: float = 1.2

`RenderedImage`

Represents a single rendered image.

class RenderedImage:
    width: int
    height: int
    size_bytes: int

    def png_bytes(self) -> bytes:
        """Get raw PNG bytes."""

    def base64(self) -> str:
        """Get base64-encoded PNG for API integration."""

    def save(self, path: str) -> None:
        """Save to file."""

Performance

Typical compression ratios (depends on content):

Code: 4-6x compression
Technical prose: 5-8x compression
JSON/Structured data: 3-5x compression
Natural language: 4-7x compression

Rendering time: ~100-200ms per image on modern hardware.

Contributing

Contributions welcome! Please open issues or PRs on GitHub.

License

MIT License — see LICENSE file for details.

Citation

If you use PixelPrompt in research, please cite:

@software{pixelprompt,
  author = {Venturi, Gabriele},
  title = {PixelPrompt: Compress LLM Context by Rendering Text as Images},
  year = {2026},
  url = {https://github.com/sinaptik-ai/pixelprompt}
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gventuri

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

0.4.0

Feb 22, 2026

This version

0.3.2

Feb 16, 2026

0.1.1

Feb 16, 2026

0.1.0

Feb 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pixelprompt-0.3.2.tar.gz (15.5 kB view details)

Uploaded Feb 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pixelprompt-0.3.2-py3-none-any.whl (10.7 kB view details)

Uploaded Feb 16, 2026 Python 3

File details

Details for the file pixelprompt-0.3.2.tar.gz.

File metadata

Download URL: pixelprompt-0.3.2.tar.gz
Upload date: Feb 16, 2026
Size: 15.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pixelprompt-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`07ad1411e60470d58f17fb5b773f1d4d89ced207e557334f52cd44a762560b05`
MD5	`dcd0a85c3f2371b0ae4f3bc2462d1dc3`
BLAKE2b-256	`97ffff7ad8953ced6509962dd3613925f07f0e48915077efa0f3446f12bf3024`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pixelprompt-0.3.2.tar.gz:

Publisher: publish.yml on sinaptik-ai/pixelprompt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pixelprompt-0.3.2.tar.gz
- Subject digest: 07ad1411e60470d58f17fb5b773f1d4d89ced207e557334f52cd44a762560b05
- Sigstore transparency entry: 955869866
- Sigstore integration time: Feb 16, 2026
Source repository:
- Permalink: sinaptik-ai/pixelprompt@6979a0c94495b9365a8572564d2cb1053fdc097d
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/sinaptik-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6979a0c94495b9365a8572564d2cb1053fdc097d
- Trigger Event: release

File details

Details for the file pixelprompt-0.3.2-py3-none-any.whl.

File metadata

Download URL: pixelprompt-0.3.2-py3-none-any.whl
Upload date: Feb 16, 2026
Size: 10.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pixelprompt-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`01d6076cc5a4c5b172a86292da7b4cbc065e34c24e3b99fb647628468f7651fb`
MD5	`0fd015b8def822034f8d6b9288fce807`
BLAKE2b-256	`c932ef65a0219a6bf87d4cd8c69a368b819034fbbad1c937aa80ddc4dd7bb690`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pixelprompt-0.3.2-py3-none-any.whl:

Publisher: publish.yml on sinaptik-ai/pixelprompt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pixelprompt-0.3.2-py3-none-any.whl
- Subject digest: 01d6076cc5a4c5b172a86292da7b4cbc065e34c24e3b99fb647628468f7651fb
- Sigstore transparency entry: 955869873
- Sigstore integration time: Feb 16, 2026
Source repository:
- Permalink: sinaptik-ai/pixelprompt@6979a0c94495b9365a8572564d2cb1053fdc097d
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/sinaptik-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6979a0c94495b9365a8572564d2cb1053fdc097d
- Trigger Event: release

pixelprompt 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PixelPrompt

Why PixelPrompt?

Installation

Quick Start

Configuration

Advanced Usage

Analyze compression metrics

Handle large documents

Custom fonts

API Reference

PixelPrompt

RenderConfig

RenderedImage

Performance

Contributing

License

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`PixelPrompt`

`RenderConfig`

`RenderedImage`