Convert code repositories into structured PDF collections for LLM collaboration.

Reason this release was yanked:

PNG is not suitable for long code

Project description

pixrep

📉 SAVE UP TO 90% TOKENS

Turn Codebases into Visual Context for Multimodal LLMs

📖 Introduction

pixrep is a developer tool designed to bridge the gap between large code repositories and Multimodal Large Language Models.

Instead of feeding raw text that consumes massive context windows, pixrep converts your repository into a structured, hierarchical set of PDFs. This allows you to:

Save 90% Tokens: Visual encoding is far more efficient than text tokenization.
Test for Free: Easily share your entire codebase with premium models (like Claude Opus 4.6) on platforms like arena.ai without hitting text limits.

🚀 Why Visual Code?

Traditional text tokenization is expensive. Visual encoding compresses structure efficiently.

Comparison in Google AI Studio (Gemini 3 Pro):

Raw Files (Text Input)	pixrep OnePDF (Visual Input)

31,812 Tokens ❌ (Cluttered context)	19,041 Tokens ✅ (Clean, single file)

🎓 Academic Backing

The core philosophy of pixrep (rendering code → PDF with syntax highlighting + heatmaps) has been validated by top-tier papers from 2025–2026:

Text or Pixels? It Takes Half (arXiv:2510.18279): Rendering text as images saves ~50% decoder tokens while maintaining or improving performance.
DeepSeek-OCR (arXiv:2510.18234): Visual encoding achieves 10–20× compression ratios for dense, structured text.
CodeOCR (arXiv:2602.01785, Feb 2026): A code-specific study showing that visual input with syntax highlighting improves performance even at 4× compression. In tasks like clone detection, the visual approach outperforms plain text.

Verdict: In the multimodal era, the optimal way to feed code is via "visual perception" rather than "text reading."

✨ Features

📉 High Efficiency: Drastically reduces context window usage for large repos.
⚡ Faster Scanning: Single-pass file loading (binary check + line count + optional content decode) to reduce I/O overhead.
🎨 Syntax Highlighting: Supports 50+ languages (Python, JS, Rust, Go, C++, etc.) with a "One Dark" inspired theme.
🧠 Semantic Minimap: Auto-generates per-file micro UML / call graph summaries to expose structure at a glance.
🔥 Linter Heatmap: Integrates ruff / eslint findings and marks risky lines with red/yellow visual overlays.
🔎 Query Mode: Search by text or semantic symbols, then render only matched snippets to PDF/PNG.
🗂️ Hierarchical Output: Generates a clean 00_INDEX.pdf summary and separate files for granular access.
🌏 CJK Support: Built-in font fallback for Chinese/Japanese/Korean characters (Auto-detects OS fonts).
🛡️ Smart Filtering: Respects .gitignore patterns and supports custom ignore rules.
📊 Insightful Stats: Calculates line counts and language distribution automatically.
🧾 Scan Diagnostics: Prints scan summary (seen/loaded/ignored/binary/errors) for faster troubleshooting.

📦 Installation

pip install pixrep

For PNG output support (--format png), install optional extras:

pip install "pixrep[png]"

🛠️ Usage

Quick Start

Convert the current directory to hierarchial PDFs in ./pixrep_output/<repo_name>:

pixrep .

Or pack everything into a single, token-optimized PDF (Recommended for LLMs):

pixrep onepdf .

Or generate the exact same all-in-one layout as a single long PNG:

pixrep onepng .

Common Commands

Generate PDFs for a specific repo:

pixrep generate /path/to/my-project -o ./my-project-pdfs

Pack core code into a single minimized PDF (all-in-one):

pixrep onepdf /path/to/my-project -o ./ONEPDF_CORE.pdf

Notes:

Defaults to git ls-files (tracked files) when available.
Defaults to "core-only" filtering (skips docs/tests); use --no-core-only to include them.

Pack core code into a single long PNG with the same layout as onepdf:

pixrep onepng /path/to/my-project -o ./ONEPDF_CORE.png --png-dpi 200

Preview structure and stats (without generating PDFs):

pixrep list /path/to/my-project

list mode now uses lightweight scanning (no file content decode), so large repos respond significantly faster.

Show only top 5 languages in the summary:

pixrep list . --top-languages 5

Query and render only matching snippets:

pixrep query . -q "cache" --glob "*.py" --format png

Semantic query (Python symbols) with interactive terminal preview:

pixrep query . -q "CodeInsight" --semantic --tui

CLI Reference

Argument	Description	Default
`repo`	Path to the code repository.	`.` (Current Dir)
`-o`, `--output`	Directory to save the generated PDFs.	`./pixrep_output/<repo>`
`--max-size`	Max file size to process (in KB). Files larger than this are skipped.	`512` KB
`--ignore`	Additional glob patterns to ignore (e.g., `.json` `test/`).	`[]`
`--index-only`	Generate only the `00_INDEX.pdf` (Directory tree & stats).	`False`
`--disable-semantic-minimap`	Turn off per-file semantic UML/callgraph panel.	`False`
`--disable-lint-heatmap`	Turn off linter-based line heatmap background.	`False`
`--linter-timeout`	Timeout seconds for each linter command.	`20`
`--list-only`	Print the directory tree and stats to console, then exit.	`False`
`-V`, `--version`	Show version information.	-

⚙️ Performance Notes

pixrep now applies two execution paths:

Light scan path (pixrep list, pixrep generate --index-only, --list-only): only metadata and line counts are collected; file content is not loaded.
Full scan path (regular pixrep generate): file content is decoded only when needed for PDF rendering.

This reduces memory pressure and disk I/O for repository exploration workflows.

Lint/semantic caches are now stored in user cache directories by default:

Windows: %LOCALAPPDATA%/pixrep/cache/<repo_name>
Linux/macOS: $XDG_CACHE_HOME/pixrep/<repo_name> or ~/.cache/pixrep/<repo_name>

You can override with PIXREP_CACHE_DIR.

📂 Output Structure

After running pixrep ., you will get a folder structure optimized for LLM upload:

pixrep_output/pixrep/
├── 00_INDEX.pdf             # <--- Upload this first! Contains tree & stats
├── 001_LICENSE.pdf
├── 002_README.md.pdf
├── 003_pixrep___init__.py.pdf
├── 005_pixrep_cli.py.pdf
└── ...

🧩 Supported Languages

pixrep automatically detects and highlights syntax for:

Core: Python, C, C++, Java, Rust, Go
Web: HTML, CSS, JavaScript, TypeScript, Vue, Svelte
Config: JSON, YAML, TOML, XML, Dockerfile, Ini
Scripting: Bash, Lua, Perl, Ruby, PHP
And more: Swift, Kotlin, Scala, Haskell, OCaml, etc.

🤝 Contributing

We welcome contributions! Please feel free to submit a Pull Request.

Fork the repository.
Create your feature branch (git checkout -b feature/AmazingFeature).
Commit your changes (git commit -m 'Add some AmazingFeature').
Push to the branch (git push origin feature/AmazingFeature).
Open a Pull Request.

📄 License

Distributed under the MIT License. See LICENSE for more information.

Project details

Release history Release notifications | RSS feed

0.7.0

Mar 8, 2026

0.6.2 yanked

Mar 7, 2026

Reason this release was yanked:

PNG is not suitable for long code

This version

0.6.1 yanked

Mar 7, 2026

Reason this release was yanked:

PNG is not suitable for long code

0.6.0

Feb 22, 2026

0.5.1

Feb 21, 2026

0.5.0

Feb 21, 2026

0.4.0

Feb 20, 2026

0.3.0

Feb 20, 2026

0.2.2

Feb 19, 2026

0.2.1

Feb 19, 2026

0.1.7

Feb 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pixrep-0.6.1.tar.gz (58.1 kB view details)

Uploaded Mar 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pixrep-0.6.1-py3-none-any.whl (57.5 kB view details)

Uploaded Mar 7, 2026 Python 3

File details

Details for the file pixrep-0.6.1.tar.gz.

File metadata

Download URL: pixrep-0.6.1.tar.gz
Upload date: Mar 7, 2026
Size: 58.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pixrep-0.6.1.tar.gz
Algorithm	Hash digest
SHA256	`4b274d7d68bd528bcd3839a1a9b7754939c8ba6286df72adbd170c75d5e4bc7f`
MD5	`7e7d9a64f240822728ba1c4affa5d419`
BLAKE2b-256	`b3b7023c703e9749348c27ded15adaf87a9c42e28fb3b93a1e5bafa74373fcc4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pixrep-0.6.1.tar.gz:

Publisher: publish.yml on TingjiaInFuture/pixrep

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pixrep-0.6.1.tar.gz
- Subject digest: 4b274d7d68bd528bcd3839a1a9b7754939c8ba6286df72adbd170c75d5e4bc7f
- Sigstore transparency entry: 1056171896
- Sigstore integration time: Mar 7, 2026
Source repository:
- Permalink: TingjiaInFuture/pixrep@91a1a294d99ddda54bc3b9b21d34815d84a8b642
- Branch / Tag: refs/tags/v0.6.1
- Owner: https://github.com/TingjiaInFuture
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@91a1a294d99ddda54bc3b9b21d34815d84a8b642
- Trigger Event: release

File details

Details for the file pixrep-0.6.1-py3-none-any.whl.

File metadata

Download URL: pixrep-0.6.1-py3-none-any.whl
Upload date: Mar 7, 2026
Size: 57.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pixrep-0.6.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7484aab3f8af73ca45c1bdb563f2c4a12ba6a7874e2adccd0d764f479395c642`
MD5	`76f0aa0976c72ce173aadf409306baff`
BLAKE2b-256	`257531df5af91b37c6db500461debeb76d51b134e0ee57c7daf6dad4b5fbc795`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pixrep-0.6.1-py3-none-any.whl:

Publisher: publish.yml on TingjiaInFuture/pixrep

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pixrep-0.6.1-py3-none-any.whl
- Subject digest: 7484aab3f8af73ca45c1bdb563f2c4a12ba6a7874e2adccd0d764f479395c642
- Sigstore transparency entry: 1056171918
- Sigstore integration time: Mar 7, 2026
Source repository:
- Permalink: TingjiaInFuture/pixrep@91a1a294d99ddda54bc3b9b21d34815d84a8b642
- Branch / Tag: refs/tags/v0.6.1
- Owner: https://github.com/TingjiaInFuture
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@91a1a294d99ddda54bc3b9b21d34815d84a8b642
- Trigger Event: release

pixrep 0.6.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

pixrep

📉 SAVE UP TO 90% TOKENS

Turn Codebases into Visual Context for Multimodal LLMs

📖 Introduction

🚀 Why Visual Code?

🎓 Academic Backing

✨ Features

📦 Installation

🛠️ Usage

Quick Start

Common Commands

CLI Reference

⚙️ Performance Notes

📂 Output Structure

🧩 Supported Languages

🤝 Contributing

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance