Squeeze verbose LLM agent tool output down to only the relevant lines

These details have not been verified by PyPI

Project links

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Squeez

Squeez Logo
Squeeze out the juice, leave the pulp behind.

Squeeze verbose LLM agent tool output down to only the relevant lines.

The Problem

LLM coding agents waste 80-95% of context tokens on irrelevant tool output. When an agent reads a 500-line file to find one function, or runs git log to find a specific commit, most of the output is noise.

Squeez trains a small (2-3B) generative model to identify and extract only the lines that matter for the task at hand — compressing tool output by ~86% on average.

Example

Task: "Fix the CSRF validation bug in the referer check"

Before — 42 lines, ~1,200 tokens	After — 8 lines, ~150 tokens
class CsrfViewMiddleware(MiddlewareMixin): def _check_referer(self, request): referer = request.META.get('HTTP_REFERER') if referer is None: raise RejectRequest('No referer') good_referer = request.get_host() if not same_origin(referer, good_referer): raise RejectRequest('Bad referer') def process_view(self, request, callback, ...): if getattr(request, 'csrf_processing_done', False): return None csrf_token = request.META.get('CSRF_COOKIE') if csrf_token is None: return self._reject(request, 'No CSRF cookie') return self._accept(request) class SessionMiddleware(MiddlewareMixin): def process_request(self, request): session_key = request.COOKIES.get(...) request.session = self.SessionStore(session_key) def process_response(self, request, response): if request.session.modified: request.session.save() return response class CommonMiddleware(MiddlewareMixin): def process_request(self, request): host = request.get_host() if settings.PREPEND_WWW and ...: return redirect(...) def process_response(self, request, response): if settings.USE_ETAGS: response['ETag'] = hashlib.md5(...) return response class SecurityMiddleware(MiddlewareMixin): def process_request(self, request): if settings.SECURE_SSL_REDIRECT and ...: return redirect(...)	class CsrfViewMiddleware(MiddlewareMixin): def _check_referer(self, request): referer = request.META.get('HTTP_REFERER') if referer is None: raise RejectRequest('No referer') good_referer = request.get_host() if not same_origin(referer, good_referer): raise RejectRequest('Bad referer') 87% compression — only the CSRF referer logic survives. Session, Common, and Security middleware are irrelevant to the task and get dropped.

Before — 42 lines, ~1,200 tokens

After — 8 lines, ~150 tokens

class CsrfViewMiddleware(MiddlewareMixin):
    def _check_referer(self, request):
        referer = request.META.get('HTTP_REFERER')
        if referer is None:
            raise RejectRequest('No referer')
        good_referer = request.get_host()
        if not same_origin(referer, good_referer):
            raise RejectRequest('Bad referer')

    def process_view(self, request, callback, ...):
        if getattr(request, 'csrf_processing_done', False):
            return None
        csrf_token = request.META.get('CSRF_COOKIE')
        if csrf_token is None:
            return self._reject(request, 'No CSRF cookie')
        return self._accept(request)

class SessionMiddleware(MiddlewareMixin):
    def process_request(self, request):
        session_key = request.COOKIES.get(...)
        request.session = self.SessionStore(session_key)

    def process_response(self, request, response):
        if request.session.modified:
            request.session.save()
        return response

class CommonMiddleware(MiddlewareMixin):
    def process_request(self, request):
        host = request.get_host()
        if settings.PREPEND_WWW and ...:
            return redirect(...)

    def process_response(self, request, response):
        if settings.USE_ETAGS:
            response['ETag'] = hashlib.md5(...)
        return response

class SecurityMiddleware(MiddlewareMixin):
    def process_request(self, request):
        if settings.SECURE_SSL_REDIRECT and ...:
            return redirect(...)

class CsrfViewMiddleware(MiddlewareMixin):
    def _check_referer(self, request):
        referer = request.META.get('HTTP_REFERER')
        if referer is None:
            raise RejectRequest('No referer')
        good_referer = request.get_host()
        if not same_origin(referer, good_referer):
            raise RejectRequest('Bad referer')

87% compression — only the CSRF referer logic survives. Session, Common, and Security middleware are irrelevant to the task and get dropped.

$ cat django/middleware.py | squeez "Fix the CSRF validation bug in the referer check"

Another example — filtering git log

Task: "Find the commit that changed the authentication timeout"

Before — 25 commits of noise:

a1b2c3d Fix typo in README
e4f5g6h Update CI pipeline
i7j8k9l Bump version to 2.3.1
m0n1o2p Add docker-compose.yml
q3r4s5t Refactor database migrations
u6v7w8x Change auth timeout from 30m to 1h
y9z0a1b Fix linting warnings
c2d3e4f Update dependencies
...

After — the one commit that matters:

u6v7w8x Change auth timeout from 30m to 1h

$ git log --oneline -25 | squeez "find the commit that changed the authentication timeout"

Installation

pip install squeez

Quick Start

CLI

# Pipe tool output through squeez
cat output.txt | squeez "Fix the CSRF validation bug"

# Or with a file
squeez "Fix the CSRF bug" --input-file output.txt

# Explicit extract subcommand also works
squeez extract "Fix the CSRF bug" --input-file output.txt

Python API

from squeez.inference.extractor import ToolOutputExtractor

# Load model from config/env
extractor = ToolOutputExtractor()

# Or load model locally
extractor = ToolOutputExtractor(model_path="./output/squeez_qwen")

# Or connect to a server explicitly
extractor = ToolOutputExtractor(base_url="http://localhost:8000/v1", model_name="squeez")

filtered = extractor.extract(
    task="Fix the CSRF validation bug in middleware",
    tool_output=raw_output,
)
print(filtered)  # Only the relevant lines

The model returns JSON: {"relevant_lines": ["line1", "line2", ...]} and the extract() method joins them into filtered text.

Configuration

Backend is resolved in order: CLI args > env vars > config file (squeez.yaml or configs/default.yaml).

# squeez.yaml
backend: "transformers"  # optional preference
local_model_path: "./output/squeez_qwen"
# server_url: "https://api.groq.com/openai/v1"
# server_model: "squeez"

# Or via environment variables
export SQUEEZ_LOCAL_MODEL=./output/squeez_qwen
export SQUEEZ_SERVER_URL=https://api.groq.com/openai/v1
export SQUEEZ_SERVER_MODEL=squeez
export SQUEEZ_API_KEY=gsk_...

Clear flag names are available on the CLI, with the old names kept as aliases:

squeez "Fix the bug" --local-model ./output/squeez_qwen
squeez "Fix the bug" --server-url http://localhost:8000/v1 --server-model squeez

Use with Claude Code

Add this to your project's CLAUDE.md (or ~/.claude/CLAUDE.md for global):

Always when you invoke a shell command, pipe it through `squeez` and tell exactly what you want to know.

Examples:
- `bun test 2>&1 | squeez "did the tests pass?"`
- `git log --oneline -50 | squeez "find the commit that broke CSRF"`
- `cat src/auth/middleware.py | squeez "find the referer validation logic"`

Do NOT use squeez when:
- You need exact, uncompressed output (e.g. writing a patch)
- The command is interactive

This saves context tokens by replacing verbose tool output with only the relevant lines.

Also works with other coding agents (Codex CLI, OpenCode, etc.) via their equivalent instruction files.

Training

1. Download the dataset

python scripts/download_data.py

This pulls the SWE-bench tool output dataset (7,148 train + 436 eval samples) from HuggingFace.

2. Train with LoRA

squeez train \
    --train-file data/train.jsonl \
    --eval-file data/eval.jsonl

Default: Qwen 3.5 2B with LoRA (r=16, alpha=32). See configs/default.yaml for all hyperparameters.

3. Evaluate

squeez eval \
    --extractor-model output/squeez_qwen \
    --eval-file data/eval.jsonl

Dataset

Training data: KRLabsOrg/tool-output-extraction-swebench

	Count
Train samples	7,148
Eval samples	436
With relevant lines	3,985 (53%)
Empty (not relevant)	3,599 (47%)
Avg compression	86%

Built from 2,294 SWE-bench instances with real tool execution (git grep, git blame, pytest, ruff, etc.) against 12 repos. Teacher distillation by gpt-oss-120b on Groq.

Tool types

Tool Type	Count
read_file	4,309
git_log	840
grep	575
build_output	380
ls	376
test_output	344
python	310
git_blame	201
lint_output	101
curl	95
git_diff	53

How It Works

Source: SWE-bench test split (2,294 real GitHub issues)
Tool calls: 3-7 synthetic tool calls per instance
Real execution: All commands run against bare-cloned repos at the correct commit
Teacher distillation: gpt-oss-120b selects relevant line ranges via JSON spans
Zero-hallucination extraction: Teacher spans matched against original output — no generated text
Assembly: Extracted lines formatted as {"relevant_lines": [...]} for SFT training

Data Generation

To regenerate the dataset from scratch:

squeez pipeline --phase 1 2 3 4 5 6 7 8 \
    --output-dir data \
    --github-token $GITHUB_TOKEN \
    --teacher-api-key $GROQ_API_KEY \
    --teacher-base-url https://api.groq.com/openai/v1

Citation

@software{kovacs2026squeez,
    title={Squeez: Compressing Tool Output for LLM Coding Agents},
    author={Adam Kovacs},
    year={2026},
    url={https://github.com/KRLabsOrg/squeez}
}

Built on top of SWE-bench:

@inproceedings{jimenez2024swebench,
    title={SWE-bench: Can Language Models Resolve Real-world Github Issues?},
    author={Carlos E Jimenez and John Yang and Alexander Wettig and Shunyu Yao and Kexin Pei and Ofir Press and Karthik R Narasimhan},
    booktitle={The Twelfth International Conference on Learning Representations},
    year={2024}
}

License

Apache 2.0

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.1.3

Mar 18, 2026

0.1.2

Mar 8, 2026

This version

0.1.1

Mar 8, 2026

0.1.0

Mar 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

squeez-0.1.1.tar.gz (45.4 kB view details)

Uploaded Mar 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

squeez-0.1.1-py3-none-any.whl (48.3 kB view details)

Uploaded Mar 8, 2026 Python 3

File details

Details for the file squeez-0.1.1.tar.gz.

File metadata

Download URL: squeez-0.1.1.tar.gz
Upload date: Mar 8, 2026
Size: 45.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for squeez-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`4708db667b63e2815c58cc14332ef09cae018b2b1ae968ccf6a58c70662275f1`
MD5	`e4d266fcf4e9d3024c1b366e08ba3b51`
BLAKE2b-256	`6883b01d00e57ac2f4b12358eee08f39a077b94fce3c8fff78949773ed0a0012`

See more details on using hashes here.

File details

Details for the file squeez-0.1.1-py3-none-any.whl.

File metadata

Download URL: squeez-0.1.1-py3-none-any.whl
Upload date: Mar 8, 2026
Size: 48.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for squeez-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f32663ef8bed6446054e540f58569a3386b678368e8902ed5a7b6780ad8bf2ef`
MD5	`852bc8fcab76f681a4022ccc2df819ba`
BLAKE2b-256	`a742ccc8d4089a939b0522da9ac66838bda1918e072a2a6f984d5c4789e01dc8`

See more details on using hashes here.

squeez 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Squeez

The Problem

Example

Installation

Quick Start

CLI

Python API

Configuration

Use with Claude Code

Training

1. Download the dataset

2. Train with LoRA

3. Evaluate

Dataset

Tool types

How It Works

Data Generation

Citation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes