Clear Your Tools (CYT) — dynamic tool gating for eliminating the MCP/tools tax

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Clear Your Tools

Clear Your Tools is a reverse proxy for coding agents such as Claude Code. It sits between the agent and upstream LLM providers (Anthropic-compatible APIs on OpenRouter, Novita, DeepInfra, and others), intercepts each request, and shrinks the tool payload before forwarding it upstream. Can be easily adopted for other harness agents.

Large MCP catalogs can add tens of thousands of tokens of tool-schema overhead on every turn. Clear Your Tools removes irrelevant tools and trims irrelevant optional parameters while always keeping required fields for tools that stay in the request.

How it works

Agent (Claude Code, etc.)
        │
        ▼
Clear Your Tools proxy  ──► extract user query from messages
        │                   decompose each tool schema
        │                   score / filter with reranker (or LLM pruning)
        │                   recompose pruned tool list
        ▼
Upstream provider (OpenRouter, Anthropic, Novita, …)

On each intercepted request the proxy:

Extracts the user query from the conversation (latest user turn, with message cleanup).
Decomposes tool schemas into a catalog of chunks: each tool root keeps required properties; optional properties are split into separate searchable units.
Runs the pruning pipeline configured in config.yaml (default: rerank; or llm).
Recomposes surviving tools — required properties always remain; only optional properties that look relevant to the query are merged back in.
Forwards the modified request to the upstream provider with the smaller tools array.

Pruning pipeline

Stage	Model (default)	When it runs	What it does
`rerank`	Qwen3-Reranker-8B (DeepInfra)	≥ `models.rerankers.minimum_tools` tools (default 29)	Scores every catalog chunk against the user query; drops low-scoring tools and optional props.
`llm`	Mercury 2 or GPT-OSS-120B (OpenRouter)	≥ `models.llm.minimum_tools` tools (default 50), after `rerank`	LLM selects which catalog chunks to keep; can remove entire tools more aggressively.

Tool Recommendations:

50+ tools — keep rerank or use llm. rerank can be pipelined into LLM as a second stage (pipeline: [rerank, llm]) for stronger tool-level filtering on large catalogs.

Pipeline & Model Recommendations: Choose your pipeline based on model cost:

Expensive models (≥$3/M input tokens, e.g. Sonnet): Use an LLM pruner pipeline.
Cheap models ($0.10–$1/M input tokens, e.g. Haiku, Gemini 3 Flash): Use a rerank pipeline with a low-cost model.
Premium models (e.g. Opus): Use an LLM pruner + rerank combined pipeline.

Quick start

Requires uv tool. Install uv

1. Install proxy

From PyPI (proxy + pruners):

uv tool install 'clear-your-tools[all]'

Though we strongly recommend using password vaults like macOS KeyChain

# Store key in secure vault
security add-generic-password -s "nono" -a "OPENROUTER_API_KEY" -w "sk-..."  # macOS

# Now you can access the key like this:
export ANTHROPIC_AUTH_TOKEN="$(security find-generic-password -s "nono" -a "OPENROUTER_API_KEY" -w)"

2. Configure the proxy

Interactive wizard (writes ~/.config/cyt/config.yaml and optionally ~/.config/cyt/.env):

uv run cyt-rproxy setup

Or edit ~/.config/cyt/config.yaml manually — see CONFIG.md.

3. Run the proxy

Installed CLI:

uv run cyt-rproxy serve

Default listen port: 8834 (from bundled defaults.yaml or ~/.config/cyt/config.yaml).

4. Run the the Agent

Point Claude Code at the proxy:

export ANTHROPIC_BASE_URL="http://localhost:8834/anthropic"
export OPENROUTER_API_KEY="..."
export ANTHROPIC_AUTH_TOKEN="${OPENROUTER_API_KEY}"
claude --model haiku 'say hi' -p

The default upstream in config.yaml is OpenRouter's Anthropic-compatible endpoint. Change network.proxy.reverse.upstreams to target a different provider URL.

5. View pruning stats savings

uv run cyt-rproxy stats totals
uv run cyt-rproxy stats summary --period day
uv run cyt-rproxy stats events --limit 20

Stats are stored in ~/.config/cyt/stats.db by default.

FAQ

Doesn't pruning burn more tokens than it saves?

The reranker and weak LLM used for pruning are much cheaper per token than the main model (e.g. Claude Sonnet). You may spend extra tokens on pruning, but they cost a fraction of what you save on the main request. Set input_cost_per_token and output_cost_per_token in ~/.config/cyt/config.yaml to track savings.

Example pricing (input tokens):

Model	Cost per 1M input tokens
Claude Sonnet 4.6	$3.00
Qwen-Reranker-8B	$0.050
GPT-OSS-120B	$0.14
Inception Mercury 2	$0.25

The weak models such as Mercury 2 or GPT-OSS-120B returns only the IDs of tools to keep, so its output stays extremely small. Rerankers do not count output tokens and are usually much cheaper than a strong LLM.

Rule of thumb: saving 1M Sonnet input tokens is still worthwhile even if pruning uses up to ~10M Mercury tokens — roughly a 1:10 cost ratio. The reranker has roughly a 1:60 cost ratio.

In practice, pruning usually adds modest overhead. Worst case (no tools pruned), you might pay ~$3.30 instead of $3.00. With typical pruning (40–95% of tool tokens removed), tool-schema cost drops from ~$3.00 to roughly $0.15–$1.80, plus ~$0.30 for pruning — about $0.45–$2.10 total for tool-related cost, or roughly 30–85% savings depending on policy.

Why don't I see 30–85% savings on my total request?

Those numbers apply to tool schemas only of the input tokens only, not the full prompt (system message, conversation history, user message, etc.). Clear Your Tools prunes tools based on the user request; the rest of the request is unchanged.

How much you save overall depends on:

How many tools you have — more MCP servers mean a larger share of the request is tool schemas. We do not recommend using CYT below 50 tools.
Which pruning policy you use — see Pruning policies.

To estimate savings on a captured request JSON, see DEV.md. To see statistics of actual net savings (input tokens) run:

uv run cyt-rproxy stats totals

With ~100 tools and prune_all, expect ~85–95% savings on tool tokens and typically ~30%+ savings on the full request. The more tools you have the more overall savings you'll see.

Where can I see how many tools and parameters an MCP server has?

The popular Fetch MCP server is a good example. On its Tools tab: 4 tools, each with 4 parameters (1 required, 3 optional) — 16 parameters total.

If the user asks to "fetch the Markdown of a webpage", the prune_all typically keeps only the Fetch Markdown tool with its required parameter plus any optional parameters that look relevant. Unrelated tools (e.g. Read file) are dropped entirely.

Is my provider/model supported?

CYT's pruner models (the cheap reranker and LLM that decide which tools to keep) call providers through LiteLLM. If LiteLLM supports your provider and model, you can use them in CYT.

When you run cyt-rproxy setup and add a pruner model, you'll be prompted for:

Provider — LiteLLM provider route, without a trailing slash (e.g. openai, openrouter).
Model name — LiteLLM model string (see the provider docs).
API key env var — the name of the environment variable that holds your key, not the key itself (e.g. OPENAI_API_KEY, OPENROUTER_API_KEY).
domain_match — hostname from the provider's API base URL (e.g. openai.com for OpenAI, openrouter.ai for OpenRouter). Used to match outgoing requests to the right model config.

Claude Code reports ZlibError when using the proxy

Install missing zlib:

npm install -g zlib
brew install zlib

This usually means the proxy returned a Content-Encoding: gzip (or deflate) header with a body that was already decompressed. Claude Code’s fetch then tries to inflate plain JSON/SSE and fails. It is not a missing zlib install on your machine or in CYT.

Fix: upgrade to a cyt-rproxy build that streams upstream bytes unchanged (aiter_raw pass-through). After upgrading, verify:

curl --raw -sS -D - -o /tmp/cyt-msg.body \
  -H 'Accept-Encoding: gzip' \
  ... # your POST to http://127.0.0.1:8834/anthropic/v1/messages
head -c 4 /tmp/cyt-msg.body | xxd   # should show 1f8b when header says gzip

Also check: ANTHROPIC_BASE_URL must use http:// for the default plain-HTTP server, e.g. http://localhost:8834/anthropic. Using https:// against cyt-rproxy serve (without TLS/http2.serve) causes uvicorn’s Invalid HTTP request received and broken API calls.

Uvicorn logs Invalid HTTP request received

cyt-rproxy serve listens for HTTP/1.1 on the configured port (default 8834). This warning almost always means a client connected with the wrong protocol:

https://localhost:8834 while the proxy is plain HTTP → TLS handshake bytes, not HTTP
HTTP/2 prior knowledge to uvicorn (use http2.serve + TLS certs only if you intend HTTPS)

Use http://localhost:8834/anthropic unless you have enabled Hypercorn TLS in config.

Development

See DEV.md for checkout setup, repository layout, library usage, and configuration reference.

Limitations

See LIMITATIONS.md for deployment constraints, token accounting caveats, and MCP aggregator trade-offs.

Debug

See details to debug pruning in debug/.

License

Inspiration

This project is inspired by the ideas explored in the tool-attention project, particularly around improving tool selection efficiency and reducing unnecessary tool exposure to the model.

It also aims to limit the effects of context rot by pruning irrelevant or confusing tools from the available toolset based on the current user prompt and execution context.

Reducing irrelevant tools helps decrease prompt noise, lowers cognitive load on the model, and can improve tool selection accuracy and overall agent reliability.

See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

qdrddr

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.5

May 31, 2026

0.1.4

May 31, 2026

This version

0.1.3

May 31, 2026

0.1.2

May 31, 2026

0.1.0

May 30, 2026

0.0.12

May 30, 2026

0.0.11

May 29, 2026

0.0.10

May 29, 2026

0.0.9

May 28, 2026

0.0.8

May 28, 2026

0.0.7

May 28, 2026

0.0.6

May 28, 2026

0.0.5

May 27, 2026

0.0.4

May 27, 2026

0.0.1

May 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clear_your_tools-0.1.3.tar.gz (387.1 kB view details)

Uploaded May 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

clear_your_tools-0.1.3-py3-none-any.whl (93.9 kB view details)

Uploaded May 31, 2026 Python 3

File details

Details for the file clear_your_tools-0.1.3.tar.gz.

File metadata

Download URL: clear_your_tools-0.1.3.tar.gz
Upload date: May 31, 2026
Size: 387.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for clear_your_tools-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`a66bc950ad48d859982c780e047530a5bcc0d3a04769d91b2f689da4c6f582ec`
MD5	`3481b1014add12e9dbea1b6c1d7ede72`
BLAKE2b-256	`50240c7afa447574b05b3d10bfb1e0a0085ad9c0046072843f5ca58df33f396d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for clear_your_tools-0.1.3.tar.gz:

Publisher: publish-pypi.yml on qdrddr/clear-your-tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: clear_your_tools-0.1.3.tar.gz
- Subject digest: a66bc950ad48d859982c780e047530a5bcc0d3a04769d91b2f689da4c6f582ec
- Sigstore transparency entry: 1684691308
- Sigstore integration time: May 31, 2026
Source repository:
- Permalink: qdrddr/clear-your-tools@0a70cd244775db8f4d70068445a7389dd4d4a937
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/qdrddr
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@0a70cd244775db8f4d70068445a7389dd4d4a937
- Trigger Event: release

File details

Details for the file clear_your_tools-0.1.3-py3-none-any.whl.

File metadata

Download URL: clear_your_tools-0.1.3-py3-none-any.whl
Upload date: May 31, 2026
Size: 93.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for clear_your_tools-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`54a9d697da06db9fbd41e1209a24828461a087cf4b9f89d7ed37f4dc6ac0ff7c`
MD5	`14e33aead5aa7548bde1e601eb298fe2`
BLAKE2b-256	`0269896aa5003d68be1a643bf4355f3de20669f96e1c113bef5bce00bac6b4dd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for clear_your_tools-0.1.3-py3-none-any.whl:

Publisher: publish-pypi.yml on qdrddr/clear-your-tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: clear_your_tools-0.1.3-py3-none-any.whl
- Subject digest: 54a9d697da06db9fbd41e1209a24828461a087cf4b9f89d7ed37f4dc6ac0ff7c
- Sigstore transparency entry: 1684691400
- Sigstore integration time: May 31, 2026
Source repository:
- Permalink: qdrddr/clear-your-tools@0a70cd244775db8f4d70068445a7389dd4d4a937
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/qdrddr
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@0a70cd244775db8f4d70068445a7389dd4d4a937
- Trigger Event: release

clear-your-tools 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Clear Your Tools

How it works

Pruning pipeline

Quick start

1. Install proxy

2. Configure the proxy

3. Run the proxy

4. Run the the Agent

5. View pruning stats savings

FAQ

Development

Limitations

Debug

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance