Skip to main content

MCP proxy that compresses tool schemas — up to -98% tokens, 100% signal preserved

Project description

Refract

Refract

Cuts up to 97% of the tokens your AI agents spend using MCP tools — without losing anything.


The problem in one sentence

When your AI agent (Claude, Cursor...) connects to an external tool — your calendar, your emails, GitHub — it downloads the full description of every available tool, every single time, even if it only ends up using one.

It's like asking someone to read the entire store catalogue just to buy bread.

Refract fixes that. It sits between your agent and the tool server, and only lets through what's actually needed.


What it actually changes

Without Refract With Refract
Filesystem tools (14 tools) 1,892 tokens 236 tokens (−88%)
Google Calendar tools (5 tools) 5,010 tokens 660 tokens (−87%)
Enterprise pack — Calendar + Gmail + Drive (12 tools) 8,649 tokens 882 tokens (−90%)

Fewer tokens sent = lower API bills, faster responses.

And nothing is lost. Every check confirmed tools stay 100% usable after compression — no required information is ever stripped.


Install

pip install refract-mcp

That's it. No API key required, no account needed.


How to use it

With Claude Desktop

Open your Claude Desktop config file and add:

{
  "mcpServers": {
    "my-tool-via-refract": {
      "command": "refract-proxy",
      "args": [
        "--target",
        "npx @modelcontextprotocol/server-filesystem /path/to/folder",
        "--verbose"
      ]
    }
  }
}

Replace the --target line with any MCP server you already use. Restart Claude Desktop — that's it, Refract runs in the background.

From the command line

refract-proxy --target "npx @modelcontextprotocol/server-filesystem /tmp" --verbose

The --verbose flag shows live savings:

[Refract] Connected to npx @modelcontextprotocol/server-filesystem /tmp
  14 tools  |  1892 → 236 tokens  (88% reduction)

How it works, no jargon

Refract optimizes information retrieval by sending an agent a simple index of tool names instead of a massive summary of all available data. Once the required tool is identified, the system delivers only the necessary full details and automatically verifies that no important content was lost during the compression. Operating completely without artificial intelligence, this streamlined process is entirely automated, fast, and predictable.


Works with

  • Claude Desktop
  • Cursor
  • Any client that follows the MCP (Model Context Protocol) standard
  • Any existing MCP server — your internal tools, GitHub, Google Workspace, Slack, etc.

For developers

Python usage

from refract_proxy import RefractProxy

proxy = RefractProxy(
    target_url="npx @modelcontextprotocol/server-filesystem /tmp",
    verbose=True,
)
await proxy.connect()

# Use compressed tools directly with the Anthropic API
tools = proxy.as_anthropic_tools(use_cache=True)

# Or serve as a local MCP server (stdio)
await proxy.serve()

# Or expose it via HTTP/SSE
await proxy.serve_http()  # → http://localhost:8080/sse

HTTP/SSE mode

refract-proxy --target "https://my-mcp-server.com" --mode http --port 8080

With a local schema file (for testing)

refract-proxy --target schemas/mcp_calendar_schemas.json --verbose

Built-in Anthropic caching

Refract integrates with Anthropic prompt caching: as_anthropic_tools() automatically marks the compressed catalogue as cacheable, cutting costs even further on repeated requests.

Example over 30 days, 100 requests/day, 5,000 tokens of schemas:

Cost
Without Refract, without cache $45.00
With Refract + cache $1.49

License

MIT — free to use, including commercially. ENDOFREADME

cd ~/MCP-repo git add README.md git commit -m "Force English README" git push origin main

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

refract_mcp-0.1.1.tar.gz (45.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

refract_mcp-0.1.1-py3-none-any.whl (28.1 kB view details)

Uploaded Python 3

File details

Details for the file refract_mcp-0.1.1.tar.gz.

File metadata

  • Download URL: refract_mcp-0.1.1.tar.gz
  • Upload date:
  • Size: 45.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for refract_mcp-0.1.1.tar.gz
Algorithm Hash digest
SHA256 60816d938afa2be57643fd4290addb259aec83df6546882e8e0f7c3272f71f5d
MD5 68ae5ce03a1566e450a3403c5ecda222
BLAKE2b-256 c6dfea9460b28c975b41caffc7a21b0de90972c48da7dde726d5c52e034a1f3e

See more details on using hashes here.

File details

Details for the file refract_mcp-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: refract_mcp-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 28.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for refract_mcp-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4065828ccc2ffd1e098ae8cbd6cb2b06d92f54a4435d7a0e161a39e9ab194c6c
MD5 b41a82e60bcddfd57b555b83441621c2
BLAKE2b-256 3a1d2c66be9024eb6a24eb416153069f95f8a360d597e87b27e79f8273140e71

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page