Skip to main content

Transparent MCP proxy that reduces token cost of large tool responses via content-aware handles

Project description

jmunch-mcp

Transparent MCP proxy that reduces the token cost of large upstream tool responses for nearly every other MCP server imaginable. Wraps a single upstream MCP, forwards every call, and handle-ifies fat payloads into content-aware backends the agent can query with a small set of universal verbs (peek, slice, search, aggregate, describe, list_handles).

Benchmarks

Measured end-to-end against two popular real-world MCP servers. Each run fires a fixed script of tool calls twice — once direct, once through jmunch-mcp — with three follow-up jmunch.* verb calls on the proxied side to model an agent drilling into a large result rather than slurping it whole.

suite upstream direct tokens via jmunch-mcp saved
GitHub (facebook/react issues/PRs/commits) @modelcontextprotocol/server-github 379,878 44,328 335,550 (88.3%)
Firecrawl (Wikipedia scrapes + site map + search) firecrawl-mcp 259,574 2,928 256,646 (98.9%)

Wall-clock time was also faster with the proxy on both suites, despite the extra verb calls — the agent never has to page through the fat payload:

suite direct via jmunch-mcp delta
GitHub 8.4s 6.8s −1.6s (−19.0%)
Firecrawl 16.4s 9.2s −7.2s (−43.9%)

Tabular content (GitHub) routes to the SQLite backend and answers peek/slice/aggregate; JSON content (Firecrawl scrape/map) routes to the JSON-tree backend and answers peek/slice (JSONPath)/search. See bench/README.md to reproduce.

Install

pip install jmunch-mcp

From source:

git clone https://github.com/jgravelle/jmunch-mcp
cd jmunch-mcp
pip install -e .

Quickstart

jmunch-mcp init

init scans three sources — your MCP client configs (Claude Desktop, Claude Code, Cursor, Windsurf, Continue), running processes, and a small catalog of popular upstreams (GitHub, Firecrawl, filesystem, fetch, Brave Search, Slack) — and renders a checklist. Tick the upstreams you want wrapped, and it writes one <name>.toml per selection into ./configs/. Non-interactive flags: --yes (pick everything already registered in a client), --dry-run, --overwrite, --out <dir>, --no-running, --no-catalog.

Manual

jmunch-mcp --config examples/config.toml

Configure your MCP client to launch jmunch-mcp --config <path> instead of the upstream server directly. Add --report to print a session summary on shutdown.

Gateway mode (v2 — universal proxy)

The MCP proxy above saves tokens for MCP clients. The gateway saves tokens for any AI application that speaks the OpenAI or Anthropic HTTP API — LangChain, LlamaIndex, CrewAI, AutoGen, Continue, Cline, Aider, or a raw SDK. No code changes in the app; just point base_url at jmunch.

pip install 'jmunch-mcp[gateway]'
jmunch-mcp gateway --config configs/gateway.example.toml
# listening on http://127.0.0.1:7879

Point your app:

# OpenAI SDK, LangChain, Aider, Continue, Cline, Ollama-compat apps:
export OPENAI_API_BASE=http://127.0.0.1:7879/v1

# Native Anthropic SDK / Claude Code:
export ANTHROPIC_BASE_URL=http://127.0.0.1:7879

What it does, transparently:

  • Handle-ifies fat tool_results in outgoing requests — your app's tool returns 100KB of JSON, the model sees a 1KB summary + opaque handle.
  • Injects jmunch verbs (peek, slice, search, aggregate, describe, summarize, list_handles) into the request's tools array so the model can drill in.
  • Short-circuits verb calls — when the model calls jmunch_peek, the gateway resolves it locally against the handle registry and synthesizes the follow-up turn. The app never sees jmunch tool_calls; those completions cost zero upstream tokens.
  • Persists handles to ~/.jmunch/handles.db with a configurable TTL so they survive restarts and cross-session reads.
  • Streams both ways — OpenAI SSE and Anthropic event streams are buffer-then-replayed with correct verb resolution.

Per-request controls via headers:

  • X-Jmunch-Upstream: <name> — override the configured upstream.
  • X-Jmunch-Inject: false — disable tool injection for this call (pure pass-through + request-side handle-ify only).

Metrics flow into the same dashboard as the MCP proxy. Filter with ?surface=gateway or ?surface=mcp on /api/stats and /api/calls.

Dashboard

A read-only local web UI over the metrics DB each proxy writes to. Shows cumulative totals, per-upstream breakdowns, and a time series of forwarded calls.

jmunch-mcp dashboard              # http://127.0.0.1:7878
jmunch-mcp dashboard --open       # also open in your default browser

Flags: --port (default 7878), --host (default 127.0.0.1), --db <path> to point at a non-default metrics DB, --open to launch the browser. Metrics only populate once proxies have recorded calls, so run your client against a wrapped upstream first.

License

jmunch-mcp is released under the MIT License — free to use, modify, distribute, and embed in commercial products.

Note that licensing of upstream MCP services you proxy through jmunch-mcp is governed by those services' own terms. This applies to third-party MCP servers (GitHub, filesystem providers, vendor APIs) and to any sibling tools in the broader retrieval ecosystem you may compose with — check each upstream's license before redistribution.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jmunch_mcp-0.2.0.tar.gz (108.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

jmunch_mcp-0.2.0-py3-none-any.whl (86.8 kB view details)

Uploaded Python 3

File details

Details for the file jmunch_mcp-0.2.0.tar.gz.

File metadata

  • Download URL: jmunch_mcp-0.2.0.tar.gz
  • Upload date:
  • Size: 108.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for jmunch_mcp-0.2.0.tar.gz
Algorithm Hash digest
SHA256 153209f89f6136d26a2b514b83f7405e2809883759b88a90168633d9dd0d7a9d
MD5 e981cb27e5ddaf29f3e3346ef8b5c568
BLAKE2b-256 4a28d54b888c356ad7ed23f992aa1c414563f202a3793eee0c40395bc41b823d

See more details on using hashes here.

File details

Details for the file jmunch_mcp-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: jmunch_mcp-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 86.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for jmunch_mcp-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6b2355b895309922361f088ea50215ef53c9953f38a8cdfd003d5df9d7c948e6
MD5 5c57411bf54783cccaa3c2da3dff7b49
BLAKE2b-256 ea4d9e2f393d5b83d1dd9518c8ee2f854d45167cc323ffd4942d95a752b704dd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page