Local LLM router that cuts premium-model spend with adaptive 3-tier routing, OpenAI + Anthropic compatible

These details have not been verified by PyPI

Project links

Project description

UncommonRoute

Cut your LLM costs by 82% with automatic model routing.

Most of your LLM budget goes to simple tasks that don't need a premium model. UncommonRoute picks the cheapest model that still gets the job done — automatically.

UncommonRoute Dashboard

Quick Start · How It Works · Benchmarks · Dashboard · Configuration

Quick Start

1. Install

pip install uncommon-route

2. Run the guided setup

uncommon-route init

The wizard walks you through:

choosing a connection path: Commonstack, local/custom upstream, or BYOK
saving upstream credentials locally
configuring Claude Code, Codex, or OpenAI SDK / Cursor
optionally starting the proxy in background

If you prefer to sanity-check before starting the proxy:

uncommon-route doctor

3. Point your client at the proxy

Client	Change
Claude Code	`export ANTHROPIC_BASE_URL="http://localhost:8403"`
Codex / Cursor / OpenAI SDK	`export OPENAI_BASE_URL="http://localhost:8403/v1"`
OpenClaw	Plugin — see openclaw.ai

Then use uncommon-route/auto as the model ID:

client = OpenAI(base_url="http://localhost:8403/v1")
resp = client.chat.completions.create(model="uncommon-route/auto", messages=msgs)
# → simple tasks → cheap model, complex tasks → premium model

Works with Claude Code, Codex, Cursor, the OpenAI SDK, and OpenClaw.

Manual setup (advanced)

Commonstack managed upstream

export UNCOMMON_ROUTE_UPSTREAM="https://api.commonstack.ai/v1"
export UNCOMMON_ROUTE_API_KEY="csk-your-key"
uncommon-route serve

One key gives you OpenAI, Anthropic, Google, xAI, MiniMax, Moonshot, DeepSeek, and more — consolidated billing, no per-provider setup.

Bring your own keys (BYOK)

uncommon-route provider add openai     sk-...
uncommon-route provider add anthropic  sk-ant-...
uncommon-route provider add google     AIza...
# also supported: xai, minimax, moonshot, deepseek
uncommon-route serve

Auto-routing will only consider models backed by a registered provider.

Note: UncommonRoute does not auto-read OPENAI_API_KEY / ANTHROPIC_API_KEY. Use uncommon-route init, a saved connection, or one of the manual paths above.

How It Works

Every request is analyzed by three independent signals, then routed to the cheapest capable model:

"hello"                              → 🟢 nano         $0.0008
"fix the typo on line 3"             → 🟢 deepseek     $0.0012
"refactor this 500-line module"      → 🟠 sonnet       $0.0337
"design a distributed scheduler"     → 🔴 opus         $0.0562

Signal	What it does	Speed (CPU, warm)
Metadata	Conversation structure, tool usage, depth	<1ms
Embedding	Semantic similarity to known task patterns (bge-small)	~20ms
Structural	Text complexity features (shadow mode)	<1ms

End-to-end route() overhead on a warm process is ~20–25ms (dominated by the embedding signal). Cold start is a few hundred ms for the first request. GPU or a cached embedding path can bring this under 5ms; benchmark with scripts/bench_overhead.py.

Signals vote. The ensemble picks the tier. The router selects the cheapest model in that tier. If uncertain, it leans conservative — better to spend a little more than to fail the task.

It gets smarter over time. Signal weights adjust from routing outcomes. The embedding index grows with usage. Low-confidence predictions automatically escalate.

Why v2

Our v1 classifier hit 88.5% accuracy on clean benchmark data. We shipped it.

Then we tested on real agent conversations — multi-turn, tool-calling, messy context — and accuracy dropped to 43%. More than half the routing decisions were wrong.

We didn't patch it. We rebuilt from scratch.

	v1	v2
Accuracy	43%	78%
Task pass rate	100% (cheated — always chose most expensive)	93.4% (real routing)
Cost savings	0%	82%

We're telling you this because we'd rather you trust our numbers than be impressed by them.

Benchmarks

Tested on CommonRouterBench — 970 real agent task traces across SWE-Bench, BFCL, MT-RAG, QMSum, and PinchBench. All numbers measured end-to-end through the production code path.

Metric	Value
Cost savings	82% vs always-premium
Task pass rate	93.4%
Routing overhead	~20–25ms (warm process, CPU, bge-small embedding)
Accuracy	78% tier match

python scripts/eval_v2.py  # reproduce it yourself

Dashboard

uncommon-route serve
# → http://localhost:8403/dashboard/

Real-time monitoring, interactive playground, cost tracking, and model routing configuration — all in a Nothing Design-inspired interface.

Diagnostics

When a user hits a routing or upstream issue, you can export a local support bundle without guessing which logs to collect:

uncommon-route support bundle
uncommon-route support request <request_id>

The bundle includes recent request traces, recent errors, stats summaries, provider/config snapshots, and redacted local state. It stays on your machine until you choose to share it.

Configuration

Routing modes

Mode	Model ID	Behavior
auto	`uncommon-route/auto`	Balanced — best quality-per-dollar
fast	`uncommon-route/fast`	Cost-first — cheapest acceptable
best	`uncommon-route/best`	Quality-first — strongest available

Spend limits

uncommon-route spend set daily 20.00
uncommon-route spend status

Managing providers

uncommon-route provider list
uncommon-route provider add <name> <api-key>
uncommon-route provider remove <name>

Supported names: commonstack, openai, anthropic, google, xai, minimax, moonshot, deepseek. See Quick Start for the two setup paths (managed upstream vs. BYOK).

All environment variables

Variable	Meaning
`UNCOMMON_ROUTE_UPSTREAM`	Upstream base URL for the managed path (e.g. `https://api.commonstack.ai/v1`). Ignored in BYOK mode.
`UNCOMMON_ROUTE_API_KEY`	API key paired with `UNCOMMON_ROUTE_UPSTREAM`. Not a fallback for per-provider keys.
`UNCOMMON_ROUTE_PORT`	Local proxy port (default 8403)

Privacy

Runs entirely on your machine. No data leaves unless you opt in.

uncommon-route telemetry status

Diagnostics exports are also local-first: uncommon-route support bundle writes a redacted zip under ~/.uncommon-route/support/ by default.

Development

git clone https://github.com/CommonstackAI/UncommonRoute.git
cd UncommonRoute && pip install -e ".[dev]"
python -m pytest tests -v

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.7.14

Apr 30, 2026

0.7.13

Apr 28, 2026

0.7.12

Apr 27, 2026

0.7.11

Apr 27, 2026

0.7.10

Apr 27, 2026

0.7.9

Apr 23, 2026

0.7.8

Apr 23, 2026

0.7.7

Apr 23, 2026

0.7.6

Apr 21, 2026

0.7.5

Apr 21, 2026

0.7.4

Apr 21, 2026

0.7.3

Apr 21, 2026

0.7.2

Apr 21, 2026

0.7.1

Apr 21, 2026

This version

0.7.0

Apr 21, 2026

0.6.0

Apr 16, 2026

0.5.2

Apr 14, 2026

0.5.1

Apr 14, 2026

0.5.0

Apr 14, 2026

0.4.1

Mar 26, 2026

0.4.0

Mar 26, 2026

0.3.1

Mar 26, 2026

0.3.0

Mar 17, 2026

0.2.9

Mar 10, 2026

0.2.8

Mar 10, 2026

0.2.7

Mar 10, 2026

0.2.6

Mar 10, 2026

0.2.5

Mar 9, 2026

0.2.4

Mar 8, 2026

0.2.3

Mar 8, 2026

0.2.2

Mar 6, 2026

0.2.0

Mar 6, 2026

0.1.1

Mar 5, 2026

0.1.0

Mar 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uncommon_route-0.7.0.tar.gz (620.8 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

uncommon_route-0.7.0-py3-none-any.whl (585.7 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file uncommon_route-0.7.0.tar.gz.

File metadata

Download URL: uncommon_route-0.7.0.tar.gz
Upload date: Apr 21, 2026
Size: 620.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for uncommon_route-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`d9ecc592adf6c309029d02a49b89b1b51de555893d420ec8b401906a1e65533d`
MD5	`9e63f37672c4056f616c5532c1456673`
BLAKE2b-256	`3f8171f301303eb1c21cb7497f86cc33b655e05f45135d065b13e1cf8b9826de`

See more details on using hashes here.

File details

Details for the file uncommon_route-0.7.0-py3-none-any.whl.

File metadata

Download URL: uncommon_route-0.7.0-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 585.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for uncommon_route-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9ed64d2a0f2d7c84c630d4ae11cf8798df65cdce685dfa55292db1adc025c062`
MD5	`c6dc60b6ff80d10206cc991d38f0055e`
BLAKE2b-256	`3fe15d8ccc9e958f30dd56e9f464156506d234ea6cc3839dd389283c659da591`

See more details on using hashes here.

uncommon-route 0.7.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

UncommonRoute

Quick Start

1. Install

2. Run the guided setup

3. Point your client at the proxy

How It Works

Why v2

Benchmarks

Dashboard

Diagnostics

Configuration

Routing modes

Spend limits

Managing providers

Privacy

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes