unlimited-mcp

MCP orchestration server: delegate work from frontier orchestrators to cheaper or specialized workers.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

triumsebas

These details have not been verified by PyPI

Project description

unlimited-mcp — Tired of burning your limits? Go unlimited.

Why • Quick start • Use cases • Architecture • Roadmap • Contributing

The problem that built this

Subscriptions burn fast. Claude's 5-hour limit, Codex credits, API costs — anyone doing serious AI-assisted development hits walls constantly.

This project was built using itself. Here's what happened:

Same project, same plan:

Without this tool: Opus designed, Sonnet implemented — hitting the 5-hour limit repeatedly over several days, and eventually the weekly limit too.

With this tool: Sonnet orchestrated opencode Go, delegating the hard work and fixes to DeepSeek V3 Flash (with DeepSeek V3 Pro doing critical reviews). Finished in a single sitting. Not a single 5-hour limit hit. opencode Go never came close to its subscription limit either.

The insight: you don't need the best model for everything. You decide what gets delegated, to which agent, running which model — the orchestrator just executes your strategy. Design and supervision stay with the expensive model; everything else goes wherever you point it.

unlimited-mcp is the infrastructure that makes this delegation possible, safe, reproducible, and extremely easy to configure.

What it does

unlimited-mcp is an MCP server that sits between your frontier orchestrator (Claude Code, Codex, etc.) and a pool of cheaper workers (local agents, local LLMs, background processes). It exposes a clean set of MCP tools that let your AI:

Delegate coding tasks to aider, opencode, or other agent CLIs running cheaper models
Run any command, script, or automation on your local system or a remote server
Run background jobs that survive your session ending — even closing Claude Code
Use local or remote GPU via Ollama, MLX, or LM Studio at zero API cost
Manage git workspaces — worktrees, commits, and merges are handled automatically
Stay in control — safety checks, path allowlists, and an audit log keep things safe

In the terms people search for: your frontier model (Claude Code, Codex) is the architect / orchestrator; the delegated agents are the workers / contractors. The rest of this README uses orchestrator and worker (or agent).

  Claude Code          Codex
  Opus · Sonnet        gpt-5.5
       │                  │
       └────────┬─────────┘
                │ MCP tools
                ▼
  ┌─────────────────────────┐
  │    unlimited-mcp        │
  │  ┌───────────────────┐  │
  │  │   Safety Layer    │  │  path allowlists · argv checks · audit log
  │  ├───────────────────┤  │
  │  │ Workspace Manager │  │  git worktrees · branches · merges
  │  ├───────────────────┤  │
  │  │   Job Runners     │  │  local async · Task Spooler (durable)
  │  └───────────────────┘  │
  └──────────┬──────────────┘
             │
    ┌────────┼────────────────┐
    ▼        ▼                ▼
Agent CLIs  Local LLMs    System Tools
aider       Ollama · MLX  commands · scripts
opencode    LM Studio     shell · automation
claude      (zero cost)   sysops tasks
jcode
goose

Key features

Feature	Description
Non-blocking delegation	Submit a job and get a `job_id` immediately; poll when ready
Background durability	Jobs backed by Task Spooler (`ts`) survive MCP restarts and session closures
Local GPU support	Use Ollama or MLX models at zero API cost — today, not in a future release
Git worktree isolation	Each coding job runs in its own branch; you review and merge when satisfied
Self-configuring	Add agents, providers, and paths by talking to Claude — no config files, no restarts
Parallel execution	Multiple agents run concurrently — tell Claude "start phase 2 and parallelize whatever has no dependencies" and it figures out what can run in parallel and what has to wait
Safety enforcement	Path allowlists, dangerous command confirmation, shell-injection blocking, audit log
Agent clarification rounds	Agents can ask the orchestrator questions before starting, preventing costly wrong assumptions

Supported agents

Agent	Use case	Tested
opencode	Full-featured coding tasks, supports subscriptions	✅
aider	Git-aware code editing with SEARCH/REPLACE diffs	✅
claude (Claude Code CLI)	Delegate to Claude Code programmatically	✅
codex	OpenAI's Codex CLI	✅
goose	Block's agentic coding with model selection	✅
hermes	Multi-provider coding agent	✅
jcode	Jcode.ai agent built in Rust — fast tool-call execution	✅
smolagents	HuggingFace Python-based agentic execution	✅
gptme	Terminal-native agent with tool use	✅
pi	Pi coding agent	✅

Note on the claude worker: Claude as a worker is best used pointed at other providers. Anthropic subscriptions cap usage per command, so a Claude-Code agent doing the heavy delegated work will hit those limits fast — the whole point of this tool is to keep your Claude orchestrator free. Use claude as a worker only for occasional high-quality tasks; send the bulk of delegated work to other providers (OpenRouter, local GPU, etc.).

Any CLI agent can be added without restarting the server. The server ships with a knowledge base of common agents; for anything new, just ask Claude to register it.

Why unlimited-mcp?

Without it	With it
Claude hits the 5h limit mid-task	Long tasks run in background agents, Claude stays free
You pay frontier prices for every line of code	Design with Opus, delegate execution to a cheaper/free LLM — you set the strategy
Closing Claude cancels in-progress work	Background jobs keep running after you close your session
Adding a new agent means editing config files	Tell Claude "add opencode with DeepSeek Flash" — done
Local GPU sits idle while paying for API	Ollama/MLX/LM Studio agents work out of the box
Infrastructure tasks need separate tooling	Run commands, scripts, and automation on any accessible server

Quick start

1. Install

# From PyPI (recommended)
pip install unlimited-mcp
# …or, isolated, with uv:
uv tool install unlimited-mcp        # then run:  uvx unlimited-mcp

# Or straight from GitHub (latest, unreleased)
pip install git+https://github.com/triumsebas/unlimited-mcp.git

Prerequisites: Python 3.11+. For background jobs: ts (task-spooler) on your PATH.

Get everything, not just the server. Installing the package gives you the MCP server only — the opinionated /delegate skill and the orchestrator companion guides live in the repo. To pull them in, just ask your AI:

Claude Code: "add the /delegate skill from https://github.com/triumsebas/unlimited-mcp/tree/main/skills/delegate" (or, if you cloned the repo, give it the local path to skills/delegate/).

Codex / other orchestrators: "add this project's AGENTS.md from https://github.com/triumsebas/unlimited-mcp/blob/main/AGENTS.md to my config" (or, if you cloned the repo, give it the local path to AGENTS.md) so it has the full delegation reference.

2. Add to your MCP client

Or add this snippet to your client's config file:

{
  "mcpServers": {
    "unlimited-mcp": {
      "command": "unlimited-mcp",
      "args": []
    }
  }
}

Claude Code — ~/.claude/settings.json

Add the mcpServers block above to your settings file, or ask Claude Code directly: "add an MCP server called unlimited-mcp that runs the unlimited-mcp command".

Claude Desktop — claude_desktop_config.json

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

Add the mcpServers block above and restart Claude Desktop.

Codex — ~/.codex/config.json

Add the mcpServers block to your Codex config file, or ask Codex: "add this MCP server to my config" and paste the snippet.

Cursor / Windsurf / other MCP clients

Open the MCP settings panel in your client and add a new server with command unlimited-mcp and no arguments.

Tip: You can skip all of this and just ask your AI: "add an MCP server called unlimited-mcp that runs the unlimited-mcp command" — it will edit the config file for you.

3. First session — just talk to Claude

No config files, no manual setup. Connect the MCP and start a conversation:

You: "I just installed Opencode. Help me register it as a coding agent using DeepSeek Flash with OpenRouter."

Claude: "I'll set that up. You'll need an API key for the provider — either tell me now or drop it in ~/.config/unlimited-mcp/.env as OPENROUTER_API_KEY if you prefer to keep it out of the conversation."

You: [key or: "done, I added it to .env"]

Claude: "Registered opencode_flash — Opencode with DeepSeek Flash via OpenRouter. Which project do you want it to work on?"

You: "/home/user/myproject"

Claude: "Path allowed. Ready to go — what do you want Opencode to work on first?"

That's it. Every agent, provider, path, and safety rule is configured through conversation. Everything is persisted automatically. The creator of this project never manually edited a config file.

Use cases

AI-assisted software development

The pattern that motivated this project: an expensive model designs and reviews, a cheap model implements for hours burning no subscription. You decide which agent handles what and when the orchestrator steps back in — the full Design → Implementation → Review flow is in PROMPTS.md.

Some operations you can offload:

Writing boilerplate, tests, and docstrings
Refactoring with well-defined acceptance criteria
Fixing lint errors and type issues across large codebases
Generating and running database migrations
Reviewing patches and suggesting improvements

Because each task runs in an isolated git worktree, your main branch is never touched until you explicitly approve and merge.

Systems operations

Automate infrastructure tasks that require many calls or long runtimes — audit a fleet, roll out config changes, run long batch jobs in the background. Runnable examples are in PROMPTS.md.

Use cases:

Audit many servers for compliance or security issues
Roll out configuration changes across a fleet
Run long batch jobs (data processing, ML training pipelines)
Any automation that would block your terminal for minutes or hours

Local GPU — zero API cost

If you have a GPU — local or on a machine at home or work — you can run entire coding sessions at zero API cost. Any agent that accepts an OpenAI-compatible endpoint works out of the box.

Supported local inference backends:

Backend	Platform	Notes
Ollama	Mac · Linux · Windows	Easy setup, wide model library
MLX	Apple Silicon	Fast, native quantization
LM Studio	Mac · Windows	GUI + OpenAI-compat API server

Just tell Claude which backend you're running and on which machine, and it will configure the agent against it. No code, no config files.

Architecture

Component map, the full internal data flow, and the JobResult envelope are in ARCHITECTURE.md.

Configuration

You don't configure unlimited-mcp — you talk to your orchestrator and it configures itself.

Every agent, provider, safety rule, and path permission is managed through conversation. No config files to edit, no syntax to learn, no server restarts.

Some examples of what you can say:

"Add opencode using DeepSeek Flash via OpenRouter for cheap tasks"
"Add a local agent using Ollama with qwen2.5-coder:32b"
"Add aider with the claude-sonnet-4-6 model for careful refactoring"
"Allow /home/user/projects so workers can access it"
"Show me what agents and providers are configured"
"Change opencode_flash to use deepseek-r1 instead"
"Remove the openrouter provider"

Everything persists automatically across sessions. API keys and secrets are stored securely, never alongside the configuration.

Note for developers: configuration lives at ~/.config/unlimited-mcp/config.yaml and can be inspected or backed up. But you'll rarely need to open it.

Built with Claude · Tested with Claude and Codex

unlimited-mcp was designed and built using Claude as the primary orchestrator — including using the server itself to delegate implementation to cheaper models during development (see the origin story).

Tested orchestrators:

Claude Code (Claude Code CLI and Desktop) — primary development environment
Codex — tested as a drop-in orchestrator

Any MCP-compatible orchestrator should work with no or minor changes. If you test it with another one, open an issue — we'll add it to the list.

Roadmap

✅ Phase 1 — Core orchestration

MCP server with FastMCP
Agent delegation (aider, opencode, claude, jcode...)
Local async runner
Git worktree workspace isolation
Safety layer (path allowlists, argv blocking)
Live configuration (no restart)

✅ Phase 2 — Background execution

Task Spooler (ts) backend — durable jobs across MCP restarts
Multi-queue routing (local, ts, ts_serial) with parallel execution
run_shell with audit log
Worker clarification rounds (clarify_rounds)
Additional agent CLIs: goose, hermes, jcode, pi, gptme
query_logs for operational observability
Local inference: Ollama, MLX, LM Studio (any OpenAI-compatible endpoint)

✅ Phase 3 — Remote execution

SSH backend — delegate tasks to remote machines; run commands, agents, and prompt-via-file over SSH. Authentication: macOS Keychain (tested ✓) · ssh-agent (tested ✓) · Linux keyring (pending). In Claude Desktop, the server has no SSH agent socket — use Keychain. In Claude Code (terminal), both ssh-agent and Keychain work. See SSH.md for setup.
Remote GPU clusters — submit jobs to remote Ollama/MLX/LM Studio servers

🔜 Phase 4 — Notifications & observability

Anthropic direct provider — query Claude models directly as a provider for summarization, analysis, and review tasks without going through an agent CLI
Webhook / instant messaging notifications — get notified when long background jobs complete
Web dashboard — monitor active and completed jobs visually

Security

unlimited-mcp runs commands with your privileges over a local stdio transport — there is no network listener, but the safety layer is a best-effort guardrail, not an OS-level sandbox. Read the Security / Threat model before pointing it at anything you care about.

Documentation

File	What it's for
README.md	This file — overview, quick start, use cases, roadmap
ARCHITECTURE.md	Internal component map, data flow, and the `JobResult` envelope
PROMPTS.md	Copy-paste example prompts and delegation flows (dev, sysops)
SECURITY.md	Threat model — access model, privilege level, what the safety layer does and does not protect
AGENTS.md	Canonical orchestration reference — decision trees, queues, timeouts, safety, `clarify_rounds`, agent config. Read natively by Codex; imported by CLAUDE.md
CLAUDE.md	Claude Code entry point — imports AGENTS.md and adds only the Claude-Code-specific notes
CONTRIBUTING.md	Developer guide — how to add agents, run tests, contribute code
skills/delegate/SKILL.md	The `/delegate` skill for Claude Code — opinionated delegation patterns; defers to AGENTS.md for the tool reference
knowledge.yaml	Built-in catalog of agent CLIs and providers — what the server knows before you configure anything

Contributing

unlimited-mcp is early-stage and contributions are very welcome — especially:

Large project users — if you're working on a big codebase or running heavy automation, your feedback is the most valuable thing right now. Real-world usage exposes what toy examples don't.
New agent CLIs — if you use a coding agent not in the knowledge base, add it
SSH backend — keyring + agent auth shipped and tested on macOS; Linux keyring and Windows are still needed
Tests — the test suite grows with every feature, more coverage always helps
Documentation — walkthroughs, examples, and troubleshooting guides

See CONTRIBUTING.md to get started.

About

Built by Sebastián Fernández · CEO at Trium Sistemas Informáticos SL

A systems engineer who dove deep into AI tooling — and built the infrastructure he wished existed.

License

Apache 2.0 — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

triumsebas

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.4

May 23, 2026

0.9.3

May 22, 2026

This version

0.9.0

May 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unlimited_mcp-0.9.0.tar.gz (279.6 kB view details)

Uploaded May 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

unlimited_mcp-0.9.0-py3-none-any.whl (139.2 kB view details)

Uploaded May 18, 2026 Python 3

File details

Details for the file unlimited_mcp-0.9.0.tar.gz.

File metadata

Download URL: unlimited_mcp-0.9.0.tar.gz
Upload date: May 18, 2026
Size: 279.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for unlimited_mcp-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`fc96ef0158fd7f0d54dff861da559fd38fd32bce6e8e7659ea9e0433e2003e45`
MD5	`543e2e90a73eed99843e68df3bf9814a`
BLAKE2b-256	`9755f71ac153c7ec2504931333e1fc473b331ccbaf0d98f860bb5cfe57eaf7f1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for unlimited_mcp-0.9.0.tar.gz:

Publisher: publish.yml on triumsebas/unlimited-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: unlimited_mcp-0.9.0.tar.gz
- Subject digest: fc96ef0158fd7f0d54dff861da559fd38fd32bce6e8e7659ea9e0433e2003e45
- Sigstore transparency entry: 1568207648
- Sigstore integration time: May 18, 2026
Source repository:
- Permalink: triumsebas/unlimited-mcp@07cdf3a93621f59774af156e6cfddb18997e3fc9
- Branch / Tag: refs/tags/v0.9.0
- Owner: https://github.com/triumsebas
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@07cdf3a93621f59774af156e6cfddb18997e3fc9
- Trigger Event: push

File details

Details for the file unlimited_mcp-0.9.0-py3-none-any.whl.

File metadata

Download URL: unlimited_mcp-0.9.0-py3-none-any.whl
Upload date: May 18, 2026
Size: 139.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for unlimited_mcp-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b284b9a2156d466dfe2db10ca906ff75997b748c45b14286a2fd3f3273d44c53`
MD5	`14f33138cd5ed1559d14482672bfc5de`
BLAKE2b-256	`cbe0d2d6a13eb9374c5d812134e08d0b6519684aa643a166c8f63633c4749449`

See more details on using hashes here.

Provenance

The following attestation bundles were made for unlimited_mcp-0.9.0-py3-none-any.whl:

Publisher: publish.yml on triumsebas/unlimited-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: unlimited_mcp-0.9.0-py3-none-any.whl
- Subject digest: b284b9a2156d466dfe2db10ca906ff75997b748c45b14286a2fd3f3273d44c53
- Sigstore transparency entry: 1568207719
- Sigstore integration time: May 18, 2026
Source repository:
- Permalink: triumsebas/unlimited-mcp@07cdf3a93621f59774af156e6cfddb18997e3fc9
- Branch / Tag: refs/tags/v0.9.0
- Owner: https://github.com/triumsebas
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@07cdf3a93621f59774af156e6cfddb18997e3fc9
- Trigger Event: push

unlimited-mcp 0.9.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

The problem that built this

What it does

Key features

Supported agents

Why unlimited-mcp?

Quick start

1. Install

2. Add to your MCP client

3. First session — just talk to Claude

Use cases

AI-assisted software development

Systems operations

Local GPU — zero API cost

Architecture

Configuration

Built with Claude · Tested with Claude and Codex

Roadmap

✅ Phase 1 — Core orchestration

✅ Phase 2 — Background execution

✅ Phase 3 — Remote execution

🔜 Phase 4 — Notifications & observability

Security

Documentation

Contributing

About

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance