MCP executor for Claude Code or Codex that offloads repetitive coding work to cheaper local or flat-rate models.

These details have not been verified by PyPI

Project links

Project description

HermitAgent

Hermit is the executor layer for Claude Code or Codex: keep the premium orchestrator for planning and review, and offload the repetitive coding work to cheaper local or flat-rate models.

┌──────────────┐
│  Claude Code │──┐
│  (planner)   │  │    ┌──────────────┐   any OpenAI-compatible   ┌───────┐
└──────────────┘  ├───▶│  HermitAgent │ ────────────────────────▶ │  LLM  │
                  │    │  (executor)  │                           └───────┘
┌──────────────┐  │    └──────────────┘
│    Codex     │──┘         local / flat-rate by default
│  (planner)   │
└──────────────┘

Claude Code or Codex stays in charge of planning, interviewing, and review. Hermit takes the mechanical path: file edits, test runs, refactors, commits, and MCP-executed follow-through on cheaper execution models. The switch is one word in a slash command: /foo → /foo-hermit.

Why Hermit stands out:

Keep your best reasoning model on the work that needs judgment, not boilerplate execution.
Use MCP to turn planner decisions into concrete repo changes, tests, commits, and release operations.
Default to predictable local / flat-rate executor routing instead of silently drifting onto a paid hosted fallback.
Work with both Claude Code and Codex instead of forcing a single orchestrator stack.

Why not just use Claude Code or Codex directly?

Workflow shape	Claude Code / Codex alone	With Hermit
Planning and review	Strong	Still strong — keep the premium orchestrator where judgment matters
Repetitive repo work	Expensive or token-heavy	Offloaded to a cheaper MCP executor lane
Multi-step follow-through	Manual context handoff	MCP tasks can carry edits, tests, commits, and release ops through
Default execution cost	Can drift onto paid hosted models	Defaults to local / flat-rate executor routing
Team adoption	Tied to one orchestrator workflow	Works as a shared executor layer across Claude Code and Codex

Hermit is not trying to replace your orchestrator. It gives you a second lane: use the premium model for judgment, and use Hermit for the mechanical throughput that makes repositories expensive to operate at scale.

Who Hermit is for

Teams that already like Claude Code or Codex for planning, review, and decision-making, but want a cheaper execution lane for repo mechanics.
Developers who want MCP-driven follow-through on edits, tests, commits, and release chores without spending premium-model tokens on every step.
Repositories that need predictable default routing toward local or flat-rate models instead of surprising hosted fallback costs.
Maintainers who want one shared executor layer even if different contributors prefer different orchestrators.

Who Hermit is not for

People looking for a brand-new premium planner to replace Claude Code or Codex entirely.
Teams that want a single hosted model to do both judgment and execution with no planner/executor split.
Workflows where provider cost predictability, MCP task handoff, and execution-lane separation are not important.

If your pain is not "my orchestrator is smart enough, but too much of its time is spent on repetitive repo labor," Hermit is probably not the right abstraction.

Install

npm install -g @cafitac/hermit-agent
hermit

Requires Node.js 20+ and Python 3.11+. The npm package bootstraps a managed Python runtime under ~/.hermit/ on first run — no repo checkout needed. If Claude Code or Codex integration is still missing, hermit will offer guided setup automatically. You can still run hermit install directly when you want to force the full setup/repair flow.

To upgrade: hermit update

Quick start

hermit-mcp-server   # starts the gateway + MCP stdio server

Then in Claude Code:

/feature-develop-hermit <task>

Claude interviews, writes the plan, and delegates implementation to Hermit over MCP. Executor tokens never hit your orchestrator bill.

Reference skills

Four example skills ship under .claude/commands/. Fork these into your own workflow:

Command	Claude does	Hermit does
`/feature-develop-hermit`	interview + plan	implement + test
`/code-apply-hermit`	read PR review	apply every change
`/code-polish-hermit`	pick what to polish	lint/test loop
`/code-push-hermit`	write PR description	commit + push

See docs/hermit-variants.md to add your own.

Executor LLM

ollama (local, free):

brew install ollama && ollama pull qwen3-coder:30b

z.ai (flat-rate subscription) — add to ~/.hermit/settings.json:

{
  "providers": {
    "z.ai": {
      "base_url": "https://api.z.ai/api/coding/paas/v4",
      "api_key": "<your key>",
      "anthropic_base_url": "https://api.z.ai/api/anthropic"
    }
  }
}

Configuration

~/.hermit/settings.json (created by hermit install):

{
  "gateway_url": "http://localhost:8765",
  "gateway_api_key": "hermit-mcp-…",
  "model": "__auto__",
  "routing": {
    "priority_models": [
      {"model": "glm-5.1"},
      {"model": "qwen3-coder:30b"}
    ]
  }
}

model controls the default model for plain hermit. Set it to __auto__ if you want plain hermit to follow the routing.priority_models order. routing.priority_models is the ordered fallback chain for auto-routing in gateway / interactive flows, and providers that are not configured or installed are skipped automatically. If model is a concrete name like gpt-5.4, plain hermit stays pinned to that model even if you reorder priority_models.

By default, hermit install now keeps Codex out of routing.priority_models and treats it as an explicit opt-in executor path instead of an automatic fallback. This is intentional: local / flat-rate executor models stay the safe default, while Codex remains available when a user explicitly pins it or adds it back to routing. That separation makes billing behavior more predictable, keeps executor defaults aligned with Hermit's "cheap mechanical work" role, and avoids surprising auto-routing onto a paid hosted model.

Architecture

AgentLoop — LLM turn → tool call → result → compact on context fill
Gateway — FastAPI relay in front of the executor (routing, 429 failover, dashboard at :8765)
MCP server — run_task / reply_task / check_task / cancel_task
TUI — optional React+Ink terminal UI for standalone interactive sessions (hermit)

Tests

.venv/bin/pytest tests/

Status

Early, working, MIT. No release cadence guarantees.

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.58

Apr 27, 2026

0.3.57

Apr 27, 2026

0.3.56

Apr 27, 2026

0.3.55

Apr 27, 2026

0.3.54

Apr 27, 2026

0.3.53

Apr 27, 2026

This version

0.3.52

Apr 27, 2026

0.3.51

Apr 27, 2026

0.3.50

Apr 27, 2026

0.3.49

Apr 27, 2026

0.3.48

Apr 27, 2026

0.3.47

Apr 27, 2026

0.3.46

Apr 27, 2026

0.3.45

Apr 27, 2026

0.3.44

Apr 27, 2026

0.3.43

Apr 27, 2026

0.3.42

Apr 27, 2026

0.3.41

Apr 27, 2026

0.3.40

Apr 27, 2026

0.3.39

Apr 27, 2026

0.3.38

Apr 27, 2026

0.3.37

Apr 27, 2026

0.3.36

Apr 27, 2026

0.3.35

Apr 27, 2026

0.3.34

Apr 27, 2026

0.3.33

Apr 27, 2026

0.3.32

Apr 27, 2026

0.3.31

Apr 26, 2026

0.3.30

Apr 26, 2026

0.3.29

Apr 26, 2026

0.3.28

Apr 26, 2026

0.3.27

Apr 26, 2026

0.3.26

Apr 26, 2026

0.3.25

Apr 26, 2026

0.3.24

Apr 26, 2026

0.3.23

Apr 26, 2026

0.3.22

Apr 26, 2026

0.3.21

Apr 26, 2026

0.3.20

Apr 26, 2026

0.3.18

Apr 26, 2026

0.3.17

Apr 26, 2026

0.3.16

Apr 26, 2026

0.3.15

Apr 26, 2026

0.3.14

Apr 26, 2026

0.3.11

Apr 26, 2026

0.3.10

Apr 24, 2026

0.3.9

Apr 23, 2026

0.3.7

Apr 23, 2026

0.3.6

Apr 23, 2026

0.3.5

Apr 23, 2026

0.3.4

Apr 23, 2026

0.3.3

Apr 23, 2026

0.3.2

Apr 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cafitac_hermit_agent-0.3.52.tar.gz (462.1 kB view details)

Uploaded Apr 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cafitac_hermit_agent-0.3.52-py3-none-any.whl (415.3 kB view details)

Uploaded Apr 27, 2026 Python 3

File details

Details for the file cafitac_hermit_agent-0.3.52.tar.gz.

File metadata

Download URL: cafitac_hermit_agent-0.3.52.tar.gz
Upload date: Apr 27, 2026
Size: 462.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cafitac_hermit_agent-0.3.52.tar.gz
Algorithm	Hash digest
SHA256	`a075b1012248724879d5596c6eb7478e8329a45006370ae9418e164dbc8ee4b4`
MD5	`e93da98730e0d246075f5c0fe084f5f5`
BLAKE2b-256	`2becbd4f93dcd0630a639792039d1b74b6b1d1aefbff34660459d7b1e314abb9`

See more details on using hashes here.

File details

Details for the file cafitac_hermit_agent-0.3.52-py3-none-any.whl.

File metadata

Download URL: cafitac_hermit_agent-0.3.52-py3-none-any.whl
Upload date: Apr 27, 2026
Size: 415.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cafitac_hermit_agent-0.3.52-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d52825d4845545bdc684c37fd632df7ffb11966fa95075422a477340c4fe752a`
MD5	`55ed26bdcc3534e4e3ce79864975affa`
BLAKE2b-256	`a31e27b23d21e60a1641b6d47464be3a24bf4ee8be5599d8b920ae91b2484428`

See more details on using hashes here.

cafitac-hermit-agent 0.3.52

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

HermitAgent

Why not just use Claude Code or Codex directly?

Who Hermit is for

Who Hermit is not for

Install

Quick start

Reference skills

Executor LLM

Configuration

Architecture

Tests

Status

License

See also

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes