Intent-classification preselector for agent runtimes — open the library, hide the cost

These details have not been verified by PyPI

Project links

Project description

mind-nerve

Intent-classification preselector for agent runtimes.

A small, fast classifier that sits between a user request and the host runtime. It reads the request, decides which subset of available tools/skills/agents is relevant, and hands the host a short list — so the downstream LLM never sees the full library in its system prompt.

The result: library size decouples from token cost. Hosting 4,400 skills costs the same prompt budget as hosting 44, because only the top-K are ever loaded per turn.

Status

Phase 1 — public alpha (v0.1.0-alpha.5, 2026-05-16). Python wheel on PyPI; weights on Hugging Face. The router runs end-to-end on PyTorch via BAAI/bge-small-en-v1.5 fine-tuned with MultipleNegativesRankingLoss; top-5 accuracy is 96.06 % against the v1.1-oss catalog of 11,922 routing candidates.

MIND Language Profile target: default (full tensor stdlib + Q16.16 + heap) — see mind Phase 10.6 for the --profile flag landing in mindc 0.2.6.

PyPI: https://pypi.org/project/mind-nerve/
Weights: https://huggingface.co/star-ga/mind-nerve-phase1

Phase 2 replaces the PyTorch path with a native MIND Q16.16 inference loop and adds the cross-architecture bit-identity gate + 4-core CPU p95 ≤ 30 ms latency budget. Phase 2 is gated on mindc 0.2.6 (pub fn → C symbol export) and 0.3.0 (cdylib emit). Until Phase 2 closes, the inference path uses external ML tooling — explicitly permitted by the ROADMAP Phase 1 exception.

Quickstart

pip install mind-nerve

from mind_nerve import route
result = route("git status", top_k=5)
for r in result.routes:
    print(r.score, r.name, r.kind)

The first call auto-downloads the Phase-1 weights (~150 MB) from star-ga/mind-nerve-phase1 into ~/.local/share/mind-nerve/runtime/. To pre-seed or use a custom location, set MIND_NERVE_RUNTIME_DIR=/path/to/your/runtime/.

Daemon mode (recommended for hooks)

For hot-path callers (CLI hooks, the MCP server, any tool that hits route() many times per minute) run the daemon and connect over the UNIX socket — it loads the runtime once and serves sub-30 ms round-trips after warmup.

mind-nerve-routed &       # listens on $XDG_RUNTIME_DIR/mind-nerve.sock

import json, socket, os
def route(prompt: str, top_k: int = 5) -> dict:
    sock = f"{os.environ.get('XDG_RUNTIME_DIR', f'/run/user/{os.getuid()}')}/mind-nerve.sock"
    with socket.socket(socket.AF_UNIX, socket.SOCK_STREAM) as s:
        s.connect(sock)
        s.sendall(json.dumps({"prompt": prompt, "top_k": top_k}).encode() + b"\n")
        return json.loads(s.makefile("r").readline())

Skill preselection for Claude Code

If you use Claude Code with a large ~/.claude/skills/ directory, mind-nerve can rewrite that directory on every prompt to only the top-K most relevant skills:

pip install mind-nerve
mind-nerve-install install --cli claude-code --with-preselect

That wires two hooks into ~/.claude/settings.json:

SessionStart: spawns the mind-nerve-routed daemon if not already running (~7 s warmup; sub-30 ms responses afterwards).
UserPromptSubmit: asks the daemon for the top-K matching skills and atomically rewrites ~/.claude/skills/ as a directory of symlinks pointing into your real catalog.

The installer auto-detects two install layouts:

Regular: your existing ~/.claude/skills/ directory is renamed once to ~/.claude/skills.full/. After that the daemon projects a top-K subset back into ~/.claude/skills/ per turn.
Shared catalog (e.g. STARGA's ~/.agents/skills/ linked across multiple CLIs): the shared catalog stays put. mind-nerve projects from there into ~/.claude/skills/ per turn.

If you also use mind-mem for durable memory, add the companion MCP:

mind-nerve-install install --cli claude-code --with-preselect --with-mind-mem

mind-nerve handles intent routing; mind-mem provides search-backed memory. Together they bracket the prompt path.

Why this exists

Agent runtimes today load entire skill/tool/MCP libraries into the LLM's system prompt on every turn. At small scale this is fine. At hundreds of skills, the prompt-cache and per-call token cost become the binding constraint on library growth.

Standard responses to this problem all degrade either correctness or latency:

Vector-only retrieval over skill descriptions loses precise intent matching
LLM-based routing pays full inference cost just to decide what to load
Manual skill grouping shifts the problem onto the operator

mind-nerve takes the third option: a purpose-built sub-50M-parameter classifier that runs in tens of milliseconds on CPU, returns top-K relevant routes, and is small enough to call on every turn without paying real cost.

Integration surface

mind-nerve exposes a single contract across two host classes:

Claude Code, codex, gemini, vibe, and 13 other CLIs — preselects which agent skills load into the system prompt for a given turn
MCP servers — preselects which tools are surfaced as candidates before the calling LLM sees the full registry

Same model, same binary, same evidence chain — both host targets.

Design constraints (non-negotiable)

Latency p95 ≤ 30 ms on CPU. If we miss this, the preselector becomes the bottleneck instead of relieving it.
Cross-architecture bit-identity. Same request on x86, ARM, CUDA, WebGPU returns the same top-K. Q16.16 fixed-point throughout, no IEEE-754 fallback in the inference path.
No training data leakage at inference. The classifier reveals only route names, never the training corpora content.
Tamper detection. Every inference emits an attestation envelope tying the request hash, model hash, and result hash into the evidence chain.

Architecture (one paragraph)

Asymmetric encoder/decoder with a classifier head. Encoder reads the request, no feed-forward blocks (attention + gated residuals only) for compact representation. Decoder cross-attends to the encoder output and to a fixed embedding of every available route (skills/tools/agents). Classifier head emits per-route relevance scores. Top-K extraction is deterministic tie-breaking by route ID hash. Full spec in spec/architecture.md.

Repository structure

mind-nerve/
  README.md                       this file
  ROADMAP.md                      phased delivery plan
  LICENSE.md                      Apache-2.0 architecture, weights separate
  spec/                           authoritative design documents
    architecture.md
    quality_targets.md
    integration_surface.md
  src/                            pure MIND implementation
    lib.mind
    model.mind
    inference.mind
    evidence.mind
  cli/
    main.mind                     single-binary entrypoint
  integrations/
    claude-code/                  TypeScript hook shim
    codex/                        shell hook wrapper
    mcp/                          MCP server façade
  tests/
    bit_identity/                 cross-architecture reproducibility
    accuracy/                     classification benchmarks

License

mind-nerve ships under Apache-2.0 — repository, Python wheel, and the Phase-1 trained weights on Hugging Face all carry the same license. The wheel additionally bundles libmindnerve.so, a FORTRESS-protected runtime component whose source remains private under STARGA Commercial terms. The protected binary is the future Phase-2 native inference layer; the Phase-1 PyTorch path does not depend on it.

For commercial deployments needing per-customer FORTRESS-locked builds of the runtime layer, contact license@star.ga. See LICENSE.md for the full split.

Dependencies

numpy, sentence-transformers, torch — Phase-1 inference path
mind-runtime — Phase-2 native inference (gated on mindc 0.3.0)
mind-mem (optional) — consumes mind-nerve preselection for tool routing

The "no third-party ML framework" goal applies to Phase 2. Phase 1 (this release) deliberately uses sentence-transformers + PyTorch to ship the API, evaluation harness, and integration surface before the native runtime lands.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0b3 pre-release

May 18, 2026

0.3.0b2 pre-release

May 18, 2026

0.3.0b1 pre-release

May 18, 2026

0.2.0

May 18, 2026

0.2.0b1 pre-release

May 18, 2026

0.1.0b2 pre-release

May 18, 2026

0.1.0a13 pre-release

May 17, 2026

0.1.0a12 pre-release

May 16, 2026

0.1.0a10 pre-release

May 16, 2026

0.1.0a8 pre-release

May 16, 2026

This version

0.1.0a7 pre-release

May 16, 2026

0.1.0a6 pre-release

May 16, 2026

0.1.0a5 pre-release

May 16, 2026

0.1.0a4 pre-release

May 16, 2026

0.1.0a3 pre-release

May 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mind_nerve-0.1.0a7.tar.gz (44.7 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mind_nerve-0.1.0a7-py3-none-any.whl (44.9 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file mind_nerve-0.1.0a7.tar.gz.

File metadata

Download URL: mind_nerve-0.1.0a7.tar.gz
Upload date: May 16, 2026
Size: 44.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for mind_nerve-0.1.0a7.tar.gz
Algorithm	Hash digest
SHA256	`085bf94f0e61fe436f9bab9315c2abc1531dab82798820f5142427fca24ead56`
MD5	`e54dc7a57ac88e5e5f845f58ba6b3bd3`
BLAKE2b-256	`612e1abe4f6f9fbd23d5b326ac41d991122f3f335bfa40fac9fce7a75969152f`

See more details on using hashes here.

File details

Details for the file mind_nerve-0.1.0a7-py3-none-any.whl.

File metadata

Download URL: mind_nerve-0.1.0a7-py3-none-any.whl
Upload date: May 16, 2026
Size: 44.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for mind_nerve-0.1.0a7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ae4f97310c4023d270759e6ee6d32946dbce976b2f3fecd019f67de8def01382`
MD5	`09e667ebd27e6832d195b6ff857225a4`
BLAKE2b-256	`e09e34b1bec4df1562089bbe7d7d4a338e19e70064c9c66a6c6a62afa3cdab9c`

See more details on using hashes here.

mind-nerve 0.1.0a7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

mind-nerve

Status

Quickstart

Daemon mode (recommended for hooks)

Skill preselection for Claude Code

Why this exists

Integration surface

Design constraints (non-negotiable)

Architecture (one paragraph)

Repository structure

License

Dependencies

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes