Skip to main content

Production-grade tool routing for AI agents

Project description

CLIbrary

Production-grade tool routing for AI agents.

PyPI License: MIT Python


The Problem

AI agents waste massive amounts of tokens on tool selection. When an agent has 100+ available tools, current approaches force LLMs to read every tool's documentation in the context window before making a decision:

  • Token waste: 50,000+ tokens per call just for tool descriptions
  • High latency: 2–5 seconds before any actual work happens
  • Poor accuracy: LLMs get "lost in the middle" with too many options
  • Unbounded cost: Linear cost growth as tool count increases

When you scale beyond 50 tools, current systems break down.


How CLIbrary Works

CLIbrary uses a two-stage retrieval architecture to route agent intents to the right tool in ~36ms using only ~150 tokens.

Traditional approach:
  Agent intent → Stuff all 500 tools into LLM → LLM picks
  ~50,000 tokens, ~3 seconds, ~70% accuracy

CLIbrary approach:
  Agent intent → CLIbrary router → Single tool + params
  ~150 tokens, ~36ms, ~93% top-3 accuracy

Architecture

Agent intent
    │
    ▼
Embed (multilingual-e5-base, ~10ms)
    │
    ▼
Stage 1: FAISS cli_index → top-3 candidates (~5ms)
    │
    ▼
MaxSim re-rank  (combined = 0.7×mean_sim + 0.3×max_sim)
    │
    ├── low confidence + small gap → Clarify (return top-3 to LLM)
    │
    └── high confidence
            │
            ▼
        Stage 2: example_index → best matching example
            │
            ├── sim ≥ 0.85 → Path A: template fill (no LLM, ~80% of calls)
            │
            └── sim < 0.85 → Path B: LLM param extraction (~20% of calls)
                    │
                    ▼
              Tool call JSON output

Quick Start

Install

pip install clibrary-hub

Build the FAISS indices from your manifests

clibrary-build-index --manifest-dir ./manifests
# Indices written to ~/.clibrary/indices by default

Route an intent

from clibrary_hub import CLIbrary

router = CLIbrary()
result = router.route("幫我查上週銷售總額")

# {
#   "action": "route",
#   "cli": "sql-runner",
#   "params": {"query": "SELECT SUM(amount) FROM orders WHERE ...", "output_format": "json"},
#   "confidence": 0.94,
#   "source": "A",
#   "latency_ms": 36
# }

Performance

Evaluated on 2,050 queries across 500 CLIs:

Metric Result
Top-1 accuracy 82.3%
Top-3 accuracy 92.5%
Path A hit rate (no LLM needed) 93.6%
Median latency 36ms
Token usage ~150

Compared to traditional "stuff all tools into LLM" approach:

Metric CLIbrary Traditional
Tokens ~150 ~50,000
Latency 36ms 2–5s
Accuracy (100+ tools) 82–92% 60–75%

Manifest Format

Each CLI tool has a manifest.json describing its purpose, inputs, and examples:

{
  "name": "sql-runner",
  "version": "1.0.0",
  "category": "data",
  "description": "Execute SQL queries against a database",
  "intent_triggers": [
    "query a database",
    "run a SQL statement",
    "查資料庫",
    "跑 SQL"
  ],
  "input_schema": {
    "type": "object",
    "properties": {
      "query": {"type": "string", "description": "SQL query to execute"},
      "output_format": {"type": "string", "enum": ["json", "csv", "table"], "default": "json"}
    },
    "required": ["query"]
  },
  "examples": [
    {
      "query": "查上週銷售總額",
      "params": {"query": "SELECT SUM(amount) FROM orders WHERE created_at > NOW() - INTERVAL 7 DAY", "output_format": "json"}
    }
  ],
  "tags": ["sql", "database", "query"]
}

Three FAISS Indices

Index Vectors Content Usage
cli_index 500 Mean-pooled intent_triggers per CLI Stage 1 candidate retrieval
trigger_index ~3,500 Individual trigger vectors MaxSim re-ranking
example_index ~1,500 Example query vectors Stage 2 template matching

Project Structure

CLIbrary/
├── src/clibrary_hub/
│   ├── __init__.py
│   ├── router.py          # Core routing logic
│   ├── build_index.py     # FAISS index builder (clibrary-build-index)
│   ├── manifest.py        # Manifest loader / validator
│   ├── matchers/
│   └── utils/
├── benchmark/
│   └── eval_sets/         # in_domain / paraphrase / cross_domain (2,050 queries)
├── tests/
└── README.md

Why It Works

  1. Embedding routing beats LLM selection at scale — geometric distance is not affected by "lost in the middle" attention dilution.
  2. Two-stage design improves accuracy: broad candidate retrieval in Stage 1, precise example matching in Stage 2.
  3. Example-based caching eliminates LLM calls for ~80% of queries.
  4. Multilingual (multilingual-e5-base) handles mixed Chinese/English queries natively.

Comparison

CLIbrary LangChain Tool Retrieval Function Calling MCP
Scales to 500+ tools ⚠️ ⚠️
No LLM for routing ✅ (80%)
Manifest standard
Multilingual ⚠️ ⚠️ ⚠️

CLIbrary complements MCP: CLIbrary handles routing, MCP handles protocol.


Status

  • ✅ Manifest schema v1.0
  • ✅ 500 CLI manifests (8 categories)
  • ✅ Reference router implementation (FAISS, multilingual-e5-base)
  • ✅ Evaluation dataset (2,050 queries)
  • ✅ pip package — pip install clibrary-hub

License

MIT License — free to use, including for commercial purposes.


Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clibrary_hub-0.1.2.tar.gz (17.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

clibrary_hub-0.1.2-py3-none-any.whl (14.3 kB view details)

Uploaded Python 3

File details

Details for the file clibrary_hub-0.1.2.tar.gz.

File metadata

  • Download URL: clibrary_hub-0.1.2.tar.gz
  • Upload date:
  • Size: 17.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for clibrary_hub-0.1.2.tar.gz
Algorithm Hash digest
SHA256 8e5e32c45fc0363d3b82d6fa936c0eab5fc41cf83fc4751a75e8144b405584c5
MD5 1120209ad51cad38f6bc665f1ef69989
BLAKE2b-256 8a3984f7b417f74b9662827266ebded7efde9e69ed6c5511a581820c1fa88186

See more details on using hashes here.

File details

Details for the file clibrary_hub-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: clibrary_hub-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 14.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for clibrary_hub-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 82f81829afa445ec0d8700584bdc6b71037746bdd4632e470eccaf400fc79ab5
MD5 5cd3b63c31fcbc8c3bd7e24acb4e37ee
BLAKE2b-256 d4d64a1c43a484b3cb0aad1ea32153bf44fb7c02769042f40e7ccd90997347bd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page