Intelligent LLM model router driven by real code metrics — successor to preLLM

These details have not been verified by PyPI

Project links

Project description

llx

Intelligent LLM model router driven by real code metrics.

Successor to preLLM — rebuilt with modular architecture, no god modules, and metric-driven routing.

llx analyzes your codebase with code2llm, redup, and vallm, then selects the optimal LLM model based on actual project metrics — file count, complexity, coupling, duplication — not abstract scores.

Principle: larger + more coupled + more complex → stronger (and more expensive) model.

Why llx? (Lessons from preLLM)

preLLM proved the concept but had architectural issues that llx resolves:

Problem in preLLM	llx Solution
`cli.py`: 999 lines, CC=30 (`main`), CC=27 (`query`)	CLI split into `app.py` + `formatters.py`, max CC ≤ 8
`core.py`: 893 lines god module	Config, analysis, routing in separate modules (≤250L each)
`trace.py`: 509 lines, CC=28 (`to_stdout`)	Output formatting as dedicated functions
Hardcoded model selection	Metric-driven thresholds from code2llm .toon data
No duplication/validation awareness	Integrates redup + vallm for richer metrics

Architecture

┌─────────────────────────────────────────────────────────────┐
│                    IDE / Agent Layer                        │
│  Roo Code │ Cline │ Continue.dev │ Aider │ Claude Code      │
│  (point at localhost:4000 as OpenAI-compatible API)         │
└─────────────────┬───────────────────────────────────────────┘
                  │
┌─────────────────▼───────────────────────────────────────────┐
│              LiteLLM Proxy (localhost:4000)                 │
│  ┌──────────┐  ┌──────────────┐  ┌────────────────────┐     │
│  │ Router   │  │ Semantic     │  │ Cost Tracking      │     │
│  │ (metrics)│  │ Cache (Redis)│  │ + Budget Limits    │     │
│  └────┬─────┘  └──────────────┘  └────────────────────┘     │
└───────┼─────────────────────────────────────────────────────┘
        │
   ┌────┼────────────────────────────────────────┐
   │    │           Model Tiers                   │
   │    ├── premium:  Claude Opus 4               │
   │    ├── balanced: Claude Sonnet 4 / GPT-5     │
   │    ├── cheap:    Claude Haiku 4.5            │
   │    ├── free:     Gemini 2.5 Pro              │
   │    ├── openrouter: 300+ models (fallback)    │
   │    └── local:    Ollama (Qwen2.5-Coder)      │
   └──────────────────────────────────────────────┘
        │
┌───────▼─────────────────────────────────────────────────────┐
│            Code Analysis Pipeline                           │
│  code2llm → redup → vallm → llx                             │
│  (metrics → duplication → validation → model selection)     │
└─────────────────────────────────────────────────────────────┘

MCP Server Integration

llx provides a complete MCP (Model Context Protocol) server that exposes analysis, preprocessing, and proxy-routing tools as MCP endpoints:

# Start MCP server for Claude Desktop
llx mcp start

# Generate Claude Desktop config
llx mcp config

# List available MCP tools
llx mcp tools

MCP Tools Available

Tool	Description
`llx_analyze`	Analyze a project and recommend the optimal model tier
`llx_select`	Quick model selection from existing analysis output
`llx_chat`	Analyze, select, and send a prompt with project context
`llx_preprocess`	Run the merged preLLM two-agent preprocessing pipeline
`llx_context`	Build shell, codebase, and sensitive-data context bundles
`code2llm_analyze`	Run code2llm static analysis and generate `.toon` files
`redup_scan`	Run duplication detection and emit a refactoring map
`vallm_validate`	Validate code or generated output with vallm
`llx_proxy_status`	Check LiteLLM proxy status
`llx_proxym_status`	Check Proxym routing status
`llx_proxym_chat`	Send a metrics-aware chat request through Proxym

Claude Desktop Setup

{
  "mcpServers": {
    "llx": {
      "command": "python3",
      "args": ["-m", "llx.mcp.server"]
    }
  }
}

Installation

pip install llx

# With integrations
pip install llx[all]         # Core integrations + MCP + Ollama
pip install llx[mcp]         # MCP server only
pip install llx[litellm]     # LiteLLM proxy
pip install llx[code2llm]    # Code analysis
pip install llx[redup]       # Duplication detection
pip install llx[vallm]       # Code validation
pip install llx[ollama]      # Local Ollama integration
pip install llx[prellm]      # Merged preLLM stack
pip install llx[prellm-full] # preLLM + optional context tooling

Quick Start

# Analyze project and get model recommendation
llx analyze ./my-project

# Run code2llm/redup/vallm before selection
llx analyze . --run

# Quick model selection
llx select .

# With task hint
llx select . --task refactor

# Point to pre-existing .toon files
llx analyze . --toon-dir ./project/

# JSON output for CI/CD
llx analyze . --json

# Chat with auto-selected model
llx chat . --prompt "Refactor the god modules"

# Force local model
llx select . --local

Model Selection Logic

Metric	Premium (≥)	Balanced (≥)	Cheap (≥)	Free
Files	50	10	3	<3
Lines	20,000	5,000	500	<500
Avg CC	6.0	4.0	2.0	<2.0
Max fan-out	30	10	—	—
Max CC	25	15	—	—
Dup groups	15	5	—	—
Dep cycles	any	—	—	—

Real-World Selection Examples

Project	Files	Lines	CC̄	Max CC	Fan-out	Tier
Single script	1	80	2.0	4	0	free
Small CLI	5	600	3.0	8	3	cheap
preLLM	31	8,900	5.0	28	30	premium
vallm	56	8,604	3.5	42	—	balanced
code2llm	113	21,128	4.6	65	45	premium
Monorepo	500+	100K+	5.0+	30+	50+	premium

LiteLLM Proxy

llx proxy config     # Generate litellm_config.yaml
llx proxy start      # Start proxy (default :4000; override with --port or LLX_PROXY_PORT)
llx proxy status     # Check if running

Configure IDE tools to point at http://localhost:4000:

Tool	Config
Roo Code / Cline	`"apiBaseUrl": "http://localhost:4000/v1"`
Continue.dev	`"apiBaseUrl": "http://localhost:4000/v1"`
Aider	`OPENAI_API_BASE=http://localhost:4000`
Claude Code	`ANTHROPIC_BASE_URL=http://localhost:4000`
Cursor / Windsurf	OpenAI-compatible endpoint

Configuration

llx init  # Creates llx.toml with defaults

Environment variables: LLX_LITELLM_URL, LLX_DEFAULT_TIER, LLX_PROXY_HOST, LLX_PROXY_PORT, LLX_PROXY_MASTER_KEY, LLX_VERBOSE.

Python API

from llx import analyze_project, select_model, LlxConfig

metrics = analyze_project("./my-project")
result = select_model(metrics)
print(result.model_id)   # Selected model ID
print(result.explain())   # Human-readable reasoning

Integration with wronai Toolchain

Tool	Role	llx Uses
code2llm	Static analysis	CC, fan-out, cycles, hotspots
redup	Duplication detection	Groups, recoverable lines
vallm	Code validation	Pass rate, issue count
llx	Model routing + MCP server	Consumes all above

Package Structure

llx/
├── __init__.py              # Public API
├── __main__.py              # `python -m llx`
├── analysis/                # Metrics collection and external tool runners
├── cli/                     # Typer CLI and output formatters
├── config.py                # Configuration loading and env overrides
├── integrations/            # Context builder, LiteLLM proxy, Proxym client
├── mcp/                     # MCP server and tool definitions
├── orchestration/           # Instances, routing, sessions, queues, VS Code
├── prellm/                  # Merged preLLM preprocessing pipeline
├── routing/                 # Model selection and LLM client wrappers
├── tools/                   # Utility CLIs and helper commands
└── litellm_config.py        # LiteLLM config helpers

The library stays split into small, composable modules. The merged preLLM code now lives under llx/prellm/, and orchestration / VS Code / instance-management code lives under llx/orchestration/.

Architecture Improvements (v0.1.7)

✅ Refactored 6 high-CC functions to meet targets (CC̄ ≤ 2.5, max CC ≤ 16)
✅ Added complete MCP server with 11 tools for Claude Desktop integration
✅ Added Proxym integration for metrics-aware routing and proxy status checks
✅ Fixed import resolution issues reported by vallm
✅ Enhanced test coverage for MCP functionality
✅ Modular design with single-responsibility functions

License

Apache License 2.0 - see LICENSE for details.

Author

Created by Tom Sapletta - tom@sapletta.com

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.84

May 13, 2026

0.1.83

May 13, 2026

0.1.82

Apr 26, 2026

0.1.81

Apr 26, 2026

0.1.80

Apr 26, 2026

0.1.79

Apr 26, 2026

0.1.78

Apr 26, 2026

0.1.77

Apr 26, 2026

0.1.76

Apr 26, 2026

0.1.75

Apr 26, 2026

0.1.74

Apr 26, 2026

0.1.73

Apr 26, 2026

0.1.72

Apr 26, 2026

0.1.71

Apr 26, 2026

0.1.70

Apr 26, 2026

0.1.69

Apr 26, 2026

0.1.68

Apr 26, 2026

0.1.67

Apr 26, 2026

0.1.66

Apr 26, 2026

0.1.65

Apr 26, 2026

0.1.64

Apr 26, 2026

0.1.63

Apr 26, 2026

0.1.62

Apr 26, 2026

0.1.61

Apr 26, 2026

0.1.60

Apr 25, 2026

0.1.59

Apr 25, 2026

0.1.58

Apr 20, 2026

0.1.57

Apr 9, 2026

0.1.56

Apr 9, 2026

0.1.54

Apr 3, 2026

0.1.53

Apr 3, 2026

0.1.52

Apr 3, 2026

0.1.51

Mar 31, 2026

0.1.50

Mar 30, 2026

0.1.49

Mar 30, 2026

0.1.48

Mar 30, 2026

0.1.47

Mar 30, 2026

0.1.46

Mar 30, 2026

0.1.45

Mar 29, 2026

0.1.44

Mar 29, 2026

0.1.43

Mar 29, 2026

0.1.42

Mar 29, 2026

0.1.41

Mar 29, 2026

0.1.39

Mar 27, 2026

0.1.38

Mar 27, 2026

0.1.37

Mar 27, 2026

0.1.36

Mar 26, 2026

0.1.35

Mar 26, 2026

0.1.34

Mar 26, 2026

0.1.33

Mar 26, 2026

0.1.32

Mar 26, 2026

0.1.31

Mar 26, 2026

0.1.30

Mar 26, 2026

0.1.29

Mar 26, 2026

0.1.28

Mar 26, 2026

0.1.27

Mar 26, 2026

0.1.26

Mar 26, 2026

0.1.25

Mar 26, 2026

0.1.24

Mar 26, 2026

0.1.23

Mar 26, 2026

0.1.22

Mar 26, 2026

0.1.21

Mar 26, 2026

0.1.20

Mar 26, 2026

0.1.19

Mar 26, 2026

0.1.18

Mar 26, 2026

0.1.17

Mar 26, 2026

0.1.16

Mar 26, 2026

0.1.15

Mar 26, 2026

0.1.14

Mar 26, 2026

0.1.13

Mar 26, 2026

0.1.12

Mar 26, 2026

0.1.11

Mar 26, 2026

0.1.10

Mar 26, 2026

0.1.9

Mar 26, 2026

This version

0.1.8

Mar 26, 2026

0.1.7

Mar 25, 2026

0.1.6

Mar 25, 2026

0.1.4

Mar 25, 2026

0.1.3

Mar 25, 2026

0.1.2

Mar 25, 2026

0.1.1

Mar 25, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llx-0.1.8.tar.gz (211.4 kB view details)

Uploaded Mar 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llx-0.1.8-py3-none-any.whl (248.7 kB view details)

Uploaded Mar 26, 2026 Python 3

File details

Details for the file llx-0.1.8.tar.gz.

File metadata

Download URL: llx-0.1.8.tar.gz
Upload date: Mar 26, 2026
Size: 211.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for llx-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`524691ab0aa75c90542e587e9d7f480e12410ddf65fc8c879e4dfd1088189bb0`
MD5	`f70e8f62e04b324a4eeb12f29c5064b1`
BLAKE2b-256	`124aca0edaf8a9a2c2f0c7f53e36f76bd72ab47676c4ffea0c18e1e02599d9e6`

See more details on using hashes here.

File details

Details for the file llx-0.1.8-py3-none-any.whl.

File metadata

Download URL: llx-0.1.8-py3-none-any.whl
Upload date: Mar 26, 2026
Size: 248.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for llx-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b9e6e790cb78fcf87558026677c8e981b76af9aa402ba37f2c7223e2542e6edb`
MD5	`fb42c55853fd8ed29b58a72c0c52db35`
BLAKE2b-256	`1a6545f5b8325c833b0a6437d7d512f2884d8f46cfe7d7159abd83d53a650985`

See more details on using hashes here.

llx 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

llx

Why llx? (Lessons from preLLM)

Architecture

MCP Server Integration

MCP Tools Available

Claude Desktop Setup

Installation

Quick Start

Model Selection Logic

Real-World Selection Examples

LiteLLM Proxy

Configuration

Python API

Integration with wronai Toolchain

Package Structure

Architecture Improvements (v0.1.7)

License

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes