Skip to main content

Multi-model MCP routing fabric for local and cloud LLMs

Project description

AuraRouter: The AuraXLM-Lite Compute Fabric

Current Status: Production Prototype v3 (Feb 2026) Maintainer: Steven Siebert / AuraCore Dynamics

Overview

AuraRouter implements a role-based configurable xLM (SLM/TLM/LLM) prompt routing fabric. It acts as intelligent middleware that routes tasks across local and cloud models with automatic fallback. AuraRouter is content-agnostic -- it handles code generation, summarization, analysis, RAG-enabled Q&A, and any other prompt-based work. It can run as an MCP server, a desktop GUI application, or a managed service on AuraGrid.

It implements an Intent -> Plan -> Execute loop:

  1. Classifier: A fast local model classifies the task (Direct vs. Multi-Step).
  2. Planner: If multi-step, a reasoning model generates a sequential execution plan.
  3. Worker: An execution model carries out the plan step-by-step.

Architecture

graph TD
    User[MCP Client / GUI] -->|Task| Classifier{Intent Analysis}
    Classifier -->|Direct| Worker[Worker Node]
    Classifier -->|Multi-Step| Planner[Planner Node]
    Planner -->|Plan JSON| Worker

    subgraph Compute Fabric [auraconfig.yaml]
        Worker -->|Try| Node1[Local Model]
        Node1 -->|Fail| Node2[Cloud Fallback]
    end

Installation

PyPI (Recommended)

# Core install (MCP server + GUI + cloud providers + llamacpp-server HTTP provider)
pip install aurarouter

# With embedded llama.cpp + HuggingFace model downloading
pip install aurarouter[local]

# Everything (local + AuraGrid + dev tools)
pip install aurarouter[all]

Source Install

git clone https://github.com/auracoredynamics/aurarouter.git
cd aurarouter
pip install -r requirements.txt        # Core dependencies
pip install -r requirements-local.txt   # Optional: local inference deps
pip install -e .                        # Editable install

Conda

conda env create -f environment.yaml
conda activate aurarouter

See DEPLOYMENT.md for detailed deployment and configuration guide.

Quick Start

1. Configuration

Run the interactive installer to create a config template:

aurarouter --install

Or manually create ~/.auracore/aurarouter/auraconfig.yaml:

models:
  local_qwen:
    provider: ollama
    endpoint: http://localhost:11434/api/generate
    model_name: qwen2.5-coder:7b

  cloud_gemini:
    provider: google
    model_name: gemini-2.0-flash
    api_key: "AIzaSy..."

roles:
  router:   [local_qwen, cloud_gemini]
  reasoning: [cloud_gemini]
  coding:   [local_qwen, cloud_gemini]

2. Run

# MCP server (default)
aurarouter

# Desktop GUI
aurarouter gui

# With explicit config
aurarouter --config /path/to/auraconfig.yaml

Providers

Provider Type Config Key Dependencies
Ollama Local HTTP ollama None (uses httpx)
llama.cpp Server Local HTTP llamacpp-server None (uses httpx)
llama.cpp Embedded Local Native llamacpp pip install aurarouter[local]
Google Gemini Cloud google Included
Anthropic Claude Cloud claude Included

GUI

The desktop GUI (included in the base install) provides:

  • Environment selector — Switch between Local and AuraGrid deployments at runtime
  • Service controls — Start, stop, and pause the MCP server or AuraGrid MAS
  • Execute tab — Task input with file attachment, intent analysis, routing pipeline visualization, task output
  • Models tab — Local GGUF model browser, HuggingFace downloads, grid model listing (AuraGrid)
  • Configuration tab — Model CRUD, routing chain editor, YAML preview, cell-wide save warnings (AuraGrid)
  • Grid panels (AuraGrid) — Deployment strategy editor, cell node status, event log
  • Health dashboard — Per-model health status with clickable indicator
  • Prompt history — Last 20 tasks with results, restorable from dropdown
  • Keyboard shortcuts — Ctrl+Enter (execute), Ctrl+N (new), Escape (cancel)

All configuration changes are persisted to auraconfig.yaml. See GUI_GUIDE.md for the complete guide.

CLI Commands

Command Description
aurarouter Run MCP server (default)
aurarouter gui Launch desktop GUI
aurarouter download-model --repo REPO --file FILE Download GGUF model from HuggingFace
aurarouter list-models List locally downloaded GGUF models
aurarouter remove-model --file FILE Remove a downloaded model
aurarouter --install Interactive installer for MCP clients
aurarouter --install-gemini Register for Gemini CLI
aurarouter --install-claude Register for Claude

AuraGrid Integration (Optional)

AuraRouter can be deployed as a Managed Application Service (MAS) on AuraGrid for distributed access to routing services.

pip install aurarouter[auragrid]

See AURAGRID.md for the complete integration guide.

Scaling Guide

When you add new on-prem xLM resources:

  1. Open auraconfig.yaml (or use the GUI Configuration tab).
  2. Add the new model under models.
  3. Add it to the appropriate role chain under roles.
  4. Restart the router (or save from the GUI). No code changes required.

Troubleshooting

  • "Empty response received": The local model is likely OOMing or timing out. Check the timeout setting in auraconfig.yaml.
  • "Model not found": Ensure the model_name in YAML matches ollama list exactly.
  • "huggingface-hub is required": Run pip install aurarouter[local] to enable model downloading and embedded llama.cpp.
  • PySide6 issues on headless servers: PySide6 is a core dependency. On headless/server-only deployments, use the MCP server mode (aurarouter) which does not launch the GUI.

License

Copyright 2026 AuraCore Dynamics Inc.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aurarouter-0.3.0.tar.gz (123.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aurarouter-0.3.0-py3-none-any.whl (121.3 kB view details)

Uploaded Python 3

File details

Details for the file aurarouter-0.3.0.tar.gz.

File metadata

  • Download URL: aurarouter-0.3.0.tar.gz
  • Upload date:
  • Size: 123.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aurarouter-0.3.0.tar.gz
Algorithm Hash digest
SHA256 c678e8b9106ee2dd439772a800286ce81745879753b929da134848cf251bf098
MD5 5ae34b028936f30c3d2271ba9bb95932
BLAKE2b-256 d79a37b156423c976536d2ae87d5a86cd85396e05f4cfd7fbaf1c6cfe1210779

See more details on using hashes here.

Provenance

The following attestation bundles were made for aurarouter-0.3.0.tar.gz:

Publisher: publish.yml on AuraCoreDynamics/aurarouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aurarouter-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: aurarouter-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 121.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aurarouter-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4e1e62911f8948e47f961604000f2821631a3a73a99d297895de0c94860bb54e
MD5 665b86610f691d6afd55cfe9fedd4a37
BLAKE2b-256 606e18c83ab65072238fb3808abc1d901016b2d0f1a2d405899890ea07424f36

See more details on using hashes here.

Provenance

The following attestation bundles were made for aurarouter-0.3.0-py3-none-any.whl:

Publisher: publish.yml on AuraCoreDynamics/aurarouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page