Skip to main content

Multi-model MCP routing fabric for local and cloud LLMs

Project description

AuraRouter: The AuraXLM-Lite Compute Fabric

Current Status: Production Prototype v3 (Feb 2026)
Maintainer: Steven Siebert / AuraCore Dynamics

Overview

AuraRouter implements a simplified role-based configurable xLM (SLM/TLM/LLM) prompt routing as an MCP server. AuraRouter is designed to orchestrate local and cloud resources for AuraCore development. It acts as an intelligent middleware for an MCP Client (ie Gemini CLI), allowing you to route code generation tasks to local hardware while maintaining a cloud safety net.

It implements an Intent -> Plan -> Execute loop:

  1. Router: A fast local model classifies the task (Simple vs. Complex).
  2. Architect: If complex, a reasoning model generates a sequential execution plan.
  3. Worker: A coding model executes the plan step-by-step.

Architecture

graph TD
    User[Gemini CLI] -->|Task| Router{Intent Analysis}
    Router -->|Simple| Worker[Coding Node]
    Router -->|Complex| Architect[Reasoning Node]
    Architect -->|Plan JSON| Worker
    
    subgraph Compute Fabric [auraconfig.yaml]
        Worker -->|Try| Node1[Local RTX 3070]
        Node1 -->|Fail| Node2[Cloud Fallback]
    end

Prerequisites

  • Python 3.12+
  • Ollama (Running locally with qwen2.5-coder:7b or similar)
  • Google AI Studio Key (For cloud fallback/reasoning)

Installation

1. Environment Setup

Create the isolated environment with the required dependencies.

conda env create -f environment.yaml
conda activate aurarouter

2. Pull Local Models

We recommend the Qwen 2.5 series for consumer hardware speed/stability.

ollama pull qwen2.5-coder:7b

3. Configuration

Edit auraconfig.yaml to define your nodes and paste your API keys.

models:
  local_3070:
    provider: ollama
    endpoint: http://localhost:11434/api/generate
    model_name: qwen2.5-coder:7b

  cloud_gemini:
    provider: google
    model_name: gemini-2.0-flash
    api_key: "AIzaSy..." # Paste key here

4. Register AuraRouter CLI Tools

AuraRouter can integrate with your Gemini CLI to provide advanced routing capabilities. Choose from the following options to register the necessary tools:

  • Interactive Installation (Recommended): Run the interactive installer to register support for all available models (Gemini and Claude). You will be prompted to confirm or skip installation for each.

    python aurarouter.py --install
    
  • Gemini-only Installation: Register AuraRouter specifically for Gemini models.

    python aurarouter.py --install-gemini
    
  • Claude-only Installation: Register AuraRouter specifically for Claude models.

    python aurarouter.py --install-claude
    

Usage

Restart your Gemini CLI. You can now use natural language to trigger the fabric.

The "Fast Lane" (Local Only):

"Write a python function to calculate Fibonacci." (Routes directly to Local Qwen)

The "Heavy Lane" (Cloud Plan + Local Build):

"Create a distributed lock manager in C# with an interface and unit tests." (Routes to Cloud Architect for planning, then Local Qwen for execution)

Scaling Guide

When you add new on-prem xLM resources:

  1. Open auraconfig.yaml.
  2. Uncomment the local_3090_deepseek block under models.
  3. Add it to the top of the reasoning role list.
  4. Restart the router. No code changes required.

Troubleshooting

  • "Empty response received": The local model is likely OOMing or timing out. Check the timeout setting in auraconfig.yaml.
  • "Model not found": Ensure the model_name in YAML matches ollama list exactly.
  • Installer fails: Manually add the mcpServers block to your ~/.geminichat/settings.json pointing to the aurarouter.py absolute path.

License

Copyright 2026 AuraCore Dynamics Inc.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aurarouter-0.2.0.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aurarouter-0.2.0-py3-none-any.whl (23.4 kB view details)

Uploaded Python 3

File details

Details for the file aurarouter-0.2.0.tar.gz.

File metadata

  • Download URL: aurarouter-0.2.0.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aurarouter-0.2.0.tar.gz
Algorithm Hash digest
SHA256 07093ffeacac5cbe2d9c88e7d0ac695d57780693b73cf032836e4e12c873dbbb
MD5 17d26efa9455a390b1bc398809ea402b
BLAKE2b-256 fdc21f4155b5d51c4009c13fa9708d49e5488104860d56b9637cc783410f3c34

See more details on using hashes here.

Provenance

The following attestation bundles were made for aurarouter-0.2.0.tar.gz:

Publisher: publish.yml on AuraCoreDynamics/aurarouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aurarouter-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: aurarouter-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 23.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aurarouter-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 79a48e52f1398c8a897accf2dc7f50af6fa90a03ca589d4e763fd72e6c00707f
MD5 f5338cfaf645f76f2bb37771350e0c6f
BLAKE2b-256 01888e3ba76c13b83e6377c34bb52ddc0020f9a4a1a038539ffff3acd7ff7418

See more details on using hashes here.

Provenance

The following attestation bundles were made for aurarouter-0.2.0-py3-none-any.whl:

Publisher: publish.yml on AuraCoreDynamics/aurarouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page