Multi-model MCP routing fabric for local and cloud LLMs

Project description

AuraRouter: The AuraXLM-Lite Compute Fabric

Current Status: Production Prototype v3 (Feb 2026)
Maintainer: Steven Siebert / AuraCore Dynamics

Overview

AuraRouter implements a simplified role-based configurable xLM (SLM/TLM/LLM) prompt routing as an MCP server. AuraRouter is designed to orchestrate local and cloud resources for AuraCore development. It acts as an intelligent middleware for an MCP Client (ie Gemini CLI), allowing you to route code generation tasks to local hardware while maintaining a cloud safety net.

It implements an Intent -> Plan -> Execute loop:

Router: A fast local model classifies the task (Simple vs. Complex).
Architect: If complex, a reasoning model generates a sequential execution plan.
Worker: A coding model executes the plan step-by-step.

Architecture

graph TD
    User[Gemini CLI] -->|Task| Router{Intent Analysis}
    Router -->|Simple| Worker[Coding Node]
    Router -->|Complex| Architect[Reasoning Node]
    Architect -->|Plan JSON| Worker
    
    subgraph Compute Fabric [auraconfig.yaml]
        Worker -->|Try| Node1[Local RTX 3070]
        Node1 -->|Fail| Node2[Cloud Fallback]
    end

Prerequisites

Python 3.12+
Ollama (Running locally with qwen2.5-coder:7b or similar)
Google AI Studio Key (For cloud fallback/reasoning)

Installation

1. Environment Setup

Create the isolated environment with the required dependencies.

conda env create -f environment.yaml
conda activate aurarouter

2. Pull Local Models

We recommend the Qwen 2.5 series for consumer hardware speed/stability.

ollama pull qwen2.5-coder:7b

3. Configuration

Edit auraconfig.yaml to define your nodes and paste your API keys.

models:
  local_3070:
    provider: ollama
    endpoint: http://localhost:11434/api/generate
    model_name: qwen2.5-coder:7b

  cloud_gemini:
    provider: google
    model_name: gemini-2.0-flash
    api_key: "AIzaSy..." # Paste key here

4. Register AuraRouter CLI Tools

AuraRouter can integrate with your Gemini CLI to provide advanced routing capabilities. Choose from the following options to register the necessary tools:

Interactive Installation (Recommended): Run the interactive installer to register support for all available models (Gemini and Claude). You will be prompted to confirm or skip installation for each.
```
python aurarouter.py --install
```
Gemini-only Installation: Register AuraRouter specifically for Gemini models.
```
python aurarouter.py --install-gemini
```
Claude-only Installation: Register AuraRouter specifically for Claude models.
```
python aurarouter.py --install-claude
```

Usage

Restart your Gemini CLI. You can now use natural language to trigger the fabric.

The "Fast Lane" (Local Only):

"Write a python function to calculate Fibonacci." (Routes directly to Local Qwen)

The "Heavy Lane" (Cloud Plan + Local Build):

"Create a distributed lock manager in C# with an interface and unit tests." (Routes to Cloud Architect for planning, then Local Qwen for execution)

Scaling Guide

When you add new on-prem xLM resources:

Open auraconfig.yaml.
Uncomment the local_3090_deepseek block under models.
Add it to the top of the reasoning role list.
Restart the router. No code changes required.

Troubleshooting

"Empty response received": The local model is likely OOMing or timing out. Check the timeout setting in auraconfig.yaml.
"Model not found": Ensure the model_name in YAML matches ollama list exactly.
Installer fails: Manually add the mcpServers block to your ~/.geminichat/settings.json pointing to the aurarouter.py absolute path.

License

Project details

Release history Release notifications | RSS feed

0.5.0

Mar 3, 2026

0.4.0

Feb 26, 2026

0.3.0

Feb 13, 2026

This version

0.2.0

Feb 6, 2026

0.1.0

Feb 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aurarouter-0.2.0.tar.gz (23.2 kB view details)

Uploaded Feb 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aurarouter-0.2.0-py3-none-any.whl (23.4 kB view details)

Uploaded Feb 6, 2026 Python 3

File details

Details for the file aurarouter-0.2.0.tar.gz.

File metadata

Download URL: aurarouter-0.2.0.tar.gz
Upload date: Feb 6, 2026
Size: 23.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aurarouter-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`07093ffeacac5cbe2d9c88e7d0ac695d57780693b73cf032836e4e12c873dbbb`
MD5	`17d26efa9455a390b1bc398809ea402b`
BLAKE2b-256	`fdc21f4155b5d51c4009c13fa9708d49e5488104860d56b9637cc783410f3c34`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aurarouter-0.2.0.tar.gz:

Publisher: publish.yml on AuraCoreDynamics/aurarouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aurarouter-0.2.0.tar.gz
- Subject digest: 07093ffeacac5cbe2d9c88e7d0ac695d57780693b73cf032836e4e12c873dbbb
- Sigstore transparency entry: 924108260
- Sigstore integration time: Feb 6, 2026
Source repository:
- Permalink: AuraCoreDynamics/aurarouter@91a3665efc9f0c9acb3cac36d17f81955a085b44
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/AuraCoreDynamics
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@91a3665efc9f0c9acb3cac36d17f81955a085b44
- Trigger Event: release

File details

Details for the file aurarouter-0.2.0-py3-none-any.whl.

File metadata

Download URL: aurarouter-0.2.0-py3-none-any.whl
Upload date: Feb 6, 2026
Size: 23.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aurarouter-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`79a48e52f1398c8a897accf2dc7f50af6fa90a03ca589d4e763fd72e6c00707f`
MD5	`f5338cfaf645f76f2bb37771350e0c6f`
BLAKE2b-256	`01888e3ba76c13b83e6377c34bb52ddc0020f9a4a1a038539ffff3acd7ff7418`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aurarouter-0.2.0-py3-none-any.whl:

Publisher: publish.yml on AuraCoreDynamics/aurarouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aurarouter-0.2.0-py3-none-any.whl
- Subject digest: 79a48e52f1398c8a897accf2dc7f50af6fa90a03ca589d4e763fd72e6c00707f
- Sigstore transparency entry: 924108270
- Sigstore integration time: Feb 6, 2026
Source repository:
- Permalink: AuraCoreDynamics/aurarouter@91a3665efc9f0c9acb3cac36d17f81955a085b44
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/AuraCoreDynamics
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@91a3665efc9f0c9acb3cac36d17f81955a085b44
- Trigger Event: release

aurarouter 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

AuraRouter: The AuraXLM-Lite Compute Fabric

Overview

Architecture

Prerequisites

Installation

1. Environment Setup

2. Pull Local Models

3. Configuration

4. Register AuraRouter CLI Tools

Usage

Scaling Guide

Troubleshooting

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance