The Switchboard - Unified LLM API Gateway with fail-closed semantics

These details have not been verified by PyPI

Project description

The Switchboard 🔌

Unified LLM API Gateway with Fail-Closed Semantics

The Switchboard is a high-performance proxy service that provides intelligent routing to multiple LLM providers. It serves as the central nervous system for all Aperion agents (Sentinel, AR, Aether), ensuring they can access LLMs reliably and cost-effectively.

🎯 Core Features

OpenAI-Compatible API: Drop-in replacement - just change base_url
Intelligent Task Routing: Security tasks → Premium, Docs → Free tier
Fail-Closed Semantics: Never silently falls back to Echo in production
Cost Optimization: Target 75% savings by routing volume to free tiers
Telemetry Injection: X-Correlation-ID propagation for tracing
Structured Logging: JSON cost/latency metrics (Constitution D3)

🚀 Quick Start

Installation

# From source
pip install -e .

# With dev dependencies
pip install -e ".[dev]"

Configuration

Set environment variables for your providers:

# OpenAI (Premium tier)
export OPENAI_API_KEY=sk-...

# Google Gemini (Free tier)
export GEMINI_API_KEY=AIza...

# Cloudflare Workers AI (Low-cost tier)
export WORKERS_AI_API_KEY=your-cf-token
export WORKERS_AI_BASE_URL=https://api.cloudflare.com/client/v4/accounts/ACCT/ai/run

Running

# Development
python -m aperion_switchboard.main

# Production
uvicorn aperion_switchboard.main:app --host 0.0.0.0 --port 8080

# Docker
docker build -t switchboard .
docker run -p 8080:8080 \
  -e OPENAI_API_KEY=sk-... \
  -e GEMINI_API_KEY=AIza... \
  switchboard

📡 API Usage

OpenAI-Compatible Endpoint

curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "X-Aperion-Task-Type: security_audit" \
  -d '{
    "model": "gpt-4.1-mini",
    "messages": [{"role": "user", "content": "Analyze this code for vulnerabilities"}]
  }'

Task Types

Use the X-Aperion-Task-Type header to trigger intelligent routing:

Task Type	Routes To	Use Case
`security_audit`	OpenAI	Critical security analysis
`production_decision`	OpenAI	High-stakes decisions
`strategic_analysis`	OpenAI	Complex reasoning
`code_review`	OpenAI	Quality reviews
`doc_update`	Gemini	Documentation updates
`doc_generation`	Gemini	Batch doc creation
`lint_analysis`	Gemini	Fast batch processing
`test_generation`	Gemini	High-volume generation
`general`	Gemini	Default (cost-optimized)

🔒 Constitution Compliance

A6: Fail-Closed Semantics (Iron Rule)

The Switchboard MUST NEVER silently fall back to the Echo provider in production.

If no real providers are configured AND APERION_ALLOW_ECHO is not "true":
- Service crashes on startup
- Returns 503 for all requests
- Logs CRITICAL error with remediation steps

# Production mode (default) - will crash if no providers configured
export APERION_ALLOW_ECHO=false

# Development mode - allows echo fallback
export APERION_ALLOW_ECHO=true

B1: Secrets Management

All credentials are loaded from environment variables:

OPENAI_API_KEY
GEMINI_API_KEY
WORKERS_AI_API_KEY
SWITCHBOARD_API_KEY (optional - for Switchboard auth)

D1: Telemetry Injection

Extracts X-Correlation-ID from incoming requests
Generates one if missing (format: sw_{uuid})
Propagates to all upstream provider requests
Adds to response headers

D3: Structured Logging

All cost/latency metrics are logged as JSON:

{
  "event": "llm_request_cost",
  "correlation_id": "sw_abc123",
  "provider": "openai",
  "model": "gpt-4.1-mini",
  "estimated_cost_usd": 0.00015,
  "tokens": {"prompt": 100, "completion": 50, "total": 150},
  "latency_ms": 1234,
  "task_type": "security_audit"
}

🧪 Testing

# Run all tests
pytest

# Run safety tests (fail-closed verification)
pytest -m safety

# Run unit tests only
pytest -m unit

# Run integration tests (requires API keys)
pytest -m integration

# With coverage
pytest --cov=aperion_switchboard --cov-report=html

📊 Endpoints

Endpoint	Method	Description
`/v1/chat/completions`	POST	OpenAI-compatible chat
`/health`	GET	Health check
`/healthz`	GET	Kubernetes health probe
`/docs`	GET	OpenAPI documentation

🏗️ Architecture

src/aperion_switchboard/
├── core/
│   ├── router.py      # Task routing & fallback logic
│   ├── protocol.py    # LLMClient abstract base class
│   └── fail_closed.py # Constitution A6 enforcement
├── providers/
│   ├── openai.py      # OpenAI/compatible providers
│   ├── gemini.py      # Google Gemini
│   ├── workers.py     # Cloudflare Workers AI
│   └── echo.py        # Test-only echo provider
├── service/
│   ├── app.py         # FastAPI application
│   ├── middleware.py  # Auth, telemetry, cost logging
│   └── schemas.py     # OpenAI-compatible Pydantic models
└── main.py            # Entry point

📈 Cost Optimization

The Switchboard achieves ~75% cost savings by:

Routing 80% of requests (docs, linting, tests) to free tiers
Reserving premium providers for critical tasks only
Tracking and reporting cost per request

View cost summary:

from aperion_switchboard.core.router import get_router

router = get_router()
summary = router.get_cost_summary()
print(f"Savings: {summary['savings_percent']:.1f}%")

🔧 Development

# Install dev dependencies
pip install -e ".[dev]"

# Run linter
ruff check src tests

# Run type checker
mypy src

# Run tests with coverage
pytest --cov=aperion_switchboard --cov-report=term-missing

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.2.1

Feb 11, 2026

This version

0.1.0

Feb 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aperion_switchboard-0.1.0.tar.gz (72.8 kB view details)

Uploaded Feb 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aperion_switchboard-0.1.0-py3-none-any.whl (47.1 kB view details)

Uploaded Feb 11, 2026 Python 3

File details

Details for the file aperion_switchboard-0.1.0.tar.gz.

File metadata

Download URL: aperion_switchboard-0.1.0.tar.gz
Upload date: Feb 11, 2026
Size: 72.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aperion_switchboard-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`4795deb4e5059cd2cd6e15c95da6a44a5abddc8453b260e086f4563893391a6b`
MD5	`2e1d6a4edee17666875eca3dbc834fab`
BLAKE2b-256	`595ddcfde5a08be4fe91b9e551c1ffe6728312bebc0eeabc52129d4398d306d2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aperion_switchboard-0.1.0.tar.gz:

Publisher: release.yml on invictustitan2/aperion-llm-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aperion_switchboard-0.1.0.tar.gz
- Subject digest: 4795deb4e5059cd2cd6e15c95da6a44a5abddc8453b260e086f4563893391a6b
- Sigstore transparency entry: 941827409
- Sigstore integration time: Feb 11, 2026
Source repository:
- Permalink: invictustitan2/aperion-llm-router@15c3c678cf02100bc76f7d9b3c2a1e3fdbfb7262
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/invictustitan2
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@15c3c678cf02100bc76f7d9b3c2a1e3fdbfb7262
- Trigger Event: push

File details

Details for the file aperion_switchboard-0.1.0-py3-none-any.whl.

File metadata

Download URL: aperion_switchboard-0.1.0-py3-none-any.whl
Upload date: Feb 11, 2026
Size: 47.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aperion_switchboard-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c10ece1cffeae844738f03f77c32e9e423cc2c25123df2838a4985b6ea6b7297`
MD5	`8d5bac57e35d1170beff8c0d0422d30e`
BLAKE2b-256	`2363649ea68f9a93c20606075d199f898cb6e71dba00276b61633d7435136509`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aperion_switchboard-0.1.0-py3-none-any.whl:

Publisher: release.yml on invictustitan2/aperion-llm-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aperion_switchboard-0.1.0-py3-none-any.whl
- Subject digest: c10ece1cffeae844738f03f77c32e9e423cc2c25123df2838a4985b6ea6b7297
- Sigstore transparency entry: 941827435
- Sigstore integration time: Feb 11, 2026
Source repository:
- Permalink: invictustitan2/aperion-llm-router@15c3c678cf02100bc76f7d9b3c2a1e3fdbfb7262
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/invictustitan2
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@15c3c678cf02100bc76f7d9b3c2a1e3fdbfb7262
- Trigger Event: push

aperion-switchboard 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

The Switchboard 🔌

🎯 Core Features

🚀 Quick Start

Installation

Configuration

Running

📡 API Usage

OpenAI-Compatible Endpoint

Task Types

🔒 Constitution Compliance

A6: Fail-Closed Semantics (Iron Rule)

B1: Secrets Management

D1: Telemetry Injection

D3: Structured Logging

🧪 Testing

📊 Endpoints

🏗️ Architecture

📈 Cost Optimization

🔧 Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance