Skip to main content

Production-ready runtime for building and orchestrating intelligent multi-agent AI systems

Project description

MUXI Runtime

License Python 3.10+

The execution engine for AI agent formations.

For most users: Install MUXI CLI for the complete experience. This repo is for contributors and developers embedding the runtime directly.

[!IMPORTANT]

MUXI Ecosystem

This repository is part of the larger MUXI ecosystem.

๐Ÿ“‹ Complete architectural overview: See muxi/ARCHITECTURE.md - explains how core repositories fit together, dependencies, status, and roadmap.

What is MUXI Runtime?

MUXI Runtime transforms declarative YAML configurations into running AI systems. It's the core engine that powers the MUXI Server.

Core responsibilities:

  • Formation execution - Loads and runs agent configurations from YAML
  • Overlord orchestration - Routes requests, manages clarifications, coordinates workflows
  • Memory systems - Three-tier memory (buffer, persistent, vector)
  • Tool integration - MCP protocol support for external tools
  • Multi-tenant isolation - User and session management

Architecture

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚     MUXI Server - Formation lifecycle management    โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                         โ”‚
                         โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚           MUXI Runtime โ—„โ”€โ”€ THIS REPO                โ”‚
โ”‚  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”‚
โ”‚  โ”‚  Formation Engine (YAML loader & validator)   โ”‚  โ”‚
โ”‚  โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค  โ”‚
โ”‚  โ”‚  Overlord โ”‚ Agents โ”‚ Workflow โ”‚ Background    โ”‚  โ”‚
โ”‚  โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค  โ”‚
โ”‚  โ”‚  Memory โ”‚ MCP โ”‚ A2A โ”‚ LLM โ”‚ Observability     โ”‚  โ”‚
โ”‚  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                         โ”‚
                         โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   External Services (LLM APIs, MCP Servers, DBs)    โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Package Structure

The runtime uses src/muxi/runtime/ to share the muxi namespace with the Python SDK:

src/muxi/runtime/
โ”œโ”€โ”€ formation/          # Formation engine
โ”‚   โ”œโ”€โ”€ overlord/       # Central orchestration
โ”‚   โ”œโ”€โ”€ agents/         # Agent implementations
โ”‚   โ”œโ”€โ”€ workflow/       # Task decomposition, SOPs
โ”‚   โ”œโ”€โ”€ server/         # Formation API (FastAPI)
โ”‚   โ””โ”€โ”€ background/     # Webhooks, scheduling, async
โ”œโ”€โ”€ services/           # Runtime services
โ”‚   โ”œโ”€โ”€ memory/         # Memory systems
โ”‚   โ”œโ”€โ”€ mcp/            # MCP client
โ”‚   โ”œโ”€โ”€ a2a/            # Agent-to-agent
โ”‚   โ””โ”€โ”€ llm/            # LLM abstraction
โ””โ”€โ”€ datatypes/          # Type definitions

Quick Start

Using with MUXI Server (recommended)

# Install MUXI CLI
curl -fsSL https://muxi.ai/install | sh

# Create and run a formation
muxi new my-assistant
cd my-assistant
muxi dev

Embedding directly

# Base install (covers almost all use cases)
pip install muxi-runtime

# PyTorch variant โ€” for local embedding models without an ONNX export
pip install 'muxi-runtime[pytorch]'

# CUDA variant โ€” GPU-accelerated ONNX/PyTorch models + FAISS-GPU (NVIDIA only)
# Must uninstall CPU-only packages first to avoid conflicts:
pip uninstall -y faiss-cpu faissx onnxruntime
pip install 'muxi-runtime[cuda]'
from muxi.runtime import Formation
import asyncio

async def main():
    formation = Formation()
    await formation.load("formation.afs")
    overlord = await formation.start_overlord()

    response = await overlord.chat(
        "Hello!",
        user_id="user123"
    )
    print(response)

asyncio.run(main())

Docker Images

MUXI Runtime ships three image variants. All support linux/amd64 and linux/arm64 (except CUDA).

Most users should use the base variant. The PyTorch and CUDA variants exist for specific embedding workloads described below.

Variant SIF size Description Status
default (base) ~600 MB Lean runtime โ€” covers the vast majority of use cases Stable
pytorch larger Adds CPU-only PyTorch for local embedding models that lack ONNX exports Stable
cuda largest GPU-accelerated: ONNX and PyTorch local models + FAISS-GPU for faster vector ops Experimental

When to use each variant:

  • default โ€” the right choice for almost everyone. Uses ONNX-based local embedding models (fast, lightweight) and CPU FAISS.
  • pytorch โ€” only needed when you want to run a local embedding model that does not have an ONNX export and therefore requires the full PyTorch runtime.
  • cuda โ€” recommended for production workloads running on servers with NVIDIA GPUs. Supports both ONNX and PyTorch local models, and ships with FAISS-GPU for significantly faster vector similarity operations.
# Build the default (base) variant
./scripts/build/runtime.sh

# Build the PyTorch variant (requires default built first)
./scripts/build/runtime.sh --variant pytorch

# Build the CUDA variant (experimental, linux/amd64 + NVIDIA tooling required)
./scripts/build/runtime.sh --variant cuda

# Cross-compile for a specific platform
./scripts/build/runtime.sh --platform linux/amd64 --variant pytorch

SIF (Singularity/Apptainer)

Each variant can be converted to a .sif artifact for use with MUXI Server:

./scripts/build/sif.sh                        # default
./scripts/build/sif.sh --variant pytorch
./scripts/build/sif.sh --variant cuda         # experimental
./scripts/build/sif.sh --arch amd64           # force architecture

Note: On macOS and Windows the correct SIF architecture is always linux-amd64, regardless of host CPU. linux-arm64 SIFs only apply on native arm64 Linux hosts (e.g. AWS Graviton).

CUDA variant is experimental. It has not been end-to-end validated against live GPUs in CI and only builds on linux/amd64 hosts with NVIDIA tooling.

Development

git clone https://github.com/muxi-ai/runtime
cd runtime
pip install -e ".[dev]"

# Unit and integration tests
pytest tests/unit -v
pytest tests/integration -v

# E2E tests (standalone scripts, not pytest)
cd e2e && python run_all_tests.py          # full suite
cd e2e && python run_random_tests.py 10    # random sample
cd e2e/tests/<area> && python test_<name>.py  # single test

See contributing/README.md for contributor documentation.

Related Repositories

Repo Description
muxi-ai/muxi Main repo with architecture docs
muxi-ai/server Go server that hosts this runtime
muxi-ai/cli Command-line tool
muxi-ai/sdks Python, TypeScript, Go SDKs
muxi-ai/schemas API schemas

Documentation

License

Elastic License 2.0 - Free to use, modify, and embed in products. Cannot be offered as a hosted service.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

muxi_runtime-0.20260426.1.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

muxi_runtime-0.20260426.1-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file muxi_runtime-0.20260426.1.tar.gz.

File metadata

  • Download URL: muxi_runtime-0.20260426.1.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for muxi_runtime-0.20260426.1.tar.gz
Algorithm Hash digest
SHA256 1b1c3a71f51b47eccc6f3700128d774cdced191557314fe07dad383adc392621
MD5 ca734459e143a9769d682c14be9a6103
BLAKE2b-256 a82908035af0e0e961a0c1d710f0ff1e6ba6bf0c331c068a9cd1ca9a6836f440

See more details on using hashes here.

Provenance

The following attestation bundles were made for muxi_runtime-0.20260426.1.tar.gz:

Publisher: release.yml on muxi-ai/runtime

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file muxi_runtime-0.20260426.1-py3-none-any.whl.

File metadata

File hashes

Hashes for muxi_runtime-0.20260426.1-py3-none-any.whl
Algorithm Hash digest
SHA256 430c1bb5d8276cf75c549fdad1a13d503706916af20433ff3393739ddc010c4d
MD5 f5f1a4f9f9bfa1b7ced6f889090e4e91
BLAKE2b-256 543debc2aa8e834c4a1b10cd450a7bd6680f025a00062c6161709803865909dc

See more details on using hashes here.

Provenance

The following attestation bundles were made for muxi_runtime-0.20260426.1-py3-none-any.whl:

Publisher: release.yml on muxi-ai/runtime

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page