Skip to main content

Agentic Framework for Enterprise-Wide Execution with multi-LLM provider support, observability, and error tracking

Project description

Continuum
by ShyftLabs

The agent runtime for builders who ship.

Build, run, and deploy reliable AI agents at enterprise scale — multi-LLM routing, persistent memory, MCP-native tools, durable workflows, and full observability, out of the box.


Python 3.13+ License Version

CI Docs PRs Welcome Code of Conduct

📖 Documentation · ⚡ Quick start · ⚙️ Configuration · 🧩 Components · 🧪 Examples · 🤝 Contributing


Continuum is a production-grade Python framework for building, orchestrating, and shipping autonomous AI agents at enterprise scale. It unifies a clean, typed agent core with cost-aware multi-model inference, stateful long- and short-term memory, open standards-based tool calling, durable execution, and end-to-end observability — all behind one small, composable, type-safe API.

✨ Features

  • 🤖 Agentic core & orchestration — a strongly-typed agent primitive with full lifecycle hooks, schema-validated structured outputs, and nine composable multi-agent patterns (sequential, parallel, loop, routing, planning, reflection, debate, scatter, supervised).
  • 🔀 Smart Inference — cost-aware inference routing that classifies every request by complexity and dispatches it to the cheapest capable model, with seamless cross-provider failover and zero lock-in.
  • 🧠 Stateful memory — persistent semantic long-term recall plus low-latency working memory, with multi-tenant isolation scopes and built-in PII redaction for privacy-by-default agents.
  • 🔌 Open tool calling — plug into any standards-based tool ecosystem (Model Context Protocol) across multiple transports, with fine-grained capability scoping, context capture/injection, and rich generative-UI artifacts.
  • 🔁 Durable execution — long-running, crash- and restart-safe agent workflows with human-in-the-loop approval gates and exactly-once guarantees.
  • 🔭 Full observability — first-class distributed tracing, token/latency/error telemetry, and one-line function instrumentation for complete run transparency.
  • 🌐 Model-agnostic — target frontier or open-weight models through a single model string; swap providers without touching agent code.
  • 🤝 Multi-agent handoffs — context-preserving agent-to-agent delegation with history summarization, cycle detection, and depth control.
  • 📡 Real-time streaming — token-, tool-, handoff-, and memory-level events streamed the moment they happen.
  • Built-in evaluation — turn live production traces into golden datasets and regression-test agent quality with standard LLM-evaluation metrics.

🚀 Quick start

Requirements: Python 3.13+ and Docker (for Redis · Milvus/Qdrant · Langfuse).

git clone https://github.com/shyftlabs/continuum.git
cd continuum

python3.13 -m venv .venv && source .venv/bin/activate
pip install -e .

cp .env.template .env        # add your provider key(s) — see Configuration below
docker compose up -d         # Redis · Milvus/Qdrant · Langfuse

Your first agent:

import asyncio
from continuum.agent import BaseAgent, AgentRunner

async def main():
    agent = BaseAgent(
        name="hello-agent",
        instructions="You are a friendly assistant.",
        model="gpt-4o-mini",
    )
    runner = AgentRunner()
    response = await runner.run(agent, "Hi!")
    print(response.content)

asyncio.run(main())

AgentRunner.run() returns an AgentResponse with content, structured_output, usage, tool_calls, run_artifacts, latency_ms, and the full handoff chain. See the docs for streaming, tools/MCP, memory, handoffs, and workflows.

⚙️ Configuring Continuum

Continuum is configured through environment variables (copy .env.template.env). Set keys only for the providers and components you use — everything else has sensible defaults. The most common settings:

LLM providers & routing

Variable Description Example
OPENAI_API_KEY / ANTHROPIC_API_KEY / GEMINI_API_KEY Provider API keys — set the one(s) you use sk-…
DEFAULT_LLM_MODEL Default model (provider/model, or bare name for OpenAI) gemini/gemini-2.5-flash
FALLBACK_LLM_MODEL Model used if the default fails gpt-4o-mini
LLM_ENABLE_FALLBACK Automatically fall back on provider errors true
SMART_LAYER_ENABLED Enable cost-aware tier routing (Smart Inference) true

Memory (long-term) & embeddings

Variable Description Example
MEMORY_ENABLED Enable mem0-backed long-term memory true
VECTOR_STORE_PROVIDER Vector store backend qdrant / milvus
EMBEDDER_PROVIDER / EMBEDDER_MODEL Embedding provider & model openai / text-embedding-3-small
MEMORY_ISOLATION Scope of memory isolation user / agent / run / shared

Sessions (short-term)

Variable Description Example
SESSION_ENABLED Enable Redis-backed conversation sessions true
SESSION_REDIS_HOST / SESSION_REDIS_PORT Redis connection localhost / 6380
SESSION_TTL_SECONDS Session lifetime 172800

Observability (Langfuse)

Variable Description Example
LANGFUSE_ENABLED Enable tracing true
LANGFUSE_PUBLIC_KEY / LANGFUSE_SECRET_KEY Langfuse credentials pk-… / sk-…
LANGFUSE_HOST Langfuse endpoint http://localhost:3000

Temporal (optional, durable workflows)

Variable Description Example
TEMPORAL_ENABLED Enable durable workflow orchestration false
TEMPORAL_HOST Temporal frontend localhost:7233

Optional extras: pip install -e ".[temporal]" for Temporal, ".[eval]" for evaluation, ".[embeddings]" for local embeddings. See .env.template for the complete, annotated reference.

🧩 Components

Component What it does
Agents BaseAgent + AgentRunner — config, hooks, structured outputs, ReAct
Workflows Nine multi-agent patterns for chaining, branching, looping, and self-improvement
Smart Inference Request classifier + cost-aware model routing with fallback
Memory mem0 + Qdrant/Milvus (long-term) · Redis (sessions) · multi-tenant scopes
Tools / MCP MCP servers over Stdio/SSE/StreamableHTTP, tool filtering, widget artifacts
Temporal Durable, restart-safe workflows with human-in-the-loop gates
Observability Langfuse traces, metrics, @observe decorators
Evaluation Golden datasets + DeepEval / RAGAS metrics

📚 Documentation

Full documentation lives at docs.continuum.shyftlabs.io — guides for building & running agents, Smart Inference, memory, tools/MCP, workflows, handoffs, streaming, evaluation, and the research behind it.

Markdown sources are also in docs/ if you prefer reading on GitHub — e.g. agent.md, memory.md, tools.md, and the integration GUIDE.md.

🧪 Examples

Runnable demos live under playground/:

  • gateway-local-shop — an MCP server + agent + chat UI for a pet-shop assistant (end-to-end: server → agent → UI).
  • gateway-multi-agent-shop — a multi-agent workflow variant with routing and handoffs.
  • frontend/ — the demo web UIs (assortment, commerce-chat).

🤝 Contributing

Contributions are welcome! Please read CONTRIBUTING.md for the branch model, Conventional Commits, DCO sign-off, and local setup. By participating you agree to our Code of Conduct.

📄 License

Licensed under the Apache License, Version 2.0. Copyright © 2025–2026 ShyftLabs Inc.

For commercial / enterprise inquiries — SLAs, indemnification, hosted offerings, custom features — contact continuum@shyftlabs.io.


Built with ❤️ by ShyftLabs · continuum@shyftlabs.io

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shyftlabs_continuum-0.2.0.tar.gz (341.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

shyftlabs_continuum-0.2.0-py3-none-any.whl (419.9 kB view details)

Uploaded Python 3

File details

Details for the file shyftlabs_continuum-0.2.0.tar.gz.

File metadata

  • Download URL: shyftlabs_continuum-0.2.0.tar.gz
  • Upload date:
  • Size: 341.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for shyftlabs_continuum-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b1615133c71150d8a872dc009325c048a86cbcc0c79acc84f7e77788f91a9102
MD5 ec5c4eefc130149d0ae6278028947811
BLAKE2b-256 4dd80f5770e336584dba752b069f71f3a8f3d712e278b71839769d0d18e72635

See more details on using hashes here.

File details

Details for the file shyftlabs_continuum-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for shyftlabs_continuum-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c15ef423a6f907e3ae09d6a967be8e0d6dfc22269de16234e662980ad6a21ada
MD5 28558b152794ef4c246ef6e99e710aec
BLAKE2b-256 89b3d9e72a9e2cd6800cf1830b592ef266c83541ac1df017d3cca410a4acdec5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page