Skip to main content

Agentic Framework for Enterprise-Wide Execution with multi-LLM provider support, observability, and error tracking

Project description

Continuum
by ShyftLabs

The agent runtime for builders who ship.

Build, run, and deploy reliable AI agents at enterprise scale — multi-LLM routing, persistent memory, MCP-native tools, durable workflows, and full observability, out of the box.


Python 3.13+ License Version

CI Docs PRs Welcome Code of Conduct

📖 Documentation · ⚡ Quick start · ⚙️ Configuration · 🧩 Components · 🧪 Examples · 🤝 Contributing


Continuum is a production-grade Python framework for building, orchestrating, and shipping autonomous AI agents at enterprise scale. It unifies a clean, typed agent core with cost-aware multi-model inference, stateful long- and short-term memory, open standards-based tool calling, durable execution, and end-to-end observability — all behind one small, composable, type-safe API.

✨ Features

  • 🤖 Agentic core & orchestration — a strongly-typed agent primitive with full lifecycle hooks, schema-validated structured outputs, and nine composable multi-agent patterns (sequential, parallel, loop, routing, planning, reflection, debate, scatter, supervised).
  • 🔀 Smart Inference — cost-aware inference routing that classifies every request by complexity and dispatches it to the cheapest capable model, with seamless cross-provider failover and zero lock-in.
  • 🧠 Stateful memory — persistent semantic long-term recall plus low-latency working memory, with multi-tenant isolation scopes and built-in PII redaction for privacy-by-default agents.
  • 🔌 Open tool calling — plug into any standards-based tool ecosystem (Model Context Protocol) across multiple transports, with fine-grained capability scoping, context capture/injection, and rich generative-UI artifacts.
  • 🔁 Durable execution — long-running, crash- and restart-safe agent workflows with human-in-the-loop approval gates and exactly-once guarantees.
  • 🔭 Full observability — first-class distributed tracing, token/latency/error telemetry, and one-line function instrumentation for complete run transparency.
  • 🌐 Model-agnostic — target frontier or open-weight models through a single model string; swap providers without touching agent code.
  • 🤝 Multi-agent handoffs — context-preserving agent-to-agent delegation with history summarization, cycle detection, and depth control.
  • 📡 Real-time streaming — token-, tool-, handoff-, and memory-level events streamed the moment they happen.
  • Built-in evaluation — turn live production traces into golden datasets and regression-test agent quality with standard LLM-evaluation metrics.

🚀 Quick start

Requirements: Python 3.13+ and Docker (for Redis · Milvus/Qdrant · Langfuse).

git clone https://github.com/shyftlabs/continuum.git
cd continuum

python3.13 -m venv .venv && source .venv/bin/activate
pip install -e .

cp .env.template .env        # add your provider key(s) — see Configuration below
docker compose up -d         # Redis · Milvus/Qdrant · Langfuse

Your first agent:

import asyncio
from continuum.agent import BaseAgent, AgentRunner

async def main():
    agent = BaseAgent(
        name="hello-agent",
        instructions="You are a friendly assistant.",
        model="gpt-4o-mini",
    )
    runner = AgentRunner()
    response = await runner.run(agent, "Hi!")
    print(response.content)

asyncio.run(main())

AgentRunner.run() returns an AgentResponse with content, structured_output, usage, tool_calls, run_artifacts, latency_ms, and the full handoff chain. See the docs for streaming, tools/MCP, memory, handoffs, and workflows.

⚙️ Configuring Continuum

Continuum is configured through environment variables (copy .env.template.env). Set keys only for the providers and components you use — everything else has sensible defaults. The most common settings:

LLM providers & routing

Variable Description Example
OPENAI_API_KEY / ANTHROPIC_API_KEY / GEMINI_API_KEY Provider API keys — set the one(s) you use sk-…
DEFAULT_LLM_MODEL Default model (provider/model, or bare name for OpenAI) gemini/gemini-2.5-flash
FALLBACK_LLM_MODEL Model used if the default fails gpt-4o-mini
LLM_ENABLE_FALLBACK Automatically fall back on provider errors true
SMART_LAYER_ENABLED Enable cost-aware tier routing (Smart Inference) true

Memory (long-term) & embeddings

Variable Description Example
MEMORY_ENABLED Enable mem0-backed long-term memory true
VECTOR_STORE_PROVIDER Vector store backend qdrant / milvus
EMBEDDER_PROVIDER / EMBEDDER_MODEL Embedding provider & model openai / text-embedding-3-small
MEMORY_ISOLATION Scope of memory isolation user / agent / run / shared

Sessions (short-term)

Variable Description Example
SESSION_ENABLED Enable Redis-backed conversation sessions true
SESSION_REDIS_HOST / SESSION_REDIS_PORT Redis connection localhost / 6380
SESSION_TTL_SECONDS Session lifetime 172800

Observability (Langfuse)

Variable Description Example
LANGFUSE_ENABLED Enable tracing true
LANGFUSE_PUBLIC_KEY / LANGFUSE_SECRET_KEY Langfuse credentials pk-… / sk-…
LANGFUSE_HOST Langfuse endpoint http://localhost:3000

Temporal (optional, durable workflows)

Variable Description Example
TEMPORAL_ENABLED Enable durable workflow orchestration false
TEMPORAL_HOST Temporal frontend localhost:7233

Optional extras: pip install -e ".[temporal]" for Temporal, ".[eval]" for evaluation, ".[embeddings]" for local embeddings. See .env.template for the complete, annotated reference.

🧩 Components

Component What it does
Agents BaseAgent + AgentRunner — config, hooks, structured outputs, ReAct
Workflows Nine multi-agent patterns for chaining, branching, looping, and self-improvement
Smart Inference Request classifier + cost-aware model routing with fallback
Memory mem0 + Qdrant/Milvus (long-term) · Redis (sessions) · multi-tenant scopes
Tools / MCP MCP servers over Stdio/SSE/StreamableHTTP, tool filtering, widget artifacts
Temporal Durable, restart-safe workflows with human-in-the-loop gates
Observability Langfuse traces, metrics, @observe decorators
Evaluation Golden datasets + DeepEval / RAGAS metrics

📚 Documentation

Full documentation lives at docs.continuum.shyftlabs.io — guides for building & running agents, Smart Inference, memory, tools/MCP, workflows, handoffs, streaming, evaluation, and the research behind it.

Markdown sources are also in docs/ if you prefer reading on GitHub — e.g. agent.md, memory.md, tools.md, and the integration GUIDE.md.

🧪 Examples

Runnable demos live under playground/:

  • gateway-local-shop — an MCP server + agent + chat UI for a pet-shop assistant (end-to-end: server → agent → UI).
  • gateway-multi-agent-shop — a multi-agent workflow variant with routing and handoffs.
  • frontend/ — the demo web UIs (assortment, commerce-chat).

🤝 Contributing

Contributions are welcome! Please read CONTRIBUTING.md for the branch model, Conventional Commits, DCO sign-off, and local setup. By participating you agree to our Code of Conduct.

📄 License

Licensed under the Apache License, Version 2.0. Copyright © 2025–2026 ShyftLabs Inc.

For commercial / enterprise inquiries — SLAs, indemnification, hosted offerings, custom features — contact continuum@shyftlabs.io.


Built with ❤️ by ShyftLabs · continuum@shyftlabs.io

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shyftlabs_continuum-0.2.1.tar.gz (341.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

shyftlabs_continuum-0.2.1-py3-none-any.whl (419.9 kB view details)

Uploaded Python 3

File details

Details for the file shyftlabs_continuum-0.2.1.tar.gz.

File metadata

  • Download URL: shyftlabs_continuum-0.2.1.tar.gz
  • Upload date:
  • Size: 341.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for shyftlabs_continuum-0.2.1.tar.gz
Algorithm Hash digest
SHA256 5d63ce75ccc89acb0d7ca136c55d6dc5b7f598d916c55ea167a34216055e9601
MD5 c6caa70573e62b478abe31774fe5c61a
BLAKE2b-256 a0d1cea6445dec8ad197a78fe1e92e7908cfdf7677a272b938160afa6a08a140

See more details on using hashes here.

File details

Details for the file shyftlabs_continuum-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for shyftlabs_continuum-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d4eb6d66e136d282a8d09449f7e156e2ef1bfa57ae0357f89465b709ab2b7b0a
MD5 8e9ce84e5b09f6aec8f50db078b12a98
BLAKE2b-256 395b85ee9c6a680505b9c1e6633eda4bd33954c0a027b969df8d0a56eb2608d5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page