Multi-agent orchestration system built with Microsoft Agent Framework's Magentic Fleet pattern

These details have not been verified by PyPI

Project links

Project description

AgenticFleet Architecture

⚠️ Active Development Notice APIs, signatures, and execution semantics can change between minor versions. Pin a version tag for production usage.

AgenticFleet – DSPy‑Enhanced Multi‑Agent Orchestration

AgenticFleet is a hybrid DSPy + Microsoft agent-framework runtime that delivers a self‑optimizing fleet of specialized AI agents. DSPy handles task analysis, routing, progress & quality assessment; agent-framework provides robust orchestration primitives, event streaming, and tool execution. Together they enable delegated, sequential, parallel, and handoff‑driven workflows with iterative refinement loops.

AgenticFleet – DSPy‑Enhanced Multi‑Agent Orchestration

Key Features

Adaptive Routing – DSPy reasoner analyzes tasks and decides agent roster + execution mode (delegated / sequential / parallel).
Advanced Reasoning – Pluggable strategies per agent: ReAct for autonomous tool loops (Researcher) and Program of Thought for code-based logic (Analyst).
Quality Loops – Automatic Judge / Reviewer refinement when quality score drops below configurable threshold.
Tool‑Aware Decisions – Signatures include tool context; Reasoner recommends tool usage (code interpreter, search, browser, etc.).
Streaming Events – Emits OpenAI Responses‑compatible events for real‑time TUI / web UI updates.
Self‑Improvement – GEPA + BootstrapFewShot compilation refines routing from curated examples & execution history.
YAML‑Driven – Central workflow_config.yaml governs models, thresholds, agents, tracing, evaluation toggles.
Rich Ergonomics – Typer CLI (cli/console.py), dspy-fleet command, optional Vite frontend, history analytics scripts.
Safe Fallbacks – Graceful degradation when DSPy unavailable (heuristic routing & quality scoring).
Extensible Toolkit – Add agents, tools, signatures, evaluation metrics with minimal boilerplate.
Azure Cosmos DB Persistence (optional) – Set one flag to mirror workflow runs, agent memories, DSPy datasets, and cache metadata into Cosmos NoSQL for durable, queryable telemetry.

Architecture Overview

Four‑phase pipeline:

Task → [1] DSPy Analysis → [2] DSPy Routing → [3] Agent Execution → [4] Quality / Judge Assessment → (Optional Refinement)

Phase	Responsibility	Source
Analysis	Extract goals, complexity, constraints	`dspy_modules/reasoner.py` (`analyze_task`)
Routing	Pick agents + execution mode, tools	`dspy_modules/reasoner.py` (`route_task`)
Execution	Orchestrate agents & tools; stream events	`workflows/supervisor.py`
Quality	Score output, recommend improvements	`dspy_modules/reasoner.py` (`assess_quality` + Judge)

Workflow Diagram

graph TD
A[Task input] --> B[DSPy analysis]
B --> C[DSPy routing]
C --> D1[Agent execution delegated]
C --> D2[Agent execution sequential]
C --> D3[Agent execution parallel]
D1 --> E[Quality assessment]
D2 --> E
D3 --> E
E --> F[Final output]
E --> G[Refinement loop]
G --> F

Refinement triggers when score < threshold (default 8 or judge threshold ≥ 7). Handoffs coordinate multi‑agent chains via HandoffManager (in workflows/handoff.py).

Consult: docs/developers/architecture.md & docs/guides/quick-reference.md.

Latency & Slow Phases

Common bottlenecks and how to mitigate:

DSPy compilation on first run
- Use cached compiled reasoner after first run; clear via uv run python -m agentic_fleet.scripts.manage_cache --clear
- Reduce GEPA effort in config/workflow_config.yaml (e.g., gepa_max_metric_calls, max_bootstrapped_demos)
- Set DSPY_COMPILE=false during rapid iteration
External tool calls (OpenAI, Tavily, Hosted Interpreter)
- Prefer lighter Reasoner model dspy.model: gpt-5-mini
- Disable pre‑analysis tool usage for simple tasks
Judge/refinement loops
- Set quality.max_refinement_rounds: 1
- Use judge_reasoning_effort: minimal
Parallel fan‑out synthesis
- Cap execution.max_parallel_agents to a small number
- Enable streaming to surface progress early
History and tracing I/O
- Reduce verbosity in production; batch writes if needed

For timing analysis, run history analytics: uv run python src/agentic_fleet/scripts/analyze_history.py --timing.

Backend API & Performance

Recent optimizations have significantly improved the responsiveness and scalability of the backend API:

Non-Blocking Architecture: Heavy background tasks, such as self-improvement analysis and DSPy compilation, are now offloaded to separate threads. This prevents blocking the main asyncio event loop, ensuring the API remains responsive to new requests even under load.
Job Store Abstraction: Background job state is no longer tied to a global variable. A pluggable JobStore interface (currently implemented as InMemoryJobStore) allows for easy future extension to persistent stores like Redis or Azure Cosmos DB.
Performance Benchmarking: A dedicated benchmark script (scripts/benchmark_api.py) is available to rigorously measure API latency, throughput, and error rates under concurrent load.
Real Models Only: DSPy now requires a real model ID (e.g., gpt-5-mini); the previous test-model/DummyLM path has been removed to avoid mock outputs during production runs.

Directory Layout

Path	Purpose
`config/workflow_config.yaml`	Models, agents, thresholds, tracing, evaluation flags
`src/agentic_fleet/dspy_modules/`	DSPy Signatures & Reasoner implementation
`src/agentic_fleet/workflows/`	Flattened orchestration logic (`supervisor.py`, `handoff.py`, `strategies.py`)
`src/agentic_fleet/agents/`	Specialist configurations, factory, and prompts (`prompts.py`)
`src/agentic_fleet/api/`	FastAPI backend, DB models (`api/db`), settings (`api/settings.py`)
`src/agentic_fleet/tools/`	Tool adapters: Tavily, Browser, Hosted Interpreter, MCP
`src/agentic_fleet/utils/`	Compiler cache, GEPA optimizer, history, tracing, registry
`src/agentic_fleet/evaluation/`	Metrics & evaluator engine
`src/agentic_fleet/cli/console.py`	Rich / Typer CLI (dspy-fleet)
`examples/`	Minimal workflow samples
`scripts/`	Analysis, self-improvement, benchmarking, dataset gen
`logs/`	Execution history, compilation artifacts
`frontend/`	Optional Vite + React streaming UI

Installation

Python (uv recommended)

git clone https://github.com/Qredence/agentic-fleet.git
cd agentic-fleet

# Create and sync a local environment from pyproject.toml
uv sync

Standard pip

# From PyPI (library / CLI usage)
pip install agentic-fleet

# From a local clone (editable install)
pip install -e .

Optional Frontend

make frontend-install          # installs Node dependencies
make dev                       # runs backend + frontend dev servers

Playwright (Browser Tool)

playwright install chromium

Configuration & Environment

Create .env (or copy .env.example):

OPENAI_API_KEY=sk-...
# Required for all model calls (validated at startup)
TAVILY_API_KEY=tvly-...
# Optional: Enables web search for Researcher agent
DSPY_COMPILE=true              # Toggle DSPy compilation (true/false)
OPENAI_BASE_URL=https://...
# Optional custom endpoint
LANGFUSE_PUBLIC_KEY=...
LANGFUSE_SECRET_KEY=...

# Optional Azure Cosmos DB mirroring
AGENTICFLEET_USE_COSMOS=0
AZURE_COSMOS_ENDPOINT=https://<account>.documents.azure.com:443/
AZURE_COSMOS_USE_MANAGED_IDENTITY=0
AZURE_COSMOS_KEY=<primary-or-secondary-key>
AZURE_COSMOS_DATABASE=agentic-fleet
AGENTICFLEET_DEFAULT_USER_ID=local-dev
# Container overrides (use defaults unless you renamed them)
# AZURE_COSMOS_WORKFLOW_RUNS_CONTAINER=workflowRuns
# AZURE_COSMOS_AGENT_MEMORY_CONTAINER=agentMemory
# AZURE_COSMOS_DSPY_EXAMPLES_CONTAINER=dspyExamples
# AZURE_COSMOS_DSPY_OPTIMIZATION_RUNS_CONTAINER=dspyOptimizationRuns
# AZURE_COSMOS_CACHE_CONTAINER=cache

Note: The OPENAI_API_KEY environment variable is required and will be validated at startup. If missing, the application will fail with a clear error message.

Key YAML knobs (workflow_config.yaml):

dspy.model – Reasoner model (e.g. gpt-5-mini)
dspy.optimization.metric_threshold – Minimum routing accuracy
workflow.supervisor.max_rounds – Conversation turn limit
workflow.supervisor.enable_streaming – Event streaming toggle
agents.* – Per-agent model + temperature + tools
evaluation.* – Batch evaluation settings

Configuration Validation: The YAML configuration is automatically validated against a schema when loaded. Invalid values (e.g., out-of-range temperatures, invalid model names) will raise ConfigurationError with clear messages indicating which field failed validation. Cosmos mirroring is best-effort—if environment variables are missing or the containers are unreachable, workflows continue locally and you’ll see a warning in the logs.

Quick Start

TUI / CLI

agentic-fleet                       # Launch interactive console (packaged entry point)

# Process a single task with streaming
agentic-fleet run -m "What is Gemini 3 Pro?" --verbose

# List available agents and their tools
agentic-fleet list-agents

# Run batch evaluation (uses config dataset by default)
agentic-fleet evaluate --max-tasks 5

# Module-style invocation (alternative)
python -m agentic_fleet.cli.console --help

Python API

import asyncio
from agentic_fleet.workflows import create_supervisor_workflow

async def main():
		workflow = await create_supervisor_workflow(compile_dspy=True)
		result = await workflow.run("Summarize transformer architecture evolution")
		print(result["result"])  # final output
		print(result["quality"]) # quality assessment details

asyncio.run(main())

Backend API

Start the FastAPI backend server:

./start_backend.sh
# Server runs at http://localhost:8000
# API Docs: http://localhost:8000/api/docs

Run the automated performance benchmark:

# Requires the backend to be running
python scripts/benchmark_api.py

Streaming

async for event in workflow.run_stream("Compare AWS vs Azure AI offerings"):
		# Handle MagenticAgentMessageEvent / WorkflowOutputEvent
		print(event)

Execution Modes

Mode	Description	Use Case
Delegated	Single agent manages entire task	Focused research, simple writeups
Sequential	Output of one feeds next	Research → Analyze → Write report
Parallel	Multiple agents concurrently; synthesis afterwards	Multi‑source comparisons
Handoff Chains	Explicit role transitions with artifacts	Complex coding + verification flows

Reasoner chooses based on task structure + examples; can be overridden via configuration or future explicit flags.

Agents

Core specialists: Researcher, Analyst, Writer, Reviewer, Judge (quality). Extended handoff specialists: Planner, Executor, Coder, Verifier, Generator.

See AGENTS.md for detailed roles, tool usage, configuration examples, and selection guidelines.

DSPy Optimization

Training examples live in src/agentic_fleet/data/supervisor_examples.json:

{
  "task": "Research the latest AI advances",
  "team": "Researcher: web search\nAnalyst: code + data",
  "assigned_to": "Researcher,Analyst",
  "mode": "sequential"
}

Compilation (BootstrapFewShot + GEPA) occurs on first run (if DSPY_COMPILE=true). Cache stored under logs/compiled_supervisor.pkl. Refresh via:

uv run python -m agentic_fleet.scripts.manage_cache --clear

Observability & History

History: Structured events appended to logs/execution_history.jsonl.
Tracing: Enable OpenTelemetry in YAML; export to AI Toolkit / OTLP endpoint.
Logging: Adjustable log level via env (AGENTIC_FLEET_LOG_LEVEL=DEBUG).
Analysis: scripts/analyze_history.py --all surfaces aggregate metrics.

Evaluation & Self-Improvement

Run batch evaluations against curated tasks:

uv run python -m agentic_fleet.cli.console analyze --dataset data/evaluation_tasks.jsonl

Generate evaluation datasets from history:

uv run python scripts/create_history_evaluation.py

Self‑improve routing by folding high‑quality history examples back into DSPy training:

uv run python scripts/self_improve.py --max 50

Testing & Quality

make check                 # lint + format (Ruff), type‑check (ty)
make test                  # run pytest suite
PYTHONPATH=. uv run pytest tests/workflows/test_execution_strategies.py -q

Key test domains: routing accuracy, tool registry integration, judge refinement, lazy compilation, tracing hooks.

Troubleshooting

Symptom	Cause	Fix
Missing web citations	`TAVILY_API_KEY` unset	Export key or set in `.env`
Workflow startup fails	Missing `OPENAI_API_KEY`	Set `OPENAI_API_KEY` in `.env` or environment
Configuration error	Invalid YAML values	Check `workflow_config.yaml` for invalid values (e.g., temperature > 2.0)
Slow first run	DSPy compilation	Enable cache; reduce `max_bootstrapped_demos`
No streaming output	`enable_streaming=false`	Toggle in YAML
Low quality score	Insufficient examples	Add training examples; rerun compilation
Tool warning	Name mismatch	Verify tool name & registry entry

Detailed guides: docs/users/troubleshooting.md, docs/guides/dspy-optimizer.md.

Contributing

Fork / branch (breaking-refactor for large changes)
Add or update tests (prefer focused unit tests over broad integration when possible)
Run make check and ensure no style / type errors
Update docs (README, AGENTS.md, or relevant guide) for user‑visible changes
Submit PR with clear rationale & architectural notes (link to docs/developers/architecture.md sections if modifying internals)

Please see: docs/developers/contributing.md.

License

MIT License – see LICENSE.

Acknowledgments

Microsoft agent-framework – Orchestration, events & tool interfaces
DSPy (Stanford NLP) – Prompt optimization & structured signatures
Tavily – Reliable, citation‑rich web search
OpenAI Responses – Event paradigm enabling unified CLI/TUI/frontend streaming

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.95

Dec 17, 2025

0.6.9

Dec 7, 2025

0.6.7

Dec 5, 2025

0.6.6

Dec 2, 2025

0.6.5

Nov 28, 2025

0.6.4

Nov 26, 2025

This version

0.6.2

Nov 22, 2025

0.6.0

Nov 13, 2025

0.5.5

Nov 5, 2025

0.5.4

Oct 23, 2025

0.5.3

Oct 20, 2025

0.5.2

Oct 16, 2025

0.5.1

Oct 16, 2025

0.5.0

Oct 12, 2025

0.4.90

Feb 26, 2025

0.4.81

Feb 20, 2025

0.4.80

Feb 16, 2025

0.4.78

Feb 10, 2025

0.4.73

Feb 8, 2025

0.4.72

Feb 3, 2025

0.4.71

Feb 2, 2025

0.4.65

Jan 31, 2025

0.4.61

Jan 25, 2025

0.4.60

Jan 23, 2025

0.4.50

Jan 19, 2025

0.4.31

Jan 16, 2025

0.4.22

Jan 13, 2025

0.4.21

Jan 13, 2025

0.4.20

Jan 13, 2025

0.4.12

Jan 13, 2025

0.4.9

Feb 24, 2025

0.4.3

Jan 15, 2025

0.4.1

Jan 9, 2025

0.4.0

Jan 8, 2025

0.3.6

Dec 29, 2024

0.3.2

Dec 27, 2024

0.3.1

Dec 27, 2024

0.3.0

Dec 27, 2024

0.2.0

Dec 14, 2024

0.1.6

Dec 8, 2024

0.1.5

Dec 8, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentic_fleet-0.6.2.tar.gz (387.9 kB view details)

Uploaded Nov 22, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentic_fleet-0.6.2-py3-none-any.whl (228.3 kB view details)

Uploaded Nov 22, 2025 Python 3

File details

Details for the file agentic_fleet-0.6.2.tar.gz.

File metadata

Download URL: agentic_fleet-0.6.2.tar.gz
Upload date: Nov 22, 2025
Size: 387.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for agentic_fleet-0.6.2.tar.gz
Algorithm	Hash digest
SHA256	`e1cb585f6faba1a8c1fcdac255dde00c728bbbb46180ff6451798a3ba8c3ef0d`
MD5	`ce477a944ba03f6eb59b31717426109f`
BLAKE2b-256	`5a50d6684d7d8878a0f94eaaa9ad8c1c352d0e2c7908c94d6b8653397a7aa523`

See more details on using hashes here.

File details

Details for the file agentic_fleet-0.6.2-py3-none-any.whl.

File metadata

Download URL: agentic_fleet-0.6.2-py3-none-any.whl
Upload date: Nov 22, 2025
Size: 228.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for agentic_fleet-0.6.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`607821657bd42a831178afbf37631625d5959b6df2910e01c3a4a4075060a2da`
MD5	`2e725d43b65a472ce15dce86bc9e4a64`
BLAKE2b-256	`90a3c5460d006a371060fe37b22511d52837856006aaf92951862be4487dd47f`

See more details on using hashes here.

agentic-fleet 0.6.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgenticFleet – DSPy‑Enhanced Multi‑Agent Orchestration

Table of Contents

Key Features

Architecture Overview

Workflow Diagram

Latency & Slow Phases

Backend API & Performance

Directory Layout

Installation

Python (uv recommended)

Standard pip

Optional Frontend

Playwright (Browser Tool)

Configuration & Environment

Quick Start

TUI / CLI

Python API

Backend API

Streaming

Execution Modes

Agents

DSPy Optimization

Observability & History

Evaluation & Self-Improvement

Testing & Quality

Troubleshooting

Contributing

License

Acknowledgments

Related Documentation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes