Add your description here

Project description

Flock Banner

Python Version

Flock 0.5: Declarative Blackboard Multi-Agent Orchestration

Stop engineering prompts. Start declaring contracts.

Flock is a production-focused framework for orchestrating AI agents through declarative type contracts and blackboard architecture, proven patterns from distributed systems, decades of engaging with micro-service landscapes and classical AI, now applied to modern LLMs.

The Problem With Current Approaches

Building production multi-agent systems today means dealing with:

🔥 Prompt Engineering Hell

prompt = """You are an expert code reviewer. When you receive code, you should...
[498 more lines of instructions that the LLM ignores half the time]"""

# 500-line prompt that breaks when models update

# How do I know that there isn't an even better prompt (you don't) -> proof of 'best possible performane' impossible

🧪 Testing Nightmares

# How do you unit test this?
result = llm.invoke(prompt)  # Hope for valid JSON
data = json.loads(result.content)  # Crashes in production

📐 Rigid topology and tight coupling

# Want to add a new agent? Rewrite the entire graph.
workflow.add_edge("agent_a", "agent_b")
workflow.add_edge("agent_b", "agent_c")
# Add agent_d? Start rewiring...

💀 Single point of failure: Orchestrator dies? Everything dies.

# Orchestrator dies? Everything dies.

🧠 God object anti-pattern:

# One orchestrator needs domain knowledge of 20+ agents to route correctly
# Orchestrator 'guesses' next agent based on a natural language description. Hardly fit for critical systems.

These aren't framework limitations, they're architectural choices that don't scale.

Most issues are solvable, because decades of experience with micro services taught us hard lessons about decoupling, orchestration and reliability already. Let's use these learnings!

The Flock Approach

Flock takes a different path, combining two proven patterns:

1. Declarative Type Contracts (Not Prompts)

Traditional approach:

prompt = "Analyze this bug report and return JSON with severity, category, hypothesis..."
result = llm.invoke(prompt)  # Hope it works

The Flock way:

@flock_type
class BugDiagnosis(BaseModel):
    severity: str = Field(pattern="^(Critical|High|Medium|Low)$")
    category: str = Field(description="Bug category")
    root_cause_hypothesis: str = Field(min_length=50)
    confidence_score: float = Field(ge=0.0, le=1.0)

# The schema IS the instruction. No 500-line prompt needed.
agent.consumes(BugReport).publishes(BugDiagnosis)

Flock Banner

Why this matters:

✅ Survives model upgrades - GPT-6 will still understand Pydantic schemas
✅ Runtime validation - Errors caught at parse time, not in production
✅ Testable - Mock inputs/outputs with concrete types
✅ Self-documenting - The code tells you what agents do

2. Blackboard Architecture (Not Directed Graphs)

Graph-based approach:

# Explicit workflow with hardcoded edges
workflow.add_edge("radiologist", "diagnostician")
workflow.add_edge("lab_tech", "diagnostician")
# Add performance_analyzer? Rewrite the graph.

The Flock way (blackboard):

# Agents subscribe to types, workflows emerge
radiologist = flock.agent("radiologist").consumes(Scan).publishes(XRayAnalysis)
lab_tech = flock.agent("lab_tech").consumes(Scan).publishes(LabResults)
diagnostician = flock.agent("diagnostician").consumes(XRayAnalysis, LabResults).publishes(Diagnosis)

# Add performance_analyzer? Just subscribe it:
performance = flock.agent("perf").consumes(Scan).publishes(PerfAnalysis)
# Done. No graph rewiring. Diagnostician can optionally consume it.

What just happened:

✅ Parallel execution - Radiologist and lab_tech run concurrently (automatic)
✅ Dependency resolution - Diagnostician waits for both inputs (automatic)
✅ Loose coupling - Agents don't know about each other, just data types
✅ Scalable - O(n) complexity, not O(n²) edges

This is not a new idea. Blackboard architecture powered groundbreaking AI systems since the 1970s (Hearsay-II, HASP/SIAP, BB1). We're applying proven patterns to modern LLMs.

Quick Start (60 Seconds)

pip install flock-core
export OPENAI_API_KEY="sk-..."
# Optional: export DEFAULT_MODEL (falls back to hard-coded default if unset)
export DEFAULT_MODEL="openai/gpt-4.1"

import os
import asyncio
from pydantic import BaseModel, Field
from flock import Flock, flock_type

# 1. Define typed artifacts
@flock_type
class CodeSubmission(BaseModel):
    code: str
    language: str

@flock_type
class BugAnalysis(BaseModel):
    bugs_found: list[str]
    severity: str = Field(pattern="^(Critical|High|Medium|Low|None)$")
    confidence: float = Field(ge=0.0, le=1.0)

@flock_type
class SecurityAnalysis(BaseModel):
    vulnerabilities: list[str]
    risk_level: str = Field(pattern="^(Critical|High|Medium|Low|None)$")

@flock_type
class FinalReview(BaseModel):
    overall_assessment: str = Field(pattern="^(Approve|Approve with Changes|Reject)$")
    action_items: list[str]

# 2. Create the blackboard
flock = Flock(os.getenv("DEFAULT_MODEL", "openai/gpt-4.1"))

# 3. Agents subscribe to types (NO graph wiring!)
bug_detector = flock.agent("bug_detector").consumes(CodeSubmission).publishes(BugAnalysis)
security_auditor = flock.agent("security_auditor").consumes(CodeSubmission).publishes(SecurityAnalysis)

# This agent AUTOMATICALLY waits for both analyses
final_reviewer = flock.agent("final_reviewer").consumes(BugAnalysis, SecurityAnalysis).publishes(FinalReview)

# 4. Run with real-time dashboard
async def main():
    await flock.serve(dashboard=True)

asyncio.run(main())

What happened:

Bug detector and security auditor ran in parallel (both consume CodeSubmission)
Final reviewer automatically waited for both
Zero prompts written - types defined the behavior
Zero graph edges - subscriptions created the workflow
Full type safety - Pydantic validates all outputs

Core Concepts

Typed Artifacts (The Vocabulary)

Every piece of data on the blackboard is a validated Pydantic model:

@flock_type
class PatientDiagnosis(BaseModel):
    condition: str = Field(min_length=10)
    confidence: float = Field(ge=0.0, le=1.0)
    recommended_treatment: list[str] = Field(min_length=1)
    follow_up_required: bool

Benefits:

Runtime validation ensures quality
Field constraints prevent bad outputs
Self-documenting data structures
Version-safe (types survive model updates)

Agent Subscriptions (The Rules)

Agents declare what they consume and produce:

analyzer = (
    flock.agent("analyzer")
    .description("Analyzes patient scans")  # Optional: improves multi-agent coordination
    .consumes(PatientScan)                   # What triggers this agent
    .publishes(PatientDiagnosis)             # What it produces
)

Advanced subscriptions:

# Conditional consumption - only high-severity cases
urgent_care = flock.agent("urgent").consumes(
    Diagnosis,
    where=lambda d: d.severity in ["Critical", "High"]
)

# Batch processing - wait for 10 items
batch_processor = flock.agent("batch").consumes(
    Event,
    batch=BatchSpec(size=10, timeout=timedelta(seconds=30))
)

# Join operations - wait for multiple types within time window
correlator = flock.agent("correlator").consumes(
    SignalA,
    SignalB,
    join=JoinSpec(within=timedelta(minutes=5))
)

Visibility Controls (The Security)

Unlike other frameworks, Flock has zero-trust security built-in:

# Multi-tenancy (SaaS isolation)
agent.publishes(CustomerData, visibility=TenantVisibility(tenant_id="customer_123"))

# Explicit allowlist (HIPAA compliance)
agent.publishes(MedicalRecord, visibility=PrivateVisibility(agents={"physician", "nurse"}))

# Role-based access control
agent.identity(AgentIdentity(name="analyst", labels={"clearance:secret"}))
agent.publishes(IntelReport, visibility=LabelledVisibility(required_labels={"clearance:secret"}))

# Time-delayed release (embargo periods)
artifact.visibility = AfterVisibility(ttl=timedelta(hours=24), then=PublicVisibility())

# Public (default)
agent.publishes(PublicReport, visibility=PublicVisibility())

Why this matters: Financial services, healthcare, defense, SaaS platforms all need this for compliance. Other frameworks make you build it yourself.

Batching Pattern: Parallel Execution Control

A key differentiator: The separation of publish() and run_until_idle() enables parallel execution.

# ✅ EFFICIENT: Batch publish, then run in parallel
for review in customer_reviews:
    await flock.publish(review)  # Just scheduling work

await flock.run_until_idle()  # All sentiment_analyzer agents run concurrently!

# Get all results
analyses = await flock.store.get_by_type(SentimentAnalysis)
# 100 analyses completed in ~1x single review processing time!

Why this separation matters:

⚡ Parallel execution - Process 100 customer reviews concurrently
🎯 Batch control - Publish multiple artifacts, execute once
🔄 Multi-type workflows - Publish different types, trigger different agents in parallel
📊 Better performance - Process 1000 items in the time it takes to process 1

Comparison to other patterns:

# ❌ If run_until_idle() was automatic (like most frameworks):
for review in customer_reviews:
    await flock.publish(review)  # Would wait for completion each time!
# Total time: 100x single execution (sequential)

# ✅ With explicit batching:
for review in customer_reviews:
    await flock.publish(review)  # Fast: just queuing
await flock.run_until_idle()
# Total time: ~1x single execution (parallel)

Production Safety Features

Built-in safeguards prevent common production failures:

# Circuit breakers prevent runaway costs
flock = Flock("openai/gpt-4.1", max_agent_iterations=1000)

# Feedback loop protection
critic = (
    flock.agent("critic")
    .consumes(Essay)
    .publishes(Critique)
    .prevent_self_trigger(True)  # Won't trigger itself infinitely
)

# Best-of-N execution (run 5x, pick best)
agent.best_of(5, score=lambda result: result.metrics["confidence"])

# Configuration validation
agent.best_of(150, ...)  # ⚠️ Warns: "best_of(150) is very high - high LLM costs"

Production-Ready Observability

Real-Time Dashboard

Start the dashboard with one line:

await flock.serve(dashboard=True)

The dashboard provides comprehensive real-time visibility into your agent system with professional UI/UX:

Flock Agent View Agent View: See agent communication patterns and message flows in real-time

Key Features:

Dual Visualization Modes:
- Agent View - Agents as nodes with message flows as edges
- Blackboard View - Messages as nodes with data transformations as edges

Flock Blackboard View Blackboard View: Track data lineage and transformations across the system

Real-Time Updates:
- WebSocket streaming with 2-minute heartbeat
- Live agent activation and message publication
- Auto-layout with Dagre algorithm
Interactive Graph:
- Drag nodes, zoom, pan, and explore topology
- Double-click nodes to open detail windows
- Right-click for context menu with auto-layout options:
  - 5 Layout Algorithms: Hierarchical (Vertical/Horizontal), Circular, Grid, and Random
  - Smart Spacing: Dynamic 200px minimum clearance based on node dimensions
  - Viewport Centering: Layouts always center around current viewport
- Add modules dynamically from context menu
Advanced Filtering:
- Correlation ID tracking for workflow tracing
- Time range filtering (last 5/10/60 minutes or custom)
- Active filter pills with one-click removal
- Autocomplete search with metadata preview
Control Panel:
- Publish artifacts from the UI
- Invoke agents manually
- Monitor system health
Keyboard Shortcuts:
- Ctrl+M - Toggle view mode
- Ctrl+F - Focus filter
- Ctrl+/ - Show shortcuts help
- WCAG 2.1 AA compliant accessibility

Production-Grade Trace Viewer

The dashboard includes a Jaeger-style trace viewer with 7 powerful visualization modes:

Trace Viewer: Timeline view showing span hierarchies and execution flow

7 Trace Viewer Modes:

Timeline - Waterfall visualization with parent-child relationships
Statistics - Sortable table view with durations and error tracking
RED Metrics - Rate, Errors, Duration monitoring for service health
Dependencies - Service-to-service communication analysis
DuckDB SQL - Interactive SQL query editor with CSV export
Configuration - Real-time service/operation filtering
Guide - Built-in documentation and query examples

Additional Features:

Full I/O Capture - Complete input/output data for every operation
JSON Viewer - Collapsible tree structure with expand all/collapse all
Multi-Trace Support - Open and compare multiple traces simultaneously
Smart Sorting - Sort by date, span count, or duration
CSV Export - Download query results for offline analysis

Trace Viewer: Dependency Analysis

OpenTelemetry + DuckDB Tracing

One environment variable enables comprehensive tracing:

export FLOCK_AUTO_TRACE=true
export FLOCK_TRACE_FILE=true

python your_app.py
# Traces stored in .flock/traces.duckdb

AI-queryable debugging:

import duckdb
conn = duckdb.connect('.flock/traces.duckdb', read_only=True)

# Find bottlenecks
slow_ops = conn.execute("""
    SELECT name, AVG(duration_ms) as avg_ms, COUNT(*) as count
    FROM spans
    WHERE duration_ms > 1000
    GROUP BY name
    ORDER BY avg_ms DESC
""").fetchall()

# Find errors with full context
errors = conn.execute("""
    SELECT name, status_description,
           json_extract(attributes, '$.input') as input,
           json_extract(attributes, '$.output') as output
    FROM spans
    WHERE status_code = 'ERROR'
""").fetchall()

Real debugging session:

You: "My pizza agent is slow"
AI: [queries DuckDB]
    "DSPyEngine.evaluate takes 23s on average.
     Input size: 50KB of conversation history.
     Recommendation: Limit context to last 5 messages."

Why DuckDB? 10-100x faster than SQLite for analytical queries. Zero configuration. AI agents can debug your AI agents.

Trace Viewer: DuckDB Query

Framework Comparison

Architectural Differences

Flock uses a fundamentally different coordination pattern than most multi-agent frameworks:

Dimension	Graph-Based Frameworks	Chat-Based Frameworks	Flock (Blackboard)
Core Pattern	Directed graph with explicit edges	Round-robin conversation	Blackboard subscriptions
Coordination	Manual edge wiring	Message passing	Type-based subscriptions
Parallelism	Manual (split/join nodes)	Sequential turn-taking	Automatic (concurrent consumers)
Type Safety	Varies (often TypedDict)	Text-based messages	Pydantic + runtime validation
Coupling	Tight (hardcoded successors)	Medium (conversation context)	Loose (type subscriptions only)
Adding Agents	Rewrite graph topology	Update conversation flow	Just subscribe to types
Testing	Requires full graph	Requires full group	Individual agent isolation
Security Model	DIY implementation	DIY implementation	Built-in (5 visibility types)
Scalability	O(n²) edge complexity	Limited by turn-taking	O(n) subscription complexity

When Flock Wins

✅ Use Flock when you need:

Parallel agent execution - Agents consuming the same type run concurrently automatically
Type-safe outputs - Pydantic validation catches errors at runtime
Minimal prompt engineering - Schemas define behavior, not natural language
Dynamic agent addition - Subscribe new agents without rewiring existing workflows
Testing in isolation - Unit test individual agents with mock inputs
Built-in security - 5 visibility types for compliance (HIPAA, SOC2, multi-tenancy)
10+ agents - Linear complexity stays manageable at scale

When Alternatives Win

⚠️ Consider graph-based frameworks when:

You need extensive ecosystem integration with existing tools
Your workflow is inherently sequential (no parallelism needed)
You want battle-tested maturity (larger communities, more documentation)
Your team has existing expertise with those frameworks

⚠️ Consider chat-based frameworks when:

You prefer conversation-based development patterns
Your use case maps naturally to turn-taking dialogue
You need features specific to those ecosystems

Honest Trade-offs

You trade:

Ecosystem maturity (established frameworks have larger communities)
Extensive documentation (we're catching up)
Battle-tested age (newer architecture means less production history)

You gain:

Better scalability (O(n) vs O(n²) complexity)
Type safety (runtime validation vs hope)
Cleaner architecture (loose coupling vs tight graphs)
Production safety (circuit breakers, feedback prevention built-in)
Security model (5 visibility types vs DIY)

Different frameworks for different priorities. Choose based on what matters to your team.

Production Readiness

What Works Today (v0.5.0)

✅ Production-ready core:

more than 700 tests, with >75% coverage (>90% on critical paths)
Blackboard orchestrator with typed artifacts
Parallel + sequential execution (automatic)
Zero-trust security (5 visibility types)
Circuit breakers and feedback loop prevention
OpenTelemetry distributed tracing with DuckDB storage
Real-time dashboard with 7-mode trace viewer
MCP integration (Model Context Protocol)
Best-of-N execution, batch processing, join operations
Type-safe retrieval API (get_by_type())

⚠️ What's missing for large-scale production:

Persistent blackboard - Currently in-memory only
Advanced retry logic - Basic only
Event replay - No Kafka integration yet
Kubernetes-native deployment - No Helm chart yet
OAuth/RBAC - Dashboard has no auth

All planned for v1.0

Recommended Use Cases Today

✅ Good fit right now:

Startups/MVPs - Fast iteration, type safety, built-in observability
Internal tools - Where in-memory blackboard is acceptable
Research/prototyping - Rapid experimentation with clean architecture
Medium-scale systems (10-50 agents, 1000s of artifacts)

⚠️ Wait for 1.0 if you need:

Enterprise persistence (multi-region, high availability)
Compliance auditing (immutable event logs)
Multi-tenancy SaaS (with OAuth/SSO)
Mission-critical systems with 99.99% uptime requirements

Flock 0.5.0 is production-ready for the right use cases. Know your requirements.

Roadmap to 1.0

We're not building a toy framework. We're building enterprise infrastructure for AI agents.

See ROADMAP.md for the complete roadmap with detailed code examples.

Flock 1.0 - Q4 2025 Release

We're confident to deliver all enterprise features by Q4 2025:

🏢 Enterprise Persistence

Redis and PostgreSQL backends for durable blackboard state
Agent crashes? State persists, agents resume automatically
Multi-region deployments with shared blackboard
SQL queries on artifact history for analytics and compliance

🔄 Advanced Error Handling

Exponential backoff with jitter for transient failures
Dead letter queues for poison messages
Per-agent circuit breakers with auto-recovery
Full observability of all failure modes

🤝 Aggregation Patterns

Map-reduce pattern for parallel processing → aggregation
Voting/consensus for multi-agent decision making
Best-result selection with custom scoring functions

📨 Kafka Event Backbone

Event replay for debugging production issues in development
Time-travel debugging with checkpoint restoration
Immutable audit logs for regulatory compliance
Backfill new agents with historical data

☸️ Kubernetes-Native Deployment

Helm charts for production deployments
Horizontal auto-scaling based on blackboard queue depth
Zero-downtime deployments with health checks
Production-grade readiness probes

🔐 OAuth/RBAC

OAuth2/OIDC authentication for multi-tenant SaaS
API key authentication for programmatic access
Role-based access control with agent-level permissions
Complete audit trails for compliance (SOC2, HIPAA)

👤 Human-in-the-Loop

Approval patterns for high-value transactions
Dashboard integration for pending approvals
Slack/email notifications with audit trails
Training mode with review-before-automation

🔀 Fan-Out/Fan-In Patterns

Dynamic work distribution based on runtime data
Result collection and aggregation
Map-reduce over LLM operations
Sharding for horizontal scale

⏰ Time-Based Scheduling

Cron-like triggers for periodic workflows
Sliding window patterns for real-time analytics
Hybrid event+time based triggers
SLA monitoring and data freshness checks

Release Criteria for v1.0

v1.0 will ship when all of these are complete:

✅ Production persistence (Redis + Postgres backends stable)
✅ Advanced error handling (retry, circuit breakers, DLQ working)
✅ Aggregation patterns (map-reduce, voting, consensus implemented)
✅ Kafka event backbone (replay and time-travel debugging)
✅ Kubernetes native (Helm chart with auto-scaling)
✅ Authentication (OAuth/OIDC + API key auth)
✅ Human-in-the-loop (approval patterns implemented)
✅ Fan-out/fan-in (distributed processing patterns)
✅ Time-based scheduling (cron + sliding windows)
✅ 85%+ test coverage (1000+ tests passing)
✅ Production validation (deployed at 3+ companies)

Target Date: Q4 2025

Example: Multi-Modal Clinical Decision Support

import os
from flock import Flock, flock_type
from flock.visibility import PrivateVisibility, TenantVisibility, LabelledVisibility
from flock.identity import AgentIdentity
from pydantic import BaseModel

@flock_type
class PatientScan(BaseModel):
    patient_id: str
    scan_type: str
    image_data: bytes

@flock_type
class XRayAnalysis(BaseModel):
    findings: list[str]
    confidence: float

@flock_type
class LabResults(BaseModel):
    markers: dict[str, float]

@flock_type
class Diagnosis(BaseModel):
    condition: str
    reasoning: str
    confidence: float

# Create HIPAA-compliant blackboard
flock = Flock(os.getenv("DEFAULT_MODEL", "openai/gpt-4.1"))

# Radiologist with privacy controls
radiologist = (
    flock.agent("radiologist")
    .consumes(PatientScan)
    .publishes(
        XRayAnalysis,
        visibility=PrivateVisibility(agents={"diagnostician"})  # HIPAA!
    )
)

# Lab tech with multi-tenancy
lab_tech = (
    flock.agent("lab_tech")
    .consumes(PatientScan)
    .publishes(
        LabResults,
        visibility=TenantVisibility(tenant_id="patient_123")  # Isolation!
    )
)

# Diagnostician with explicit access
diagnostician = (
    flock.agent("diagnostician")
    .identity(AgentIdentity(name="diagnostician", labels={"role:physician"}))
    .consumes(XRayAnalysis, LabResults)  # Waits for BOTH
    .publishes(
        Diagnosis,
        visibility=LabelledVisibility(required_labels={"role:physician"})
    )
)

# Run with tracing
async with flock.traced_run("patient_123_diagnosis"):
    await flock.publish(PatientScan(patient_id="123", ...))
    await flock.run_until_idle()

    # Get diagnosis (type-safe retrieval)
    diagnoses = await flock.store.get_by_type(Diagnosis)
    # Returns list[Diagnosis] directly - no .data access, no casting

What this demonstrates:

Multi-modal data fusion (images + labs + history)
Built-in access controls (HIPAA compliance)
Parallel agent execution (radiology + labs run concurrently)
Automatic dependency resolution (diagnostician waits for both)
Full audit trail (traced_run + DuckDB storage)
Type-safe data retrieval (no Artifact wrappers)

Production Use Cases

Flock's architecture shines in production scenarios requiring parallel execution, security, and observability. Here are common patterns:

Financial Services: Multi-Signal Trading

The Challenge: Analyze multiple market signals in parallel, correlate them within time windows, maintain SEC-compliant audit trails.

The Solution: 20+ signal analyzers run concurrently, join operations correlate signals, DuckDB provides complete audit trails.

# Parallel signal analyzers
volatility = flock.agent("volatility").consumes(MarketData).publishes(VolatilityAlert)
sentiment = flock.agent("sentiment").consumes(NewsArticle).publishes(SentimentAlert)

# Trade execution waits for CORRELATED signals (within 5min window)
trader = flock.agent("trader").consumes(
    VolatilityAlert, SentimentAlert,
    join=JoinSpec(within=timedelta(minutes=5))
).publishes(TradeOrder)

Healthcare: HIPAA-Compliant Diagnostics

The Challenge: Multi-modal data fusion with strict access controls, complete audit trails, zero-trust security.

The Solution: Built-in visibility controls for HIPAA compliance, automatic parallel execution, full data lineage tracking.

# Privacy controls built-in
radiology.publishes(XRayAnalysis, visibility=PrivateVisibility(agents={"diagnostician"}))
lab.publishes(LabResults, visibility=TenantVisibility(tenant_id="patient_123"))

# Diagnostician waits for BOTH inputs with role-based access
diagnostician = flock.agent("diagnostician").consumes(XRayAnalysis, LabResults).publishes(Diagnosis)

E-Commerce: 50-Agent Personalization

The Challenge: Analyze dozens of independent signals, support dynamic signal addition, process millions of events daily.

The Solution: O(n) scaling to 50+ analyzers, batch processing for efficiency, zero graph rewiring when adding signals.

# 50+ signal analyzers (all run in parallel!)
for signal in ["browsing", "purchase", "cart", "reviews", "email", "social"]:
    flock.agent(f"{signal}_analyzer").consumes(UserEvent).publishes(Signal)

# Recommender batches signals for efficient LLM calls
recommender = flock.agent("recommender").consumes(Signal, batch=BatchSpec(size=50))

Multi-Tenant SaaS: Content Moderation

The Challenge: Complete data isolation between tenants, multi-agent consensus, full audit trails.

The Solution: Tenant visibility ensures zero cross-tenant leakage, parallel checks provide diverse signals, traces show complete reasoning.

See USECASES.md for complete code examples and production metrics.

Getting Started

# Install
pip install flock-core

# Set API key
export OPENAI_API_KEY="sk-..."

# Try the workshop
git clone https://github.com/whiteducksoftware/flock-flow.git
cd flock-flow
uv run python examples/05-claudes-workshop/lesson_01_code_detective.py

Learn by doing:

📚 7-Lesson Workshop - Progressive lessons from basics to advanced
🆚 The Blackboard - See data-driven orchestration without graphs
🎯 Declarative Basics - Understanding declarative programming
📖 Documentation - Complete development guide

Contributing

We're building Flock in the open. See CONTRIBUTING.md and AGENTS.md for development setup.

We welcome:

Bug reports and feature requests
Documentation improvements
Example contributions
Architecture discussions

Quality standards:

All tests must pass
Coverage requirements met
Code formatted with Ruff

Why "0.5"?

We're calling this 0.5 to signal:

Core is production-ready - real-world client deployments, comprehensive features
Ecosystem is evolving - Documentation growing, community building, features maturing
Architecture is proven - Blackboard pattern is 50+ years old, declarative contracts are sound
Enterprise features are coming - Persistence, auth, Kubernetes deployment in roadmap

1.0 will arrive when we've delivered persistence, advanced error handling, and enterprise deployment patterns (targeting Q4 2025).

The Bottom Line

Flock is different because it makes different architectural choices:

Instead of:

❌ Prompt engineering → ✅ Declarative type contracts
❌ Workflow graphs → ✅ Blackboard subscriptions
❌ Manual parallelization → ✅ Automatic concurrent execution
❌ Bolt-on security → ✅ Zero-trust visibility controls
❌ Hope-based debugging → ✅ AI-queryable distributed traces

These aren't marketing slogans. They're architectural decisions with real tradeoffs.

You trade:

Ecosystem maturity (established frameworks have larger communities)
Extensive documentation (we're catching up)
Battle-tested age (newer architecture means less production history)

You gain:

Better scalability (O(n) vs O(n²) complexity)
Type safety (runtime validation vs hope)
Cleaner architecture (loose coupling vs tight graphs)
Production safety (circuit breakers, feedback prevention built-in)
Security model (5 visibility types vs DIY)

Different frameworks for different priorities. Choose based on what matters to your team.

Built with ❤️ by white duck GmbH

"Declarative contracts eliminate prompt hell. Blackboard architecture eliminates graph spaghetti. Proven patterns applied to modern LLMs."

⭐ Star on GitHub | 📖 Read the Docs | 🚀 Try Examples | 💼 Enterprise Support

Last Updated: October 8, 2025 Version: Flock 0.5.0 (Blackboard Edition) Status: Production-Ready Core, Enterprise Features Roadmapped

Project details

Release history Release notifications | RSS feed

0.5.500

May 19, 2026

0.5.400

Feb 23, 2026

0.5.318

Dec 4, 2025

0.5.312

Dec 4, 2025

0.5.311

Dec 1, 2025

0.5.310

Nov 28, 2025

0.5.39

Nov 25, 2025

0.5.38

Nov 17, 2025

0.5.37

Nov 15, 2025

0.5.36

Nov 15, 2025

0.5.35

Nov 15, 2025

0.5.34

Nov 14, 2025

0.5.33

Nov 14, 2025

0.5.31

Nov 8, 2025

0.5.30

Nov 7, 2025

0.5.25

Oct 24, 2025

0.5.24

Oct 19, 2025

0.5.23

Oct 19, 2025

0.5.22

Oct 19, 2025

0.5.21

Oct 19, 2025

0.5.20

Oct 19, 2025

0.5.11

Oct 18, 2025

0.5.10

Oct 17, 2025

0.5.9

Oct 16, 2025

0.5.8

Oct 16, 2025

0.5.7

Oct 14, 2025

0.5.6

Oct 14, 2025

0.5.5

Oct 14, 2025

0.5.4

Oct 14, 2025

0.5.3

Oct 13, 2025

0.5.2

Oct 12, 2025

0.5.1

Oct 12, 2025

0.5.0

Oct 12, 2025

0.5.0b75 pre-release

Oct 12, 2025

0.5.0b71 pre-release

Oct 10, 2025

0.5.0b70 pre-release

Oct 10, 2025

0.5.0b65 pre-release

Oct 9, 2025

0.5.0b63 pre-release

Oct 8, 2025

0.5.0b62 pre-release

Oct 8, 2025

0.5.0b61 pre-release

Oct 8, 2025

0.5.0b60 pre-release

Oct 8, 2025

0.5.0b59 pre-release

Oct 8, 2025

0.5.0b58 pre-release

Oct 8, 2025

This version

0.5.0b57 pre-release

Oct 8, 2025

0.5.0b56 pre-release

Oct 8, 2025

0.5.0b55 pre-release

Oct 8, 2025

0.5.0b54 pre-release

Oct 7, 2025

0.5.0b53 pre-release

Oct 7, 2025

0.5.0b52 pre-release

Oct 6, 2025

0.5.0b51 pre-release

Oct 6, 2025

0.5.0b50 pre-release

Oct 6, 2025

0.5.0b28 pre-release

Oct 3, 2025

0.5.0b27 pre-release

Sep 30, 2025

0.5.0b26 pre-release

Sep 29, 2025

0.5.0b25 pre-release

Sep 29, 2025

0.5.0b24 pre-release

Sep 29, 2025

0.5.0b23 pre-release

Sep 29, 2025

0.5.0b22 pre-release

Sep 26, 2025

0.5.0b21 pre-release

Sep 26, 2025

0.5.0b19 pre-release

Sep 25, 2025

0.5.0b18 pre-release

Sep 25, 2025

0.5.0b17 pre-release

Sep 25, 2025

0.5.0b16 pre-release

Sep 1, 2025

0.5.0b15 pre-release

Aug 30, 2025

0.5.0b14 pre-release

Aug 8, 2025

0.5.0b13 pre-release

Aug 8, 2025

0.5.0b12 pre-release

Aug 7, 2025

0.5.0b11 pre-release

Aug 6, 2025

0.5.0b10 pre-release

Jul 16, 2025

0.5.0b9 pre-release

Jul 16, 2025

0.5.0b8 pre-release

Jun 2, 2025

0.5.0b7 pre-release

Jun 1, 2025

0.5.0b6 pre-release

May 31, 2025

0.5.0b5 pre-release

May 31, 2025

0.5.0b3 pre-release

May 30, 2025

0.5.0b2 pre-release

May 30, 2025

0.5.0b1 pre-release

May 30, 2025

0.5.0b0 pre-release

Aug 5, 2025

0.4.543

Sep 23, 2025

0.4.542

Sep 23, 2025

0.4.541

Sep 23, 2025

0.4.540

Sep 23, 2025

0.4.539

Sep 23, 2025

0.4.538

Sep 23, 2025

0.4.537

Sep 23, 2025

0.4.536

Sep 17, 2025

0.4.535

Sep 17, 2025

0.4.534

Sep 16, 2025

0.4.533

Sep 16, 2025

0.4.532

Sep 16, 2025

0.4.531

Aug 31, 2025

0.4.529

Aug 6, 2025

0.4.528

Aug 1, 2025

0.4.527

Aug 1, 2025

0.4.526

Jul 17, 2025

0.4.525

Jul 3, 2025

0.4.524

Jun 1, 2025

0.4.523

May 31, 2025

0.4.522

May 31, 2025

0.4.521

May 31, 2025

0.4.520

May 30, 2025

0.4.519

May 27, 2025

0.4.518

May 27, 2025

0.4.517

May 27, 2025

0.4.516

May 26, 2025

0.4.515

May 26, 2025

0.4.514

May 26, 2025

0.4.513

May 26, 2025

0.4.512

May 26, 2025

0.4.511

May 24, 2025

0.4.510

May 24, 2025

0.4.509

May 23, 2025

0.4.508

May 23, 2025

0.4.506

May 22, 2025

0.4.505

May 22, 2025

0.4.504

May 22, 2025

0.4.503

May 22, 2025

0.4.5

May 21, 2025

0.4.3

May 21, 2025

0.4.2

May 21, 2025

0.4.1

May 21, 2025

0.4.0b50 pre-release

May 20, 2025

0.4.0b49 pre-release

May 20, 2025

0.4.0b48 pre-release

May 19, 2025

0.4.0b46 pre-release

May 15, 2025

0.4.0b45 pre-release

May 15, 2025

0.4.0b44 pre-release

May 13, 2025

0.4.0b43 pre-release

May 12, 2025

0.4.0b42 pre-release

May 12, 2025

0.4.0b40 pre-release

May 12, 2025

0.4.0b39 pre-release

May 9, 2025

0.4.0b38 pre-release

May 8, 2025

0.4.0b37 pre-release

May 8, 2025

0.4.0b36 pre-release

May 8, 2025

0.4.0b35 pre-release

May 8, 2025

0.4.0b34 pre-release

May 7, 2025

0.4.0b33 pre-release

May 6, 2025

0.4.0b32 pre-release

May 6, 2025

0.4.0b31 pre-release

May 6, 2025

0.4.0b30 pre-release

May 6, 2025

0.4.0b29 pre-release

May 6, 2025

0.4.0b28 pre-release

May 6, 2025

0.4.0b27 pre-release

Apr 19, 2025

0.4.0b26 pre-release

Apr 19, 2025

0.4.0b25 pre-release

Apr 18, 2025

0.4.0b24 pre-release

Apr 17, 2025

0.4.0b23 pre-release

Apr 16, 2025

0.4.0b22 pre-release

Apr 14, 2025

0.4.0b21 pre-release

Apr 14, 2025

0.4.0b20 pre-release

Apr 14, 2025

0.4.0b19 pre-release

Apr 13, 2025

0.4.0b18 pre-release

Apr 11, 2025

0.4.0b17 pre-release

Apr 11, 2025

0.4.0b16 pre-release

Apr 11, 2025

0.4.0b15 pre-release

Apr 10, 2025

0.4.0b14 pre-release

Apr 10, 2025

0.4.0b13 pre-release

Apr 10, 2025

0.4.0b12 pre-release

Apr 10, 2025

0.4.0b11 pre-release

Apr 10, 2025

0.4.0b10 pre-release

Apr 10, 2025

0.4.0b9 pre-release

Apr 10, 2025

0.4.0b8 pre-release

Apr 10, 2025

0.4.0b7 pre-release

Apr 10, 2025

0.4.0b6 pre-release

Apr 8, 2025

0.4.0b5 pre-release

Apr 7, 2025

0.4.0b4 pre-release

Apr 6, 2025

0.4.0b3 pre-release

Apr 6, 2025

0.4.0b2 pre-release

Apr 6, 2025

0.4.0b1 pre-release

Apr 6, 2025

0.3.41

Apr 6, 2025

0.3.40

Apr 4, 2025

0.3.39

Apr 4, 2025

0.3.38

Apr 4, 2025

0.3.37

Apr 3, 2025

0.3.36

Apr 3, 2025

0.3.35

Apr 3, 2025

0.3.34

Apr 3, 2025

0.3.33

Apr 3, 2025

0.3.32

Apr 3, 2025

0.3.31

Apr 2, 2025

0.3.30

Apr 2, 2025

0.3.23

Mar 29, 2025

0.3.22

Mar 29, 2025

0.3.21

Mar 29, 2025

0.3.20

Mar 27, 2025

0.3.18

Mar 10, 2025

0.3.17

Mar 3, 2025

0.3.16

Mar 3, 2025

0.3.15

Feb 28, 2025

0.3.14

Feb 28, 2025

0.3.13

Feb 28, 2025

0.3.11

Feb 26, 2025

0.3.10

Feb 26, 2025

0.3.8

Feb 26, 2025

0.3.6

Feb 24, 2025

0.3.5

Feb 24, 2025

0.3.4

Feb 24, 2025

0.3.3

Feb 24, 2025

0.3.2

Feb 24, 2025

0.3.1

Feb 24, 2025

0.2.18

Feb 19, 2025

0.2.17

Feb 19, 2025

0.2.16

Feb 17, 2025

0.2.15

Feb 17, 2025

0.2.14

Feb 16, 2025

0.2.13

Feb 14, 2025

0.2.12

Feb 13, 2025

0.2.11

Feb 13, 2025

0.2.10

Feb 13, 2025

0.2.9

Feb 13, 2025

0.2.8

Feb 13, 2025

0.2.7

Feb 13, 2025

0.2.6

Feb 12, 2025

0.2.5

Feb 12, 2025

0.2.4

Feb 11, 2025

0.2.3

Feb 11, 2025

0.2.2

Feb 11, 2025

0.2.1

Feb 10, 2025

0.1.2

Feb 6, 2025

0.1.1

Feb 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flock_core-0.5.0b57.tar.gz (2.8 MB view details)

Uploaded Oct 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

flock_core-0.5.0b57-py3-none-any.whl (644.1 kB view details)

Uploaded Oct 8, 2025 Python 3

File details

Details for the file flock_core-0.5.0b57.tar.gz.

File metadata

Download URL: flock_core-0.5.0b57.tar.gz
Upload date: Oct 8, 2025
Size: 2.8 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.0

File hashes

Hashes for flock_core-0.5.0b57.tar.gz
Algorithm	Hash digest
SHA256	`c40df2619fb1aac57d438283bf39dde063a4f7b670be796b2dacf0933fa9e335`
MD5	`4842a26dabb451c220423d8e7a3cc09a`
BLAKE2b-256	`1d742afa9aff66cedb50788c7f551102c05d008cc29babca19523087bc91d6a1`

See more details on using hashes here.

File details

Details for the file flock_core-0.5.0b57-py3-none-any.whl.

File metadata

Download URL: flock_core-0.5.0b57-py3-none-any.whl
Upload date: Oct 8, 2025
Size: 644.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.0

File hashes

Hashes for flock_core-0.5.0b57-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c6fc8510e5a8ec57e9eac86f2d675ab506e35d272081c56ca5ccb91534f6553e`
MD5	`4591e3d1a700defa6467a54b65d1ecfc`
BLAKE2b-256	`edc50a1390c0aa61b047d0db84d2ab6aa0025acedb2a0cc7ab33a34a9af0f288`

See more details on using hashes here.

flock-core 0.5.0b57

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Flock 0.5: Declarative Blackboard Multi-Agent Orchestration

The Problem With Current Approaches

The Flock Approach

1. Declarative Type Contracts (Not Prompts)

2. Blackboard Architecture (Not Directed Graphs)

Quick Start (60 Seconds)

Core Concepts

Typed Artifacts (The Vocabulary)

Agent Subscriptions (The Rules)

Visibility Controls (The Security)

Batching Pattern: Parallel Execution Control

Production Safety Features

Production-Ready Observability

Real-Time Dashboard

Production-Grade Trace Viewer

OpenTelemetry + DuckDB Tracing

Framework Comparison

Architectural Differences

When Flock Wins

When Alternatives Win

Honest Trade-offs

Production Readiness

What Works Today (v0.5.0)

Recommended Use Cases Today

Roadmap to 1.0

Flock 1.0 - Q4 2025 Release

Release Criteria for v1.0

Example: Multi-Modal Clinical Decision Support

Production Use Cases

Financial Services: Multi-Signal Trading

Healthcare: HIPAA-Compliant Diagnostics

E-Commerce: 50-Agent Personalization

Multi-Tenant SaaS: Content Moderation

Getting Started

Contributing

Why "0.5"?

The Bottom Line

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes