Skip to main content

A lightweight framework for building LLM-powered agents and composable state machines with pluggable backends.

Project description

FlatAgents

Define LLM agents in YAML. Run them anywhere.

For LLM/machine readers: see MACHINES.md for comprehensive reference.

Why?

  • Composition over inheritance — compose stateless agents and checkpointable machines
  • Compact structure — easy for LLMs to read and generate
  • Simple hook interfaces — escape hatches without complexity; webhook ready
  • Inspectable — every agent and machine is readable config
  • Language-agnostic — reduce code in any particular runtime
  • Common TypeScript interface — single schema for agents, single schema for machines
  • Limitations — machine topologies can get complex at scale

Inspired by Kubernetes manifests and character card specifications.

Core Concepts

Use machines to write flatagents and flatmachines, they are designed for LLMs.

Term What it is
FlatAgent A single LLM call: model + prompts + output schema
FlatMachine A state machine that orchestrates multiple agents, actions, and state machines

Use FlatAgent alone for simple tasks. Use FlatMachine when you need multi-step workflows, branching, or error handling.

Examples

Example What it demonstrates
helloworld Minimal setup — single agent, single state machine
writer_critic Multi-agent loop — writer drafts, critic reviews, iterates
story_writer Multi-step creative workflow with chapter generation
human_in_loop Pause execution for human approval via hooks
error_handling Error recovery and retry patterns at state machine level
dynamic_agent On-the-fly agent generation from runtime context
character_card Loading agent config from character card format
mdap MDAP voting execution — multi-sample consensus
gepa_self_optimizer Self-optimizing prompts via reflection and critique
research_paper_analysis Document analysis with structured extraction
multi_paper_synthesizer Cross-document synthesis with dynamic machine launching
support_triage_json JSON input/output with classification pipeline
parallelism Parallel machines, dynamic foreach, fire-and-forget launches

Quick Start

pip install flatagents[all]
from flatagents import FlatAgent

agent = FlatAgent(config_file="reviewer.yml")
result = await agent.call(query="Review this code...")
print(result.output)

Example Agent

reviewer.yml

spec: flatagent
spec_version: "0.0.0"

data:
  name: code-reviewer

  model:
    provider: openai
    name: gpt-4
    temperature: 0.3

  system: |
    You are a senior code reviewer. Analyze code for bugs, 
    style issues, and potential improvements.

  user: |
    Review this code:
    {{ input.code }}

  output:
    issues:
      type: list
      items:
        type: str
      description: "List of issues found"
    rating:
      type: str
      enum: ["good", "needs_work", "critical"]
      description: "Overall code quality"

What the fields mean:

  • spec/spec_version — Format identifier and version
  • data.name — Agent identifier
  • data.model — LLM provider, model name, and parameters
  • data.system — System prompt (sets behavior)
  • data.user — User prompt template (uses Jinja2, {{ input.* }} for runtime values)
  • data.output — Structured output schema (the runtime extracts these fields)

Output Types

output:
  answer:      { type: str }
  count:       { type: int }
  score:       { type: float }
  valid:       { type: bool }
  raw:         { type: json }
  items:       { type: list, items: { type: str } }
  metadata:    { type: object, properties: { key: { type: str } } }

Use enum: [...] to constrain string values.

Multi-Agent Workflows

For orchestration, use FlatMachine (full docs in MACHINES.md):

from flatagents import FlatMachine

machine = FlatMachine(config_file="workflow.yml")
result = await machine.execute(input={"query": "..."})

FlatMachine provides: state transitions, conditional branching, loops, retry with backoff, and error recovery—all in YAML.

Features

  • Checkpoint and restore
  • Python SDK (TypeScript SDK in progress)
  • MACHINES.md — LLM-optimized reference docs
  • Decider agents and machines
  • On-the-fly agent and machine definitions
  • Webhook hooks for remote state machine handling
  • Metrics and logging
  • Error recovery and exception handling at the state machine level
  • Parallel machine execution (machine: [a, b, c])
  • Dynamic parallelism with foreach
  • Fire-and-forget launches for background tasks

Planned

  • Distributed execution — cross-network machine peering, inter-machine strategies
  • SQL persistence backend
  • TypeScript SDK
  • max_depth config to limit machine launch nesting
  • Checkpoint pruning to prevent storage explosion
  • $root/ path prefix — resolve agent/machine refs from workspace root, not config dir
  • Input size validation — warn when prompt exceeds model context window
  • Serialization warnings — flag non-JSON-serializable context values before checkpoint

Specs

TypeScript definitions are the source of truth:

Python SDK

pip install flatagents[litellm]

LLM Backends

from flatagents import LiteLLMBackend, AISuiteBackend

# LiteLLM (default)
agent = FlatAgent(config_file="agent.yml")

# AISuite
backend = AISuiteBackend(model="openai:gpt-4o")
agent = FlatAgent(config_file="agent.yml", backend=backend)

Hooks

Extend machine behavior with Python hooks:

from flatagents import FlatMachine, MachineHooks

class CustomHooks(MachineHooks):
    def on_state_enter(self, state: str, context: dict) -> dict:
        context["entered_at"] = time.time()
        return context

    def on_action(self, action: str, context: dict) -> dict:
        if action == "fetch_data":
            context["data"] = fetch_from_api()
        return context

machine = FlatMachine(config_file="machine.yml", hooks=CustomHooks())

Available hooks: on_machine_start, on_machine_end, on_state_enter, on_state_exit, on_transition, on_error, on_action

Execution Types

execution:
  type: retry              # retry | parallel | mdap_voting
  backoffs: [2, 8, 16, 35] # Seconds between retries
  jitter: 0.1              # ±10% random variation
Type Use Case
default Single call
retry Rate limit handling with backoff
parallel Multiple samples (n_samples)
mdap_voting Consensus voting (k_margin, max_candidates)

Schema Validation

from flatagents import validate_flatagent_config, validate_flatmachine_config

warnings = validate_flatagent_config(config)
warnings = validate_flatmachine_config(config)

Logging & Metrics

from flatagents import setup_logging, get_logger

setup_logging(level="INFO")  # Respects FLATAGENTS_LOG_LEVEL env var
logger = get_logger(__name__)

Env vars: FLATAGENTS_LOG_LEVEL (DEBUG/INFO/WARNING/ERROR), FLATAGENTS_LOG_FORMAT (standard/json/simple)

For OpenTelemetry metrics:

pip install flatagents[metrics]
export FLATAGENTS_METRICS_ENABLED=true

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flatagents-0.4.1.tar.gz (7.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

flatagents-0.4.1-py3-none-any.whl (65.8 kB view details)

Uploaded Python 3

File details

Details for the file flatagents-0.4.1.tar.gz.

File metadata

  • Download URL: flatagents-0.4.1.tar.gz
  • Upload date:
  • Size: 7.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for flatagents-0.4.1.tar.gz
Algorithm Hash digest
SHA256 6f4c3dd7c27cf766e99f7393440dd0e303dc031d56ca4f05a26b5516ff1caa8f
MD5 083bf6d22e860ddb434d359f115fbbae
BLAKE2b-256 3f63c58e2f928f07811c10fbb8bf9399452433131f38143158f8d9e6fe168cb7

See more details on using hashes here.

File details

Details for the file flatagents-0.4.1-py3-none-any.whl.

File metadata

  • Download URL: flatagents-0.4.1-py3-none-any.whl
  • Upload date:
  • Size: 65.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for flatagents-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ca449ba1f39e1d35be1292f5a1e7d89558465aa0424fb6348dbb3d06323c05ea
MD5 df12c7366eba7d0893ea6076cae14031
BLAKE2b-256 357496e17767fa8f88157745f6d2d01c80f556f759b874b112793c07e7c2098e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page