Skip to main content

A lightweight framework for building LLM-powered agents and hierarchical state machines with pluggable backends.

Project description

FlatAgents

Define LLM agents in YAML. Run them anywhere.

For LLM/machine readers: see MACHINES.md for comprehensive reference.

Why?

  • Composition over inheritance — compose stateless agents and checkpointable machines
  • Compact structure — easy for LLMs to read and generate
  • Simple hook interfaces — escape hatches without complexity; webhook ready
  • Inspectable — every agent and machine is readable config
  • Language-agnostic — reduce code in any particular runtime
  • Common TypeScript interface — single schema for agents, single schema for machines
  • Limitations — hierarchical state machine networks can get complex at scale

Inspired by Kubernetes manifests and character card specifications.

Core Concepts

Use machines to write flatagents and flatmachines, they are designed for LLMs.

Term What it is
FlatAgent A single LLM call: model + prompts + output schema
FlatMachine A state machine that orchestrates multiple agents, actions, and state machines

Use FlatAgent alone for simple tasks. Use FlatMachine when you need multi-step workflows, branching, or error handling.

Examples

Example What it demonstrates
helloworld Minimal setup — single agent, single state machine
writer_critic Multi-agent loop — writer drafts, critic reviews, iterates
story_writer Multi-step creative workflow with chapter generation
human_in_loop Pause execution for human approval via hooks
error_handling Error recovery and retry patterns at state machine level
dynamic_agent On-the-fly agent generation from runtime context
character_card Loading agent config from character card format
mdap MDAP voting execution — multi-sample consensus
gepa_self_optimizer Self-optimizing prompts via reflection and critique
research_paper_analysis Document analysis with structured extraction
multi_paper_synthesizer Cross-document synthesis with dynamic child machines
support_triage_json JSON input/output with classification pipeline

Quick Start

pip install flatagents[all]
from flatagents import FlatAgent

agent = FlatAgent(config_file="reviewer.yml")
result = await agent.call(query="Review this code...")
print(result.output)

Example Agent

reviewer.yml

spec: flatagent
spec_version: "0.0.0"

data:
  name: code-reviewer

  model:
    provider: openai
    name: gpt-4
    temperature: 0.3

  system: |
    You are a senior code reviewer. Analyze code for bugs, 
    style issues, and potential improvements.

  user: |
    Review this code:
    {{ input.code }}

  output:
    issues:
      type: list
      items:
        type: str
      description: "List of issues found"
    rating:
      type: str
      enum: ["good", "needs_work", "critical"]
      description: "Overall code quality"

What the fields mean:

  • spec/spec_version — Format identifier and version
  • data.name — Agent identifier
  • data.model — LLM provider, model name, and parameters
  • data.system — System prompt (sets behavior)
  • data.user — User prompt template (uses Jinja2, {{ input.* }} for runtime values)
  • data.output — Structured output schema (the runtime extracts these fields)

Output Types

output:
  answer:      { type: str }
  count:       { type: int }
  score:       { type: float }
  valid:       { type: bool }
  raw:         { type: json }
  items:       { type: list, items: { type: str } }
  metadata:    { type: object, properties: { key: { type: str } } }

Use enum: [...] to constrain string values.

Multi-Agent Workflows

For orchestration, use FlatMachine (full docs in MACHINES.md):

from flatagents import FlatMachine

machine = FlatMachine(config_file="workflow.yml")
result = await machine.execute(input={"query": "..."})

FlatMachine provides: state transitions, conditional branching, loops, retry with backoff, and error recovery—all in YAML.

Features

  • Checkpoint and restore
  • Python SDK (TypeScript SDK in progress)
  • MACHINES.md — LLM-optimized reference docs
  • Decider agents and machines
  • On-the-fly agent and machine definitions
  • Webhook hooks for remote state machine handling
  • Metrics and logging
  • Error recovery and exception handling at the state machine level

Planned

  • Distributed execution — cross-network HSMs, inter-machine strategies
  • Parallel state execution
  • SQL persistence backend
  • TypeScript SDK
  • max_depth config to limit child machine nesting
  • Checkpoint pruning to prevent storage explosion
  • $root/ path prefix — resolve agent/machine refs from workspace root, not config dir
  • Input size validation — warn when prompt exceeds model context window
  • Serialization warnings — flag non-JSON-serializable context values before checkpoint

Specs

TypeScript definitions are the source of truth:

Python SDK

pip install flatagents[litellm]

LLM Backends

from flatagents import LiteLLMBackend, AISuiteBackend

# LiteLLM (default)
agent = FlatAgent(config_file="agent.yml")

# AISuite
backend = AISuiteBackend(model="openai:gpt-4o")
agent = FlatAgent(config_file="agent.yml", backend=backend)

Hooks

Extend machine behavior with Python hooks:

from flatagents import FlatMachine, MachineHooks

class CustomHooks(MachineHooks):
    def on_state_enter(self, state: str, context: dict) -> dict:
        context["entered_at"] = time.time()
        return context

    def on_action(self, action: str, context: dict) -> dict:
        if action == "fetch_data":
            context["data"] = fetch_from_api()
        return context

machine = FlatMachine(config_file="machine.yml", hooks=CustomHooks())

Available hooks: on_machine_start, on_machine_end, on_state_enter, on_state_exit, on_transition, on_error, on_action

Execution Types

execution:
  type: retry              # retry | parallel | mdap_voting
  backoffs: [2, 8, 16, 35] # Seconds between retries
  jitter: 0.1              # ±10% random variation
Type Use Case
default Single call
retry Rate limit handling with backoff
parallel Multiple samples (n_samples)
mdap_voting Consensus voting (k_margin, max_candidates)

Schema Validation

from flatagents import validate_flatagent_config, validate_flatmachine_config

warnings = validate_flatagent_config(config)
warnings = validate_flatmachine_config(config)

Logging & Metrics

from flatagents import setup_logging, get_logger

setup_logging(level="INFO")  # Respects FLATAGENTS_LOG_LEVEL env var
logger = get_logger(__name__)

Env vars: FLATAGENTS_LOG_LEVEL (DEBUG/INFO/WARNING/ERROR), FLATAGENTS_LOG_FORMAT (standard/json/simple)

For OpenTelemetry metrics:

pip install flatagents[metrics]
export FLATAGENTS_METRICS_ENABLED=true

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flatagents-0.3.4.tar.gz (7.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

flatagents-0.3.4-py3-none-any.whl (54.5 kB view details)

Uploaded Python 3

File details

Details for the file flatagents-0.3.4.tar.gz.

File metadata

  • Download URL: flatagents-0.3.4.tar.gz
  • Upload date:
  • Size: 7.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for flatagents-0.3.4.tar.gz
Algorithm Hash digest
SHA256 c639c265901c74b785a62f0348394a6266f856dd1781fa5f052eebc72faa8022
MD5 9f73a2e04eb9dab7b60d291e4abb5730
BLAKE2b-256 d5e53364bc38e764eb19206d48ae062153fd5a84c16f0eb2bb400a483bf0497d

See more details on using hashes here.

File details

Details for the file flatagents-0.3.4-py3-none-any.whl.

File metadata

  • Download URL: flatagents-0.3.4-py3-none-any.whl
  • Upload date:
  • Size: 54.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for flatagents-0.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 57a401908d24799cce30677a7d3c1bd6b88805c658c93cf284e4719db0ecfc04
MD5 5fdf43dfc6bd393ade7e3eb94e654137
BLAKE2b-256 861db120fa236713854635135fbe55a3dd361a52d602f43d6597774831d8bb9c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page