A lightweight framework for building LLM-powered agents and composable state machines with pluggable backends.

These details have not been verified by PyPI

Project links

Project description

FlatAgents

Define LLM agents in YAML. Run them anywhere.

For LLM/machine readers: see MACHINES.md for comprehensive reference.

Why?

Composition over inheritance — compose stateless agents and checkpointable machines
Compact structure — easy for LLMs to read and generate
Simple hook interfaces — escape hatches without complexity; webhook ready
Inspectable — every agent and machine is readable config
Language-agnostic — reduce code in any particular runtime
Common TypeScript interface — single schema for agents, single schema for machines
Limitations — machine topologies can get complex at scale

Inspired by Kubernetes manifests and character card specifications.

Versioning

All specs (flatagent.d.ts, flatmachine.d.ts, profiles.d.ts) and SDKs (Python, JS) use lockstep versioning. A single version number applies across the entire repository.

Core Concepts

Use machines to write flatagents and flatmachines, they are designed for LLMs.

Term	What it is
FlatAgent	A single LLM call: model + prompts + output schema
FlatMachine	A state machine that orchestrates multiple agents, actions, and state machines

Use FlatAgent alone for simple tasks. Use FlatMachine when you need multi-step workflows, branching, or error handling.

Examples

Example	What it demonstrates
helloworld	Minimal setup — single agent, single state machine
writer_critic	Multi-agent loop — writer drafts, critic reviews, iterates
story_writer	Multi-step creative workflow with chapter generation
human-in-the-loop	Pause execution for human approval via hooks
error_handling	Error recovery and retry patterns at state machine level
dynamic_agent	On-the-fly agent generation from runtime context
character_card	Loading agent config from character card format
mdap	MDAP voting execution — multi-sample consensus
gepa_self_optimizer	Self-optimizing prompts via reflection and critique
research_paper_analysis	Document analysis with structured extraction
multi_paper_synthesizer	Cross-document synthesis with dynamic machine launching
support_triage_json	JSON input/output with classification pipeline
distributed_worker	Worker pool with registration + work backends, scaling, stale worker reaping
parallelism	Parallel machines, dynamic foreach, fire-and-forget launches

Quick Start

pip install flatagents[all]

from flatagents import FlatAgent

agent = FlatAgent(config_file="reviewer.yml")
result = await agent.call(query="Review this code...")
print(result.output)

Example Agent

reviewer.yml

spec: flatagent
spec_version: "0.10.0"

data:
  name: code-reviewer

  model: "smart-expensive"  # Reference profile from profiles.yml

  system: |
    You are a senior code reviewer. Analyze code for bugs,
    style issues, and potential improvements.

  user: |
    Review this code:
    {{ input.code }}

  output:
    issues:
      type: list
      items:
        type: str
      description: "List of issues found"
    rating:
      type: str
      enum: ["good", "needs_work", "critical"]
      description: "Overall code quality"

What the fields mean:

spec/spec_version — Format identifier and version
data.name — Agent identifier
data.model — Profile name, inline config, or profile with overrides
data.system — System prompt (sets behavior)
data.user — User prompt template (uses Jinja2, {{ input.* }} for runtime values)
data.output — Structured output schema (the runtime extracts these fields)

Model Profiles

Centralize model configurations in profiles.yml and reference them by name:

profiles.yml

spec: flatprofiles
spec_version: "0.10.0"

data:
  model_profiles:
    fast-cheap:
      provider: cerebras
      name: zai-glm-4.6
      temperature: 0.6
      max_tokens: 2048

    smart-expensive:
      provider: anthropic
      name: claude-3-opus-20240229
      temperature: 0.3
      max_tokens: 4096

  default: fast-cheap      # Fallback when agent has no model
  # override: smart-expensive  # Uncomment to force all agents

Agent usage:

# String shorthand — profile lookup
model: "fast-cheap"

# Profile with overrides
model:
  profile: "fast-cheap"
  temperature: 0.9

# Inline config (no profile)
model:
  provider: openai
  name: gpt-4
  temperature: 0.3

Resolution order (low → high): default profile → named profile → inline overrides → override profile

Output Types

output:
  answer:      { type: str }
  count:       { type: int }
  score:       { type: float }
  valid:       { type: bool }
  raw:         { type: json }
  items:       { type: list, items: { type: str } }
  metadata:    { type: object, properties: { key: { type: str } } }

Use enum: [...] to constrain string values.

Multi-Agent Workflows

For orchestration, use FlatMachine (full docs in MACHINES.md):

from flatagents import FlatMachine

machine = FlatMachine(config_file="workflow.yml")
result = await machine.execute(input={"query": "..."})

FlatMachine provides: state transitions, conditional branching, loops, retry with backoff, and error recovery—all in YAML.

Distributed Worker Pattern

FlatAgents includes DistributedWorkerHooks plus RegistrationBackend/WorkBackend implementations (SQLite in the reference SDK) to build worker pools.

Typical topology:

Checker: get_pool_state → calculate_spawn → spawn_workers (requires worker_config_path in context or override in hooks)
Worker: register_worker → claim_job → process → complete_job/fail_job → deregister_worker
Reaper: list_stale_workers → reap_stale_workers

See distributed_worker for a runnable demo.

Features

Checkpoint and restore
Python SDK (TypeScript SDK in progress)
MACHINES.md — LLM-optimized reference docs
Decider agents and machines
On-the-fly agent and machine definitions
Webhook hooks for remote state machine handling
Metrics and logging
Error recovery and exception handling at the state machine level
Parallel machine execution (machine: [a, b, c])
Dynamic parallelism with foreach
Fire-and-forget launches for background tasks
Distributed worker orchestration (registration/work backends, scaling hooks, stale worker reaping)

Planned

Distributed execution backends (Redis/Postgres) + cross-network peering strategies
SQL persistence backend
TypeScript SDK
max_depth config to limit machine launch nesting
Checkpoint pruning to prevent storage explosion
$root/ path prefix — resolve agent/machine refs from workspace root, not config dir
Input size validation — warn when prompt exceeds model context window
Serialization warnings — flag non-JSON-serializable context values before checkpoint

Specs

TypeScript definitions are the source of truth:

Python SDK

pip install flatagents[litellm]

LLM Backends

from flatagents import LiteLLMBackend, AISuiteBackend

# LiteLLM (default)
agent = FlatAgent(config_file="agent.yml")

# AISuite
backend = AISuiteBackend(model="openai:gpt-4o")
agent = FlatAgent(config_file="agent.yml", backend=backend)

Hooks

Extend machine behavior with Python hooks:

from flatagents import FlatMachine, MachineHooks

class CustomHooks(MachineHooks):
    def on_state_enter(self, state: str, context: dict) -> dict:
        context["entered_at"] = time.time()
        return context

    def on_action(self, action: str, context: dict) -> dict:
        if action == "fetch_data":
            context["data"] = fetch_from_api()
        return context

machine = FlatMachine(config_file="machine.yml", hooks=CustomHooks())

Available hooks: on_machine_start, on_machine_end, on_state_enter, on_state_exit, on_transition, on_error, on_action

Execution Types

execution:
  type: retry              # retry | parallel | mdap_voting
  backoffs: [2, 8, 16, 35] # Seconds between retries
  jitter: 0.1              # ±10% random variation

Type	Use Case
`default`	Single call
`retry`	Rate limit handling with backoff
`parallel`	Multiple samples (`n_samples`)
`mdap_voting`	Consensus voting (`k_margin`, `max_candidates`)

Schema Validation

from flatagents import validate_flatagent_config, validate_flatmachine_config

warnings = validate_flatagent_config(config)
warnings = validate_flatmachine_config(config)

Logging & Metrics

from flatagents import setup_logging, get_logger

setup_logging(level="INFO")  # Respects FLATAGENTS_LOG_LEVEL env var
logger = get_logger(__name__)

Env vars: FLATAGENTS_LOG_LEVEL (DEBUG/INFO/WARNING/ERROR), FLATAGENTS_LOG_FORMAT (standard/json/simple)

For OpenTelemetry metrics:

pip install flatagents[metrics]
export FLATAGENTS_METRICS_ENABLED=true

Metrics are enabled by default and print to stdout every 5s. Redirect to file or use OTLP for production:

# Metrics print to stdout by default
python your_script.py

# Save to file
python your_script.py >> metrics.log 2>&1

# Disable if needed
FLATAGENTS_METRICS_ENABLED=false python your_script.py

# Send to OTLP collector for production
OTEL_METRICS_EXPORTER=otlp \
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4317 \
python your_script.py

Env vars for metrics:

Variable	Default	Purpose
`FLATAGENTS_METRICS_ENABLED`	`true`	Enable OpenTelemetry metrics
`OTEL_METRICS_EXPORTER`	`console`	`console` (stdout) or `otlp` (production)
`OTEL_EXPORTER_OTLP_ENDPOINT`	—	OTLP collector endpoint
`OTEL_METRIC_EXPORT_INTERVAL`	`5000` / `60000`	Export interval in ms (5s for console, 60s for otlp)
`OTEL_SERVICE_NAME`	`flatagents`	Service name in metrics

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.0.0

Apr 21, 2026

2.7.0

Apr 19, 2026

2.6.0

Apr 11, 2026

2.5.0

Mar 27, 2026

2.4.4

Mar 22, 2026

2.4.3

Mar 21, 2026

2.4.2

Mar 21, 2026

2.4.1

Mar 21, 2026

2.4.0

Mar 19, 2026

2.3.0

Mar 10, 2026

2.2.2

Mar 9, 2026

2.2.1

Mar 8, 2026

2.2.0

Mar 8, 2026

2.1.0

Mar 7, 2026

2.0.0

Mar 3, 2026

1.2.0

Mar 1, 2026

1.1.1

Feb 10, 2026

1.1.0

Feb 7, 2026

1.0.0

Feb 4, 2026

This version

0.10.0

Feb 3, 2026

0.9.0

Jan 31, 2026

0.8.3

Jan 28, 2026

0.8.2

Jan 27, 2026

0.8.1

Jan 22, 2026

0.8.0

Jan 21, 2026

0.7.7

Jan 19, 2026

0.7.6

Jan 19, 2026

0.7.5

Jan 19, 2026

0.7.4

Jan 19, 2026

0.7.3

Jan 18, 2026

0.7.2

Jan 18, 2026

0.7.1

Jan 18, 2026

0.7.0

Jan 18, 2026

0.5.1

Jan 18, 2026

0.5.0

Jan 17, 2026

0.4.1

Jan 17, 2026

0.4.0

Jan 15, 2026

0.3.4

Jan 7, 2026

0.3.3

Jan 7, 2026

0.3.2

Jan 6, 2026

0.3.1

Jan 6, 2026

0.3.0

Jan 6, 2026

0.2.2

Jan 4, 2026

0.2.1

Jan 4, 2026

0.2.0

Jan 4, 2026

0.1.10

Jan 2, 2026

0.1.9

Jan 2, 2026

0.1.8

Jan 2, 2026

0.1.7

Jan 1, 2026

0.1.6

Jan 1, 2026

0.1.5

Dec 30, 2025

0.1.3

Dec 30, 2025

0.1.2

Dec 29, 2025

0.1.1

Dec 29, 2025

0.1.0

Dec 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flatagents-0.10.0.tar.gz (98.9 kB view details)

Uploaded Feb 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

flatagents-0.10.0-py3-none-any.whl (99.2 kB view details)

Uploaded Feb 3, 2026 Python 3

File details

Details for the file flatagents-0.10.0.tar.gz.

File metadata

Download URL: flatagents-0.10.0.tar.gz
Upload date: Feb 3, 2026
Size: 98.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for flatagents-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`ca1618afe19294251413df9fa08806c836ce8a640e27975a3eafe088f1e8230d`
MD5	`d5247ebb4c9e30380efe0cae6bd68b7f`
BLAKE2b-256	`1ab3d8674b0f06f613d8fa6a41046f2ec2b22cbaeb7d069152d7e40e0def3cc3`

See more details on using hashes here.

File details

Details for the file flatagents-0.10.0-py3-none-any.whl.

File metadata

Download URL: flatagents-0.10.0-py3-none-any.whl
Upload date: Feb 3, 2026
Size: 99.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for flatagents-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da8445b14fcc92a6a39db21e43d05fd5685e5e220d3b34ff00702183313db4d4`
MD5	`d21181ed27442160d8b22a0840adde36`
BLAKE2b-256	`a726660cc45262546d4690fda3311f92fb067ad4014b5425ba71914e439aa02b`

See more details on using hashes here.

flatagents 0.10.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

FlatAgents

Why?

Versioning

Core Concepts

Examples

Quick Start

Example Agent

Model Profiles

Output Types

Multi-Agent Workflows

Distributed Worker Pattern

Features

Planned

Specs

Python SDK

LLM Backends

Hooks

Execution Types

Schema Validation

Logging & Metrics

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes