Serverless Posttraining for Agents - Core AI functionality and tracing

These details have not been verified by PyPI

Project links

Project description

Synth-AI SDK

Coverage Tests Blacksmith CI

Synth-AI — Serverless Posttraining for Agents.
Docs: Get Started →

🚀 Install latest version (0.2.25.dev1)

pip install synth-ai
# or
uv add synth-ai

Import:

import synth_ai

CLI (with uvx):

uvx synth-ai setup
uvx synth-ai demo
uvx synth-ai deploy
uvx synth-ai run
uvx synth-ai baseline  # For coding agents: get baseline scores

Full quickstart: https://docs.usesynth.ai/sdk/get-started

When you run uvx synth-ai setup, the SDK opens your browser to the Synth dashboard for a one‑time pairing (handshake) with your signed‑in session. The SDK will automatically:

Fast and effective serverless posttraining for agents, via an API.
Easily scale GPU topologies, train multi-node, and integrate with existing agent software.

Highlights

Scale GPU topologies (A10Gs, H100s, multi-node available on request)
Thin FastAPI wrapper integration
Supports OSS models like Qwen3 (GPT-OSS GA soon)
Own your trained models

🧭 Examples & Cookbooks

The old examples/ directory now lives in the Synth Cookbooks repo.

Docs: https://docs.usesynth.ai/cookbooks
Code: https://github.com/synth-laboratories/cookbooks (dev/ holds the migrated examples for now, e.g., dev/polyglot/)
Commands that referenced examples/... should now use the matching paths under cookbooks/dev/

⚙️ Getting Started

Synth-AI ships with a built-in RL example: training Qwen3-0.6B on math reasoning.

Create accounts at Synth and Modal

Then run:

uvx synth-ai demo
uvx synth-ai setup
uvx synth-ai deploy
uvx synth-ai run

To walk through your first RL run, see
👉 Synth-AI SDK Docs

🤖 For Coding Agents: Get Started with Baselines

Baselines are the fastest way for coding agents to evaluate changes and measure improvement on Synth tasks.

Why Use Baselines?

Baselines provide a self-contained evaluation system that:

✅ No infrastructure required — runs locally, no deployed task app needed
✅ Quick feedback loop — get task-by-task results in seconds
✅ Compare changes — establish a baseline score before making modifications
✅ Auto-discoverable — finds baseline files automatically in your codebase

Quick Start for Coding Agents

# 1. List available baselines
uvx synth-ai baseline list

# 2. Run a quick 3-task baseline to get started
uvx synth-ai baseline banking77 --split train --seeds 0,1,2

# 3. Get your baseline score (full train split)
uvx synth-ai baseline banking77 --split train

# 4. Make your changes to the code...

# 5. Re-run to compare performance
uvx synth-ai baseline banking77 --split train --output results_after.json

Available Baselines

# Filter by task type
uvx synth-ai baseline list --tag rl          # RL tasks
uvx synth-ai baseline list --tag nlp         # NLP tasks
uvx synth-ai baseline list --tag vision      # Vision tasks

# Run specific baselines
uvx synth-ai baseline warming_up_to_rl       # Crafter survival game
uvx synth-ai baseline pokemon_vl             # Pokemon Red (vision)
uvx synth-ai baseline gepa                   # Banking77 classification

Baseline Results

Each baseline run provides:

Task-by-task results — see exactly which seeds succeed/fail
Aggregate metrics — success rate, mean/std rewards, total tasks
Serializable output — save to JSON with --output results.json
Model comparison — test different models with --model

Example output:

============================================================
Baseline Evaluation: Banking77 Intent Classification
============================================================
Split(s): train
Tasks: 10
Success: 8/10
Execution time: 12.34s

Aggregate Metrics:
  mean_outcome_reward: 0.8000
  success_rate: 0.8000
  total_tasks: 10

Creating Custom Baselines

Coding agents can create new baseline files to test custom tasks:

# my_task_baseline.py
from synth_ai.baseline import BaselineConfig, BaselineTaskRunner, DataSplit, TaskResult

class MyTaskRunner(BaselineTaskRunner):
    async def run_task(self, seed: int) -> TaskResult:
        # Your task logic here
        return TaskResult(...)

my_baseline = BaselineConfig(
    baseline_id="my_task",
    name="My Custom Task",
    description="Evaluate my custom task",
    task_runner=MyTaskRunner,
    splits={
        "train": DataSplit(name="train", seeds=list(range(10))),
    },
)

Place this file in your project (for example under cookbooks/dev/baseline/) or name it *_baseline.py for auto-discovery. Official baseline examples now live in the Synth Cookbooks repo.

🔐 SDK → Dashboard Pairing

When you run uvx synth-ai setup (or legacy uvx synth-ai rl_demo setup):

The SDK opens your browser to the Synth dashboard to pair your SDK with your signed-in session.
Automatically detects your user + organization
Ensures both API keys exist
Writes them to your project’s .env as:
```
SYNTH_API_KEY=
ENVIRONMENT_API_KEY=
```

✅ No keys printed or requested interactively — all handled via browser pairing.

Environment overrides

SYNTH_CANONICAL_ORIGIN → override dashboard base URL (default: https://www.usesynth.ai/dashboard)
SYNTH_CANONICAL_DEV → 1|true|on to use local dashboard (http://localhost:3000)

🌍 Language-Agnostic: Build Task Apps in Any Language

Synth works with any programming language. You don't need Python to build Task Apps or run prompt optimization. Implement the OpenAPI contract in your preferred language and start optimizing.

Supported Languages

We provide complete, tested examples in the Synth Cookbooks repo (cookbooks/dev/polyglot/):

Rust - Fast, type-safe implementation with Axum
Go - Zero dependencies, single static binary
TypeScript - Works with Node.js, Deno, Bun, and Cloudflare Workers
Zig - Minimal binaries, trivial cross-compilation

👉 See all examples: Synth Cookbooks (polyglot) — see dev/polyglot/

How It Works

Task Apps implement a simple HTTP contract:

GET /health - Health check
POST /rollout - Evaluate prompts and return rewards
GET /task_info - (Optional) Dataset metadata

The optimizer calls your endpoints with candidate prompts, and you return rewards. That's it—no Python required!

📖 Full guide: Polyglot Task Apps Documentation
📋 OpenAPI Contract: synth_ai/contracts/task_app.yaml
🔧 CLI Access: synth contracts show task-app or synth contracts path task-app

🎯 Prompt Optimization

Automatically optimize prompts for classification, reasoning, and instruction-following tasks using evolutionary algorithms. Synth supports two state-of-the-art algorithms: GEPA (Genetic Evolution of Prompt Architectures) and MIPRO (Meta-Instruction PROposer).

References:

GEPA: Agrawal et al. (2025). "GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning." arXiv:2507.19457
MIPRO: Opsahl-Ong et al. (2024). "Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs." arXiv:2406.11695

How It Works

Prompt optimization uses an interceptor pattern that ensures optimized prompts never reach task apps. All prompt modifications happen in the backend via an inference interceptor that substitutes prompts before they reach the LLM.

✅ CORRECT FLOW:
Backend → register_prompt → Interceptor → substitutes → LLM

❌ WRONG FLOW:
Backend → prompt_template in payload → Task App (NEVER DO THIS)

Algorithms

GEPA (Genetic Evolution of Prompt Architectures)

Population-based evolutionary search
LLM-guided mutations for intelligent prompt modifications
Pareto optimization balancing performance and prompt length
Best for: Broad exploration, diverse prompt variants, classification tasks
Results: Improves accuracy from 60-75% (baseline) to 85-90%+ over 15 generations

MIPRO (Meta-Instruction PROposer)

Meta-LLM (e.g., GPT-4o-mini) generates instruction variants
TPE (Tree-structured Parzen Estimator) guides Bayesian search
Bootstrap phase collects few-shot examples from high-scoring seeds
Best for: Efficient optimization, task-specific improvements, faster convergence
Results: Achieves similar accuracy gains with fewer evaluations (~96 rollouts vs ~1000 for GEPA)

Quick Start

Build a prompt evaluation task app

# Task app evaluates prompt performance (classification accuracy, QA correctness, etc.)

Create a prompt learning config

[prompt_learning]
algorithm = "gepa"  # or "mipro"
task_app_url = "https://my-task-app.modal.run"

[prompt_learning.initial_prompt]
messages = [
  { role = "system", content = "You are a banking assistant..." },
  { role = "user", pattern = "Customer Query: {query}..." }
]

[prompt_learning.gepa]
initial_population_size = 20
num_generations = 15

Launch optimization

uvx synth-ai train --type prompt_learning --config config.toml

Query results

from synth_ai.learning import get_prompt_text
best_prompt = get_prompt_text(job_id="pl_abc123", rank=1)

Full documentation: Prompt Learning Guide →

📚 Documentation

SDK Docs: https://docs.usesynth.ai/sdk/get-started
Prompt Learning: https://docs.usesynth.ai/prompt-learning/overview
CLI Reference: https://docs.usesynth.ai/cli
API Reference: https://docs.usesynth.ai/api
Changelog: https://docs.usesynth.ai/changelog

🧠 Meta

Package: synth-ai
Import: synth_ai
Source: github.com/synth-laboratories/synth-ai
License: MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.11.0

Apr 14, 2026

0.10.0

Apr 7, 2026

0.9.11

Mar 24, 2026

0.9.10

Mar 24, 2026

0.9.9

Mar 24, 2026

0.9.8

Mar 21, 2026

0.9.7

Mar 17, 2026

0.9.6

Mar 16, 2026

0.9.5

Mar 12, 2026

0.9.4

Mar 5, 2026

0.9.3

Mar 3, 2026

0.9.2

Mar 3, 2026

0.9.1

Feb 25, 2026

0.9.0

Feb 24, 2026

0.8.2

Feb 21, 2026

0.8.0

Feb 13, 2026

0.7.15

Feb 3, 2026

0.7.14

Feb 1, 2026

0.7.13

Feb 1, 2026

0.7.12

Feb 1, 2026

0.7.11

Feb 1, 2026

0.7.10

Feb 1, 2026

0.7.9

Feb 1, 2026

0.7.8

Jan 31, 2026

0.7.7

Jan 30, 2026

0.7.6

Jan 30, 2026

0.7.3

Jan 27, 2026

0.7.2

Jan 27, 2026

0.7.1

Jan 24, 2026

0.7.0

Jan 24, 2026

0.6.6

Jan 24, 2026

0.6.5

Jan 24, 2026

0.6.4

Jan 23, 2026

0.6.3

Jan 23, 2026

0.6.2

Jan 23, 2026

0.6.1

Jan 23, 2026

0.6.1.dev0 pre-release

Jan 21, 2026

0.6.0

Jan 21, 2026

0.5.4.dev20260120 pre-release

Jan 19, 2026

0.5.4.dev20260119 pre-release

Jan 19, 2026

0.5.4.dev20260117 pre-release

Jan 17, 2026

0.5.0

Jan 14, 2026

0.4.12

Jan 10, 2026

0.4.11

Jan 9, 2026

0.4.10

Jan 9, 2026

0.4.9

Jan 9, 2026

0.4.8

Jan 9, 2026

0.4.7

Jan 8, 2026

0.4.6

Jan 5, 2026

0.4.5

Jan 4, 2026

0.4.4

Dec 31, 2025

0.4.3

Dec 27, 2025

0.4.2.dev0 pre-release

Dec 20, 2025

0.4.1

Dec 19, 2025

0.4.0

Dec 18, 2025

0.3.9

Dec 18, 2025

0.3.8.dev0 pre-release

Dec 17, 2025

0.3.7

Dec 17, 2025

0.3.6

Dec 15, 2025

0.3.5

Dec 13, 2025

0.3.4

Dec 8, 2025

0.3.4.dev2 pre-release

Dec 11, 2025

0.3.4.dev1 pre-release

Dec 6, 2025

0.3.3

Dec 4, 2025

0.3.3.dev20251205 pre-release

Dec 6, 2025

0.3.2.dev3 pre-release

Dec 5, 2025

0.3.2.dev2 pre-release

Dec 4, 2025

0.3.1

Dec 2, 2025

0.3.1.dev5 pre-release

Nov 27, 2025

0.3.1.dev3 pre-release

Nov 27, 2025

This version

0.3.1.dev1 pre-release

Nov 25, 2025

0.3.0

Nov 25, 2025

0.2.27.dev3 pre-release

Nov 23, 2025

0.2.26.dev1 pre-release

Nov 20, 2025

0.2.25.dev1 pre-release

Nov 17, 2025

0.2.24

Nov 17, 2025

0.2.23

Nov 15, 2025

0.2.23.dev5 pre-release

Nov 14, 2025

0.2.23.dev4 pre-release

Nov 14, 2025

0.2.23.dev3 pre-release

Nov 11, 2025

0.2.23.dev2 pre-release

Nov 11, 2025

0.2.23.dev1 pre-release

Nov 11, 2025

0.2.22

Nov 7, 2025

0.2.21.dev1 pre-release

Nov 5, 2025

0.2.19

Nov 5, 2025

0.2.17

Oct 29, 2025

0.2.16

Oct 28, 2025

0.2.14

Oct 25, 2025

0.2.13.dev2 pre-release

Oct 22, 2025

0.2.13.dev1 pre-release

Oct 21, 2025

0.2.12

Oct 21, 2025

0.2.10

Oct 17, 2025

0.2.9.dev17 pre-release

Oct 21, 2025

0.2.9.dev16 pre-release

Oct 21, 2025

0.2.9.dev15 pre-release

Oct 20, 2025

0.2.9.dev14 pre-release

Oct 20, 2025

0.2.9.dev13 pre-release

Oct 20, 2025

0.2.9.dev12 pre-release

Oct 20, 2025

0.2.9.dev11 pre-release

Oct 20, 2025

0.2.9.dev10 pre-release

Oct 20, 2025

0.2.9.dev9 pre-release

Oct 20, 2025

0.2.9.dev8 pre-release

Oct 20, 2025

0.2.9.dev7 pre-release

Oct 8, 2025

0.2.9.dev6 pre-release

Oct 20, 2025

0.2.9.dev5 pre-release

Oct 6, 2025

0.2.9.dev4 pre-release

Oct 6, 2025

0.2.9.dev3 pre-release

Oct 6, 2025

0.2.9.dev2 pre-release

Oct 6, 2025

0.2.9.dev1 pre-release

Oct 6, 2025

0.2.9.dev0 pre-release

Oct 6, 2025

0.2.8

Oct 6, 2025

0.2.8.dev13 pre-release

Oct 6, 2025

0.2.8.dev12 pre-release

Oct 4, 2025

0.2.8.dev11 pre-release

Oct 4, 2025

0.2.8.dev10 pre-release

Oct 4, 2025

0.2.8.dev9 pre-release

Oct 4, 2025

0.2.8.dev8 pre-release

Oct 3, 2025

0.2.8.dev7 pre-release

Oct 3, 2025

0.2.8.dev6 pre-release

Oct 3, 2025

0.2.8.dev5 pre-release

Oct 3, 2025

0.2.8.dev4 pre-release

Oct 2, 2025

0.2.8.dev3 pre-release

Oct 2, 2025

0.2.8.dev2 pre-release

Oct 2, 2025

0.2.8.dev1 pre-release

Oct 2, 2025

0.2.7

Oct 2, 2025

0.2.6

Sep 24, 2025

0.2.6.dev6 pre-release

Oct 2, 2025

0.2.6.dev5 pre-release

Oct 2, 2025

0.2.6.dev4 pre-release

Oct 2, 2025

0.2.6.dev3 pre-release

Oct 1, 2025

0.2.6.dev2 pre-release

Oct 1, 2025

0.2.6.dev1 pre-release

Sep 26, 2025

0.2.5

Sep 24, 2025

0.2.4.dev9 pre-release

Sep 24, 2025

0.2.4.dev8 pre-release

Sep 17, 2025

0.2.4.dev7 pre-release

Aug 20, 2025

0.2.4.dev6 pre-release

Aug 18, 2025

0.2.4.dev5 pre-release

Aug 15, 2025

0.2.4.dev4 pre-release

Aug 15, 2025

0.2.4.dev3 pre-release

Aug 15, 2025

0.2.4.dev2 pre-release

Aug 10, 2025

0.2.3

Aug 9, 2025

0.2.2.dev0 pre-release

Jul 30, 2025

0.2.1.dev0 pre-release

Jul 21, 2025

0.2.0

Jul 19, 2025

0.1.9

Jul 19, 2025

0.1.8

Jul 15, 2025

0.1.7

Jul 14, 2025

0.1.6

Jul 7, 2025

0.1.5

Jul 7, 2025

0.1.4

Jul 7, 2025

0.1.3

Jul 7, 2025

0.1.2

Jul 7, 2025

0.1.1

Jun 13, 2025

0.1.0.dev53 pre-release

May 19, 2025

0.1.0.dev52 pre-release

May 18, 2025

0.1.0.dev51 pre-release

May 18, 2025

0.1.0.dev50 pre-release

May 18, 2025

0.1.0.dev49 pre-release

May 18, 2025

0.1.0.dev39 pre-release

Apr 17, 2025

0.1.0.dev38 pre-release

Apr 17, 2025

0.1.0.dev37 pre-release

Apr 17, 2025

0.1.0.dev36 pre-release

Apr 16, 2025

0.1.0.dev35 pre-release

Apr 16, 2025

0.1.0.dev34 pre-release

Apr 16, 2025

0.1.0.dev33 pre-release

Apr 16, 2025

0.1.0.dev32 pre-release

Apr 2, 2025

0.1.0.dev31 pre-release

Apr 1, 2025

0.1.0.dev30 pre-release

Mar 31, 2025

0.1.0.dev29 pre-release

Mar 30, 2025

0.1.0.dev28 pre-release

Mar 26, 2025

0.1.0.dev27 pre-release

Mar 26, 2025

0.1.0.dev26 pre-release

Mar 26, 2025

0.1.0.dev25 pre-release

Mar 26, 2025

0.1.0.dev24 pre-release

Mar 26, 2025

0.1.0.dev23 pre-release

Mar 26, 2025

0.1.0.dev22 pre-release

Mar 26, 2025

0.1.0.dev21 pre-release

Mar 26, 2025

0.1.0.dev20 pre-release

Mar 25, 2025

0.1.0.dev19 pre-release

Mar 25, 2025

0.1.0.dev18 pre-release

Mar 25, 2025

0.1.0.dev17 pre-release

Mar 25, 2025

0.1.0.dev16 pre-release

Mar 25, 2025

0.1.0.dev15 pre-release

Mar 8, 2025

0.1.0.dev14 pre-release

Feb 24, 2025

0.1.0.dev13 pre-release

Feb 23, 2025

0.1.0.dev12 pre-release

Feb 17, 2025

0.1.0.dev11 pre-release

Feb 16, 2025

0.1.0.dev10 pre-release

Feb 16, 2025

0.1.0.dev9 pre-release

Feb 16, 2025

0.1.0.dev8 pre-release

Feb 16, 2025

0.1.0.dev7 pre-release

Feb 8, 2025

0.1.0.dev6 pre-release

Feb 5, 2025

0.1.0.dev4 pre-release

Feb 5, 2025

0.0.0.dev4 pre-release

Feb 16, 2025

0.0.0.dev3 pre-release

Feb 5, 2025

0.0.0.dev2 pre-release

Feb 5, 2025

0.0.0.dev1 pre-release

Feb 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

synth_ai-0.3.1.dev1.tar.gz (620.2 kB view details)

Uploaded Nov 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

synth_ai-0.3.1.dev1-py3-none-any.whl (753.7 kB view details)

Uploaded Nov 25, 2025 Python 3

File details

Details for the file synth_ai-0.3.1.dev1.tar.gz.

File metadata

Download URL: synth_ai-0.3.1.dev1.tar.gz
Upload date: Nov 25, 2025
Size: 620.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.15

File hashes

Hashes for synth_ai-0.3.1.dev1.tar.gz
Algorithm	Hash digest
SHA256	`f8f3cc547f1dddb6131972e02003c05f1f91da019c071ecd0fd8518368acf6a1`
MD5	`84422447b4d898afc86c444e51233a93`
BLAKE2b-256	`8cfd56a44f564d37bdee0af120b66545f1c87f3e59e198b98fbbecf43426b1b5`

See more details on using hashes here.

File details

Details for the file synth_ai-0.3.1.dev1-py3-none-any.whl.

File metadata

Download URL: synth_ai-0.3.1.dev1-py3-none-any.whl
Upload date: Nov 25, 2025
Size: 753.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.15

File hashes

Hashes for synth_ai-0.3.1.dev1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`32ecf2715970f07c9262e9294c2552eb6bf95047ca60c4bef70282cbb6d19cba`
MD5	`a816c11c4af14b0e040e755f6aeb3157`
BLAKE2b-256	`7eb4da5b82baadd44eb63d2d44328f6f7bc9492101960f86acedee0227820e9e`

See more details on using hashes here.

synth-ai 0.3.1.dev1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Synth-AI SDK

🚀 Install latest version (0.2.25.dev1)

Highlights

🧭 Examples & Cookbooks

⚙️ Getting Started

🤖 For Coding Agents: Get Started with Baselines

Why Use Baselines?

Quick Start for Coding Agents

Available Baselines

Baseline Results

Creating Custom Baselines

🔐 SDK → Dashboard Pairing

Environment overrides

🌍 Language-Agnostic: Build Task Apps in Any Language

Supported Languages

How It Works

🎯 Prompt Optimization

How It Works

Algorithms

Quick Start

📚 Documentation

🧠 Meta

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes