dspy-auto-gepa

Thin orchestration package around DSPy's GEPA optimizer

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

thememium

These details have not been verified by PyPI

Project description

AutoGEPA

Thin orchestration package around DSPy's GEPA optimizer.
Explore the Documentation »
Report Bug Request Feature

Table of Contents

About
Quick Start
Usage
API
Contributing
License

About

AutoGEPA automates the tedious parts of setting up a DSPy optimization pipeline: converting raw data into dspy.Examples, generating a metric file with an LLM, splitting datasets, running baselines, and training with GEPA.

Automatic field inference — Maps row columns to DSPy Signature fields automatically
LLM-generated metrics — Drafts evaluation metrics automatically, saving them as reproducible .py files for human review
End-to-end pipeline — Datasets → Baseline → GEPA optimization → Compare & promote
Zero-config defaults — Sensible defaults for all hyperparameters and LMs

Requires DSPy 3.1+.

(back to top)

Quick Start

Install

Install AutoGEPA with uv (recommended):

uv add dspy-auto-gepa

Or with pip:

pip install dspy-auto-gepa

Basic Usage

import dspy
from dspy_auto_gepa import AutoGEPA

# Configure models
lm = dspy.LM("openrouter/openai/gpt-oss-120b")
large_lm = dspy.LM("openrouter/moonshotai/kimi-k2.5")
dspy.configure(lm=lm)

class TicketSignature(dspy.Signature):
    """Classify support tickets."""
    message: str = dspy.InputField()
    urgency: str = dspy.OutputField()
    sentiment: str = dspy.OutputField()

program = dspy.ChainOfThought(TicketSignature)

rows = [
    {"message": "The server room AC is out and equipment is overheating.", "urgency": "high", "sentiment": "negative"},
    {"message": "Can someone clean conference room B next week?", "urgency": "low", "sentiment": "neutral"},
]

# Fields are automatically inferred from the module's signature
auto = AutoGEPA(
    name="TicketSignature-v1_0_0",
    rows=rows,
    module=program,
    metric_lm=large_lm,
    reflection_lm=large_lm,
)

results = auto.run(force=False)  # Set True to re-run even if a saved model exists

# Check if a cached model was loaded
if results.loaded_from:
    print(f"Loaded existing model from {results.loaded_from}")
else:
    print(f"Baseline score: {results.baseline:.4f}")
    print(f"Optimized score: {results.optimized:.4f}")
    print(f"Improvement: {results.improvement:.4f}")
    print(f"Saved optimized program to {results.saved_to}")

(back to top)

Usage

Automatic field inference (recommended)

When your row columns match your module's signature fields, you don't need to specify any field mappings. AutoGEPA infers input and output fields directly from the dspy.Signature attached to your module.

If your row columns don't match the module's signature fields, AutoGEPA will raise a clear error telling you which fields are missing and suggesting you use dict mappings:

ValueError: Row columns do not match module signature fields. Missing from rows: ['message', 'sentiment', 'urgency']. Extra in rows: ['msg_text', 'sent', 'urg']. Pass input_fields/output_fields to map row columns to signature fields, or ensure row columns match exactly.

With dict field mappings

When your row columns have different names than your module's signature fields:

# Row columns: msg_text, urg, sent
# Signature fields: message, urgency, sentiment

auto = AutoGEPA(
    rows=rows,
    module=program,
    name="TicketSignature",
    input_fields={"msg_text": "message"},    # row_col → sig_field
    output_fields={"urg": "urgency", "sent": "sentiment"},
    metric_lm=large_lm,
    reflection_lm=large_lm,
)

results = auto.run()

Advanced: step-by-step control

If you prefer fine-grained control over each stage, you can call the individual methods that run() orchestrates under the hood:

# Optional: generate the metric file first for human inspection
metric_file = auto.build_metric()
print(f"Metric written to {metric_file}")
# After reviewing, proceed:

ds = auto.datasets()

baseline = auto.run_baseline(datasets=ds)

optimized = auto.train(datasets=ds)

final = auto.run_baseline(module=optimized, datasets=ds)

# Or compare and promote
comparison = auto.compare(
    optimized_module=optimized,
    datasets=ds,
)
auto.promote(
    optimized_module=optimized,
    destination=auto._run_dir / "optimized_ticket_classifier.json",
)

(back to top)

API

Constructor

AutoGEPA(...) accepts all configuration fields directly:

Parameter	Type	Description
`rows`	`list[dict] \| DataFrame \| None`	Training data. Accepts `list[dict]`, pandas DataFrame, polars DataFrame/LazyFrame, or any object with `.to_dicts()` or `.to_pandas()`
`module`	`dspy.Module \| None`	The DSPy module to optimize
`name`	`str \| None`	Task name for artifact subdirectory
`input_fields`	`list[str] \| dict[str, str] \| None`	Input field names. List for exact match, dict for `{row_column: signature_field}` mapping. Inferred from signature if omitted
`output_fields`	`list[str] \| dict[str, str] \| None`	Output field names. Same format as `input_fields`. Inferred from signature if omitted
`metric`	`Path \| str \| None`	Path to a custom metric `.py` file (skips generation)
`split`	`tuple[float, ...]`	Train/val/test split ratios. Default `(0.7, 0.2, 0.1)`
`seed`	`int`	Random seed for reproducibility. Default `42`
`artifact_dir`	`Path \| str`	Root directory for artifacts. Default `".auto_gepa"`
`metric_lm`	`dspy.LM \| None`	LM used for metric generation. Defaults to `dspy.LM("openrouter/openai/gpt-oss-120b")`
`reflection_lm`	`dspy.LM \| None`	LM used for GEPA reflection. Defaults to `dspy.LM("openrouter/moonshotai/kimi-k2.5")`
`gepa_auto`	`Literal["light", "medium", "heavy"]`	GEPA optimization intensity. Default `"light"`
`num_threads`	`int`	Parallel threads for evaluation. Default `16`
`metric_generator_signature`	`Type[dspy.Signature] \| None`	Custom signature class for metric generation
`metric_generator_module`	`Type[dspy.Module] \| None`	Custom module class for metric generation

Methods

Method	Signature	Description
`build_metric`	`(rows, module, name, metric, out_path, force=False) → Path`	Generates the metric `.py` file explicitly. Skips if a custom `metric` path is provided. `out_path` overrides the default save location. Use `force=True` to overwrite an existing generated metric
`run`	`(rows, module, name, metric, force=False) → RunResult`	Orchestrates the full pipeline: datasets → baseline → train → compare → promote. If `force=False` and a saved model exists at `.auto_gepa/<name>/optimized_<name>.json`, loads it and skips training. Returns a `RunResult` with `baseline`, `optimized`, `improvement`, `saved_to` (or `loaded_from` if cached)
`datasets`	`(rows, module, name, metric, force=False) → Datasets`	Converts rows to `dspy.Example`s and splits into train/val/test. Uses constructor defaults if args omitted. `name` sets the artifact subdirectory. `force=True` overwrites an existing metric file
`run_baseline`	`(module=None, datasets) → float`	Evaluates the unoptimized module. Uses `module` from constructor if not overridden
`train`	`(module=None, datasets) → dspy.Module`	Runs GEPA optimization. Uses `module` from constructor if not overridden
`compare`	`(optimized_module, datasets, baseline_module=None) → dict`	Side-by-side score comparison. Uses constructor `module` as `baseline_module` if not overridden
`promote`	`(optimized_module, destination) → Path`	Saves the optimized program to the given destination
`load_metric`	`() → callable`	Lazily loads the generated metric function

Result Types

RunResult — returned by run():
- baseline: float — score before optimization
- optimized: float — score after optimization
- improvement: float — absolute difference
- saved_to: Path \| None — where the optimized program was saved
- loaded_from: Path \| None — if a cached model was loaded instead of training
Datasets — returned by datasets():
- train: list[dspy.Example]
- val: list[dspy.Example]
- test: list[dspy.Example]

Field Resolution Behaviour

AutoGEPA resolves fields in this order:

Both provided explicitly (list[str] | dict[str, str]) — uses exactly what you gave it. Lists mean exact column names. Dicts mean {row_column: signature_field} mapping.
Neither provided — infers both from the module's DSPy Signature. Raises a clear error if row columns don't match signature fields, listing what's missing and what's extra.
Only one provided — if the other can be inferred from module signature or remaining row keys, great. If not, raises an error.

(back to top)

Contributing

Quick workflow:

Fork and branch: git checkout -b feature/name
Make changes
Commit and push
Open a Pull Request

(back to top)

License

MIT (as declared in pyproject.toml).

_{Built by thememium}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

thememium

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.8

May 30, 2026

0.1.7

May 21, 2026

0.1.6

May 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dspy_auto_gepa-0.1.8.tar.gz (14.1 kB view details)

Uploaded May 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dspy_auto_gepa-0.1.8-py3-none-any.whl (16.5 kB view details)

Uploaded May 30, 2026 Python 3

File details

Details for the file dspy_auto_gepa-0.1.8.tar.gz.

File metadata

Download URL: dspy_auto_gepa-0.1.8.tar.gz
Upload date: May 30, 2026
Size: 14.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.17 {"installer":{"name":"uv","version":"0.11.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for dspy_auto_gepa-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`07529cfd11825d4dfa9fc274b939358e38d57fa131fa72c9a1c1908bc4cc80dc`
MD5	`2966ed636e07b191392eee57fbebd18d`
BLAKE2b-256	`e7c7731d6f8eb6b99690fdf587bced4ab94dcf6016f52dec50d8d6c14a46312e`

See more details on using hashes here.

File details

Details for the file dspy_auto_gepa-0.1.8-py3-none-any.whl.

File metadata

Download URL: dspy_auto_gepa-0.1.8-py3-none-any.whl
Upload date: May 30, 2026
Size: 16.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.17 {"installer":{"name":"uv","version":"0.11.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for dspy_auto_gepa-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a2b396bae099281af9cf64149282cb6f9f0e6cc2350a30d2f09e2f4dd6c9156d`
MD5	`a2968623411d2ec5c1ae8707df0c54c5`
BLAKE2b-256	`b440868dd1d46770bf801a0c232fbb2f18544286e9eee8da9c8aa16629e49faf`

See more details on using hashes here.

dspy-auto-gepa 0.1.8

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

AutoGEPA

About

Quick Start

Install

Basic Usage

Usage

Automatic field inference (recommended)

With dict field mappings

Advanced: step-by-step control

API

Constructor

Methods

Result Types

Field Resolution Behaviour

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes