AlphaEvolve with fuzzy evaluation. Evolve anything, not just code.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

fuzzyevolve

Evolve text with LLM mutation + LLM judging, using TrueSkill for noisy multi-metric feedback and MAP-Elites for diversity.

Inspired by AlphaEvolve, but designed for “fuzzy” criteria like prose, coherence, originality, funny, interesting, etc.

Quick start

export GOOGLE_API_KEY=... # default config uses google-gla:* models
uv sync

# Uses ./config.toml if present (or defaults)
uv run fuzzyevolve "This is my starting prompt."

Input can be a string, a file path, or stdin:

uv run fuzzyevolve seed.txt
cat seed.txt | uv run fuzzyevolve

Output goes to best_by_cell.md by default (override with --output). By default it includes the top 20 best-per-cell champions (override with --top-cells).

By default, each run is recorded under .fuzzyevolve/runs/<run_id>/ (checkpoints, events, and raw LLM prompts/outputs). Resume with:

uv run fuzzyevolve --resume .fuzzyevolve/runs/<run_id> --iterations 100

Browse runs in the TUI:

uv run fuzzyevolve tui
# or open a specific run/checkpoint:
uv run fuzzyevolve tui --run .fuzzyevolve/runs/<run_id>

Disable recording with --no-store.

Note: the repo’s config.toml uses semantic embeddings via sentence-transformers. Either install the extra or switch to hash/length descriptors:

uv sync --extra semantic

What it does

Critiques the selected parent once per iteration (structured: preserve / issues / rewrite routes).
Generates children via a set of LLM-backed mutation operators (e.g. “exploit” vs “explore” full rewrites).
Judges parent/children by ranking them per metric (tiered rankings, ties allowed).
Updates per-metric TrueSkill ratings (μ/σ) from those rankings (with uncertainty-aware scoring).
Keeps diversity with a MAP‑Elites archive (top‑k per descriptor cell), optionally with multiple islands + migration.

Mental model

A text is a “player” with a TrueSkill rating per metric (e.g. one rating for prose, one for coherence).
The judge doesn’t assign absolute scores; it ranks candidates relative to each other for each metric.
The archive is a grid of “niches” (cells) defined by a descriptor (length or a 2D embedding projection).
Each iteration is: pick a parent → critique → propose children → rank a battle → update ratings → insert children into niches.

How it works (core loop)

Descriptor: compute descriptor = describe(text) to place texts into MAP‑Elites cells (length or embedding_2d).
Select parent: choose an elite from a random island archive (uniform_cell or an optimistic UCB-ish policy).
Critique (optional): ask an LLM for actionable guidance (issues + distinct rewrite routes).
Mutate: allocate a per-iteration job budget across operators; each job proposes one rewritten child.
Assemble battle: parent + children (+ optional frozen anchors/opponent), sized by mutation.max_children and anchor/opponent settings.
Judge: ask an LLM to return tiered rankings for each metric (with validation + optional repair retries).
Update ratings: apply per-metric TrueSkill updates; score uses a conservative LCB (mu - c*sigma) averaged across metrics.
Archive: add children into MAP‑Elites (top‑k per cell), optionally gating “new cell” inserts.

Configuration

Config is a single TOML/JSON file. If config.toml or config.json exists in the current directory it’s auto-detected; pass an explicit file with --config.

See config.toml for a complete example. The structure is intentionally nested:

[task] and [metrics] define what “good” means (goal + metric names/descriptions).
[mutation] defines the operator set, job budget, and per-operator uncertainty.
[judging] controls judge retries + optional opponents.
[rating] controls TrueSkill parameters and the score’s LCB constant.
[descriptor] defines the MAP‑Elites “diversity axis” (length bins or 2D embedding bins).
[anchors] optionally injects frozen reference anchors (seed + periodic “ghosts”) into battles.
[population] / [maintenance] enable multiple islands, migration, and global sparring.

CLI

run is the default command, so these are equivalent:

uv run fuzzyevolve "Seed text..."
uv run fuzzyevolve run "Seed text..."

To open the run browser:

uv run fuzzyevolve tui

`run` options

--config / -c: Path to TOML/JSON config
--output / -o: Output path (default best_by_cell.md)
--top-cells: How many best-per-cell champions to include (default 20; 0 = all)
--iterations / -i: Override run.iterations
--goal / -g: Override task.goal
--metric / -m: Override metrics.names (repeatable)
--resume: Resume from a previous run directory (or checkpoint file)
--store/--no-store: Enable/disable recording under .fuzzyevolve/
--log-level / -l: Logging level (debug|info|warning|error|critical or a number)
--log-file: Write logs to a specific file
--quiet / -q: Hide the progress bar and non-essential logging

Requirements

Python 3.10+
uv (recommended)
Any model supported by pydantic-ai (configure via [llm].judge_model and [[llm.ensemble]].model)
An API key for the provider you choose

export GOOGLE_API_KEY=...     # e.g. google-gla:*
export OPENAI_API_KEY=...     # e.g. openai:*
export ANTHROPIC_API_KEY=...  # e.g. anthropic:*

Semantic embeddings require:

uv sync --extra semantic

Development

uv sync --extra dev
uv run ruff format .
uv run ruff check .
uv run pytest -q

License

Apache 2.0 — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

caesarnine

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.2

Jan 25, 2026

0.2.1

Jan 21, 2026

0.2.0

Jan 21, 2026

This version

0.1.1

Jan 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fuzzyevolve-0.1.1-py3-none-any.whl (57.4 kB view details)

Uploaded Jan 20, 2026 Python 3

File details

Details for the file fuzzyevolve-0.1.1-py3-none-any.whl.

File metadata

Download URL: fuzzyevolve-0.1.1-py3-none-any.whl
Upload date: Jan 20, 2026
Size: 57.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for fuzzyevolve-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e42574d688dc292719d5adcb5a99159457d013ee3af325a4721fdca5588472e6`
MD5	`3695d6ef782f07786e90be50e6dd097f`
BLAKE2b-256	`d1ff28324876c6b44db144ffa78ac167c23346c0a61315590b7a1769e8db783d`

See more details on using hashes here.

fuzzyevolve 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

fuzzyevolve

Quick start

What it does

Mental model

How it works (core loop)

Configuration

CLI

`run` options

Requirements

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

fuzzyevolve 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

fuzzyevolve

Quick start

What it does

Mental model

How it works (core loop)

Configuration

CLI

run options

Requirements

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

`run` options