A simple automatic algorithm design tool

Project description

autoad

A simple automated algorithm design (AAD) tool.

Overview

This tool optimizes code by iteratively maximizing multiple measurable objectives. The core concepts are:

Prompt-driven Optimization: Accepts improvement instructions and evaluation criteria as prompts to guide the optimization process
Coding Agent Delegation: Delegates code improvement tasks to a coding agent within the optimization loop
Git-based Progress Tracking: Stores evaluation scores in Git tags to inform future optimization decisions
Evolutionary Approach: Simulates genetic and evolutionary algorithms by growing, merging, and selecting branches based on their performance scores

The optimization process starts when you provide improvement goals and evaluation metrics. The system then creates new branches where a coding agent implements suggested improvements. Each variant is evaluated using your specified metrics, with scores stored in Git tags. Based on these scores, the system selects high-performing branches for further improvement or merging, continuously evolving your codebase towards better solutions.

Usage

The tool requires:

--improvement-prompt: Describes what you want to improve
--objective NAME "PROMPT": Defines evaluation criteria (can be used multiple times)

Optional parameters:

--optional-prompt: Supplementary instructions for the optimization process
--sync-remote: Automatically sync with remote repository (fetches at start, pushes at end)
--log-dir PATH: Directory to save execution logs (default: ~/.autoad/logs)
--no-logging: Disable logging to files
--iterations N: Number of optimization iterations (default: 10)
--branch-prefix PREFIX: Prefix for optimization branches (default: 'optimize')

uvx autoad \
  --improvement-prompt "Improve accuracy of milwrap/countbase.py by increasing the higher value of the two iter 9 MIL instance unit accuracy metrics obtained from running 'uv run pytest -s .'" \
  --objective accuracy-auto-init "Run 'uv run pytest -s .' and use the first iter 9 MIL instance unit accuracy value as the score" \
  --objective accuracy-external-init "Run 'uv run pytest -s .' and use the second iter 9 MIL instance unit accuracy value as the score" \
  --iterations 300 \
  --branch-prefix optim-mil \
  --optional-prompt "Please report progress in Japanese."

The tool follows these steps to evolve your codebase:

User Actions
- Define optimization goals by providing:
  - Improvement prompt describing desired changes
  - Evaluation prompts specifying metrics
System Actions - Code Generation
- Generates improved code versions by:
  - Creating new branches
  - Delegating improvements to coding agent
  - Implementing suggested changes
System Actions - Evaluation
- Evaluates each variant by:
  - Running specified evaluation metrics
  - Calculating objective scores
  - Recording results in Git tags
System Actions - Evolution
- Evolves solution space through:
  - Selecting high-performing branches
  - Merging promising variants
  - Continuing optimization process

Example Application

As a practical example, this tool was applied to improve the algorithm performance in a multiple instance learning framework (inoueakimitsu/milwrap).

Optimization Progress

The optimization process ran for 2 days, focusing on enhancing the algorithm's performance on test data. The accuracy improved from 0.914 to 0.956 (with a theoretical maximum of 0.970). The graph shows the evaluation results of various algorithm variants generated during the optimization process.

Custom Iterations and Branch Prefix

You can specify the maximum number of iterations and customize the branch prefix using the following parameters:

--iterations N: Set the maximum number of optimization iterations (default: 100)
--branch-prefix PREFIX: Set custom prefix for optimization branches (default: "optim")

Remote Synchronization

The --sync-remote option enables automatic synchronization with a remote Git repository:

Before optimization: Fetches all branches and tags from the remote repository to ensure you're working with the latest state
After optimization: Force pushes all branches and tags to the remote repository to share your optimization results

This is particularly useful for:

Distributed optimization: Run optimization on multiple machines and combine results
Collaborative workflows: Share optimization progress with team members
Backup and persistence: Ensure optimization results are saved to remote repository

Example:

uvx autoad \
  --improvement-prompt "Optimize performance" \
  --objective speed "Measure execution time" \
  --sync-remote

Note: The --force flag is used when pushing, which will overwrite remote branches. Ensure you have appropriate permissions and understand the implications before using this option.

Dry-Run Mode

The --dry-run option allows you to preview what commands would be executed without actually running them:

Command preview: Displays the exact Claude CLI commands that would be executed
Interactive mode hints: Shows how to run the same commands interactively (without -p option)
Safe validation: Verify your prompts and tool permissions before actual execution
No side effects: Skips all Git operations and Claude CLI execution
Automatic iteration limit: Forces iterations to 1 (with warning if different value specified)

This is particularly useful for:

Prompt validation: Check that your improvement and objective prompts are correctly formatted
Permission testing: Verify Claude has necessary tool permissions before long-running optimization
Command debugging: See exact commands that will be executed
Learning: Understand how autoad constructs Claude CLI commands

Example:

uvx autoad \
  --improvement-prompt "Optimize algorithm performance" \
  --objective accuracy "Run tests and extract accuracy score" \
  --dry-run

Logging and Output Management

Autoad automatically logs all execution output to help with debugging and analysis:

Default location: ~/.autoad/logs/
Directory structure: YYYY-MM-DD-HH-MM-SS-microseconds/ for each iteration (timestamp with microsecond precision)
Log files:
- stdout.log: Standard output from the iteration
- stderr.log: Error output from the iteration
- metadata.json: Execution metadata (session_id, iteration_start_time, branch name, timestamps, etc.)

Logging Options

# Specify custom log directory
uvx autoad --log-dir /path/to/logs ...

# Set via environment variable
export AUTOAD_LOG_DIR=/path/to/logs
uvx autoad ...

# Disable logging entirely
uvx autoad --no-logging ...

Log Directory Structure Example

~/.autoad/logs/
├── 2025-07-21-13-45-00-123456/     # Iteration 1 (with microseconds)
│   ├── stdout.log
│   ├── stderr.log
│   └── metadata.json
├── 2025-07-21-13-45-01-789012/     # Iteration 2
│   ├── stdout.log
│   ├── stderr.log
│   └── metadata.json
└── 2025-07-21-13-45-02-345678/     # Iteration 3
    ├── stdout.log
    ├── stderr.log
    └── metadata.json

Note: Each iteration now creates its own directory based on the iteration start timestamp with microsecond precision. This ensures unique directories even when iterations run in parallel, eliminating the need for session IDs and iteration numbers in the directory names.

The logging system:

Preserves real-time console output while saving to files
Captures subprocess output (Git, Claude CLI, etc.)
Prevents accidental commits of log files to Git
Includes error handling with fallback directories
Protects against path traversal attacks

Requirements

Python 3.10+
macOS, Linux or WSL
Claude Code installed and configured. Due to intensive usage of the coding agent, we strongly recommend subscribing to the Claude MAX plan for optimal performance and to avoid rate limiting.
Git repository (for tracking optimization history)

Running Development Version

To run a development version of autoad from a local repository without using uvx, you can clone the repository and use uv run with the --project option:

# Clone the autoad repository
git clone https://github.com/inoueakimitsu/autoad

# Run autoad from a different project directory
uv run --project ../autoad python -m autoad.main --help

In this example:

../autoad is the path to your cloned autoad repository
The command runs autoad using the development code from that repository

Project details

Release history Release notifications | RSS feed

This version

0.1.5

Jul 26, 2025

0.1.4

Jul 26, 2025

0.1.3

Jul 26, 2025

0.1.2

Jul 21, 2025

0.1.1

Jul 7, 2025

0.1.0

Jul 7, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoad-0.1.5.tar.gz (15.5 kB view details)

Uploaded Jul 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

autoad-0.1.5-py3-none-any.whl (17.5 kB view details)

Uploaded Jul 26, 2025 Python 3

File details

Details for the file autoad-0.1.5.tar.gz.

File metadata

Download URL: autoad-0.1.5.tar.gz
Upload date: Jul 26, 2025
Size: 15.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for autoad-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`be86e291a68c2cd6c3a9c81cf539ffc2a9f4a50dd26061eae150e107b2bcb458`
MD5	`f8c697b8f6fc12a88d879703eb20a4f3`
BLAKE2b-256	`df8c1e45d1a505eef846634e8544b0021c8b284d14de65da7dc7d79ef1bbaa5d`

See more details on using hashes here.

File details

Details for the file autoad-0.1.5-py3-none-any.whl.

File metadata

Download URL: autoad-0.1.5-py3-none-any.whl
Upload date: Jul 26, 2025
Size: 17.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for autoad-0.1.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`be66266930c65aff88c31c2dba8cadf167854153141b029cfef25392ecb6dd25`
MD5	`3a11bdec74ecc4979ba49d6ea6d897c9`
BLAKE2b-256	`ba256b81fdbf18ce41f599bda9fa6c6013d3eebdcf3ac9f6626fc21a67d9de0a`

See more details on using hashes here.

autoad 0.1.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

autoad

Overview

Usage

Example Application

Custom Iterations and Branch Prefix

Remote Synchronization

Dry-Run Mode

Logging and Output Management

Logging Options

Log Directory Structure Example

Requirements

Running Development Version

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes