No project description provided

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language
- Python :: 3.9

Project description

REVOLVE: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization for More Stable and Effective Progress

cuda 11.2 python 3.9

About

This is the code for paper: REVOLVE: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization for More Stable and Effective Progress.
REVOLVE is an optimization framework that enhances the stability and efficiency of AI system optimization by tracking the evolution of model responses across iterations. Building on textual feedback from LLMs, Revolve simulates higher-order optimization effects, ensuring that adjustments are guided not only by immediate feedback but also by the model’s performance trajectory, leading to faster and more stable optimization without relying on traditional derivative-based methods.
REVOLVE offers an intuitive API, built upon the foundation of TextGrad, that allows users to define custom optimization tasks and loss functions. This makes it an adaptable and effective tool for optimizing LLM-based systems across a range of applications, including prompt optimization, solution refinement, and code optimization.

Analogy with Second-order Optimization

Installation

pip install revolve

Method Evaluation

Evaluating Solution Optimization

To evaluate solution optimization, you can use various LLMs as the evaluation engine. For example, we use the gpt-4o as the evaluation engine.

For GPQA_diamond dataset:

python evaluation/solution_optimization.py --task GPQA_diamond --engine gpt-4o --num_threads 10 --optimizer_version v2

For MMLU_machine_learning dataset:

python evaluation/solution_optimization.py --task MMLU_machine_learning --engine gpt-4o --num_threads 10 --optimizer_version v2

For MMLU_college_physics dataset:

python evaluation/solution_optimization.py --task MMLU_college_physics --engine gpt-4o --num_threads 10 --optimizer_version v2

Available Optimization Methods:

We provide multiple optimization methods for testing:

v1: Original TextGrad that optimizes based on textual feedback.
v1_momentum: Momentum-TextGrad which adjusts optimization steps using feedback trends across iterations.
v2: Our REVOLVE method that tracks response evolution over time for more stable and efficient optimization. You can use the --optimizer_version flag to select the desired method.

Evaluating Prompt Optimization

To evaluate prompt optimization, two LLMs need to be specified:

--backbone_engine: This is the LLM used by Revolve (or other optimizers) to perform the optimization process.
--model: This is the LLM on which the prompt is being optimized. For example, we use the gpt-4o as the backbone_engine, using gpt-3.5-turbo as the model:
For BBH_object_counting dataset:

python evaluation/prompt_optimization.py --task BBH_object_counting --backbone_engine gpt-4o --model gpt-3.5-turbo --num_threads 10 --optimizer_version v2

For GSM8K dataset:

python evaluation/prompt_optimization.py --task GSM8K_DSPy --backbone_engine gpt-4o --model gpt-3.5-turbo --num_threads 10 --optimizer_version v2

Evaluating Code Optimization

To evaluate code optimization, follow these steps:

Clone the leetcode-hard-gym repository:

git clone https://github.com/GammaTauAI/leetcode-hard-gym.git && cd leetcode-hard-gym

Install the package in editable mode:

python -m pip install -e .

Run the evaluation script:

python ./evaluation/code_optimization/leetcode_testtime_with_supervision.py --engine meta-llama/Meta-Llama-3.1-70B-Instruct --optimizer_version v1 (for TextGrad) / v1_momentum (for Momentum-TextGrad) / v2 (for Revolve) --size 200

BibTeX citation

If you find our work useful, please consider citing us:

@misc{zhang2024revolveoptimizingaisystems,
      title={Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization}, 
      author={Peiyan Zhang and Haibo Jin and Leyang Hu and Xinnuo Li and Liying Kang and Man Luo and Yangqiu Song and Haohan Wang},
      year={2024},
      eprint={2412.03092},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2412.03092}, 
}

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language
- Python :: 3.9

Release history Release notifications | RSS feed

This version

0.1.9

Dec 6, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

revolve-0.1.9.tar.gz (48.6 kB view details)

Uploaded Dec 6, 2024 Source

File details

Details for the file revolve-0.1.9.tar.gz.

File metadata

Download URL: revolve-0.1.9.tar.gz
Upload date: Dec 6, 2024
Size: 48.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.9.20

File hashes

Hashes for revolve-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`fb6d1bfb401da0f4adaa1413ff34b114f8648bb826b1b4a582c739caa6707180`
MD5	`9ea42476187652599fb2b97b97995f5a`
BLAKE2b-256	`cec17ab14c68d7340c699aaf451a8a1e536d4b74808c39422d1feaaa369ac24c`

See more details on using hashes here.

revolve 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

REVOLVE: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization for More Stable and Effective Progress

About

Installation

Method Evaluation

Evaluating Solution Optimization

Available Optimization Methods:

Evaluating Prompt Optimization

Evaluating Code Optimization

Related Links

BibTeX citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes