Skip to main content

An AutoDiff-like tool for training AI systems end-to-end with general feedback

Project description

Trace: End-to-end Generative Optimization for AI Systems and Agents

PyPI PyPI - Python Version GitHub license arXiv

Trace is a new AutoDiff-like tool for training AI systems end-to-end with general feedback (like numerical rewards or losses, natural language text, compiler errors, etc.). Trace generalizes the back-propagation algorithm by capturing and propagating an AI system's execution trace. Trace is implemented as a PyTorch-like Python library. Users write Python code directly and can use Trace primitives to optimize certain parts, just like training neural networks!

Paper | Project website | Documentation | Blogpost

BlogPost

Setup

Simply run

pip install trace-opt

Or for development, clone the repo and run the following.

pip install -e .

The library requires Python >= 3.9. The installation script will git clone AutoGen. You may require Git Large File Storage if git is unable to clone the repository otherwise.

Citation

If you use this code in your research please cite the following publication:

@article{cheng2024trace,
  title={Trace is the New AutoDiff--Unlocking Efficient Optimization of Computational Workflows},
  author={Cheng, Ching-An and Nie, Allen and Swaminathan, Adith},
  journal={arXiv preprint arXiv:2406.16218},
  year={2024}
}

Updates

  • 2024.10.21 New paper by Nvidia, Stanford, Visa, & Intel applies Trace to optimize for mapper code of parallel programming. Trace (OptoPrime) learns code achieving 1.3X speed up under 10 minutes, compared with the code optimized by domain expert.
  • 2024.9.25 Trace Paper is accepted to NeurIPS 2024.
  • 2024.9.14 TextGrad is available as a Trace optimizer.

Evaluation

A previous version of Trace was tested with gpt-4-0125-preview on numerical optimization, simulated traffic control, big-bench-hard, and llf-metaworld tasks, which demonstrated good optimization performance on multiple random seeds; please see the paper for details.

Note For gpt-4o, please use the version gpt-4o-2024-08-06 (onwards), which fixes the structured output issue of gpt-4o-2024-05-13. While gpt-4 works reliably most of the time, we've found gpt-4o-2024-05-13 often hallucinates even in very basic optimization problems and does not follow instructions. This might be due to the current implementation of optimizers rely on outputing in json format. Issues of gpt-4o with json have been reported in the communities (see example).

Disclaimers

  • Trace is an LLM-based optimization framework for research purpose only.
  • The current release is a beta version of the library. Features and more documentation will be added, and some functionalities may be changed in the future.
  • System performance may vary by workflow, dataset, query, and response, and users are responsible for determining the accuracy of generated content.
  • System outputs do not represent the opinions of Microsoft.
  • All decisions leveraging outputs of the system should be made with human oversight and not be based solely on system outputs.
  • Use of the system must comply with all applicable laws, regulations, and policies, including those pertaining to privacy and security.
  • The system should not be used in highly regulated domains where inaccurate outputs could suggest actions that lead to injury or negatively impact an individual's legal, financial, or life opportunities.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Privacy

See Microsoft Privacy Statement.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trace_opt-0.1.2.tar.gz (57.5 kB view details)

Uploaded Source

Built Distribution

trace_opt-0.1.2-py3-none-any.whl (62.8 kB view details)

Uploaded Python 3

File details

Details for the file trace_opt-0.1.2.tar.gz.

File metadata

  • Download URL: trace_opt-0.1.2.tar.gz
  • Upload date:
  • Size: 57.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for trace_opt-0.1.2.tar.gz
Algorithm Hash digest
SHA256 f54aec021a1efd1df3be8eeb5979e0ac63bfb2a25cf979588bf18e4bd54482c1
MD5 ad5979cbd3850395377545987f87692d
BLAKE2b-256 0fc5ca49158151d492f4792a7ae314e4132b1c957258c903f56b8f1b760366f1

See more details on using hashes here.

File details

Details for the file trace_opt-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: trace_opt-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 62.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for trace_opt-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 89827bc7b0dabfe35bbe69da68cae100db86d2bcb1bfa22e6dae722adad19201
MD5 59c606b5f86fcf4aa09e59288dedefd0
BLAKE2b-256 53baf57ae030689d2d69649c26a759f078861df78e9f6b8beb31f241b934d399

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page