AI Performance Engineer
Project description
LatencyAI - AI Performance Engineer
Introduction
LatencyAI is an AI agent that optimizes any Python code for best performance using reasoning LLMs. It iteratively profiles, optimizes, and benchmarks the code. The goal is to optimize code by GPU offloading, using data/task parallel, latency hiding and other techniques.
Note: this is an experimental library. Subscribe for updates at latency.ai.
Installation
- (Optional) Deploy a CUDA-enabled GPU instance
pip install --upgrade latencyai
Usage
- Set the OPEANAI_API_KEY environment variable
- Run
python -m latencyai --runs=3 script-to-optimize.py. Optionally set--runs, which is the number of optimization attempts, i.e. optimize-benchmark-profile iterations. The default is 2.
The provided script should have a main function. The benchmark runner calls it multiple times, depending on it's execution time.
If optimization is successful, a file named <original-script>_optimized.py is be written to original script directory.
Tracking optimizations
After integrating optimized code into your application, you can verify and track end-to-end performance improvements in deployed applications using Graphsignal.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file latencyai-0.1.1.tar.gz.
File metadata
- Download URL: latencyai-0.1.1.tar.gz
- Upload date:
- Size: 6.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.10.16 Linux/6.8.0-1021-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dc57a2e2956854c7a5bd98e75be585e441015f462116c03538af605733150f35
|
|
| MD5 |
cc8498f82a12621f725e407673d348d8
|
|
| BLAKE2b-256 |
cd00db6fa7c8c7122d4bced214435daecd51dc3884745c256f79396e74fd1fa9
|
File details
Details for the file latencyai-0.1.1-py3-none-any.whl.
File metadata
- Download URL: latencyai-0.1.1-py3-none-any.whl
- Upload date:
- Size: 8.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.10.16 Linux/6.8.0-1021-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
28d50388e3958720c526adb3c5d683289c481ce5f0698aa5dd64c20ccf1927ca
|
|
| MD5 |
ceb6b81e82eb2ae8ecc17d94567f6574
|
|
| BLAKE2b-256 |
352fb48911373bce9fb83b8f4ba78dd86ca0f054b2e924c3a66adb8a5fede3f6
|