Skip to main content

AI Performance Engineer

Project description

LatencyAI - AI Performance Engineer

Introduction

LatencyAI is an AI agent that optimizes any Python code for best performance using reasoning LLMs. It iteratively profiles, optimizes, and benchmarks the code. The goal is to optimize code by GPU offloading, using data/task parallel, latency hiding and other techniques.

Note: this is an experimental library. Subscribe for updates at latency.ai.

Installation

  • (Optional) Deploy a CUDA-enabled GPU instance
  • pip install --upgrade latencyai

Usage

  • Set the OPEANAI_API_KEY environment variable
  • Run python -m latencyai --runs=3 script-to-optimize.py. Optionally set --runs, which is the number of optimization attempts, i.e. optimize-benchmark-profile iterations. The default is 2.

The provided script should have a main function. The benchmark runner calls it multiple times, depending on it's execution time.

If optimization is successful, a file named <original-script>_optimized.py is be written to original script directory.

Tracking optimizations

After integrating optimized code into your application, you can verify and track end-to-end performance improvements in deployed applications using Graphsignal.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

latencyai-0.1.1.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

latencyai-0.1.1-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file latencyai-0.1.1.tar.gz.

File metadata

  • Download URL: latencyai-0.1.1.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.10.16 Linux/6.8.0-1021-azure

File hashes

Hashes for latencyai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 dc57a2e2956854c7a5bd98e75be585e441015f462116c03538af605733150f35
MD5 cc8498f82a12621f725e407673d348d8
BLAKE2b-256 cd00db6fa7c8c7122d4bced214435daecd51dc3884745c256f79396e74fd1fa9

See more details on using hashes here.

File details

Details for the file latencyai-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: latencyai-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 8.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.10.16 Linux/6.8.0-1021-azure

File hashes

Hashes for latencyai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 28d50388e3958720c526adb3c5d683289c481ce5f0698aa5dd64c20ccf1927ca
MD5 ceb6b81e82eb2ae8ecc17d94567f6574
BLAKE2b-256 352fb48911373bce9fb83b8f4ba78dd86ca0f054b2e924c3a66adb8a5fede3f6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page