Skip to main content

No project description provided

Project description

uv PyPI - Version Python Version from PEP 621 TOML Static Badge

Unit Testing E2E Testing Coverage Status

🦙 LlamaDeploy 🤖

LlamaDeploy (formerly llama-agents) is an async-first framework for deploying, scaling, and productionizing agentic multi-service systems based on workflows from llama_index. With LlamaDeploy, you can build any number of workflows in llama_index and then run them as services, accessible through a HTTP API by a user interface or other services part of your system.

The goal of LlamaDeploy is to easily transition something that you built in a notebook to something running on the cloud with the minimum amount of changes to the original code, possibly zero. In order to make this transition a pleasant one, you can interact with LlamaDeploy in two ways:

Both the SDK and the CLI are part of the LlamaDeploy Python package. To install, just run:

pip install -U llama-deploy

[!TIP] For a comprehensive guide to LlamaDeploy's architecture and detailed descriptions of its components, visit our official documentation.

Why LlamaDeploy?

  1. Seamless Deployment: It bridges the gap between development and production, allowing you to deploy llama_index workflows with minimal changes to your code.
  2. Flexibility: By using a hub-and-spoke architecture, you can easily swap out components (like message queues) or add new services without disrupting the entire system.
  3. Fault Tolerance: With built-in retry mechanisms and failure handling, LlamaDeploy adds robustness in production environments.
  4. Async-First: Designed for high-concurrency scenarios, making it suitable for real-time and high-throughput applications.

[!NOTE] This project was initially released under the name llama-agents, but the introduction of Workflows in llama_index turned out to be the most intuitive way for our users to develop agentic applications. We then decided to add new agentic features in llama_index directly, and focus LlamaDeploy on closing the gap between local development and remote execution of agents as services.

Quick Start with llamactl

Spin up a running deployment in minutes using the interactive CLI wizard:

# 1. Install the package & CLI
pip install -U llama-deploy

# 2. Scaffold a new project (interactive)
llamactl init

#    or non-interactive
llamactl init --name project-name --template basic

# 3. Enter the project
cd project-name

# 4. Start the control-plane API server (new terminal)
python -m llama_deploy.apiserver

# 5. Deploy the generated workflow (another terminal)
llamactl deploy deployment.yml

# 6. Call it!
llamactl run --deployment hello-deploy --arg message "Hello world!"

Looking for more templates or integrations? Check the examples directory for end-to-end demos (message queues, web UIs, etc.) or read the full documentation.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_deploy-0.9.0.tar.gz (2.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_deploy-0.9.0-py3-none-any.whl (44.9 kB view details)

Uploaded Python 3

File details

Details for the file llama_deploy-0.9.0.tar.gz.

File metadata

  • Download URL: llama_deploy-0.9.0.tar.gz
  • Upload date:
  • Size: 2.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.0

File hashes

Hashes for llama_deploy-0.9.0.tar.gz
Algorithm Hash digest
SHA256 ec3eabf02d8460899fecaf99072155040e01687a84a547c4439b3bae3fb2e589
MD5 2329dfb72804eb41aa7bb19f5bce5481
BLAKE2b-256 884ebcce0e38c4365827973715e9be7b5a77600394fadbb0a4192f6c72fe0e41

See more details on using hashes here.

File details

Details for the file llama_deploy-0.9.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_deploy-0.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f603fd711196ae5c6d27ce3cf8c425efd85ed95358f865641dbc76230253e2b5
MD5 c0da504082a7031c4c2232b3847e023d
BLAKE2b-256 820a7d52fe4b3e99f1ddf3463246b5bbe3bf1cd2b4be5a7d5f7d784c91890f4f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page