Skip to main content

No project description provided

Project description

uv PyPI - Version PyPI - Python Version Static Badge

Unit Testing E2E Testing Coverage Status

🦙 LlamaDeploy 🤖

LlamaDeploy (formerly llama-agents) is an async-first framework for deploying, scaling, and productionizing agentic multi-service systems based on workflows from llama_index. With LlamaDeploy, you can build any number of workflows in llama_index and then run them as services, accessible through a HTTP API by a user interface or other services part of your system.

The goal of LlamaDeploy is to easily transition something that you built in a notebook to something running on the cloud with the minimum amount of changes to the original code, possibly zero. In order to make this transition a pleasant one, you can interact with LlamaDeploy in two ways:

Both the SDK and the CLI are part of the LlamaDeploy Python package. To install, just run:

pip install llama_deploy

[!TIP] For a comprehensive guide to LlamaDeploy's architecture and detailed descriptions of its components, visit our official documentation.

Why LlamaDeploy?

  1. Seamless Deployment: It bridges the gap between development and production, allowing you to deploy llama_index workflows with minimal changes to your code.
  2. Scalability: The microservices architecture enables easy scaling of individual components as your system grows.
  3. Flexibility: By using a hub-and-spoke architecture, you can easily swap out components (like message queues) or add new services without disrupting the entire system.
  4. Fault Tolerance: With built-in retry mechanisms and failure handling, LlamaDeploy adds robustness in production environments.
  5. State Management: The control plane manages state across services, simplifying complex multi-step processes.
  6. Async-First: Designed for high-concurrency scenarios, making it suitable for real-time and high-throughput applications.

[!NOTE] This project was initially released under the name llama-agents, but the introduction of Workflows in llama_index turned out to be the most intuitive way for our users to develop agentic applications. We then decided to add new agentic features in llama_index directly, and focus LlamaDeploy on closing the gap between local development and remote execution of agents as services.

Getting Started

The fastest way to start using LlamaDeploy is playing with a practical example. This repository contains a few applications you can use as a reference:

We recommend to start from the Quick start example and move to Use a deployment from a web-based user interface immediately after. Each folder contains a README file that will guide you through the process.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_deploy-0.8.0.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_deploy-0.8.0-py3-none-any.whl (90.5 kB view details)

Uploaded Python 3

File details

Details for the file llama_deploy-0.8.0.tar.gz.

File metadata

  • Download URL: llama_deploy-0.8.0.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.6

File hashes

Hashes for llama_deploy-0.8.0.tar.gz
Algorithm Hash digest
SHA256 7e82483f1df258b1a0f3098d9eab23da9e89eeb08fd22bed838fde7879c68b0c
MD5 6cb4143d3d17787f2016fdeb85940ab1
BLAKE2b-256 67e46fbb08a770455ab62b87e46a5ba4d90117f29c030af42c4b88827f5696b7

See more details on using hashes here.

File details

Details for the file llama_deploy-0.8.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_deploy-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 20d93cf67fd0256b5c7c9ee9381d3c5155edee0f8cebb0516336e879aae8efa3
MD5 fb9a10e4c02f151b0b2e5fe9710319ae
BLAKE2b-256 aaa426c84fc0fd3df0609705e5d871a1ed35dd5fe8080fa6c86e4ef2e9f53ae8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page