Skip to main content

No project description provided

Project description

PyPI - Version PyPI - Python Version Static Badge

Unit Testing E2E Testing Coverage Status

🦙 LlamaDeploy 🤖

LlamaDeploy (formerly llama-agents) is an async-first framework for deploying, scaling, and productionizing agentic multi-service systems based on workflows from llama_index. With LlamaDeploy, you can build any number of workflows in llama_index and then run them as services, accessible through a HTTP API by a user interface or other services part of your system.

The goal of LlamaDeploy is to easily transition something that you built in a notebook to something running on the cloud with the minimum amount of changes to the original code, possibly zero. In order to make this transition a pleasant one, you can interact with LlamaDeploy in two ways:

Both the SDK and the CLI are part of the LlamaDeploy Python package. To install, just run:

pip install llama_deploy

[!TIP] For a comprehensive guide to LlamaDeploy's architecture and detailed descriptions of its components, visit our official documentation.

Why LlamaDeploy?

  1. Seamless Deployment: It bridges the gap between development and production, allowing you to deploy llama_index workflows with minimal changes to your code.
  2. Scalability: The microservices architecture enables easy scaling of individual components as your system grows.
  3. Flexibility: By using a hub-and-spoke architecture, you can easily swap out components (like message queues) or add new services without disrupting the entire system.
  4. Fault Tolerance: With built-in retry mechanisms and failure handling, LlamaDeploy adds robustness in production environments.
  5. State Management: The control plane manages state across services, simplifying complex multi-step processes.
  6. Async-First: Designed for high-concurrency scenarios, making it suitable for real-time and high-throughput applications.

[!NOTE] This project was initially released under the name llama-agents, but the introduction of Workflows in llama_index turned out to be the most intuitive way for our users to develop agentic applications. We then decided to add new agentic features in llama_index directly, and focus LlamaDeploy on closing the gap between local development and remote execution of agents as services.

Getting Started

The fastest way to start using LlamaDeploy is playing with a practical example. This repository contains a few applications you can use as a reference:

We recommend to start from the Quick start example and move to Use a deployment from a web-based user interface immediately after. Each folder contains a README file that will guide you through the process.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_deploy-0.7.2.tar.gz (57.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_deploy-0.7.2-py3-none-any.whl (87.6 kB view details)

Uploaded Python 3

File details

Details for the file llama_deploy-0.7.2.tar.gz.

File metadata

  • Download URL: llama_deploy-0.7.2.tar.gz
  • Upload date:
  • Size: 57.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.11.7 Linux/6.8.0-1021-azure

File hashes

Hashes for llama_deploy-0.7.2.tar.gz
Algorithm Hash digest
SHA256 499097da9d4e965c1ff17bafaefefe76a5f5a81623b75b2cf5cfbca02dda3b55
MD5 cd646e34ebb2a737306ab2abceb4a8f2
BLAKE2b-256 859c8dff1550395a5166186638fd581e3e380a76aad2b35d7cc8e53268a7afb3

See more details on using hashes here.

File details

Details for the file llama_deploy-0.7.2-py3-none-any.whl.

File metadata

  • Download URL: llama_deploy-0.7.2-py3-none-any.whl
  • Upload date:
  • Size: 87.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.1 CPython/3.11.7 Linux/6.8.0-1021-azure

File hashes

Hashes for llama_deploy-0.7.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6d863f377562a0f6e8c0915f6ca9af86cc9b776cf5bc1b1dc3e42e0f9e981a34
MD5 eb4672eb12b0b4011f336f62eac5a174
BLAKE2b-256 9bc62284b7f462057b868f02e5030d754bb0b5cf1b54d19420e64e322e4abfca

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page