llama-index llms heroku managed inference integration

These details have not been verified by PyPI

Project description

Heroku Managed Inference

The llama-index-llms-heroku package contains LlamaIndex integrations for building applications with models on Heroku's Managed Inference platform. This integration allows you to easily connect to and use AI models deployed on Heroku's infrastructure.

Installation

pip install llama-index-llms-heroku

Setup

1. Create a Heroku App

First, create an app in Heroku:

heroku create $APP_NAME

2. Create and Attach AI Models

Create and attach a chat model to your app:

heroku ai:models:create -a $APP_NAME claude-3-5-haiku

3. Export Configuration Variables

Export the required configuration variables:

export INFERENCE_KEY=$(heroku config:get INFERENCE_KEY -a $APP_NAME)
export INFERENCE_MODEL_ID=$(heroku config:get INFERENCE_MODEL_ID -a $APP_NAME)
export INFERENCE_URL=$(heroku config:get INFERENCE_URL -a $APP_NAME)

Usage

Basic Usage

from llama_index.llms.heroku import Heroku
from llama_index.core.llms import ChatMessage, MessageRole

# Initialize the Heroku LLM
llm = Heroku()

# Create chat messages
messages = [
    ChatMessage(
        role=MessageRole.SYSTEM, content="You are a helpful assistant."
    ),
    ChatMessage(
        role=MessageRole.USER,
        content="What are the most popular house pets in North America?",
    ),
]

# Get response
response = llm.chat(messages)
print(response)

Using Environment Variables

The integration automatically reads from environment variables:

import os

# Set environment variables
os.environ["INFERENCE_KEY"] = "your-inference-key"
os.environ["INFERENCE_URL"] = "https://us.inference.heroku.com"
os.environ["INFERENCE_MODEL_ID"] = "claude-3-5-haiku"

# Initialize without parameters
llm = Heroku()

Using Parameters

You can also pass parameters directly:

import os

llm = Heroku(
    model=os.getenv("INFERENCE_MODEL_ID", "claude-3-5-haiku"),
    api_key=os.getenv("INFERENCE_KEY", "your-inference-key"),
    inference_url=os.getenv(
        "INFERENCE_URL", "https://us.inference.heroku.com"
    ),
    max_tokens=1024,
)

Text Completion

# Simple text completion
response = llm.complete("Explain the importance of open source LLMs")
print(response.text)

Available Models

For a complete list of available models, see the Heroku Managed Inference documentation.

Error Handling

The integration includes proper error handling for common issues:

Missing API key
Invalid inference URL
Missing model configuration

Additional Information

For more information about Heroku Managed Inference, visit the official documentation.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.0

Mar 12, 2026

0.1.1

Sep 8, 2025

This version

0.1.0

Aug 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_llms_heroku-0.1.0.tar.gz (5.0 kB view details)

Uploaded Aug 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_index_llms_heroku-0.1.0-py3-none-any.whl (4.4 kB view details)

Uploaded Aug 8, 2025 Python 3

File details

Details for the file llama_index_llms_heroku-0.1.0.tar.gz.

File metadata

Download URL: llama_index_llms_heroku-0.1.0.tar.gz
Upload date: Aug 8, 2025
Size: 5.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.6

File hashes

Hashes for llama_index_llms_heroku-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`6db766099fcfe81414082971dd49e3bcc7d1325b195957d445cffe01ee7abe6c`
MD5	`e8f3f611390cb26333f48ac64e5cd281`
BLAKE2b-256	`d5337c3b38e6f62c440311ebfff2ee678d2e8773da05b8a316dda0c7a955a2fd`

See more details on using hashes here.

File details

Details for the file llama_index_llms_heroku-0.1.0-py3-none-any.whl.

File metadata

Download URL: llama_index_llms_heroku-0.1.0-py3-none-any.whl
Upload date: Aug 8, 2025
Size: 4.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.6

File hashes

Hashes for llama_index_llms_heroku-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`15d8d9a9346266dc49559a56fc76a6b64df3f3dc1631bedc8b386d82387b3d97`
MD5	`76cd1ea623042c2c465731a1562c22c0`
BLAKE2b-256	`26023ebb01cc8b0ca0504583414df923556f5ff39382f85197db797961c04202`

See more details on using hashes here.

llama-index-llms-heroku 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Heroku Managed Inference

Installation

Setup

1. Create a Heroku App

2. Create and Attach AI Models

3. Export Configuration Variables

Usage

Basic Usage

Using Environment Variables

Using Parameters

Text Completion

Available Models

Error Handling

Additional Information

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes