LLM plugin for AWS Bedrock Converse API with tool calling and embeddings support
Project description
llm-bedrock-converse
A powerful LLM plugin that provides AWS Bedrock integration using the Converse API, enabling advanced tool calling, streaming responses, and embeddings for Claude models.
✨ Features
- 🔧 Full Tool Calling Support - Native function calling with the Converse API
- 🌊 Streaming Responses - Real-time token streaming for interactive experiences
- 🎯 MCP Server Integration - Works seamlessly with llm-tools-mcp
- 📎 Multi-Modal Support - Handle images, PDFs, and videos
- 🔄 Conversation History - Maintain context across multiple turns
- 🚀 All Claude Models - Support for Claude 3, 3.5, 4, and cross-region profiles
- 📊 Titan Embeddings - Generate embeddings with Amazon Titan models
- ⚡ Automatic Retry Logic - Built-in throttling and error handling
📦 Installation
llm install llm-bedrock-converse
🔑 Configuration
The plugin uses standard AWS credentials via boto3. Configure your credentials using any of these methods:
# Environment variables
export AWS_ACCESS_KEY_ID=your_access_key
export AWS_SECRET_ACCESS_KEY=your_secret_key
export AWS_REGION=us-east-1
# Or use AWS CLI
aws configure
# Or use IAM roles (recommended for EC2/Lambda)
🚀 Usage
Basic Prompting
# Simple prompt
llm -m bedrock-converse/claude-3-haiku "Explain quantum computing"
# Using short alias
llm -m bc-haiku "What is the capital of France?"
# With system prompt
llm -m bc-sonnet-3.5 -s "You are a Python expert" "Write a decorator"
Tool Calling
# Built-in tools
llm -m bc-haiku -T llm_time "What time is it?"
# MCP servers (requires llm-tools-mcp)
llm -m bc-sonnet-3.5 -T MCP "What's the weather in Tokyo?"
# Custom functions
llm -m bc-haiku --functions 'def multiply(a: int, b: int) -> int: return a * b' \
"What is 23 times 47?"
Chat Mode
# Interactive chat with conversation history
llm chat -m bc-sonnet-3.5
# Chat with system prompt
llm chat -m bc-haiku -s "You are a helpful coding assistant"
Multi-Modal Inputs
# Analyze an image
llm -m bc-sonnet-3.5 "Describe this image" -a image.jpg
# Process a PDF document
llm -m bc-opus "Summarize this document" -a report.pdf
# Multiple attachments
llm -m bc-sonnet-3.5 "Compare these images" -a img1.jpg -a img2.jpg
Embeddings
# Generate embeddings with Titan v2
llm embed -m titan-v2 -c "Hello world"
# Embed multiple texts
llm embed -m titan-v2 -c "Text 1" -c "Text 2" -c "Text 3"
# Use Titan v1
llm embed -m titan-v1 -c "Your text here"
🤖 Available Models
Claude Models
| Model ID | Alias | Description |
|---|---|---|
anthropic.claude-3-haiku-20240307-v1:0 |
bc-haiku |
Fast, cost-effective |
anthropic.claude-3-5-haiku-20241022-v1:0 |
bc-haiku-3.5 |
Enhanced Haiku |
anthropic.claude-haiku-4-5-20251001-v1:0 |
bc-haiku-4.5 |
Latest Haiku |
anthropic.claude-3-sonnet-20240229-v1:0 |
bc-sonnet |
Balanced performance |
anthropic.claude-3-5-sonnet-20240620-v1:0 |
bc-sonnet-3.5 |
Most popular |
anthropic.claude-3-7-sonnet-20250219-v1:0 |
bc-sonnet-3.7 |
Advanced Sonnet |
anthropic.claude-sonnet-4-20250514-v1:0 |
bc-sonnet-4 |
Claude 4 Sonnet |
anthropic.claude-sonnet-4-5-20250929-v1:0 |
bc-sonnet-4.5 |
Latest Sonnet |
anthropic.claude-3-opus-20240229-v1:0 |
bc-opus |
Most capable |
anthropic.claude-opus-4-20250514-v1:0 |
bc-opus-4 |
Claude 4 Opus |
anthropic.claude-opus-4-1-20250805-v1:0 |
bc-opus-4.1 |
Latest Opus |
Cross-Region Models
| Model ID | Alias | Description |
|---|---|---|
us.anthropic.claude-3-5-sonnet-20241022-v2:0 |
bc-sonnet-3.5-v2 |
US inference profile |
us.anthropic.claude-sonnet-4-5-20250929-v1:0 |
bc-sonnet-4.5-us |
US Sonnet 4.5 |
us.anthropic.claude-opus-4-1-20250805-v1:0 |
bc-opus-4.1-us |
US Opus 4.1 |
Global Models
| Model ID | Alias | Description |
|---|---|---|
global.anthropic.claude-sonnet-4-20250514-v1:0 |
bc-sonnet-4-global |
Global Sonnet 4 |
global.anthropic.claude-sonnet-4-5-20250929-v1:0 |
bc-sonnet-4.5-global |
Global Sonnet 4.5 |
Embedding Models
| Model ID | Alias | Dimensions |
|---|---|---|
amazon.titan-embed-text-v2:0 |
titan-v2 |
1024 |
amazon.titan-embed-text-v1 |
titan-v1 |
1536 |
🛠️ Advanced Features
Retry Logic
The plugin automatically handles throttling with exponential backoff:
# Automatically retries on ThrottlingException
# Respects retry-after headers
# Up to 5 retry attempts with exponential backoff
Rate Limiting
Embeddings include built-in rate limiting (100 RPM for Titan models):
# Automatically enforces 0.6s minimum interval between requests
# Prevents throttling errors
Conversation Context
# Maintains full conversation history
llm chat -m bc-sonnet-3.5
> What is Python?
> Can you show me an example? # Context maintained
📚 Documentation
🐛 Troubleshooting
No AWS credentials found
# Set credentials explicitly
export AWS_ACCESS_KEY_ID=your_key
export AWS_SECRET_ACCESS_KEY=your_secret
export AWS_REGION=us-east-1
Model not available in region
# Check model availability in your region
aws bedrock list-foundation-models --region us-east-1
# Use cross-region inference profiles (us. or global. prefix)
llm -m bc-sonnet-4.5-us "Your prompt"
Throttling errors
The plugin automatically retries with exponential backoff. If issues persist:
- Use cross-region profiles for better availability
- Implement request batching
- Request quota increases via AWS Support
📄 License
Apache License 2.0 - see LICENSE for details.
🤝 Contributing
Contributions welcome! Please feel free to submit issues or pull requests.
🔗 Links
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llm_bedrock_converse-0.1.0.tar.gz.
File metadata
- Download URL: llm_bedrock_converse-0.1.0.tar.gz
- Upload date:
- Size: 14.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8db7e62134ef3b000534c1685e85ff27a751ccb8b3ac0e998006de5948ebf77b
|
|
| MD5 |
cec44bdc4f4b2939c09f16dbba325a0a
|
|
| BLAKE2b-256 |
9dd775a963ed569ef0a04c1c33d29c625e58e2d7de8b8bbae6b432ad0ac1f493
|
File details
Details for the file llm_bedrock_converse-0.1.0-py3-none-any.whl.
File metadata
- Download URL: llm_bedrock_converse-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4977bd32975f32b3ed05263c9df33fac0ff9002a52380efe91d96427612efaa3
|
|
| MD5 |
7752e2ec9a3836f3004051d6bd29df28
|
|
| BLAKE2b-256 |
4348b16822ea9c9c3234075c073bcbe9a0dd77da1e5a7223ebdf7b05af8eefbd
|