A comprehensive Python library for managing fallback mechanisms for Large Language Model (LLM) API calls using LiteLLM
Project description
LLM Fallbacks
A comprehensive Python library for managing fallback mechanisms for Large Language Model (LLM) API calls using the LiteLLM library. This library helps you handle API failures gracefully by providing alternative models to try when a primary model fails.
Features
- Comprehensive Model Management: Access to thousands of LLM models across multiple providers
- Intelligent Fallback Strategies: Automatic fallback configuration based on model capabilities and cost
- Cost Optimization: Built-in cost calculation and model sorting by price and performance
- Multi-Modal Support: Support for chat, completion, embedding, vision, audio, and more model types
- Provider Management: Custom provider configuration and API key management
- LiteLLM Integration: Seamless integration with LiteLLM proxy for production deployments
- GUI Interface: Interactive model filtering and selection interface
- Configuration Export: Generate ready-to-use LiteLLM YAML configurations
Installation
pip install llm-fallbacks
Development Installation
git clone https://github.com/bodencrouch/llm-fallbacks.git
cd llm-fallbacks
pip install -e .
Quick Start
Download the repo
git clone https://github.com/th3w1zard1/llm_fallbacks.git
cd llm_fallbacks
uv sync
uv run src/tests/test_core.py
Basic Usage
from llm_fallbacks import get_chat_models, get_fallback_list
# Get all available chat models
chat_models = get_chat_models()
# Get fallback models for a specific model
fallbacks = get_fallback_list("gpt-4", model_type="chat")
print(f"Fallback models for GPT-4: {fallbacks}")
Model Filtering
from llm_fallbacks import filter_models
# Get free chat models only
free_models = filter_models(
model_type="chat",
free_only=True
)
# Get models with specific capabilities
vision_models = filter_models(
model_type="chat",
vision=True
)
Cost Analysis Examples
Sort the models by cost:
from llm_fallbacks import get_litellm_models, sort_models_by_cost_and_limits
# Get all models with cost information
models = get_litellm_models()
# Sort models by cost (cheapest first)
sorted_models = sort_models_by_cost_and_limits(models, free_only=True)
print(repr(sorted_models))
Calculate cost for a specific model:
from llm_fallbacks import get_litellm_models, calculate_cost_per_token
model_spec = get_litellm_models()["gpt-5"]
cost_per_token = calculate_cost_per_token(model_spec)
print(f"Cost per token: ${cost_per_token:.6f}")
Configuration Generation
from llm_fallbacks.generate_configs import to_litellm_config_yaml
# Generate LiteLLM configuration
config = to_litellm_config_yaml(
providers=[], # Your custom providers
free_only=True
)
# Save to YAML file
import yaml
with open("litellm_config.yaml", "w") as f:
yaml.dump(config, f)
or run generate_configs.py:
uv run src/generate_configs.py
Core Components
1. Model Management (core.py)
get_litellm_models(): Retrieve all available LiteLLM models with specificationsget_chat_models(): Get models supporting chat completionget_completion_models(): Get models supporting text completionget_embedding_models(): Get models supporting text embeddingsget_vision_models(): Get models supporting vision tasksget_audio_models(): Get models supporting audio processing
2. Configuration (config.py)
- Model Specifications: Comprehensive model metadata including capabilities, costs, and limits
- Provider Configuration: Custom provider setup for private or specialized models
- Fallback Strategies: Intelligent fallback configuration based on model compatibility
3. Configuration Generation (generate_configs.py)
- LiteLLM YAML Export: Generate production-ready LiteLLM proxy configurations
- Fallback Mapping: Automatic fallback model assignment based on capabilities
- Cost Optimization: Prioritize models by cost and performance
4. Interactive Interface (__main__.py)
- GUI Application: Tkinter-based interface for model exploration (experimental)
- Advanced Filtering: Multiple filtering methods (regex, quantile, outlier detection)
- Data Export: Export filtered results to various formats
Configuration Files
The library generates several configuration files that are stored in the configs/ directory:
litellm_config.yaml: Full LiteLLM configuration with all modelslitellm_config_free.yaml: Configuration with free models onlyall_models.json: Complete model database in JSON formatfree_chat_models.json: Free chat models onlycustom_providers.json: Custom provider configurations
These files are automatically updated daily at 12:00 AM UTC via GitHub Actions to ensure you always have the latest model information and configurations.
Advanced Features
Custom Provider Configuration
from llm_fallbacks.config import CustomProviderConfig
custom_provider = CustomProviderConfig(
name="my-custom-provider",
base_url="https://api.myprovider.com",
api_key="your-api-key",
models=["custom-model-1", "custom-model-2"]
)
Fallback Strategy Customization
from llm_fallbacks.config import RouterSettings
router_settings = RouterSettings(
allowed_fails=3,
cooldown_time=30,
fallbacks=[{"gpt-4": ["gpt-3.5-turbo", "claude-3-sonnet"]}]
)
CLI Usage
Interactive GUI
python -m llm_fallbacks
Generate Configurations
python -m llm_fallbacks.generate_configs
System Testing
python test_system.py
Development
Prerequisites
- Python 3.12+
- Poetry or pip for dependency management
Setup Development Environment
# Install development dependencies
pip install -r requirements-dev.txt
# Install pre-commit hooks
pre-commit install
# Run tests
pytest tests/
CI/CD
The project includes automated workflows:
- Python Package: Runs on every push/PR with linting, testing, and building
- Python Publish: Automatically publishes to PyPI on releases
- Daily Config Update: Updates model configurations daily at 12:00 AM UTC
Automated Configuration Updates
The library automatically maintains up-to-date model configurations through:
- Daily Updates: GitHub Actions workflow runs every day at 12:00 AM UTC
- Model Database: Fetches latest model information from LiteLLM
- Fallback Strategies: Generates intelligent fallback configurations
- Version Control: All changes are automatically committed and tracked
This ensures your applications always have access to the latest models, pricing, and capabilities without manual intervention.
Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
License
This project is licensed under the Business Source License 1.1 (BSL 1.1) - see the LICENSE file for details.
Support
For support and questions:
- Open an issue on GitHub
- Check the documentation
- Contact: boden.crouch@gmail.com
Acknowledgments
- Built on top of LiteLLM
- Inspired by the need for robust LLM fallback strategies
- Community contributions and feedback
Note: This library is designed to work with the LiteLLM ecosystem and provides fallback mechanisms for production LLM applications. Always test fallback configurations in your specific environment before deploying to production.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llm_fallbacks-0.1.0.tar.gz.
File metadata
- Download URL: llm_fallbacks-0.1.0.tar.gz
- Upload date:
- Size: 28.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
30f161598a0fe4627dbeab5d7ebfe55c70676aef2dcc41f775997697c3a39997
|
|
| MD5 |
24bf60bb53fff969bebb53fdb35f91c4
|
|
| BLAKE2b-256 |
87c9fba9f3dde01190dc1da99183c0e6c69bdeade96e8bf5b97655efcb125352
|
File details
Details for the file llm_fallbacks-0.1.0-py3-none-any.whl.
File metadata
- Download URL: llm_fallbacks-0.1.0-py3-none-any.whl
- Upload date:
- Size: 26.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
922eb74c73f35dedd71e925da1da8a247998988d8cf25a2ec51eef161a0f68e6
|
|
| MD5 |
596541f0f7831763149efc3cfcff8970
|
|
| BLAKE2b-256 |
0c5984a1ca12d158b66bd395027ec4004d54272e34fdf86b4b482e176aa6794c
|