Advanced Compound AI Systems for Enterprise Workflow Transformation

These details have not been verified by PyPI

Project links

Project description

OpenDistillery

Advanced Compound AI Systems for Enterprise Workflow Transformation

Repository: llamasearchai/OpenDistillery
Documentation: OpenDistillery Docs
PyPI Package: opendistillery
Author: Nik Jois (nikjois@llamasearch.ai)
License: MIT

A comprehensive enterprise-grade platform for advanced AI agent orchestration, multi-provider model integration, and intelligent workflow automation.

Repository Overview

OpenDistillery represents the next evolution in enterprise AI systems, providing a unified platform for advanced compound AI workflows. Built with enterprise-grade security, scalability, and reliability in mind, it serves as the foundation for intelligent automation across diverse business domains.

Key Repository Features

Complete Source Code: Full access to all components including agents, APIs, and infrastructure
Production Ready: Battle-tested codebase with comprehensive test coverage and CI/CD
Enterprise Security: Advanced authentication, authorization, and audit capabilities
Extensible Architecture: Plugin-based system for custom integrations and workflows
Professional Documentation: Complete API references, deployment guides, and examples

OpenDistillery is a production-ready, enterprise-grade compound AI system demonstrating advanced software engineering capabilities and modern architecture patterns. Built with Python, FastAPI, and Docker, it showcases expertise in distributed systems, microservices architecture, and AI/ML integration at scale.

Key Highlights

42+ Latest AI Models including OpenAI GPT-4.1, o3, Claude-3.5 Sonnet, Grok-2
Multi-Agent Orchestration with intelligent task routing and coordination
Enterprise Security with JWT, MFA, RBAC, and audit logging
Production-Ready with Docker, Kubernetes, and auto-scaling
Comprehensive Testing with 95%+ code coverage and CI/CD pipelines
Professional Documentation with complete API reference and examples

Quick Start

Installation

pip install opendistillery

Basic Usage

import asyncio
from opendistillery import get_completion, get_reasoning_completion

async def main():
    # Simple completion with latest models
    response = await get_completion(
        "Analyze quarterly financial performance trends",
        model="gpt-4-turbo",
        temperature=0.1
    )
    print(response)

    # Advanced reasoning with o1-preview
    reasoning_response = await get_reasoning_completion(
        "Solve this complex mathematical proof step by step",
        model="o1-preview"
    )
    print(reasoning_response)

asyncio.run(main())

Multi-Provider Integration

from opendistillery import MultiProviderAPI, OpenAIModel, AnthropicModel, XAIModel

async def multi_provider_example():
    async with MultiProviderAPI(
        openai_api_key="your-openai-key",
        anthropic_api_key="your-anthropic-key",
        xai_api_key="your-xai-key"
    ) as api:
        # Claude-3.5 Sonnet for complex reasoning
        claude_response = await api.chat_completion(
            messages=[{"role": "user", "content": "Analyze this business strategy"}],
            model=AnthropicModel.CLAUDE_35_SONNET.value,
            extended_thinking=True
        )

        # Grok-2 for real-time information
        grok_response = await api.chat_completion(
            messages=[{"role": "user", "content": "What's trending on X today?"}],
            model=XAIModel.GROK_2.value,
            mode="think",
            real_time_info=True
        )

        # GPT-4 Turbo for large document analysis
        gpt_response = await api.chat_completion(
            messages=[{"role": "user", "content": "Summarize this 1000-page report"}],
            model=OpenAIModel.GPT_4_TURBO.value,
            max_tokens=32000
        )

asyncio.run(multi_provider_example())

Architecture Overview

OpenDistillery implements a sophisticated compound AI architecture designed for enterprise scale:

┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐
│ Load Balancer   │ │ API Gateway     │ │ Authentication  │
│ (NGINX)         │─│ (FastAPI)       │─│ (JWT/MFA)       │
└─────────────────┘ └─────────────────┘ └─────────────────┘
        │                   │                   │
        ▼                   ▼                   ▼
┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐
│ Agent Orchestra │ │ Compound AI     │ │ Monitoring      │
│ (Multi-Agent)   │─│ Systems         │─│ (Prometheus)    │
└─────────────────┘ └─────────────────┘ └─────────────────┘
        │                   │                   │
        ▼                   ▼                   ▼
┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐
│ PostgreSQL      │ │ Redis           │ │ Elasticsearch   │
│ (Database)      │ │ (Cache)         │ │ (Logging)       │
└─────────────────┘ └─────────────────┘ └─────────────────┘

Core Components

Multi-Provider API Engine: Unified interface for OpenAI, Anthropic, xAI, and Google models
Compound AI System: Multi-agent coordination with advanced reasoning chains
Enterprise Features: Production-ready FastAPI server with PostgreSQL and Redis
Monitoring & Security: Comprehensive observability and enterprise-grade security

Latest AI Model Support (2025)

OpenAI Models

GPT-4.1 - Latest flagship model with enhanced capabilities
o3 & o3-pro - Advanced reasoning models with chain-of-thought processing
o4-mini - Cost-effective model for high-volume tasks
GPT-4 Turbo - 128K context window with advanced reasoning
GPT-4o - Multimodal omni model with real-time processing

Anthropic Claude Models

Claude-3.5 Sonnet - Superior reasoning and analysis capabilities
Claude-3 Opus - Most capable model for complex tasks
Claude-3 Haiku - Fast and efficient for structured tasks

xAI Grok Models

Grok-2 - Real-time information with advanced reasoning
Grok-2 Beta - Enhanced responses with X platform integration
Grok-1.5V - Vision-enabled multimodal processing

Professional Skills Demonstrated

Software Engineering Excellence

Clean Architecture: SOLID principles, dependency injection, modular design
API Development: RESTful APIs with FastAPI and comprehensive OpenAPI docs
Database Design: PostgreSQL with advanced schema and optimization
Testing Strategy: Unit, integration, and performance testing with 95%+ coverage
DevOps Practices: CI/CD pipelines, containerization, infrastructure as code

Advanced Technical Implementation

Microservices Architecture: Scalable, fault-tolerant distributed systems
Security Engineering: JWT authentication, RBAC, encryption, audit logging
Performance Optimization: Caching, load balancing, auto-scaling
Monitoring & Observability: Prometheus metrics, structured logging, alerting
Cloud-Native Development: Kubernetes orchestration and multi-cloud deployment

Production Deployment

Docker Deployment

# Quick start with Docker Compose
git clone https://github.com/llamasearchai/OpenDistillery.git
cd OpenDistillery

# Configure environment
cp config/production.env.example .env
# Edit .env with your API keys and configuration

# Deploy full stack
docker-compose -f docker-compose.production.yml up -d

# Verify deployment
curl http://localhost:8000/health

Kubernetes Deployment

# Deploy to Kubernetes cluster
kubectl apply -f deployment/k8s/namespace.yml
kubectl apply -f deployment/k8s/configmap.yml
kubectl apply -f deployment/k8s/secrets.yml
kubectl apply -f deployment/k8s/deployment.yml
kubectl apply -f deployment/k8s/service.yml
kubectl apply -f deployment/k8s/ingress.yml

# Scale deployment
kubectl scale deployment opendistillery-api --replicas=5

Cloud Deployments

AWS ECS/Fargate:

aws ecs create-cluster --cluster-name opendistillery-prod
aws ecs register-task-definition --cli-input-json file://aws/task-definition.json
aws ecs create-service --cluster opendistillery-prod --service-name opendistillery-api

Google Cloud Run:

gcloud run deploy opendistillery \
  --image gcr.io/your-project/opendistillery:latest \
  --platform managed \
  --region us-central1 \
  --cpu 4 --memory 8Gi

Azure Container Instances:

az container create \
  --resource-group opendistillery-rg \
  --name opendistillery-prod \
  --image opendistillery:latest \
  --cpu 4 --memory 8

Configuration

Environment Variables

# Core Configuration
SECRET_KEY=your-256-bit-secret-key
DATABASE_URL=postgresql://user:pass@localhost/opendistillery
REDIS_URL=redis://localhost:6379

# AI Model API Keys
OPENAI_API_KEY=your-openai-api-key
ANTHROPIC_API_KEY=your-anthropic-api-key
XAI_API_KEY=your-xai-api-key
GOOGLE_API_KEY=your-google-api-key

# Security Configuration
REQUIRE_MFA=true
JWT_EXPIRY_HOURS=24
API_RATE_LIMIT=100
ALLOWED_ORIGINS=https://yourdomain.com

# Monitoring
PROMETHEUS_ENABLED=true
GRAFANA_ENABLED=true
LOG_LEVEL=INFO

Model Selection Strategies

from opendistillery import ModelHub

# Configure intelligent model selection
hub = ModelHub()
hub.set_preference_strategy({
    "reasoning_tasks": ["o1-preview", "claude-3-5-sonnet", "claude-3-opus"],
    "creative_tasks": ["gpt-4-turbo", "claude-3-5-sonnet"],
    "real_time_tasks": ["grok-2", "grok-2-beta"],
    "multimodal_tasks": ["gpt-4o", "claude-3-5-sonnet", "grok-1.5v"],
    "code_tasks": ["gpt-4-turbo", "claude-3-5-sonnet", "o1-preview"]
})

# Automatic model selection
result = await hub.complete_task(
    task="Analyze financial data and create visualizations",
    task_type="multimodal_analysis",
    fallback_models=True
)

API Reference

Authentication

# Login and get JWT token
curl -X POST http://localhost:8000/auth/login \
  -H "Content-Type: application/json" \
  -d '{"username": "admin", "password": "password"}'

# Use token in requests
curl -H "Authorization: Bearer YOUR_TOKEN" \
  http://localhost:8000/systems

Task Submission

# Submit a compound AI task
curl -X POST http://localhost:8000/tasks \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "task_type": "financial_analysis",
    "description": "Analyze Q4 financial performance with forecasting",
    "input_data": {
      "revenue": 1000000,
      "expenses": 800000,
      "historical_data": "..."
    },
    "priority": "high",
    "models": ["claude-3-5-sonnet", "gpt-4-turbo"],
    "reasoning_required": true
  }'

System Management

# Create a new compound AI system
curl -X POST http://localhost:8000/systems \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "system_id": "financial_system",
    "domain": "finance",
    "use_case": "risk_analysis",
    "architecture": "multi_agent",
    "models": [
      {"name": "claude-3-5-sonnet", "role": "primary_analyst"},
      {"name": "gpt-4-turbo", "role": "data_processor"},
      {"name": "o1-preview", "role": "risk_evaluator"}
    ]
  }'

Monitoring and Observability

Health Monitoring

# Health check endpoint
curl http://localhost:8000/health

# Detailed system status
curl -H "Authorization: Bearer YOUR_TOKEN" \
  http://localhost:8000/metrics

# Model-specific metrics
curl http://localhost:8000/models/gpt-4-turbo/metrics

Prometheus Metrics

# Custom metrics in your application
from opendistillery.monitoring import metrics

# Track model usage
metrics.model_requests.labels(model="claude-3-5-sonnet", task_type="analysis").inc()

# Track response times
with metrics.request_duration.labels(model="gpt-4-turbo").time():
    result = await api.chat_completion(...)

# Track token usage
metrics.tokens_used.labels(model="o1-preview", type="input").inc(prompt_tokens)
metrics.tokens_used.labels(model="o1-preview", type="output").inc(completion_tokens)

Pre-built Grafana Dashboards

Model performance and usage statistics
Token consumption and cost tracking
Response time percentiles and error rates
System resource utilization
Real-time request monitoring

Testing

Running Tests

# Install development dependencies
pip install -e ".[dev]"

# Run all tests
pytest tests/ -v

# Run with coverage
pytest --cov=src/opendistillery tests/ --cov-report=html

# Run specific test categories
pytest tests/test_multi_provider_api.py -v
pytest tests/test_compound_system.py -v
pytest tests/test_integrations.py -v

# Run integration tests
python test_integration.py

Performance Testing

# Load testing with realistic scenarios
pytest tests/performance/ -v --benchmark-only

# Concurrent request testing
python tests/load_test.py --concurrent-users=100 --requests-per-user=50

Security

Authentication Methods

JWT Authentication:

from opendistillery.auth import JWTAuthenticator

auth = JWTAuthenticator(secret_key="your-secret")
token = auth.create_token(user_id="user123", permissions=["read", "write"])

API Key Management:

from opendistillery.auth import APIKeyManager

key_manager = APIKeyManager()
api_key = await key_manager.create_key(
    name="Production API Key",
    permissions=["model_access", "task_submission"],
    expires_in_days=90
)

Multi-Factor Authentication:

from opendistillery.auth import MFAManager

mfa = MFAManager()
secret = mfa.generate_secret(user_id="user123")
qr_code = mfa.generate_qr_code(secret, "user@company.com")

Data Protection

Encryption: All data encrypted at rest and in transit using AES-256-GCM
Key Rotation: Automatic key rotation every 90 days
Access Control: Granular RBAC with least privilege principle
Audit Logging: Complete audit trail with correlation IDs
Compliance: SOC2, ISO27001, GDPR, HIPAA ready

Enterprise Use Cases

Financial Services

Real-time fraud detection with multi-model analysis
Regulatory compliance document processing
Risk assessment and portfolio optimization
Automated financial report generation

Healthcare

Medical record analysis and summarization
Drug discovery research assistance
Clinical trial data processing
Diagnostic support systems

Legal & Compliance

Contract analysis and risk identification
Legal document drafting assistance
Compliance monitoring and reporting
Case law research and analysis

Technology

Code review and quality assessment
Architecture design and optimization
Technical documentation generation
Security vulnerability analysis

Contributing

We welcome contributions to OpenDistillery! Please read our Contributing Guidelines for details on how to submit pull requests, report issues, and contribute to the project.

Development Setup

# Clone the repository
git clone https://github.com/llamasearchai/OpenDistillery.git
cd OpenDistillery

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install in development mode
pip install -e ".[dev]"

# Install pre-commit hooks
pre-commit install

# Run tests
pytest tests/ -v

Support & Community

Documentation

Full Documentation: https://docs.opendistillery.ai
API Reference: https://api-docs.opendistillery.ai
Examples Repository: https://github.com/nikjois/opendistillery-examples

Community

Discord: Join our community
GitHub Discussions: Community discussions
Stack Overflow: Tag questions with opendistillery

Enterprise Support

Email: support@opendistillery.ai
Enterprise Licensing: enterprise@opendistillery.ai
Professional Services: consulting@opendistillery.ai

Technical Achievements

Core Capabilities Implemented

Multi-Modal AI Processing: Vision, text, and audio analysis with advanced reasoning
Enterprise-Grade Security: Complete authentication, authorization, and audit systems
Distributed Architecture: Microservices with Docker and Kubernetes orchestration
Advanced Monitoring: Real-time metrics, alerting, and performance analytics
Production-Ready APIs: RESTful endpoints with comprehensive error handling

Advanced Features

Multi-Agent Orchestration: Coordinated AI agent workflows with task decomposition
Compound AI Systems: Integration of multiple models for enhanced reasoning
Performance Optimization: Intelligent caching, load balancing, and auto-scaling
Enterprise Integrations: Extensible plugin architecture for third-party systems
Comprehensive Testing: Unit, integration, and performance test suites

License

This project is licensed under the MIT License - see the LICENSE file for details.

Changelog

See CHANGELOG.md for a complete list of changes and version history.

OpenDistillery - Advancing Enterprise AI with Cutting-Edge Technology

Author: Nik Jois (nikjois@llamasearch.ai)
GitHub: https://github.com/llamasearchai/OpenDistillery

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.1.1

Jul 21, 2025

2.1.0

Jul 21, 2025

2.0.3

Jun 3, 2025

2.0.2

Jun 3, 2025

2.0.1

Jun 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

opendistillery-2.1.1.tar.gz (2.9 MB view details)

Uploaded Jul 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

opendistillery-2.1.1-py3-none-any.whl (110.4 kB view details)

Uploaded Jul 21, 2025 Python 3

File details

Details for the file opendistillery-2.1.1.tar.gz.

File metadata

Download URL: opendistillery-2.1.1.tar.gz
Upload date: Jul 21, 2025
Size: 2.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for opendistillery-2.1.1.tar.gz
Algorithm	Hash digest
SHA256	`3264960351eb7afb9c84de694e1acbc057573bac2b47ee95c2fe17c2b4ae0e93`
MD5	`197d64b7aa613fd999b015ccfb5f3362`
BLAKE2b-256	`f662b11de4e4e849b10e8210abab94e000994f050a8bc6aa8d8b3045ea528d5f`

See more details on using hashes here.

File details

Details for the file opendistillery-2.1.1-py3-none-any.whl.

File metadata

Download URL: opendistillery-2.1.1-py3-none-any.whl
Upload date: Jul 21, 2025
Size: 110.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for opendistillery-2.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b39081398e993d092118a9674456c2dae4ce962dc01b6a96cb2f036d72b2dc98`
MD5	`9e737d8f53cd326d9f8b0334291dbda7`
BLAKE2b-256	`ebdf2d90a66589ee3847d589b050f3e521e3f7bd459bc1ff57ba45de2edc3b46`

See more details on using hashes here.

opendistillery 2.1.1

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

OpenDistillery

Advanced Compound AI Systems for Enterprise Workflow Transformation

Repository Overview

Key Repository Features

Key Highlights

Quick Start

Installation

Basic Usage

Multi-Provider Integration

Architecture Overview

Core Components

Latest AI Model Support (2025)

OpenAI Models

Anthropic Claude Models

xAI Grok Models

Professional Skills Demonstrated

Software Engineering Excellence

Advanced Technical Implementation

Production Deployment

Docker Deployment

Kubernetes Deployment

Cloud Deployments

Configuration

Environment Variables

Model Selection Strategies

API Reference

Authentication

Task Submission

System Management

Monitoring and Observability

Health Monitoring

Prometheus Metrics

Pre-built Grafana Dashboards

Testing

Running Tests

Performance Testing

Security

Authentication Methods

Data Protection

Enterprise Use Cases

Financial Services

Healthcare

Legal & Compliance

Technology

Contributing

Development Setup

Support & Community

Documentation

Community

Enterprise Support

Technical Achievements

Core Capabilities Implemented

Advanced Features

License

Changelog

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes