Python framework for multiple GGUF language models to collaborate on tasks

These details have not been verified by PyPI

Project links

Project description

AI Ensemble Suite

Alpha Release (v0.5): This library is in active development. While already functional and useful for real-world applications, some APIs may change, and additional cleanup and feature development are in progress. Feedback and contributions are welcome!

Python framework for multiple GGUF language models to collaborate on tasks using structured communication patterns, aggregating their outputs into coherent responses. While I chose to support GGUF to begin with, I plan to add OpenAI compatible server support allowing local LLM server and Internet APIs to be called.

Installation
Quick Start
Key Features
Ensembles Explained
Why Ensembles Work
Why This Library?
Examples
Decision Guide: Choosing Collaboration & Aggregation Patterns
Collaboration Phases
Aggregation Strategies
Architecture Diagram
Class Hierarchy and Interaction Diagram
Project File Structure
API Reference
Requirements
Contributing
Running Tests
Roadmap
License
Special Thanks

Installation

Prerequisites

Python >= 3.10
Sufficient RAM for running multiple language models
Preferably a GPU with sufficient VRAM to load multiple models, otherwise your llama-cpp will store models in RAM and use your CPU for inference

Install via pip

For GPU acceleration note the export/set See optional dependencies below before installing

export CMAKE_ARGS="-DGGML_CUDA=on"
export FORCE_CMAKE=1
pip install ai-ensemble-suite

SET CMAKE_ARGS="-DGGML_CUDA=on"
SET FORCE_CMAKE=1
pip install ai-ensemble-suite

Optional Dependencies

The library has optional dependencies that can be installed with:

pip install "ai-ensemble-suite[rich]"  # For enhanced console output
pip install "ai-ensemble-suite[dev]"   # For development tools

Installation with GPU acceleration

For NVIDIA GPU acceleration:

CMAKE_ARGS="-DGGML_CUDA=on" FORCE_CMAKE=1 pip install llama-cpp-python
pip install ai-ensemble-suite

For Apple Metal (macOS):

CMAKE_ARGS="-DLLAMA_METAL=on" FORCE_CMAKE=1 pip install llama-cpp-python
pip install ai-ensemble-suite

Development Installation

To install the package with development dependencies:

git clone https://github.com/StephenGenusa/ai-ensemble-suite.git
cd ai-ensemble-suite
pip install -e ".[dev]"

Quick Start

You will want to determine which GGUF models you wish to work with. Download them and place in the root folder of the project under a /models directory... or change the YAML to point to the location of your models.

Here's a minimal example to get started with AI Ensemble Suite:

from ai_ensemble_suite import Ensemble

# Initialize the ensemble with a configuration
ensemble = Ensemble(config_path="path/to/config.yaml")

# Initialize models (loads them into memory)
await ensemble.initialize()

# Ask a question
try:
    response = await ensemble.ask("What are the key considerations for sustainable urban planning?")
    print(response)
finally:
    # Release resources when done
    await ensemble.shutdown()

# Alternatively, use with async context manager
async with Ensemble(config_path="path/to/config.yaml") as ensemble:
    response = await ensemble.ask("What are the key considerations for sustainable urban planning?")
    print(response)

Key Features

✅ Multiple Collaboration Phases: Choose from over a dozen different collaboration methods to design your ensemble approach
✅ Specialized Model Roles: Assign different roles (critic, synthesizer, researcher, etc.) to models, allowing them to specialize in different aspects of task completion
✅ Flexible Aggregation Strategies: Use different methods for combining model outputs into coherent, high-quality responses
✅ Advanced Tracing: Get detailed traces of model interactions and decision-making processes
✅ Extensible Design: Easily add custom collaboration phases or aggregation strategies
✅ Built for GGUF Models: Optimized for running multiple smaller GGUF language models locally
✅ Confidence Estimation: Token probability analysis and self-evaluation capabilities
✅ Concurrent Model Management: Efficient loading and execution of multiple models
✅ Async-First Design: Native async/await support with context manager
✅ YAML Configuration for Models: YAML configuration files to hold model parameters and queries
✅ Jinja2 Templating: The (query) "templates" portion of the the YAML configuration file support Jinja2

AI Ensembles Explained

Think of AI ensembles like getting advice from a group of experts instead of just one person. Just as you might ask several friends for opinions before making an important decision, AI ensembles combine the strengths of multiple AI systems or approaches to produce better results than any single AI could achieve alone. Instead of relying on one AI that might have blind spots or make certain types of mistakes, ensembles bring together diverse AI perspectives that can check each other's work, complement each other's strengths, and collectively arrive at more reliable answers. It's similar to how a team of doctors with different specialties might collaborate on a difficult medical case—the combined expertise leads to better outcomes than what any individual doctor could provide on their own.

Brief Explanations of AI Ensemble Techniques

AsyncThinking: Like a rapid brainstorming session where multiple people jot down ideas independently before sharing, this technique generates diverse initial thoughts quickly without influence from other perspectives.
StructuredCritique: Similar to having your work reviewed by a tough but fair editor, this approach systematically evaluates ideas, identifies logical flaws, and improves the rigor of thinking.
SynthesisOriented: Acts like a skilled mediator who finds common ground between opposing viewpoints, integrating different perspectives into a balanced, comprehensive analysis.
RoleBasedDebate: Resembles a panel discussion with experts from different fields, each contributing specialized knowledge to address complex topics from multiple angles.
HierarchicalReview: Works like a multi-stage editing process, where content is refined layer by layer, with each review focusing on different aspects for progressive improvement.
CompetitiveEvaluation: Functions like a contest where multiple solutions compete, and the strongest approach wins based on objective criteria.
PerspectiveRotation: Similar to walking around a sculpture to view it from all sides, this technique examines issues from different stakeholder perspectives, ethical frameworks, or creative angles.
ChainOfThoughtBranching: Like mapping out a complex maze with multiple possible paths, this method explores different reasoning routes for problems with multiple decision points.
AdversarialImprovement: Acts as a stress-test or devil's advocate, actively looking for weaknesses in a solution to strengthen it against potential problems.
RoleBasedWorkflow: Operates like a production line with specialized stations, creating a structured process where different roles handle specific aspects of a multi-stage analysis.
Bagging: Works like taking the average of multiple poll results to get a more stable prediction, reducing the impact of outliers or unusual patterns.
UncertaintyBasedCollaboration: Similar to how a group might work together on a puzzle with missing pieces, this approach handles ambiguous questions by combining different levels of confidence.
StackedGeneralization: Functions like a team of specialists with a coordinator, where outputs from different AI models are combined to leverage their unique strengths and minimize weaknesses.

Why Ensembles Work

Ensemble methods have a strong theoretical and empirical foundation in machine learning, and they're particularly effective with language models for several reasons:

Diverse Knowledge and Perspectives: Different language models, even when trained on similar data, develop slightly different internal representations and "expertise areas." By combining multiple models, you access a broader knowledge base than any single model contains.
Error Reduction Through Aggregation: Models tend to make different errors. When their outputs are combined intelligently, errors from one model can be corrected by others, leading to more accurate results.
Specialization Through Roles: When models adopt specialized roles (like critic, researcher, or synthesizer), they can focus on specific aspects of a task. This division of cognitive labor mirrors effective human teams and leads to more thorough analysis.
Iterative Refinement: Multi-step collaboration allows initial ideas to be critiqued, refined, and expanded. This resembles human drafting and editing processes, typically producing higher quality results than single-pass generation.
Confidence Calibration: Ensemble techniques help identify areas of uncertainty or disagreement between models, leading to better-calibrated confidence in the final output.

Research consistently shows that properly designed ensembles outperform even the strongest individual models, often by significant margins. AI Ensemble Suite provides the infrastructure to easily tap into these powerful techniques.

Why This Library?

AI Ensemble Suite was created to address two key challenges:

The Need for Human-Friendly Ensemble AI: After extensive searching for a comprehensive yet easy-to-use library for ensemble AI work that followed a "for humans" philosophy, nothing quite fit the bill. This library makes it easy to harness multiple smaller language models on local machines to produce enhanced AI responses.
Structured Collaboration Patterns: Rather than just averaging model outputs, AI Ensemble Suite implements sophisticated collaboration patterns where models can critique, refine, and extend each other's work - resulting in higher quality responses that benefit from diverse model strengths.

The framework is designed with simplicity in mind for common use cases while providing comprehensive customization for advanced users.

Additionally, this project served as a meta-challenge to build a medium sized Python library using AI assistance despite context window limitations, developing techniques to work around these constraints.

Examples

The library includes several example scripts demonstrating different ensemble techniques:

Basic Usage: Non-ensembled simple usage with default configurations
Structured Debate: Models present opposing viewpoints to refine conclusion
Expert Committee: Specialized models contribute domain expertise
Hierarchical Review: Progressive refinement through layers of specialized models
Competitive Evaluation: Multiple solutions generated and evaluated against criteria
Perspective Rotation: Problem analyzed through different framing lenses
Chain-of-Thought Branching: Reasoning paths that branch and reconverge
Adversarial Improvement: One model finds flaws in another's reasoning
Role-based Workflow: Models adopt complementary roles in a structured process
Bagging: Combines models to reduce prediction variance

Note on Examples: Some example files require additional libraries not included in requirements.txt. Several examples also generate graphic charts to disk, which were implemented to help visualize how the ensemble builds the final result. These visualization features are optional but helpful for testing and understanding the process.

Decision Guide: Choosing Collaboration & Aggregation Patterns

Collaboration Patterns

When you need...	Use this collaboration pattern	Best for
Quick independent analyses	AsyncThinking	Simple questions, brainstorming, diverse initial ideas
Critical evaluation of ideas	StructuredCritique	Evaluating arguments, finding flaws, improving rigor
Balanced perspectives	SynthesisOriented	Finding common ground, integrating viewpoints, balanced analysis
Multiple specialist perspectives	RoleBasedDebate	Complex topics requiring multiple forms of expertise
Progressive improvement	HierarchicalReview	Content requiring layer-by-layer refinement or fact-checking
Competition between solutions	CompetitiveEvaluation	Generating multiple solutions and selecting the best one
Examining from different angles	PerspectiveRotation	Ethical analysis, stakeholder considerations, creative ideation
Complex reasoning paths	ChainOfThoughtBranching	Mathematical problems, logic puzzles, decision trees
Stress-testing solutions	AdversarialImprovement	Finding edge cases, improving robustness, anticipating objections
Structured workflow process	RoleBasedWorkflow	Research projects, content creation, multi-stage analysis
Stabilizing volatile outputs	Bagging	Reducing variance, improving prediction stability
Handling uncertainty	UncertaintyBasedCollaboration	Questions with ambiguity, calibrating confidence
Model stacking	StackedGeneralization	Leveraging strengths of different model types, boosting performance

Aggregation Strategies

When you need...	Use this aggregation strategy	Best for
To prioritize some models over others	WeightedVoting	When certain models perform better for specific tasks
To use the final result of a sequence	SequentialRefinement	When using phases that progressively refine content
To choose the most confident output	ConfidenceBased	When models have reliable confidence estimation
To evaluate along multiple criteria	MultidimensionalVoting	Complex evaluation requiring different quality dimensions
To blend multiple perspectives	EnsembleFusion	Creating a coherent synthesis from diverse inputs
Dynamic strategy selection	AdaptiveSelection	When different queries benefit from different aggregation approaches

Collaboration Phases

AI Ensemble Suite implements various collaboration phases that can be combined or used individually:

Core Phases

✅ AsyncThinking: Models work independently on a problem before combining insights
✅ Integration/Refinement: Models refine responses based on feedback and insights
✅ ExpertCommittee: Final processing/structuring of model outputs before aggregation

Structured Debate Patterns

✅ StructuredCritique: Models evaluate others' responses using structured formats
✅ SynthesisOriented: Models focus on finding common ground and integrating perspectives
✅ RoleBasedDebate: Models interact according to assigned specialized roles

Advanced Collaboration Methods

✅ HierarchicalReview: Content is progressively reviewed by models in a hierarchical structure
✅ CompetitiveEvaluation: Models are pitted against each other in a competition
✅ PerspectiveRotation: Models iterate on a problem by assuming different perspectives
✅ ChainOfThoughtBranching: Models trace through multiple reasoning paths
✅ AdversarialImprovement: Models improve a solution by seeking its weaknesses
✅ RoleBasedWorkflow: Models function in specialized roles like researcher, analyst, and writer
✅ Bagging: Models process different variations of the same input
✅ UncertaintyBasedCollaboration: Uncertainty measurements guide model interactions
✅ StackedGeneralization: Base models process input, then a meta-model combines their outputs

Aggregation Strategies

The library provides several strategies for aggregating the outputs from multiple models:

✅ WeightedVoting: Models are assigned different weights based on performance or expertise
✅ SequentialRefinement: Assumes phases run in a sequence where later phases refine earlier ones
✅ ConfidenceBased: Selects output with the highest confidence score
✅ MultidimensionalVoting: Evaluates outputs along multiple dimensions
✅ EnsembleFusion: Uses a model to synthesize multiple outputs into one
✅ AdaptiveSelection: Dynamically selects and executes another aggregation strategy

Architecture Diagram

┌─────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│                                              Ensemble                                                   │
│                                    (Main user-facing interface)                                         │
│ ┌─────────────────────────┐ ┌───────────────────────────┐ ┌────────────────────────────────────────────┐│
│ │ Attributes:             │ │ Core Methods:             │ │ Interaction Methods:                       ││
│ │ ● config_manager        │ │ ● __init__()              │ │ ● initialize() → ModelManager.initialize() ││
│ │ ● model_manager         │ │ ● ask()                   │ │ ● shutdown() → ModelManager.shutdown()     ││
│ │ ● template_manager      │ │ ● configure()             │ │ ● _execute_collaboration_phases()          ││
│ │ ● _initialized          │ │ ● __aenter__(),           │ │ ● _aggregate_results()                     ││
│ │ ● _initialization_lock  │ │   __aexit__()             │ │                                            ││
│ └─────────────────────────┘ └───────────────────────────┘ └────────────────────────────────────────────┘│
└─────────────────┬───────────────────────┬────────────────────────────┬──────────────────────────────────┘
                  │                       │                            │
                  │                       │                            │
          ┌───────▼──────┐       ┌────────▼─────────┐        ┌─────────▼────────┐
          │ ConfigManager│       │   ModelManager   │        │  TemplateManager │
          │ (YAML Config │       │   (GGUF Models)  │        │                  │
          └───────┬──────┘       └────────┬─────────┘        └──────────────────┘
                  │                       │
                  │                       │
                  │                       ▼
                  │            ┌────────────────────┐
                  │            │   Model Registry   │
                  │            └───────────┬────────┘
                  │                        │
         ┌────────▼──────────┐      ┌─────▼───────────────────────────────────┐
         │                   │      │                                         │
         ▼                   ▼      ▼                                         ▼
┌─────────────────┐  ┌──────────────────┐                   ┌─────────────────────────────────┐
│ BaseAggregator  │  │BaseCollaboration │                   │          TraceCollector         │
│  (Abstract)     │  │Phase (Abstract)  │                   │          Tracing System         │
└───┬──────┬──────┘  └────┬──────┬──────┘                   │(Records all collaboration steps)│
    │      │              │      │                          └─────────────────────────────────┘ 
    ▼      ▼              ▼      ▼                                       
┌─────────────────────────────┐  ┌────────────────────────────────────────────────────────────┐
│                             │  │                                                            │
│ Aggregation Implementations │  │              Collaboration Phase Implementations           │
│ ┌─────────────────────────┐ │  │ ┌────────────────┐ ┌─────────────────┐ ┌──────────────────┐│
│ │ WeightedVoting          │ │  │ │ AsyncThinking  │ │ChainOfThought   │ │BaseDebate        ││
│ ├─────────────────────────┤ │  │ ├────────────────┤ ├─────────────────┤ ├──────────────────┤│
│ │ SequentialRefinement    │ │  │ │ Integration    │ │CompetitiveEval  │ │StructuredCritique││
│ ├─────────────────────────┤ │  │ ├────────────────┤ ├─────────────────┤ ├──────────────────┤│
│ │ ConfidenceBased         │ │  │ │ ExpertCommittee│ │PerspectiveRot   │ │SynthesisOriented ││
│ ├─────────────────────────┤ │  │ ├────────────────┤ ├─────────────────┤ ├──────────────────┤│
│ │ MultidimensionalVoting  │ │  │ │ HierarchicalRev│ │AdversarialImp   │ │RoleBasedDebate   ││
│ ├─────────────────────────┤ │  │ └────────────────┘ ├─────────────────┤ └──────────────────┘│
│ │ EnsembleFusion          │ │  │                    │RoleBasedWorkflow│                     │
│ ├─────────────────────────┤ │  │                    └─────────────────┘                     │
│ │ AdaptiveSelection       │ │  │ ┌────────────────┐ ┌─────────────────┐ ┌────────────────┐  │
│ └─────────────────────────┘ │  │ │ Bagging        │ │UncertaintyBased │ │StackedGen      │  │
│                             │  │ └────────────────┘ └─────────────────┘ └────────────────┘  │
└─────────────────────────────┘  └────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────────────────────────────────────────────────┐
│                                    Exception Hierarchy                                                  │
│                                                                                                         │
│          ┌─────────────────────────────────────────────────────────────────────────────┐                │
│          │                        AiEnsembleSuiteError (Base)                          │                │
│          └──────┬──────────────┬─────────────────┬────────────────┬────────────────────┘                │
│                 │              │                 │                │                                     │
│                 ▼              ▼                 ▼                ▼                                     │
│          ┌────────────┐ ┌─────────────┐ ┌───────────────┐ ┌─────────────┐                               │
│          │ModelError  │ │ConfigError  │ │CollabError    │ │AggrError    │                               │
│          └────────────┘ └──────┬──────┘ └───────────────┘ └─────────────┘                               │
│                                │                                                                        │
│                                ▼                                                                        │
│                         ┌────────────────┐                                                              │
│                         │ValidationError │                                                              │
│                         └────────────────┘                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────────────────┘

Class Hierarchy and Interaction Diagram

┌─────────────────────────────────────────────────────────────────────────────────────────┐
│                                                                                         │
│                                     ENSEMBLE CLASS                                      │
│                        (Main user-facing interface & orchestrator)                      │
│                                                                                         │
├─────────────────┬─────────────────────────────────────────┬─────────────────────────────┤
│                 │                                         │                             │
│   Public API:   │           Core Control Flow:            │      Context Management:    │
│   ● initialize()│   ● _execute_collaboration_phases()     │      ● __aenter__()         │
│   ● shutdown()  │   ● _aggregate_results()                │      ● __aexit__()          │
│   ● ask()       │   ● _get_phase_class()                  │      ● _initialization_lock │
│   ● configure() │                                         │                             │
│                 │                                         │                             │
└─────────┬───────┴─────────────────────┬───────────────────┴─────────────┬───────────────┘
          │                             │                                 │
          ▼                             ▼                                 ▼
┌─────────────────────────────┐  ┌─────────────────────────┐   ┌─────────────────────────────┐
│                             │  │                         │   │                             │
│       ConfigManager         │  │     ModelManager        │   │     TemplateManager         │
│      (Config handling)      │  │(Model loading/inference)│   │    (Prompt template         │
│                             │  │                         │   │     management)             │
│ ● load()                    │  │  ● initialize()         │   │  ● get_template()           │
│ ● update()                  │  │  ● shutdown()           │   │  ● render_template()        │
│ ● validate()                │  │  ● get_model()          │   │                             │
│ ● get_collaboration_mode()  │  │  ● run_inference()      │   └─────────────────────────────┘
│ ● get_aggregation_strategy()│  │                         │
│                             │  │                         │
└─────────┬───────────────────┘  └───────────┬─────────────┘
          │                                  │
          │                                  ▼
          │              ┌────────────────────────────────────────────────┐
          │              │                                                │
          │              │              Model Registry                    │
          │              │         (Loaded GGUF Models)                   │
          │              │                                                │
          └──────────────┼━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┿━━━━━━━━━┓
                         └────────────────────────────────────────────────┘         │
                                                                                    │
┌────────────────────────────────────────────────────────────────────┐              │
│                                                                    │              │
│                    COLLABORATION PHASES                            │              │
│                                                                    │              │
├──────────────────────┬─────────────────────┬───────────────────────┤              │
│                      │                     │                       │              │
│   Simple Phases:     │   Complex Phases:   │     Debate Types:     │              │
│   ● AsyncThinking    │   ● CompetitiveEval │   ● StructuredCritique│              │
│   ● Integration      │   ● PerspectiveRot  │   ● SynthesisOriented │              │
│   ● ExpertCommittee  │   ● ChainOfThought  │   ● RoleBasedDebate   │              │
│   ● HierarchicalRev  │   ● AdversarialImp  │                       │              │
│                      │   ● RoleBasedWork   │                       │              │
│                      │                     │                       │              │
├──────────────────────┴─────────────────────┴───────────────────────┤              │
│                                                                    │              │
│   Machine Learning-Oriented Phases:                                │              │
│   ● Bagging                                                        │              │
│   ● UncertaintyBasedCollaboration                                  │              │
│   ● StackedGeneralization                                          │              │
│                                                                    │              │
└────────────────────────────────┬───────────────────────────────────┘              │
                                 │                                                  │
                                 │                                                  │
                                 ▼                                                  │
┌────────────────────────────────────────────────────────────────────┐              │
│                                                                    │              │
│                     AGGREGATION STRATEGIES                         │◀─────────────┘
│                                                                    │
├──────────────────────┬─────────────────────┬───────────────────────┤
│                      │                     │                       │
│   ● WeightedVoting   │   ● ConfidenceBased │   ● EnsembleFusion    │
│   ● SequentialRef    │   ● MultiDimVoting  │   ● AdaptiveSelection │
│                      │                     │                       │
└──────────────────────┴─────────────────────┴───────────────────────┘

┌────────────────────────────────────────────────────────────────────┐
│                                                                    │
│                    UTILITY COMPONENTS                              │
│                                                                    │
├──────────────────────────────────┬─────────────────────────────────┤
│                                  │                                 │
│   ● TraceCollector               │   ● Exceptions:                 │
│     (Execution tracing)          │     - AiEnsembleSuiteError      │
│                                  │     - ConfigurationError        │
│   ● Logger                       │     - ModelError                │
│     (Structured logging)         │     - CollaborationError        │
│                                  │     - AggregationError          │
│                                  │     - ValidationError           │
│                                  │                                 │
└──────────────────────────────────┴─────────────────────────────────┘

Project File Structure

ai_ensemble_suite/
├── 📄 __init__.py             # Package exports (Ensemble class)
├── 📄 ensemble.py             # Main Ensemble class implementation
│
├── 📁 config/
│   ├── 📄 __init__.py         # Configuration package exports
│   ├── 📄 config_manager.py   # Core configuration handling
│   ├── 📄 defaults.py         # Default config values
│   ├── 📄 schema.py           # JSON Schema for config validation
│   ├── 📄 template_manager.py # Manages prompt templates
│   └── 📄 utils.py            # Configuration utilities
│
├── 📁 models/
│   ├── 📄 __init__.py         # Model package exports
│   ├── 📄 model_manager.py    # Core model management
│   ├── 📄 llm_interface.py    # Common LLM interface
│   ├── 📄 gguf_model.py       # GGUF format model implementation
│   ├── 📄 llama_cpp.py        # llama.cpp specific implementation
│   └── 📄 metadata.py         # Model metadata handling
│
├── 📁 collaboration/
│   ├── 📄 __init__.py                 # Collaboration package exports
│   ├── 📄 base.py                     # BaseCollaborationPhase abstract class
│   ├── 📄 async_thinking.py           # Independent parallel thinking
│   ├── 📄 integration.py              # Result integration approach
│   ├── 📄 expert_committee.py         # Expert committee pattern
│   ├── 📄 hierarchical_review.py      # Hierarchical review pattern
│   ├── 📄 competitive_evaluation.py   # Competitive evaluation pattern
│   ├── 📄 perspective_rotation.py     # Perspective rotation pattern
│   ├── 📄 chain_of_thought.py         # Chain-of-thought implementation
│   ├── 📄 adversarial_improvement.py  # Adversarial improvement pattern
│   ├── 📄 role_based_workflow.py      # Role-based workflow pattern
│   ├── 📄 structured_debate.py        # Structured debate with subtypes
│   ├── 📄 bagging.py                  # ML-inspired bagging approach
│   ├── 📄 uncertaintybased.py         # Uncertainty-based collaboration
│   └── 📄 stackedgeneralization.py    # Stacked generalization approach
│
├── 📁 aggregation/
│   ├── 📄 __init__.py                 # Aggregation package exports
│   ├── 📄 base.py                     # BaseAggregator abstract class
│   ├── 📄 weighted_voting.py          # Weighted voting implementation
│   ├── 📄 sequential_refinement.py    # Sequential refinement pattern
│   ├── 📄 confidence_based.py         # Confidence-based aggregation
│   ├── 📄 multidimensional_voting.py  # Multi-dimensional voting
│   ├── 📄 ensemble_fusion.py          # Ensemble fusion approach
│   └── 📄 adaptive_selection.py       # Adaptive selection strategy
│
├── 📁 exceptions/
│   ├── 📄 __init__.py         # Exception exports
│   └── 📄 errors.py           # Custom exception definitions
│
└── 📁 utils/
    ├── 📄 __init__.py         # Utilities package exports
    ├── 📄 logging.py          # Logging configuration
    ├── 📄 tracing.py          # Execution tracing (TraceCollector)
    ├── 📄 concurrency.py      # Concurrency utilities
    ├── 📄 prompt_tools.py     # Prompt manipulation utilities
    └── 📄 validators.py       # Validation utilities

API Reference

Core Classes

Ensemble

class Ensemble:
    """Coordinates the collaboration of multiple AI models for complex tasks."""
    
    def __init__(self, config_path: Optional[str] = None, config_dict: Optional[Dict[str, Any]] = None) -> None:
        """Initialize the Ensemble orchestration layer."""
        
    async def initialize(self) -> None:
        """Load models and prepare the ensemble for processing queries."""
        
    async def shutdown(self) -> None:
        """Release resources used by the ensemble."""
        
    async def ask(self, query: str, **kwargs: Any) -> Union[str, Dict[str, Any]]:
        """Process a query through the configured collaboration and aggregation pipeline."""
        
    def configure(self, config_dict: Dict[str, Any]) -> None:
        """Update the ensemble's configuration dynamically."""

ModelManager

class ModelManager:
    """Manages the loading, execution, and lifecycle of GGUF models."""
    
    def __init__(self, config_manager: ConfigProvider, max_workers: Optional[int] = None) -> None:
        """Initialize the ModelManager."""
        
    async def initialize(self) -> None:
        """Initialize the ModelManager: Instantiates models and loads them asynchronously."""
        
    async def shutdown(self) -> None:
        """Shutdown the ModelManager: Unloads models and shuts down the executor."""
        
    async def run_inference(self, model_id: str, prompt: str, **kwargs: Any) -> Dict[str, Any]:
        """Run inference on a specific model using its generate method via thread pool."""

BaseCollaborationPhase

class BaseCollaborationPhase(ABC):
    """Abstract base class for collaboration phases."""
    
    def __init__(self, model_manager: "ModelManager", config_manager: "ConfigManager", phase_name: str) -> None:
        """Initialize the collaboration phase."""
        
    @abstractmethod
    async def execute(self, query: str, context: Dict[str, Any], trace_collector: Optional[TraceCollector] = None) -> Dict[str, Any]:
        """Execute the collaboration phase."""

BaseAggregator

class BaseAggregator(ABC):
    """Abstract base class for aggregation strategies."""
    
    def __init__(self, config_manager: "ConfigManager", strategy_name: str, model_manager: Optional["ModelManager"] = None, strategy_config_override: Optional[Dict[str, Any]] = None) -> None:
        """Initialize the aggregator."""
        
    @abstractmethod
    async def aggregate(self, outputs: Dict[str, Dict[str, Any]], context: Dict[str, Any], trace_collector: Optional[TraceCollector] = None) -> Dict[str, Any]:
        """Aggregate the outputs from collaboration phases."""

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Guidelines

Follow PEP 8 and PEP 484 (Type Hinting)
Use Google Python Style Guide for formatting and documentation
Apply Google Docstrings for all modules, classes, and functions
Write tests for new features
Format code using black (configured in pyproject.toml)
Run type checking with mypy

Running Tests

👉 Note: Tests are currently broken. They were originally developed early on in the development process and are now out of date. I will be updating these later.

# Install test dependencies
pip install ".[dev]"

# Run tests
pytest

# Run with coverage
pytest --cov=ai_ensemble_suite

Roadmap

Refinement and cleanup of the current codebase
Rewriting the existing tests due to API changes from early code
Adding OpenAI API compatibility for local LLM servers like LM Studio, Ollama
Support for Internet API providers

License

This project is licensed under the MIT License - see the LICENSE file for details.

Special Thanks

🙏 Special thanks to Georgi Gerganov and the whole team working on llama.cpp for making all of this possible.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.1

Apr 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_ensemble_suite-0.5.1.tar.gz (187.6 kB view details)

Uploaded Apr 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_ensemble_suite-0.5.1-py3-none-any.whl (171.3 kB view details)

Uploaded Apr 20, 2025 Python 3

File details

Details for the file ai_ensemble_suite-0.5.1.tar.gz.

File metadata

Download URL: ai_ensemble_suite-0.5.1.tar.gz
Upload date: Apr 20, 2025
Size: 187.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.4

File hashes

Hashes for ai_ensemble_suite-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`6af2cdb59cea5f6a7637a542193dff0a3f9c4db039d5c0edb7141b4b6c0ca0a2`
MD5	`c726495ae95f05433f9da505c469bc14`
BLAKE2b-256	`5387682b8f1b240e9b6f7eccdd1d2a464a52f7fde974dc80cb37b1e8a1a8dc0c`

See more details on using hashes here.

File details

Details for the file ai_ensemble_suite-0.5.1-py3-none-any.whl.

File metadata

Download URL: ai_ensemble_suite-0.5.1-py3-none-any.whl
Upload date: Apr 20, 2025
Size: 171.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.4

File hashes

Hashes for ai_ensemble_suite-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`83b08eeba1b5ac72066d59297dd50dfe5613da34387db022e1a91241ad1be77d`
MD5	`1b92b7089bc4492a18953cb00167dc22`
BLAKE2b-256	`cc8669dd692470e895de3a3532308eceb96c18fb71d49a5fd71c290934bbd531`

See more details on using hashes here.

ai-ensemble-suite 0.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AI Ensemble Suite

Table of Contents

Installation

Prerequisites

Install via pip

Optional Dependencies

Installation with GPU acceleration

Development Installation

Quick Start

Key Features

AI Ensembles Explained

Why Ensembles Work

Why This Library?

Examples

Decision Guide: Choosing Collaboration & Aggregation Patterns

Collaboration Patterns

Aggregation Strategies

Collaboration Phases

Core Phases

Structured Debate Patterns

Advanced Collaboration Methods

Aggregation Strategies

Architecture Diagram

Class Hierarchy and Interaction Diagram

Project File Structure

API Reference

Core Classes

Ensemble

ModelManager

BaseCollaborationPhase

BaseAggregator

Contributing

Guidelines

Running Tests

👉 Note: Tests are currently broken. They were originally developed early on in the development process and are now out of date. I will be updating these later.

Roadmap

License

Special Thanks

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes