Skip to main content

Automated Hyperparameter Optimization Platform for Efficient LLM Fine-Tuning

Project description

🚀 Ellora

The Automated Hyperparameter Optimization Platform for Efficient LLM Fine-Tuning.

PyPI version License: MIT Python 3.10+

Ellora is a powerful, scientific framework designed to take the guesswork out of Large Language Model (LLM) fine-tuning. By combining Bayesian Optimization (via Optuna) with High-Performance Training Engines (via Unsloth and PEFT), Ellora automatically identifies the optimal LoRA (Low-Rank Adaptation) configurations for your specific dataset and hardware constraints.


🌟 Key Features

🎯 Intelligent Hyperparameter Tuning

Stop guessing ranks and learning rates. Ellora uses Optuna to search for the best combination of:

  • LoRA Rank (r) and Alpha
  • Learning Rate and Scheduler
  • Dropout Rates
  • Target Modules

⚡ Unsloth Integration

Built-in support for Unsloth, providing:

  • 2x–5x faster training speeds.
  • 70% less VRAM usage.
  • Automatic fallback to standard PEFT if hardware is incompatible.

📊 Scientific Metric Suite

Move beyond simple loss curves. Ellora generates journal-grade reports including:

  • NLP Quality: ROUGE-L, BLEU, and Semantic Similarity (via Sentence-Transformers).
  • Inference Efficiency: Tokens Per Second (TPS), Latency (ms).
  • Hardware Profile: Peak VRAM usage, System VRAM efficiency.

📈 Dynamic Visualization

Generate stunning HTML dashboards and publication-quality Matplotlib charts with a single command.


🚀 Quick Start

Installation

Standard Installation (Recommended)

pip install ellora

From Source (For Developers)

git clone https://github.com/shrey1720/ellora.git
cd ellora
pip install -e ".[dev]"

Recommended for NVIDIA GPUs

pip install unsloth xformers

🛠 Usage Guide

1. System Health Check

Ensure your GPU and VRAM are ready for training.

ellora doctor

2. Basic Training

Train with default settings and automatic tuning.

ellora train --model "meta-llama/Llama-3.2-1B" --data "my_dataset.json" --max-trials 5

3. Using Expert Presets

Ellora comes with pre-configured settings for specific domains:

  • --preset chatbot: Optimized for conversational flow.
  • --preset coding: Lower learning rate, optimized for logic.
  • --preset summarization: Focuses on context retention.

4. Safe Carry-Forward Training

Tune as usual, then continue the best trial for extra epochs from its checkpoint.

ellora train --model "meta-llama/Llama-3.2-1B" --data "my_dataset.json" --max-trials 3 --continue-best-epochs 2

5. Scientific Benchmarking

Compare your trained adapter against ground truth answers to get a technical profile.

ellora benchmark --run <run_id> --references test_set.json

📂 Project Architecture

The system is modularly designed for extensibility:

ellora/
├── tuner/        # Bayesian optimization and search spaces
├── trainer/      # LoRA/QLoRA engine (Unsloth & PEFT)
├── dataset/      # Dynamic loading and scientific validation
├── hardware/     # VRAM analysis and hardware-aware strategy
├── metrics/      # Scorer engine (NLP & Performance)
├── reports/      # HTML Exporters and Chart Generators
├── db/           # SQLite persistence for all runs/trials
└── cli/          # Typer-powered command interface

🛠 Detailed Command Reference

Command Description
ellora init Initialize project structure in the current directory.
ellora doctor Run hardware and dependency health checks.
ellora train Start HPO training (see options below).
ellora export Export the best adapter for deployment.
ellora report Generate an HTML performance dashboard.
ellora benchmark Run scientific NLP and throughput evaluations.
ellora runs list View training history and best scores.

🚀 Training Options (ellora train)

Option Shorthand Default Description
--model -m - HuggingFace model name (Required).
--data -d - Dataset path (.json, .jsonl, .txt) (Required).
--max-trials -t 10 Maximum tuning iterations.
--epochs -e 3 Training epochs per trial.
--unsloth - True Use Unsloth for 2x faster training.
--wandb - False Enable Weights & Biases logging.

🧩 Key Terminology

  • HPO: Hyperparameter Optimization — automatically finding the best settings.
  • LoRA: Low-Rank Adaptation — efficient fine-tuning by only training a small adapter.
  • Trial: A single training attempt with one hyperparameter set.
  • Run: A complete experiment containing multiple trials.
  • Adapter: The learned weights, typically saved as a small file.
  • Throughput: Training speed measured in Tokens Per Second (TPS).

🔬 Technical Roadmap

  • Multi-GPU Support: DDP and FSDP integration.
  • DPO Tuning: Direct Preference Optimization tuning loop.
  • Custom Scoring Functions: User-defined success metrics.
  • HuggingFace Hub Integration: Direct upload of tuned adapters.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ellora-0.2.13.tar.gz (69.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ellora-0.2.13-py3-none-any.whl (78.2 kB view details)

Uploaded Python 3

File details

Details for the file ellora-0.2.13.tar.gz.

File metadata

  • Download URL: ellora-0.2.13.tar.gz
  • Upload date:
  • Size: 69.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for ellora-0.2.13.tar.gz
Algorithm Hash digest
SHA256 0d51fe494d0086ca21acdd7e1c7197dcd60c54494f7bdbd8836efdfc9b0a7f6a
MD5 338826daf2cdc2414540e42b44cea3c0
BLAKE2b-256 9ae387e4bb4c904f580e35f0c0d32b956986bd952cbde411cf8ff86b4d2df010

See more details on using hashes here.

File details

Details for the file ellora-0.2.13-py3-none-any.whl.

File metadata

  • Download URL: ellora-0.2.13-py3-none-any.whl
  • Upload date:
  • Size: 78.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for ellora-0.2.13-py3-none-any.whl
Algorithm Hash digest
SHA256 4cf6c6a123be2879a6b838655e8e09d42e4c4f559a8eaaad9c721a73959c4089
MD5 20fc63b36e8ceba91c42b350ac12676b
BLAKE2b-256 c391844c8772a959fac727b6de4cb2e06fece82d1d0140946bc795b3f2124fba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page