Production-ready headless LLM fine-tuning with smart defaults, Windows support, and modular architecture

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mikeyfrilot

These details have not been verified by PyPI

Project description

Backpropagate

Headless LLM Fine-Tuning - Making fine-tuning accessible without the complexity.

Philosophy

For Users: Upload data, pick a model, click train
For Developers: Clean Python API with smart defaults
For Everyone: Windows-safe, VRAM-aware, production-ready

Installation

Modular Installation (v0.1.0+)

Install only what you need:

pip install backpropagate              # Core only (minimal)
pip install backpropagate[unsloth]     # + Unsloth 2x faster training
pip install backpropagate[ui]          # + Gradio web UI
pip install backpropagate[standard]    # unsloth + ui (recommended)
pip install backpropagate[full]        # Everything

Available Extras

Extra	Description	Dependencies
`unsloth`	2x faster training, 50% less VRAM	unsloth
`ui`	Gradio web interface	gradio>=5.6.0
`validation`	Pydantic config validation	pydantic, pydantic-settings
`export`	GGUF export for Ollama	llama-cpp-python
`monitoring`	WandB + system monitoring	wandb, psutil

Requirements

Python 3.10+
CUDA-capable GPU (8GB+ VRAM recommended)
PyTorch 2.0+

Quick Start

Use as Library

from backpropagate import Trainer

# Dead simple
trainer = Trainer("unsloth/Qwen2.5-7B-Instruct-bnb-4bit")
trainer.train("my_data.jsonl", steps=100)
trainer.save("./my-model")

# Export to GGUF for Ollama
trainer.export("gguf", quantization="q4_k_m")

With Options

from backpropagate import Trainer

trainer = Trainer(
    model="unsloth/Llama-3.2-3B-Instruct-bnb-4bit",
    lora_r=32,
    lora_alpha=64,
    learning_rate=1e-4,
    batch_size="auto",  # Auto-detects based on VRAM
)

run = trainer.train(
    dataset="HuggingFaceH4/ultrachat_200k",
    steps=200,
    samples=2000,
)

print(f"Final loss: {run.final_loss:.4f}")
print(f"Duration: {run.duration_seconds:.1f}s")

Launch the Web UI

# CLI
backpropagate --ui

# Or from Python
from backpropagate import launch
launch(port=7862)

Feature Flags

Check which features are installed:

from backpropagate import FEATURES, list_available_features

print(FEATURES)
# {'unsloth': True, 'ui': True, 'validation': False, ...}

for name, desc in list_available_features().items():
    print(f"{name}: {desc}")

CLI Usage

# Show system info and features
backprop info

# Show current configuration
backprop config

# Train a model
backprop train \
    --data my_data.jsonl \
    --model unsloth/Qwen2.5-7B-Instruct-bnb-4bit \
    --steps 100 \
    --samples 1000

# Multi-run training (recommended for best results)
backprop multi-run \
    --data HuggingFaceH4/ultrachat_200k \
    --runs 5 \
    --steps 100 \
    --samples 1000

# Export to GGUF for Ollama
backprop export ./output/lora \
    --format gguf \
    --quantization q4_k_m \
    --ollama \
    --ollama-name my-model

# Launch UI
backpropagate --ui --port 7862

Configuration

All settings can be overridden via environment variables:

# Model settings
BACKPROPAGATE_MODEL__NAME=unsloth/Llama-3.2-3B-Instruct-bnb-4bit
BACKPROPAGATE_MODEL__MAX_SEQ_LENGTH=4096

# Training settings
BACKPROPAGATE_TRAINING__LEARNING_RATE=1e-4
BACKPROPAGATE_TRAINING__MAX_STEPS=200
BACKPROPAGATE_TRAINING__BATCH_SIZE=4

# LoRA settings
BACKPROPAGATE_LORA__R=32
BACKPROPAGATE_LORA__ALPHA=64

Or use a .env file in your project root.

Dataset Formats

JSONL (Recommended)

{"text": "<|im_start|>user\nWhat is Python?<|im_end|>\n<|im_start|>assistant\nPython is a programming language.<|im_end|>"}
{"text": "<|im_start|>user\nExplain ML<|im_end|>\n<|im_start|>assistant\nML is...<|im_end|>"}

CSV

text
"<|im_start|>user\nHello<|im_end|>\n<|im_start|>assistant\nHi!<|im_end|>"

HuggingFace Datasets

Any dataset with a text column works:

trainer.train(dataset="HuggingFaceH4/ultrachat_200k", samples=1000)

Advanced Features

Multi-Run Training (SLAO)

Multiple short runs with LoRA merging prevents catastrophic forgetting and improves results:

from backpropagate import Trainer

trainer = Trainer("unsloth/Qwen2.5-7B-Instruct-bnb-4bit")

# Run 5 training runs, each on fresh data
result = trainer.multi_run(
    dataset="HuggingFaceH4/ultrachat_200k",
    num_runs=5,
    steps_per_run=100,
    samples_per_run=1000,
    merge_mode="slao",  # Smart LoRA merging
)

print(f"Final loss: {result.final_loss:.4f}")
print(f"Total time: {result.total_time_seconds:.1f}s")

Or use the dedicated trainer:

from backpropagate import MultiRunTrainer, MultiRunConfig

config = MultiRunConfig(
    num_runs=5,
    steps_per_run=100,
    samples_per_run=1000,
)

trainer = MultiRunTrainer(
    model="unsloth/Qwen2.5-7B-Instruct-bnb-4bit",
    config=config,
)

result = trainer.run("my_data.jsonl")

Dataset Loading & Filtering

Load, validate, and filter datasets with quality controls:

from backpropagate import DatasetLoader, detect_format

# Auto-detect format and load
loader = DatasetLoader("my_data.jsonl")
print(f"Format: {loader.detected_format}")
print(f"Samples: {len(loader)}")
print(f"Valid: {loader.is_valid}")

# Preview samples
for sample in loader.preview(3):
    print(sample)

# Convert to ChatML format
chatml_data = loader.to_chatml()

# Filter by quality
filtered = loader.filter(
    min_tokens=50,
    max_tokens=2048,
    min_turns=2,
    require_assistant=True,
)

# Remove duplicates
deduped = loader.deduplicate(method="exact")  # or "minhash"

Perplexity-Based Filtering

Filter samples by perplexity score to remove outliers (requires model inference):

from backpropagate import DatasetLoader, PerplexityFilter, filter_by_perplexity

# Option 1: Use DatasetLoader method
loader = DatasetLoader("my_data.jsonl")
filtered_loader, stats = loader.filter_perplexity(
    model_name="gpt2",       # Model for scoring (gpt2, gpt2-medium, etc.)
    min_percentile=5,        # Remove bottom 5% (too simple/repetitive)
    max_percentile=95,       # Remove top 5% (noisy/unusual)
)
print(stats.summary())

# Option 2: Use standalone function
samples = [{"text": "sample 1"}, {"text": "sample 2"}]
filtered, stats = filter_by_perplexity(
    samples,
    model_name="gpt2",
    min_percentile=5,
    max_percentile=95,
)

# Option 3: Use PerplexityFilter class for more control
pf = PerplexityFilter(model_name="gpt2", device="cuda", batch_size=16)
scores = pf.score(samples)  # Get raw scores
filtered = pf.filter_by_threshold(samples, scores, min_perplexity=10, max_perplexity=500)

Perplexity measures how "surprised" a language model is by text:

Low perplexity: Very predictable (may be too simple or repetitive)
Medium perplexity: Natural, typical language
High perplexity: Unusual (may be noisy or low-quality)

Export & Ollama Integration

Export trained models to various formats:

from backpropagate import (
    export_lora,
    export_merged,
    export_gguf,
    create_modelfile,
    register_with_ollama,
)

# Export LoRA adapter
result = export_lora(model, output_dir="./lora")

# Export merged model (base + adapter)
result = export_merged(model, tokenizer, output_dir="./merged")

# Export to GGUF for Ollama/llama.cpp
result = export_gguf(
    model,
    tokenizer,
    output_dir="./gguf",
    quantization="q4_k_m",  # f16, q8_0, q5_k_m, q4_k_m, q4_0, q2_k
)

print(result.summary())
# Export Complete
#   Format: gguf
#   Path: ./gguf/model-q4_k_m.gguf
#   Size: 4096.0 MB
#   Quantization: q4_k_m
#   Time: 120.5s

# Create Ollama Modelfile
create_modelfile(
    "./gguf/model-q4_k_m.gguf",
    system_prompt="You are a helpful assistant.",
    temperature=0.7,
)

# Register with Ollama
register_with_ollama("./gguf/model-q4_k_m.gguf", "my-model")
# Now run: ollama run my-model

GPU Safety Monitoring

Monitor GPU health during training:

from backpropagate import (
    check_gpu_safe,
    get_gpu_status,
    wait_for_safe_gpu,
    GPUMonitor,
)

# Quick safety check
if check_gpu_safe():
    print("GPU is ready for training")

# Get detailed status
status = get_gpu_status()
print(f"GPU: {status.device_name}")
print(f"Temperature: {status.temperature_c}C")
print(f"VRAM: {status.vram_used_gb:.1f}/{status.vram_total_gb:.1f} GB")
print(f"Condition: {status.condition}")  # SAFE, WARNING, CRITICAL, EMERGENCY

# Wait for GPU to cool down
wait_for_safe_gpu(max_wait=300)  # Wait up to 5 minutes

# Continuous monitoring during training
with GPUMonitor(check_interval=30) as monitor:
    trainer.train(dataset, steps=1000)

Windows Support

Backpropagate is designed to work on Windows out of the box:

Pre-tokenization to avoid multiprocessing crashes
Automatic xformers disable for RTX 40/50 series
Safe dataloader settings
Tested on RTX 5080 (16GB VRAM)

Windows fixes are applied automatically when os.name == "nt".

Model Presets

Preset	VRAM	Speed	Quality
Qwen 2.5 7B	~12GB	Medium	Best
Qwen 2.5 3B	~8GB	Fast	Good
Llama 3.2 3B	~8GB	Fast	Good
Llama 3.2 1B	~6GB	Fastest	Basic
Mistral 7B	~12GB	Medium	Good

Architecture

backpropagate/
├── __init__.py          # Package exports, lazy loading
├── __main__.py          # CLI entry point
├── cli.py               # Command-line interface
├── trainer.py           # Core Trainer class
├── multi_run.py         # Multi-run SLAO training
├── slao.py              # SLAO LoRA merging algorithm
├── datasets.py          # Dataset loading & filtering
├── export.py            # GGUF/Ollama export
├── config.py            # Pydantic settings
├── feature_flags.py     # Optional dependency detection
├── gpu_safety.py        # GPU monitoring & safety
├── theme.py             # Ocean Mist Gradio theme
└── ui.py                # Gradio interface

Key Design Principles

Modular by default - Install only what you need
Smart defaults - Works out of the box
Windows-first - No multiprocessing nightmares
Fail gracefully - Helpful error messages
Type-safe - Full type hints

API Reference

Trainer

class Trainer:
    def __init__(
        self,
        model: str = None,           # Model name/path
        lora_r: int = 16,            # LoRA rank
        lora_alpha: int = 32,        # LoRA alpha
        learning_rate: float = 2e-4, # Learning rate
        batch_size: int | str = "auto",  # Batch size or "auto"
        output_dir: str = "./output",    # Output directory
    )

    def train(
        self,
        dataset: str | Dataset,  # Dataset path or HF name
        steps: int = 100,        # Training steps
        samples: int = 1000,     # Max samples
    ) -> TrainingRun

    def save(self, path: str = None) -> str
    def export(self, format: str, quantization: str = "q4_k_m") -> str

TrainingRun

@dataclass
class TrainingRun:
    run_id: str
    steps: int
    final_loss: float
    loss_history: List[float]
    duration_seconds: float
    samples_seen: int

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

# Development setup
git clone https://github.com/mikeyfrilot/backpropagate
cd backpropagate
pip install -e ".[dev]"

# Run tests
pytest

# Type checking
mypy backpropagate

# Linting
ruff check backpropagate

License

MIT License - see LICENSE for details.

Acknowledgments

Unsloth for the amazing training optimizations
HuggingFace for transformers, datasets, and PEFT
Gradio for the beautiful UI framework
Built with the same love as Comfy Headless and Tool Compass

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mikeyfrilot

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.5

Apr 15, 2026

1.0.4

Apr 14, 2026

1.0.3

Apr 14, 2026

1.0.1

Feb 27, 2026

1.0.0

Feb 27, 2026

0.1.7

Feb 27, 2026

0.1.6

Feb 23, 2026

0.1.5

Feb 23, 2026

0.1.2

Feb 18, 2026

0.1.1

Feb 12, 2026

This version

0.1.0

Jan 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

backpropagate-0.1.0.tar.gz (284.4 kB view details)

Uploaded Jan 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

backpropagate-0.1.0-py3-none-any.whl (134.7 kB view details)

Uploaded Jan 19, 2026 Python 3

File details

Details for the file backpropagate-0.1.0.tar.gz.

File metadata

Download URL: backpropagate-0.1.0.tar.gz
Upload date: Jan 19, 2026
Size: 284.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for backpropagate-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`1fe37e235ddc65320ecefe8533caf12c582f30cbf9b6898420f9de5a1b9e0e00`
MD5	`a6b70d4f56994f6a4193a2c976a10582`
BLAKE2b-256	`a1cee21af315cdd557f7d8008ab335c2360a4c910464d1a37d049a9c7a183622`

See more details on using hashes here.

Provenance

The following attestation bundles were made for backpropagate-0.1.0.tar.gz:

Publisher: publish.yml on mikeyfrilot/backpropagate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: backpropagate-0.1.0.tar.gz
- Subject digest: 1fe37e235ddc65320ecefe8533caf12c582f30cbf9b6898420f9de5a1b9e0e00
- Sigstore transparency entry: 834774232
- Sigstore integration time: Jan 19, 2026
Source repository:
- Permalink: mikeyfrilot/backpropagate@ddcea792780578af95bc3ed17ee3db455f0844e5
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/mikeyfrilot
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ddcea792780578af95bc3ed17ee3db455f0844e5
- Trigger Event: release

File details

Details for the file backpropagate-0.1.0-py3-none-any.whl.

File metadata

Download URL: backpropagate-0.1.0-py3-none-any.whl
Upload date: Jan 19, 2026
Size: 134.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for backpropagate-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0dee0a15641eb6af167b8d0dd719a91f79e6351124b38c23fd1db5af0e4bcbd6`
MD5	`cdb4a087aa8fa8149a7377c52e8cfb5a`
BLAKE2b-256	`a2b14283fa4d655f08e275a0efdf16581c266b48184e9511e37b683c23018eb8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for backpropagate-0.1.0-py3-none-any.whl:

Publisher: publish.yml on mikeyfrilot/backpropagate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: backpropagate-0.1.0-py3-none-any.whl
- Subject digest: 0dee0a15641eb6af167b8d0dd719a91f79e6351124b38c23fd1db5af0e4bcbd6
- Sigstore transparency entry: 834774240
- Sigstore integration time: Jan 19, 2026
Source repository:
- Permalink: mikeyfrilot/backpropagate@ddcea792780578af95bc3ed17ee3db455f0844e5
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/mikeyfrilot
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ddcea792780578af95bc3ed17ee3db455f0844e5
- Trigger Event: release

backpropagate 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Backpropagate

Philosophy

Installation

Modular Installation (v0.1.0+)

Available Extras

Requirements

Quick Start

Use as Library

With Options

Launch the Web UI

Feature Flags

CLI Usage

Configuration

Dataset Formats

JSONL (Recommended)

CSV

HuggingFace Datasets

Advanced Features

Multi-Run Training (SLAO)

Dataset Loading & Filtering

Perplexity-Based Filtering

Export & Ollama Integration

GPU Safety Monitoring

Windows Support

Model Presets

Architecture

Key Design Principles

API Reference

Trainer

TrainingRun

Contributing

License

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance