Fine-tune LFM 2.5 1.2B for coding tasks on Kaggle multi-GPU with auto-publish to Hugging Face

Project description

LFM Trainer

Fine-tune Liquid LFM 2.5 1.2B for coding tasks on Kaggle multi-GPU — with automatic checkpoint publishing to Hugging Face on errors, and post-training GGUF + MLX quantization.

Features

🚀 Multi-GPU training via HuggingFace Accelerate / DDP (Kaggle P100 / 2×T4)
🧠 LoRA / PEFT for memory-efficient fine-tuning
📦 Structured dataset loading — CSV, Parquet, JSONL, HuggingFace Hub IDs, or direct pd.DataFrame objects
🔍 Auto-format detection — Alpaca, prompt/response, conversational/chat (DataClaw), single text column
�️ Tool calling support — LFM 2.5 native <|tool_call_start|> / <|tool_call_end|> tokens; handles OpenAI and DataClaw tool call formats
�🛡️ Error-resilient training — auto-publishes versioned checkpoints on OOM, SIGTERM (Kaggle timeout), KeyboardInterrupt, or any exception
🔑 Flexible HF auth — CLI arg, HF_TOKEN env var, or Kaggle Secrets
📐 GGUF export — Q4_K_M, Q6_K, Q8_0 via llama.cpp
🍎 MLX export — 4-bit, 6-bit, 8-bit via mlx-lm
🏷️ Shared versioning — base model + all quantized variants tagged with the same version
📊 Auto-benchmarking — HumanEval + MBPP with before/after comparison
🧹 Data quality filters — auto-remove duplicates, empty rows, and length outliers
📈 Eval split — hold out a % for validation loss tracking during training
📝 Auto model card — generates a HuggingFace README.md with config, benchmarks, and hardware
📉 W&B / TensorBoard — optional training metric logging

Installation

pip install lfm-trainer

Quick Start (Kaggle Notebook)

# Cell 1: Install
!pip install lfm-trainer

# Cell 2: Train on a coding dataset with GGUF export
!lfm-train \
    --dataset peteromallet/dataclaw-peteromallet \
    --hub-repo your-username/lfm-code \
    --export-gguf

# Train on multiple datasets
!lfm-train \
    --dataset code_data.csv \
    --dataset more_code.parquet \
    --dataset sahil2801/CodeAlpaca-20k \
    --hub-repo your-username/lfm-code \
    --epochs 3 \
    --batch-size 2 \
    --export-gguf

The HF token is automatically picked up from Kaggle Secrets (key: HF_TOKEN).

Python API

import pandas as pd
from lfm_trainer import run_training, load_datasets
from lfm_trainer.config import TrainingConfig

# Load multiple sources including DataFrames
df = pd.read_csv("my_code_data.csv")
dataset = load_datasets([
    df,                                    # Direct DataFrame
    "code_data.parquet",                   # Local file
    "peteromallet/dataclaw-peteromallet",   # HuggingFace Hub (conversational)
])

# Or use the full pipeline
cfg = TrainingConfig(
    model_name="liquid/LFM2.5-1.2B-Base",
    dataset_paths=["peteromallet/dataclaw-peteromallet"],
    hub_repo_id="your-username/lfm-code",
    quality_filter=True,
    eval_split=0.1,
    run_benchmark=True,
    benchmark_before_after=True,
    export_gguf=True,
)
run_training(cfg)

CLI Reference

lfm-train --help

Flag	Default	Description
`--dataset`	(required)	Dataset path or Hub ID (repeatable)
`--model`	`liquid/LFM2.5-1.2B-Base`	Model to fine-tune
`--resume-from`	—	Path or Hub ID of a prior adapter for continual training
`--tool-calling-only`	off	Keep only samples with tool calls
`--quality-filter`	off	Remove empty rows, dupes, length outliers
`--eval-split`	0.0	Hold out a fraction for eval (e.g. 0.1 = 10%)
`--hf-token`	auto-detect	HuggingFace token
`--hub-repo`	auto	Hub repo to push to
`--no-push`	off	Save locally only, skip Hub push
`--epochs`	3	Training epochs
`--batch-size`	2	Per-device batch size
`--lr`	2e-4	Learning rate
`--max-seq-length`	2048	Max sequence length
`--lora-r`	16	LoRA rank
`--lora-alpha`	32	LoRA alpha
`--bf16`	off	Use bfloat16
`--report-to`	`none`	`none`, `wandb`, or `tensorboard`
`--benchmark`	off	Run HumanEval + MBPP after training
`--benchmark-compare`	off	Also benchmark base model for delta
`--benchmark-max`	all	Cap problems for quick testing
`--no-model-card`	off	Skip auto model card generation
`--export-gguf`	off	Export GGUF (Q4_K_M, Q6_K, Q8_0)
`--export-mlx`	off	Export MLX (4/6/8-bit)
`--export-dir`	`./lfm-exports`	Export scratch directory
`--simulate-error`	off	Test auto-publish mechanism

Supported Dataset Formats

The data loader auto-detects column layouts:

Format	Detected By	Example
Alpaca	`instruction` + `output` columns	`iamtarun/python_code_instructions_18k_alpaca`
Prompt/Response	`prompt` + `response` columns	Generic Q&A datasets
Conversational	`messages` column (with tool calls)	`peteromallet/dataclaw-peteromallet`
Text	Single `text` column	`jdaddyalbs/playwright-mcp-toolcalling`
DataFrame	Direct `pd.DataFrame` objects	In-memory data

Tool-Calling Training

Train exclusively on tool-calling examples using --tool-calling-only:

lfm-train \
    --dataset jdaddyalbs/playwright-mcp-toolcalling \
    --tool-calling-only \
    --hub-repo your-username/lfm-tools \
    --max-seq-length 4096

The filter keeps only samples containing tool call patterns (<|tool_call_start|>, tool_calls, function_call, etc.). Works with any dataset — pre-formatted or auto-formatted.

Continual Training

Train iteratively across multiple datasets, saving locally between rounds:

# Round 1: Coding skills → save locally
lfm-train --dataset sahil2801/CodeAlpaca-20k --no-push

# Round 2: Add tool-calling on top → save locally
lfm-train \
    --resume-from ./lfm-checkpoints/final-adapter \
    --dataset jdaddyalbs/playwright-mcp-toolcalling \
    --tool-calling-only \
    --output-dir ./lfm-checkpoints-r2 \
    --no-push

# Round 3: Final round → push to Hub + export
lfm-train \
    --resume-from ./lfm-checkpoints-r2/final-adapter \
    --dataset peteromallet/dataclaw-peteromallet \
    --hub-repo your-username/lfm-final \
    --export-gguf

Each --resume-from loads the prior adapter and continues learning from where it left off.

How Auto-Publish Works

Training is wrapped in an error handler inspired by Unsloth:

SIGTERM (Kaggle timeout) → saves + pushes immediately, then exits
CUDA OOM → clears cache, saves + pushes
KeyboardInterrupt → saves + pushes
Any Exception → saves + pushes, then re-raises

Each checkpoint is tagged with a UTC timestamp (e.g., v20260302-153000) so versions never collide.

Post-Training Export

When --export-gguf or --export-mlx is enabled:

LoRA adapters are merged into the base model
GGUF: Converted via llama.cpp → Q4_K_M, Q6_K, Q8_0 → pushed to {repo}-GGUF
MLX: Converted via mlx-lm → 4-bit, 6-bit, 8-bit → pushed to {repo}-MLX-{N}bit
All repos (base + quants) share the same version tag

Note: MLX export requires Apple Silicon. Use --export-gguf on Kaggle (Linux), and --export-mlx locally on Mac.

Examples

See the examples/ directory for ready-to-run scripts:

Example	Description
`basic_training.py`	Simple Alpaca coding fine-tune
`tool_calling_training.py`	Tool-calling-only with playwright MCP
`multi_dataset_training.py`	Combining Hub + local + DataFrame sources
`continual_training.py`	Multi-round training with local saves
`benchmark_training.py`	Train + HumanEval/MBPP benchmark + auto-upload
`full_benchmark_suite.py`	All 5 benchmarks with before/after comparison
`benchmark_only.py`	Evaluate any model without training
`export_only.py`	Standalone GGUF/MLX quantization
`kaggle_notebook.py`	Copy-paste Kaggle cells

License

MIT

Project details

Release history Release notifications | RSS feed

0.16.0

Mar 29, 2026

0.15.0

Mar 3, 2026

0.14.0

Mar 2, 2026

0.13.0

Mar 2, 2026

0.12.0

Mar 2, 2026

0.10.0

Mar 2, 2026

0.9.0

Mar 2, 2026

0.8.0

Mar 2, 2026

0.7.0

Mar 2, 2026

0.6.0

Mar 2, 2026

This version

0.5.0

Mar 2, 2026

0.2.0

Mar 2, 2026

0.1.0

Mar 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lfm_trainer-0.5.0.tar.gz (190.8 kB view details)

Uploaded Mar 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lfm_trainer-0.5.0-py3-none-any.whl (28.3 kB view details)

Uploaded Mar 2, 2026 Python 3

File details

Details for the file lfm_trainer-0.5.0.tar.gz.

File metadata

Download URL: lfm_trainer-0.5.0.tar.gz
Upload date: Mar 2, 2026
Size: 190.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.24 {"installer":{"name":"uv","version":"0.9.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for lfm_trainer-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`1c358157c0a0fe7bb199c85bbefa52c1fee3b0af6c0e63caf347a5fd6f8feb16`
MD5	`7c386d81e6e9f040ead2fc04eaa7dffb`
BLAKE2b-256	`fd4eeb77aa3879def21ecb96f523661d84e36626cec3957c35d49b320432bb3d`

See more details on using hashes here.

File details

Details for the file lfm_trainer-0.5.0-py3-none-any.whl.

File metadata

Download URL: lfm_trainer-0.5.0-py3-none-any.whl
Upload date: Mar 2, 2026
Size: 28.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.24 {"installer":{"name":"uv","version":"0.9.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for lfm_trainer-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`786f6c3bb487a6f5fda5fb04f1636a61facc2502a4de95f8877d8722e0aab02d`
MD5	`927b9ccf9033b49b78231d5b6ee04c83`
BLAKE2b-256	`902f72c6a3359b588e2989c2bce43224457eaf9d904c95533a7daef895d30e8a`

See more details on using hashes here.

lfm-trainer 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LFM Trainer

Features

Installation

Quick Start (Kaggle Notebook)

Python API

CLI Reference

Supported Dataset Formats

Tool-Calling Training

Continual Training

How Auto-Publish Works

Post-Training Export

Examples

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes