Fine-tune, experiment with, and run LLMs locally on your Mac

These details have not been verified by PyPI

Project links

Project description

MLX Forge

Fine-tune LLMs on your Mac with MLX. No cloud, no CUDA required.

MLX Forge is a complete LLM fine-tuning toolkit that runs entirely on your Mac. Pick a model, upload your data, and start training — all from a browser-based UI. Supports LoRA, QLoRA, DPO, 18+ models, and 20+ curated datasets out of the box.

pip install mlx-forge
mlx-forge studio

MLX Forge Studio — New Training

Why MLX Forge?

One command to start — pip install mlx-forge && mlx-forge studio.
Browser-based Studio UI — Guided training wizard, real-time loss charts, model library with memory estimates, interactive playground.
Runs on Apple Silicon — Built on MLX. Your data stays on your machine.
Production training features — QLoRA (67% memory reduction), sequence packing (2-5x speedup), gradient checkpointing, DPO alignment, compiled training loop.

Quick Start

Studio UI (recommended)

mlx-forge studio
# Opens at http://127.0.0.1:8741

Pick a recipe, choose a model, upload your data, and start training — all from the browser.

CLI

# Browse and download a dataset
mlx-forge data catalog
mlx-forge data download alpaca-cleaned --max-samples 5000

# Train
mlx-forge train --config train.yaml

Models are downloaded from Hugging Face on first run and cached locally. All subsequent runs work offline.

Studio UI

MLX Forge Studio — Model Library

New Training — Guided wizard: pick a recipe (chat, instruction, DPO, writing style), choose a model, configure, and launch
Model Library — Browse 18+ models with memory estimates for your hardware
Experiments — Compare runs, view loss curves in real time
Datasets — Manage your training data
Playground — Chat with your fine-tuned models interactively

Supported Models

18 curated models in the Studio library, all tested on Apple Silicon:

Architecture	Models	Sizes
Qwen	Qwen 2.5, Qwen 3, Qwen 3.5	0.5B - 8B
Gemma	Gemma 2, Gemma 3	1B - 9B
Llama	Llama 3.1	8B
Phi	Phi-3 Mini, Phi-4 Mini	3.8B
DeepSeek	DeepSeek-R1-Distill (Qwen-based)	1.5B - 7B
Mistral	Mistral (uses Llama architecture)	7B

Any HF model using a supported architecture will work — the table above shows the curated models with pre-computed memory estimates in Studio.

Features

Training

LoRA and QLoRA (4-bit) fine-tuning with 67% memory reduction
DPO (Direct Preference Optimization) for alignment
Sequence packing for 2-5x speedup on short sequences
Gradient checkpointing for 40-60% memory savings
Compiled training loop with gradient accumulation
Cosine, linear, step, and exponential LR schedules with warmup
Resume from any checkpoint

Data

20+ curated datasets across 7 categories (general, code, math, conversation, reasoning, safety, domain)
Auto-detection of chat, completions, text, and preference formats
Multi-dataset mixing with weighted sampling
Data validation with train/val overlap detection

CLI Reference

Command	Description
`mlx-forge studio`	Launch the Studio UI
`mlx-forge train --config FILE`	Run LoRA/QLoRA/DPO training
`mlx-forge generate --model MODEL`	Generate text or interactive chat
`mlx-forge prepare --data FILE --model MODEL`	Pre-tokenize a dataset
`mlx-forge data catalog`	Browse 20+ curated datasets
`mlx-forge data download DATASET`	Download a dataset from the catalog
`mlx-forge data import FILE --name NAME`	Import a local JSONL file
`mlx-forge data validate FILE`	Validate JSONL data
`mlx-forge data inspect NAME`	Preview dataset samples
`mlx-forge data stats NAME`	Show dataset statistics

Configuration

schema_version: 1

model:
  path: "Qwen/Qwen3-0.6B"         # HF model ID or local path
  quantization:                     # Optional: QLoRA (67% memory savings)
    bits: 4
    group_size: 64

adapter:
  preset: "attention-qv"           # attention-qv | attention-all | mlp | all-linear
  rank: 16
  scale: 32.0

data:
  train: "./train.jsonl"
  valid: "./val.jsonl"
  packing: false                    # Sequence packing (2-5x speedup)
  max_seq_length: 2048

training:
  optimizer: adamw                  # adam | adamw | sgd | adafactor
  learning_rate: 1.0e-5
  num_iters: 1000
  batch_size: 4
  gradient_checkpointing: false     # 40-60% memory savings
  # training_type: dpo              # For DPO training
  # dpo_beta: 0.1

runtime:
  seed: 42

Data Formats

MLX Forge auto-detects four JSONL formats:

Chat — Multi-turn conversations (loss on assistant turns only):

{"messages": [{"role": "user", "content": "Hello"}, {"role": "assistant", "content": "Hi!"}]}

Completions — Prompt-completion pairs:

{"prompt": "Translate to French: Hello", "completion": "Bonjour"}

Text — Raw text for continued pretraining:

{"text": "The quick brown fox jumps over the lazy dog."}

Preference — For DPO alignment training:

{"chosen": [{"role": "user", "content": "..."}, {"role": "assistant", "content": "good"}], "rejected": [{"role": "user", "content": "..."}, {"role": "assistant", "content": "bad"}]}

Library API

All CLI commands are backed by Python functions:

from mlx_forge import prepare, train
from mlx_forge.config import TrainingConfig

# Train from a config file
config = TrainingConfig.from_yaml("train.yaml")
result = train(config=config)
print(f"Best val loss: {result.best_val_loss:.4f}")

from mlx_forge import generate

# Generate text with a fine-tuned adapter
generate(
    model="Qwen/Qwen3-0.6B",
    adapter="~/.mlxforge/runs/my-run/checkpoints/best",
    prompt="Explain quantum computing in simple terms.",
)

Contributing

See CONTRIBUTING.md for development setup, coding standards, and how to submit changes.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.0

Apr 14, 2026

0.7.1

Apr 1, 2026

0.7.0

Mar 31, 2026

0.6.3

Mar 20, 2026

0.6.2

Mar 20, 2026

0.6.1

Mar 19, 2026

0.6.0

Mar 18, 2026

0.5.0

Mar 18, 2026

0.2.11

Mar 10, 2026

0.2.10

Mar 10, 2026

0.2.9

Mar 10, 2026

0.2.8

Mar 10, 2026

0.2.7

Mar 10, 2026

0.2.6

Mar 10, 2026

0.2.5

Mar 10, 2026

0.2.4

Mar 10, 2026

0.2.3

Mar 10, 2026

This version

0.2.2

Mar 10, 2026

0.2.1

Mar 10, 2026

0.2.0

Mar 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlx_forge-0.2.2.tar.gz (365.0 kB view details)

Uploaded Mar 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mlx_forge-0.2.2-py3-none-any.whl (344.4 kB view details)

Uploaded Mar 10, 2026 Python 3

File details

Details for the file mlx_forge-0.2.2.tar.gz.

File metadata

Download URL: mlx_forge-0.2.2.tar.gz
Upload date: Mar 10, 2026
Size: 365.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for mlx_forge-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`1c691b82b2178d380a31ac8011f9a4ddd99bc6c623595171a8f14463b59e0ebd`
MD5	`5899c0c2f3c23189950de992e2a2eb6a`
BLAKE2b-256	`2c6619bab116d55f22e3f78dcecd37cf3e62774f3d5393d8293149e633f131fa`

See more details on using hashes here.

File details

Details for the file mlx_forge-0.2.2-py3-none-any.whl.

File metadata

Download URL: mlx_forge-0.2.2-py3-none-any.whl
Upload date: Mar 10, 2026
Size: 344.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for mlx_forge-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c46793fbc573fd0383fa1a545fafa624a4315ee878dc3183f6a9eda491776e7d`
MD5	`0a02f49d88f4e483523c14763de1b1b2`
BLAKE2b-256	`6481787e7f093cc0c081f3bee2e30eca5af24b02840526b7436c731303e26b97`

See more details on using hashes here.

mlx-forge 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MLX Forge

Why MLX Forge?

Quick Start

Studio UI (recommended)

CLI

Studio UI

Supported Models

Features

CLI Reference

Configuration

Data Formats

Library API

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes