FableForge-14B: Four-stage training pipeline for a 14B coding agent model

These details have not been verified by PyPI

Project description

FableForge-14B

Four-stage training pipeline for a 14B parameter coding agent model, trained on Fable5 datasets.

Overview

FableForge-14B builds a production-quality coding agent through four carefully designed training stages:

Stage 1: Behavior Shaping — Learn tool-use patterns from v-Fable 100K examples
Stage 2: Skill Distillation — Master coding excellence from 100K curated code samples
Stage 3: Error Recovery — Debug like an expert using Glint + armand0e 18K error patterns
Stage 4: DPO Alignment — Align agent behavior with expert preferences

Installation

pip install fableforge-14b

Training Pipeline

Stage 1: Behavior Shaping

Teaches the model when and how to use coding tools (read, edit, bash, grep, glob).

from fableforge_14b.training.stage1_behavior_shaping import run_stage1, Stage1Config

config = Stage1Config(
    base_model="Qwen/Qwen2.5-14B",
    dataset_path="data/vfable_100k.jsonl",
    LoRA_r=64,
    LoRA_alpha=128,
)
result = run_stage1(config)

Stage 2: Skill Distillation

Trains on 100K high-quality code generation examples across 6 skill categories.

from fableforge_14b.training.stage2_skill_distillation import run_stage2, Stage2Config

config = Stage2Config(stage1_adapter="output/stage1")
result = run_stage2(config)

Stage 3: Error Recovery

Trains on 18K real error patterns to diagnose and fix bugs expertly.

from fableforge_14b.training.stage3_error_recovery import run_stage3, Stage3Config

config = Stage3Config(stage2_adapter="output/stage2")
result = run_stage3(config)

Stage 4: DPO Alignment

Direct Preference Optimization to align with expert agent behavior.

from fableforge_14b.training.stage4_dpo_alignment import run_stage4, Stage4Config

config = Stage4Config(stage3_adapter="output/stage3")
result = run_stage4(config)

Model Merging

Merge LoRA adapters into the base model:

from fableforge_14b.model.merge_lora import merge_lora_adapters, MergeConfig

config = MergeConfig(
    base_model="Qwen/Qwen2.5-14B",
    adapters=["output/stage1", "output/stage2", "output/stage3", "output/stage4"],
)
result = merge_lora_adapters(config)

Quantization

Export in GGUF, AWQ, or GPTQ formats:

from fableforge_14b.model.quantize import quantize, QuantizeConfig

config = QuantizeConfig(
    model_path="output/merged",
    formats=["gguf", "awq", "gptq"],
)
result = quantize(config)

Inference Server

Start a vLLM inference server with tool calling support:

from fableforge_14b.inference.server import InferenceServer, ServerConfig

server = InferenceServer(ServerConfig(
    model_path="output/merged",
    port=8000,
    enable_tool_calling=True,
))
result = server.start()

Evaluation

Run the BenchAgent evaluation benchmark:

from fableforge_14b.evaluation.bench_agent import BenchAgent

bench = BenchAgent(model_path="output/merged")
bench.load_tasks()  # loads default tasks
report = bench.evaluate(model_name="fableforge-14b")
bench.save_report(report, "output/bench_results.json")

Training Configs

Each stage has a YAML config in src/fableforge_14b/training/configs/:

stage1.yaml — Behavior shaping (LoRA r=64, 3 epochs, 100K examples)
stage2.yaml — Skill distillation (LoRA r=64, 2 epochs, 100K examples)
stage3.yaml — Error recovery (LoRA r=32, 2 epochs, 18K examples)
stage4.yaml — DPO alignment (LoRA r=16, 1 epoch, preference pairs)

Datasets

Stage	Dataset	Size	Focus
1	v-Fable	100K	Tool-use patterns
2	Coding Excellence	100K	Code generation quality
3	Glint + armand0e	18K	Error diagnosis & recovery
4	DPO Preferences	50K pairs	Behavior alignment

License

MIT

Quick Start

fableforge-14b run "your task here"

Ecosystem

Part of the FableForge ecosystem — 21 open-source projects built from 210K real agent traces:

Project	Description
Anvil	Self-verified coding agent
VerifyLoop	Plan→Execute→Verify→Recover framework
ErrorRecovery	Self-healing middleware (3,725 error patterns)
FableForge-14B	The fine-tuned 14B model (4-stage training)
ShellWhisperer	1.5B edge agent (phone/RPi, 50ms)
ReasonCritic	Verification model (130 benchmark tasks)
TraceCompiler	Compile traces → LoRA skills
AgentRuntime	Persistent agent daemon (systemd for AI)
AgentSwarm	Multi-agent from real trace transitions
AgentTelemetry	Datadog for agents (token tracking, costs)
BenchAgent	HumanEval for tool-use (107 tasks)
AgentDev	VSCode extension with verification
TraceViz	Trace replay visualizer (Next.js)
AgentSkills	npm for agent behaviors
AgentCurriculum	5-stage progressive training
AgentFuzzer	Adversarial testing for agents
AgentConstitution	Safety guardrails from traces
CostOptimizer	Token cost reduction (50-80%)
AgentProfiler	Behavioral fingerprinting
TrajectoryDistiller	Trace→training data pipeline
Fable5-Dataset	HuggingFace dataset release

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Jun 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fableforge_14b-0.1.0.tar.gz (46.9 kB view details)

Uploaded Jun 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fableforge_14b-0.1.0-py3-none-any.whl (32.4 kB view details)

Uploaded Jun 14, 2026 Python 3

File details

Details for the file fableforge_14b-0.1.0.tar.gz.

File metadata

Download URL: fableforge_14b-0.1.0.tar.gz
Upload date: Jun 14, 2026
Size: 46.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for fableforge_14b-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`f52f1b2e7eec7d99a8e67adca79ee3e47100b3bff4f35adae941d096bd639e92`
MD5	`ec66921697da93abcad22ec01cd7fdf0`
BLAKE2b-256	`1fae6871283b1fa47d3442c2cf5ff4b428222174610815bdbcbbc0b4835ccff1`

See more details on using hashes here.

File details

Details for the file fableforge_14b-0.1.0-py3-none-any.whl.

File metadata

Download URL: fableforge_14b-0.1.0-py3-none-any.whl
Upload date: Jun 14, 2026
Size: 32.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for fableforge_14b-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`745c78e9342252e6026fbab30ab83ffdfa41284ffb6d52d46745736d0884e064`
MD5	`6c1aa6ca95ca79e80c701651a657a1da`
BLAKE2b-256	`80d9791088d8817cab1070b7298f441a940652ad29e4f4416773d293b129eb12`

See more details on using hashes here.

fableforge-14b 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

FableForge-14B

Overview

Installation

Training Pipeline

Stage 1: Behavior Shaping

Stage 2: Skill Distillation

Stage 3: Error Recovery

Stage 4: DPO Alignment

Model Merging

Quantization

Inference Server

Evaluation

Training Configs

Datasets

License

Quick Start

Ecosystem

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes