locotrainer

Lightweight code agent that explores codebases and generates analysis reports via OpenAI-compatible APIs

Project description

LocoTrainer-4B is a 4B-parameter MS-SWIFT domain expert agent distilled from Qwen3-Coder-Next. Designed to analyze MS-SWIFT codebases and generate comprehensive markdown reports — combining tool-calling capabilities with deep framework knowledge.

📋 Table of Contents

📰 News & Updates
📝 Introduction
✨ Key Features
🏗️ Architecture
📈 Performance
🚀 Quick Start
🔧 Usage Examples
📁 Project Structure
⚠️ Known Limitations
📄 License
🙏 Acknowledgments

📰 News & Updates

[2026-03-13] 🚀 LocoTrainer-4B and LocoTrainer framework released.

📝 Introduction

LocoTrainer-4B is a specialized code analysis agent trained via knowledge distillation from Qwen3-Coder-Next. Unlike general-purpose code agents, it combines multi-turn tool-calling capabilities with deep MS-SWIFT framework knowledge, enabling it to generate comprehensive codebase analysis reports without requiring a separate reasoning model.

	LocoTrainer-4B
Base Model	Qwen3-4B-Instruct-2507
Teacher Model	Qwen3-Coder-Next
Training Method	Full-parameter SFT (distillation)
Training Data	361,830 samples (agent trajectory + MS-SWIFT knowledge + project paths)
Max Sequence Length	32,768 tokens
Training Hardware	8x NVIDIA H100 80GB
Training Time	~25 hours
Framework	MS-SWIFT

✨ Key Features

🎯 MS-SWIFT Domain Expert: Trained on MS-SWIFT documentation, CLI parameters, and project structure — answers framework questions accurately without hallucination
🔧 Tool-Calling Agent: Generates structured <tool_call> JSON for Read, Grep, Glob, Bash, and Write tools
📊 End-to-End Reports: From a single question to a complete, well-structured markdown analysis report
🏠 Local Deployment: GGUF quantized, runs on Mac Studio via llama.cpp at zero API cost
📏 Long Context: 32K training covers 90% of long-context analysis scenarios
🔄 Auto Clone: Automatically clones ms-swift on first run — no manual setup needed

🏗️ Architecture

LocoTrainer consists of two components: the agent framework (this repo) and the LocoTrainer-4B model.

User Query
    │
    ▼
LocoTrainer Framework
    ├── build_user_query()     # injects absolute paths
    ├── get_system_reminder()  # simulates Claude Code environment
    └── Agent Loop
            │
            ▼
    LocoTrainer-4B (or any OpenAI-compatible model)
            │
            ├── <tool_call> Read / Grep / Glob / Bash
            │       │
            │       ▼
            │   Real Filesystem (ms-swift codebase)
            │       │
            │       ▼
            └── <tool_response> → next turn
                    │
                    ▼
            output/output.md   (final markdown report)
            output/trajectory.json  (full conversation log)

The framework simulates a Claude Code-style agent environment, which is exactly what LocoTrainer-4B was trained on — ensuring maximum compatibility between the model and the runtime.

📈 Performance

Evaluated on MS-SWIFT codebase analysis tasks across 3 test iterations.

Agent Loop Reliability

Metric	Test 1 (Relative Paths)	Test 2 (Absolute Paths)	Test 3 (Absolute + Tolerant Args)
Read success rate	0%	100%	100%
Write success rate	—	0%	100%
Turns used	15 (limit)	15 (limit)	9
Tool calls	34	27	13
output.md generated	✗	✗	✓ (225 lines)

Key Design Insight

Absolute paths in user content + tolerant tool argument parsing = reliable agent behavior.

This mirrors Claude Code's own design: the system always provides full absolute paths so the model never has to guess.

🚀 Quick Start

Prerequisites

uv — curl -LsSf https://astral.sh/uv/install.sh | sh
An OpenAI-compatible API key (DashScope, OpenRouter, or local llama.cpp)

Install

git clone https://github.com/LocoreMind/LocoTrainer.git
cd LocoTrainer
cp .env.example .env
# Edit .env with your API key
uv sync

Run (auto-clones ms-swift on first use)

uv run locotrainer run -q "What are the default LoRA settings in ms-swift?"
# → output/output.md

With LocoTrainer-4B via llama.cpp

# Start local server
./llama-server -m LocoTrainer-4B.gguf --ctx-size 51200 --port 8080

# Point LocoTrainer at it
LOCOTRAINER_BASE_URL=http://localhost:8080/v1
LOCOTRAINER_MODEL=LocoTrainer-4B
LOCOTRAINER_API_KEY=local

uv run locotrainer run -q "How does ms-swift implement GRPO training?"

With DashScope (Qwen3-Coder-Next)

LOCOTRAINER_API_KEY=sk-your-dashscope-key
LOCOTRAINER_BASE_URL=https://dashscope-intl.aliyuncs.com/compatible-mode/v1
LOCOTRAINER_MODEL=qwen3-coder-next
LOCOTRAINER_ENABLE_THINKING=true

🔧 Usage Examples

Analyze a specific topic

locotrainer run -q "What are all supported training methods in ms-swift and their differences?"

Analyze a different codebase

locotrainer run \
  -q "How does the authentication system work?" \
  -c /path/to/other/project \
  -o ./reports

Full CLI options

locotrainer run \
  -q "Your question here" \
  -c /path/to/codebase \    # optional, default: auto-clone ms-swift
  -o ./output \             # optional, default: ./output
  -m model-name \           # optional, overrides .env
  --max-turns 20 \          # optional, default: 20
  --quiet                   # suppress verbose output

📁 Project Structure

LocoTrainer/
├── pyproject.toml              # uv project config, pip installable
├── .env.example                # Configuration template
├── .env                        # User config (gitignored)
└── src/locotrainer/
    ├── __init__.py             # Package version
    ├── prompts.py              # SYSTEM_PROMPT + get_system_reminder()
    ├── tools.py                # ToolExecutor (Read/Grep/Glob/Write/Bash)
    ├── agent.py                # Agent loop
    ├── config.py               # Config dataclass, .env loading
    ├── repo.py                 # Auto-clone ms-swift logic
    └── cli.py                  # Click CLI: `locotrainer run`

⚙️ Configuration

Env Variable	Default	Description
`LOCOTRAINER_API_KEY`	(required)	API key (falls back to `OPENAI_API_KEY`)
`LOCOTRAINER_BASE_URL`	`https://api.openai.com/v1`	OpenAI-compatible endpoint
`LOCOTRAINER_MODEL`	`gpt-4o`	Model name
`LOCOTRAINER_MAX_TURNS`	`20`	Max agent loop turns
`LOCOTRAINER_MAX_TOKENS`	`8192`	Max tokens per response
`LOCOTRAINER_ENABLE_THINKING`	`false`	Enable thinking mode (Qwen3)
`LOCOTRAINER_CODEBASE`	`.`	Codebase path (triggers auto-clone if default)
`LOCOTRAINER_OUTPUT_DIR`	`./output`	Output directory

Training Details

📋 Click to expand full training configuration

Parameter	Value
Base model	Qwen3-4B-Instruct-2507
Teacher model	Qwen3-Coder-Next
Method	Full-parameter SFT
Training data	361,830 samples
Data composition	Agent trajectory + MS-SWIFT knowledge + project structure paths
Hardware	8x NVIDIA H100 80GB
DeepSpeed	ZeRO-2
Precision	BF16
Epochs	1
Max sequence length	32,768 tokens
Attention	Flash Attention 2
Kernel optimization	Liger Kernel
Learning rate	1e-5, warmup ratio 0.05
Batch size	1/GPU, gradient accumulation 4 (effective batch 32)
Template	qwen3_nothinking
Framework	MS-SWIFT
Training time	~25 hours

⚠️ Known Limitations

Specialized for MS-SWIFT; performance on unrelated codebases is untested
4B parameters — complex multi-hop reasoning may require a larger model
MS-SWIFT project structure knowledge reflects the training data snapshot; may drift as the framework evolves

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

🤖 Qwen Team for the Qwen3-4B-Instruct-2507 base model
🛠️ MS-SWIFT for the training framework and the codebase this model specializes in
🦙 llama.cpp for efficient local inference
🤖 Anthropic for the Claude Code agent loop design that inspired this work

Project details

Release history Release notifications | RSS feed

0.1.1

Mar 13, 2026

This version

0.1.0

Mar 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

locotrainer-0.1.0.tar.gz (11.8 kB view details)

Uploaded Mar 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

locotrainer-0.1.0-py3-none-any.whl (14.8 kB view details)

Uploaded Mar 13, 2026 Python 3

File details

Details for the file locotrainer-0.1.0.tar.gz.

File metadata

Download URL: locotrainer-0.1.0.tar.gz
Upload date: Mar 13, 2026
Size: 11.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.8

File hashes

Hashes for locotrainer-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`6bde4880feb49b555d7e4661b538562a07d85515443a5f9f6d684ee3be9a286d`
MD5	`2c9c7403b7bb8aec28a9a232b65fb413`
BLAKE2b-256	`33e2b831835212abed4477b3546c37f7d15b2a231a354a27ffff0098b3c07f9f`

See more details on using hashes here.

File details

Details for the file locotrainer-0.1.0-py3-none-any.whl.

File metadata

Download URL: locotrainer-0.1.0-py3-none-any.whl
Upload date: Mar 13, 2026
Size: 14.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.8

File hashes

Hashes for locotrainer-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3b468181ce3ede5cd1c2570eac21b6c94b9b9b86056af557cadca6fe3eac15b4`
MD5	`3c029754ca31e75f2f30213f79026bed`
BLAKE2b-256	`1716fc56350c08eb3bb74dfc5c644bfd45a9490b66cdc764487ba31f52597da6`

See more details on using hashes here.

locotrainer 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

📋 Table of Contents

📰 News & Updates

📝 Introduction

✨ Key Features

🏗️ Architecture

📈 Performance

Agent Loop Reliability

Key Design Insight

🚀 Quick Start

Prerequisites

Install

Run (auto-clones ms-swift on first use)

With LocoTrainer-4B via llama.cpp

With DashScope (Qwen3-Coder-Next)

🔧 Usage Examples

Analyze a specific topic

Analyze a different codebase

Full CLI options

📁 Project Structure

⚙️ Configuration

Training Details

⚠️ Known Limitations

📄 License

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes