A comprehensive library for AI-assisted theorem proving in Lean

These details have not been verified by PyPI

Project links

Project description

LeanDojo-v2

LeanDojo-v2 is an end-to-end framework for training, evaluating, and deploying AI-assisted theorem provers for Lean 4. It combines repository tracing, lifelong dataset management, retrieval-augmented agents, Hugging Face fine-tuning, and external inference APIs into one toolkit.

Overview
Key Features
Repository Layout
Requirements
Installation
Environment Setup
Quick Start
Working with Agents and Trainers
Tracing and Dataset Generation
External APIs and LeanCopilot
Testing
Troubleshooting & Tips
Contributing
License

Overview

LeanDojo-v2 extends the original LeanDojo stack with the LeanAgent lifelong learning pipeline. It automates the entire loop of:

Cloning Lean repositories (GitHub or local) and tracing them with Lean instrumentation.
Storing structured theorem information in a dynamic database.
Training agent policies with supervised fine-tuning (SFT), GRPO-style RL, or retrieval objectives.
Driving Pantograph-based provers to fill in sorrys or verify solutions.
Using HuggingFace API for large model inference.

The codebase is modular: you can reuse the tracing pipeline without the agents, swap in custom trainers, or stand up your own inference service via the external API layer.

Key Features

Unified Agent Abstractions: BaseAgent orchestrates repository setup, training, and proving. Concrete implementations (HFAgent, LeanAgent, and ExternalAgent) tailor the workflow to Hugging Face models, retrieval-based provers, or REST-backed models.
Powerful Trainers: SFTTrainer, GRPOTrainer, and RetrievalTrainer cover LoRA-enabled supervised fine-tuning, group-relative policy optimization, and retriever-only curriculum learning.
Multi-Modal Provers: HFProver, RetrievalProver, and ExternalProver run on top of Pantograph’s Lean RPC server to search for tactics, generate whole proofs, or delegate to custom models.
Lean Tracing Pipeline: lean_dojo includes the Lean 4 instrumentation (ExtractData.lean) and Python utilities to trace commits, normalize ASTs, and cache proof states.
Dynamic Repository Database: database tracks repositories, theorems, curriculum difficulty, and sorry status, enabling lifelong training schedules.
External API: The external_api folder exposes HTTP endpoints (FastAPI + uvicorn) and Lean frontend snippets so you can query LLMs from Lean editors.

Repository Layout

Path	Description
`lean_dojo_v2/agent/`	Base class plus `HFAgent`, `LeanAgent`, and helpers to manage repositories and provers.
`lean_dojo_v2/trainer/`	SFT, GRPO, and retrieval trainers with Hugging Face + DeepSpeed integration.
`lean_dojo_v2/prover/`	Pantograph-based prover implementations (HF, retrieval, external).
`lean_dojo_v2/lean_dojo/`	Lean tracing, dataset generation, caching, and AST utilities.
`lean_dojo_v2/lean_agent/`	Lifelong learning pipeline (configs, database, retrieval stack, generator).
`lean_dojo_v2/external_api/`	LeanCopilot code (Lean + Python server) to query external models.
`lean_dojo_v2/utils/`	Shared helpers for Git, filesystem operations, and constants.
`lean_dojo_v2/tests/`	Pytest regression suite.

For deeper documentation on the lifelong learning component, see lean_dojo_v2/lean_agent/README.md.

Requirements

Python ≥ 3.11.
CUDA-capable GPU for training and inference (tested with CUDA 12.6).
Git ≥ 2.25 and wget.
elan Lean toolchain to trace repositories locally.
Adequate disk space for the raid/ working directory (datasets, checkpoints, traces).

Python dependencies are declared in pyproject.toml and include PyTorch, PyTorch Lightning, Transformers, DeepSpeed, TRL, PEFT, and more.

Installation

Option 1: From PyPI

# Install the core package
pip install lean-dojo-v2

# Pantograph is required for Lean RPC
pip install git+https://github.com/stanford-centaur/PyPantograph

# Install a CUDA-enabled torch build (adjust the index URL for your CUDA version)
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu126

Option 2: From Source (development)

git clone https://github.com/lean-dojo/LeanDojo-v2.git
cd LeanDojo-v2
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -e ".[dev]"
pip install git+https://github.com/stanford-centaur/PyPantograph
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu126

Tip: You can use uv (uv pip install lean-dojo-v2) as an alternative Python package manager.

Environment Setup

GitHub Access Token (required)
The tracing pipeline calls the GitHub API extensively. Create a personal access token and export it before running any agent:
```
export GITHUB_ACCESS_TOKEN=<your-token>
```
Hugging Face Token (optional but needed for gated models)
```
export HF_TOKEN=<your-hf-token>
```
Working directories
By default all datasets, caches, and checkpoints live under <repo>/raid. Change the layout by editing lean_dojo_v2/utils/constants.py or by pointing RAID_DIR to faster storage.
Lean toolchains
Ensure elan is configured and Lean 4 (e.g., leanprover/lean4:nightly) is available on your $PATH. The tracing scripts look under ~/.elan/toolchains/.

Quick Start

from lean_dojo_v2.agent.hf_agent import HFAgent
from lean_dojo_v2.trainer.sft_trainer import SFTTrainer

url = "https://github.com/durant42040/lean4-example"
commit = "005de00d03f1aaa32cb2923d5e3cbaf0b954a192"

trainer = SFTTrainer(
    model_name="deepseek-ai/DeepSeek-Prover-V2-7B",
    output_dir="outputs-deepseek",
    epochs_per_repo=1,
    batch_size=2,
    lr=2e-5,
)

agent = HFAgent(trainer=trainer)
agent.setup_github_repository(url=url, commit=commit)
agent.train()
agent.prove()

This example:

Downloads and traces the target Lean repository + commit.
Builds a supervised dataset from sorry theorems.
Fine-tunes the specified Hugging Face model (optionally with LoRA).
Launches an HFProver backed by Pantograph to search for proofs.

Working with Agents and Trainers

Supervised Fine-Tuning (`SFTTrainer`)

Accepts any Hugging Face causal LM identifier.
Supports LoRA by passing a peft.LoraConfig.
Key arguments: epochs_per_repo, batch_size, max_seq_len, lr, warmup_steps, gradient_checkpointing.
Produces checkpoints under output_dir that the HFProver consumes.

GRPO Trainer (`GRPOTrainer`)

Implements Group Relative Policy Optimization for reinforcement-style refinement.
Accepts reference_model, reward_weights, and kl_beta settings.
Useful for improving search policies on curated theorem batches.

Retrieval Trainer & LeanAgent

RetrievalTrainer trains the dense retriever that scores prior proofs.
LeanAgent wraps the trainer, maintains repository curricula, and couples it with RetrievalProver.

Each agent inherits BaseAgent, so you can implement your own by overriding _get_build_deps() and _setup_prover() to register new trainer/prover pairs.

Tracing and Dataset Generation

The lean_dojo_v2/lean_dojo/data_extraction package powers repository tracing:

lean.py clones repositories (GitHub, remote, or local), validates Lean versions, and normalizes URLs.
trace.py drives Lean with the custom ExtractData.lean instrumented module to capture theorem states.
dataset.py converts traced files to JSONL datasets ready for trainers.
cache.py memoizes repository metadata to avoid redundant downloads.
traced_data.py exposes typed wrappers for traced AST nodes and sorrys.

Typical usage:

from lean_dojo_v2.database import DynamicDatabase

url = "https://github.com/durant42040/lean4-example"
commit = "005de00d03f1aaa32cb2923d5e3cbaf0b954a192"

database = DynamicDatabase()

database.setup_github_repository(
    url=url,
    commit=commit,
    build_deps=False,
)

The generated artifacts flow into the DynamicDatabase, which keeps repositories sorted by difficulty and appends new sorrys without retracing everything.

External APIs and LeanCopilot

lean_dojo_v2/external_api contains Lean and Python code to expose models through LeanCopilot:

LeanCopilot.lean registers RPC endpoints inside Lean.
python/server.py hosts a FastAPI service with adapters for Anthropic, OpenAI, Google Generative AI, vLLM, and custom HF models.

Start the service with:

cd lean_dojo_v2/external_api/python
pip install -r requirements.txt
uvicorn server:app --port 23337

Point your Lean client to the running server to interactively request tactics, proofs, or completions from external models.

LeanProgress Step-Prediction Workflow

Generate a JSONL dataset with remaining-step targets (or replace it with your own LeanProgress export):

python -m lean_dojo_v2.lean_progress.create_sample_dataset --output raid/data/sample_leanprogress_dataset.jsonl

Fine-tune a regression head that predicts steps_remaining:

python -m lean_dojo_v2.lean_progress.train_steps_model \
  --dataset raid/data/sample_leanprogress_dataset.jsonl \
  --output-dir raid/checkpoints/leanprogress_steps \
  --model-name bert-base-uncased

Tell the LeanCopilot server where to find the checkpoint by exporting:

export LEANPROGRESS_MODEL=raid/checkpoints/leanprogress_steps
uvicorn server:app --port 23337

Add use_reward=true when calling /generate. Each output now includes steps_remaining and a reward value (currently -steps_remaining) so agents can minimize proof length.

Testing

We use pytest for regression coverage.

pip install -e .[dev]          # make sure dev extras like pytest/trl are present
export GITHUB_ACCESS_TOKEN=<token>
export HF_TOKEN=<hf-token>     # only required for tests touching HF APIs
pytest -v

Troubleshooting & Tips

401 Bad Credentials / rate limits: Ensure GITHUB_ACCESS_TOKEN is exported and has repo + read:org scopes.
Lean tracing failures: Confirm that the repo’s Lean version exists locally (elan toolchain install <version>).
Missing CUDA libraries: Install the PyTorch wheel that matches your driver and CUDA version.
Dataset location: The default raid/ directory can grow large. Point it to high-throughput storage or use symlinks.
Pantograph errors: Reinstall Pantograph from source (pip install git+https://github.com/stanford-centaur/PyPantograph) whenever Lean upstream changes.

Contributing

Issues and pull requests are welcome! Please:

Open an issue describing the bug or feature.
Run formatters (black, isort) and pytest before submitting.
Mention if your change touches Lean tracing files so reviewers can re-generate artifacts.

License

LeanDojo-v2 is released under the MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.9

Apr 26, 2026

1.0.8

Mar 10, 2026

1.0.7

Mar 1, 2026

1.0.6

Feb 26, 2026

1.0.5

Feb 14, 2026

1.0.4

Nov 28, 2025

This version

1.0.3

Nov 24, 2025

1.0.2

Nov 23, 2025

1.0.1

Oct 17, 2025

1.0.0

Oct 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lean_dojo_v2-1.0.3.tar.gz (139.0 kB view details)

Uploaded Nov 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lean_dojo_v2-1.0.3-py3-none-any.whl (165.4 kB view details)

Uploaded Nov 24, 2025 Python 3

File details

Details for the file lean_dojo_v2-1.0.3.tar.gz.

File metadata

Download URL: lean_dojo_v2-1.0.3.tar.gz
Upload date: Nov 24, 2025
Size: 139.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for lean_dojo_v2-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`4bd518d904125e28d7e652ca02fa747f2c4c3d8ded7f03b5f5920b09d519f423`
MD5	`dcf4ccf0c85579ca7102ae329a4bf128`
BLAKE2b-256	`cdd806e278d7974db549db8d2d0b3c050371d0ad8f728a8960e052fcfbe856ab`

See more details on using hashes here.

File details

Details for the file lean_dojo_v2-1.0.3-py3-none-any.whl.

File metadata

Download URL: lean_dojo_v2-1.0.3-py3-none-any.whl
Upload date: Nov 24, 2025
Size: 165.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for lean_dojo_v2-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c02714eb57c496d00cdc16a57e6178d96a242e72c9107180421d85a0c5720318`
MD5	`764c0cff3fcfa490479bc9e6ada91d33`
BLAKE2b-256	`1d86d435c83a0755023c12c6bc30104af649ce06c78c6d5480f76fc030876f55`

See more details on using hashes here.

lean-dojo-v2 1.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LeanDojo-v2

Table of Contents

Overview

Key Features

Repository Layout

Requirements

Installation

Option 1: From PyPI

Option 2: From Source (development)

Environment Setup

Quick Start

Working with Agents and Trainers

Supervised Fine-Tuning (SFTTrainer)

GRPO Trainer (GRPOTrainer)

Retrieval Trainer & LeanAgent

Tracing and Dataset Generation

External APIs and LeanCopilot

LeanProgress Step-Prediction Workflow

Testing

Troubleshooting & Tips

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Supervised Fine-Tuning (`SFTTrainer`)

GRPO Trainer (`GRPOTrainer`)