Python library for running proof search using CoPra

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3

Project description

COPRA

COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.

What's New
Setup
Simple CLI for Lean 4
Running Experiments
Latest Evaluation Results
Citation

What's New

🎯 Simple CLI for Lean 4 (NEW!)

Streamlined command-line interface for quick theorem proving - no complex YAML configuration needed! Prove theorems with a single command, use environment variables for API keys, and get real-time progress output.

Jump to Simple CLI Guide →

🤖 vLLM Support for Open Source Models (NEW!)

Run open-source LLMs locally (Llama, Mistral, DeepSeek, etc.) with GPU acceleration. OpenAI-compatible API, automatic server management, and support for any Hugging Face model compatible with vLLM.

Jump to vLLM Setup →

🚀 Parallel Theorem Execution (NEW!)

Execute proof search for multiple theorems in parallel to speed up evaluations on multi-core systems. Automatically uses threading for Python 3.14t+ and multiprocessing for older versions.

Jump to Parallel Execution Setup →

🐍 Python 3.14t Free-Threading Support (NEW!)

Full support for Python 3.14+ with free-threaded (GIL-free) execution. Automatically selects best parallel execution strategy based on Python version and GIL status.

📊 Latest Evaluation Results (NEW!)

Recent MiniF2F Test Lean 4 results with GPT-OSS-20b achieved 42.8% pass@5 success rate, including solving 1 IMO problem (imo_1959_p1) with low compute budget. View logs →

Jump to Full Results →

Setup

Quick Setup for Lean 4

# 1. Install COPRA
pip install copra-theorem-prover

# 2. Set Lean version (default: 4.24.0) and install REPL
export LEAN_VERSION="4.21.0"  # Must match your project version
install-lean-repl

# 3. Build the interface (ensure $HOME/.elan/bin is in PATH)
source $HOME/.elan/env  # Optional: if lean --version fails
install-itp-interface

Note: Tested on Linux. Windows users should use WSL.

Python 3.14t Setup (Optional)

# Create and activate environment
conda create -n py314-ft python=3.14 python-freethreading -c conda-forge
conda activate py314-ft

# Install COPRA
pip install copra-theorem-prover  # or: pip install -e . (for development)

# Run experiments (automatically detects Python version)
python -m copra.main.run --config-name lean4_simple_experiment

Note: Python 3.14t is experimental. COPRA automatically selects the best parallel execution strategy: free-threading (GIL disabled), threading (GIL enabled), or multiprocessing (Python < 3.14).

Full Setup for Coq and Lean

# 1. Install Coq dependencies and opam (Linux only, use WSL on Windows)
sudo apt install build-essential unzip bubblewrap
sh <(curl -sL https://raw.githubusercontent.com/ocaml/opam/master/shell/install.sh)

# 2. Add to ~/.bashrc (adjust path if ~/.opam/default doesn't exist)
export PATH="/home/$USER/.opam/default/bin:$PATH"
export PATH="/home/$USER/.elan/bin:$PATH"

# 3. Setup Python environment and install Lean 4
# (Follow Quick Setup for Lean 4 steps above)

Note: See OCaml installation guide for details.

vLLM Setup for Open Source Models

# Standard vLLM models (Llama, Mistral, DeepSeek, etc.)
pip install copra-theorem-prover[os_models]

pip install --pre torch --extra-index-url https://download.pytorch.org/whl/nightly/cu128
# Adjust CUDA version (cu118, cu121, cu128) based on your GPU

Usage:

# vLLM server starts automatically on port 48000 (override with VLLM_PORT)
python -m copra.main.eval_benchmark eval_settings=my_vllm_config benchmark=miniF2F

Supported Models: vllm:openai/gpt-oss-20b, vllm:codellama/CodeLlama-7b-Instruct-hf, vllm:meta-llama/Llama-2-7b-chat-hf, vllm:EleutherAI/llemma_7b, vllm:deepseek-ai/deepseek-math-7b-instruct, or any HuggingFace model compatible with vLLM.

Requirements: Python ≤ 3.12 and CUDA-capable GPU. See vLLM issue #26480 for known reasoning token limitations.

Simple CLI for Lean 4

🎯 NEW: A streamlined command-line interface for quick theorem proving without complex configuration!

The simple CLI provides an easy way to prove theorems with minimal setup - perfect for quick experiments, testing, or integration into other tools.

Quick Example:

# Set your API key
export OPENAI_API_KEY="sk-..."

# Prove a theorem (using installed script)
copra-lean-prover \
  --project data/test/lean4_proj \
  --file Lean4Proj/Temp.lean \
  --theorem test

# Or use as a Python module
python -m copra.simple \
  --project data/test/lean4_proj \
  --file Lean4Proj/Temp.lean \
  --theorem test

Key Features:

✅ No complex YAML configuration needed
✅ Environment variable support for API keys (12-factor app pattern)
✅ Simple command-line arguments for common settings
✅ Real-time progress output
✅ Modular architecture ready for REST API integration

Common Usage:

# Prove all theorems in a file
copra-lean-prover --project <path> --file <file> --theorem "*"

# Override timeout and temperature
copra-lean-prover \
  --project <path> --file <file> --theorem <name> \
  --timeout 300 --temperature 0.8

# Save proof to file
copra-lean-prover \
  --project <path> --file <file> --theorem <name> \
  --output proof.txt

Available Options:

--timeout - Timeout in seconds (default: 200)
--temperature - LLM sampling temperature (default: 0.7)
--proof-retries - Number of retry attempts (default: 4)
--main-prompt - Custom system prompt
--conv-prompt - Custom conversation prompt
--model - LLM model to use (default: gpt-5-mini)
--output - Save proof to file
--verbose - Detailed logging

📖 Full Documentation →

Running Experiments

API Setup

Option 1: Environment Variables (Recommended)

# OpenAI
export OPENAI_API_KEY="sk-..."

# Anthropic
export ANTHROPIC_API_KEY="sk-ant-..."

# AWS Bedrock
export AWS_ACCESS_KEY_ID="..."
export AWS_SECRET_ACCESS_KEY="..."
export AWS_REGION="us-east-1"

Option 2: Secrets File Create .secrets/openai_key.json:

{"organization": "<your-org-id>", "api_key": "<your-api-key>"}

Running Experiments

# Auto-detects Python version (recommended)
python src/copra/main/run.py --config-name lean4_simple_experiment

Note: See ./src/copra/main/config/experiments.yaml for available configurations.

Running the miniF2F Benchmark

# Setup Lean 4.21.0 (required for miniF2F)
export LEAN_VERSION="4.21.0"
install-lean-repl && install-itp-interface

# Run with GPT-OSS-20b
python -m copra.main.run --config-name miniF2F_lean4_easy_to_hard

# Run with OpenAI models (requires API key)
python -m copra.main.eval_benchmark \
  benchmark=miniF2F_lean4_test_easy_to_hard \
  eval_settings=n_60_dfs_gpt4_o_no_retrieve_no_ex

Results: .log/proofs/eval_driver/dfs/miniF2F_lean4_test_easy_to_hard/<timestamp>/

Starting Required Services

Lean 4: No setup needed (COPRA manages REPL automatically)
Isabelle: COPRA auto-manages PISA service (default port: 17000, override with PISA_PORT)
Coq: Ensure correct version is active (coqc --version) and project is built (make)
vLLM/Llama: COPRA auto-starts services (logs in .log/evals/benchmark/<name>/<timestamp>/)

Important: ITP projects must be built before running COPRA. For Coq, ensure the correct opam switch is active.

Parallel Theorem Execution (NEW!)

Quick Start:

# Enable with auto-detected workers (CPU cores / 2)
export ENABLE_PARALLEL_THEOREMS=True
python src/copra/main/run.py --config-name lean4_simple_experiment

# Custom worker count
export MAX_PARALLEL_WORKERS=4

How it works: Automatically uses threading (Python 3.14t+) or multiprocessing (Python < 3.14). Disabled by default.

Configuration:

ENABLE_PARALLEL_THEOREMS: Enable/disable (default: False)
MAX_PARALLEL_WORKERS: Worker count (default: auto-detected)

Tips: Match workers to CPU cores, ensure services handle concurrent requests, use sequential mode for <4 theorems.

Troubleshooting: Reduce workers if you encounter memory/service errors.

Latest Evaluation Results (NEW!)

You can find selected recent evaluation results here:

MiniF2F Test Lean 4 (v4.21.0) - GPT-OSS-20b (low reasoning tokens) (no retrieval):
- About 42.798% overall success rate on pass@5
- Solves 1 IMO problem (imo_1959_p1)
- Decent performance for low compute budget
- See logs: execution_logs

Important Notes

The ITP projects must be built before running COPRA
For Coq projects, ensure the correct switch/version is active
Services (PISA, Llama) are automatically managed by COPRA
Check logs in .log/evals/benchmark/<name>/<timestamp>/ for debugging

Citation

You can cite our paper:

@inproceedings{thakur2024context,
  title={An in-context learning agent for formal theorem-proving},
  author={Thakur, Amitayush and Tsoukalas, George and Wen, Yeming and Xin, Jimmy and Chaudhuri, Swarat},
  booktitle={First Conference on Language Modeling},
  year={2024}
}

Our paper can be found here: OpenReview and ArXiv

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

1.11.0

Jun 22, 2026

1.10.0

Jun 22, 2026

1.9.0

Jun 14, 2026

1.7.0

May 31, 2026

1.6.0

Jan 22, 2026

1.5.0

Jan 10, 2026

1.4.0

Nov 22, 2025

1.3.0

Nov 21, 2025

1.2.0

Nov 13, 2025

1.1.17

Nov 5, 2025

1.1.16

Oct 24, 2025

1.1.15

Oct 19, 2025

1.1.14

Oct 18, 2025

1.1.13

Oct 16, 2025

1.1.12

Oct 16, 2025

1.1.11

Oct 4, 2025

1.1.10

Sep 28, 2025

1.1.9

May 14, 2025

1.1.8

May 13, 2025

1.1.7

May 13, 2025

1.1.6

May 8, 2025

1.1.5

May 8, 2025

1.1.4

May 6, 2025

1.1.3

May 5, 2025

1.1.2

May 3, 2025

1.1.1

Apr 27, 2025

1.0.0

Apr 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

copra_theorem_prover-1.11.0.tar.gz (5.2 MB view details)

Uploaded Jun 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

copra_theorem_prover-1.11.0-py3-none-any.whl (229.1 kB view details)

Uploaded Jun 22, 2026 Python 3

File details

Details for the file copra_theorem_prover-1.11.0.tar.gz.

File metadata

Download URL: copra_theorem_prover-1.11.0.tar.gz
Upload date: Jun 22, 2026
Size: 5.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for copra_theorem_prover-1.11.0.tar.gz
Algorithm	Hash digest
SHA256	`884c75bdd89c58efe59d5bfdab889ec225fc03aac2c6c54bc44f59e19a0b4427`
MD5	`eac553b67d54545412182de35048ca45`
BLAKE2b-256	`7cebc7db1e928c2a035bcdcd7cb84a5b951deb0bddea6c699ddb1351cdef8bcc`

See more details on using hashes here.

File details

Details for the file copra_theorem_prover-1.11.0-py3-none-any.whl.

File metadata

Download URL: copra_theorem_prover-1.11.0-py3-none-any.whl
Upload date: Jun 22, 2026
Size: 229.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for copra_theorem_prover-1.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ad4e5e919448b90dca40d779985342057e1973b995fa407da49e7e6425c2125a`
MD5	`ac2cd454cf35e37cfacf1e0193a41c63`
BLAKE2b-256	`d6198ef5731d4f5facd7faae8d1dbb33c1c81178419f1d888f9fc64964679c87`

See more details on using hashes here.

copra-theorem-prover 1.11.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

COPRA

Table of Contents

What's New

🎯 Simple CLI for Lean 4 (NEW!)

🤖 vLLM Support for Open Source Models (NEW!)

🚀 Parallel Theorem Execution (NEW!)

🐍 Python 3.14t Free-Threading Support (NEW!)

📊 Latest Evaluation Results (NEW!)

Setup

Quick Setup for Lean 4

Python 3.14t Setup (Optional)

Full Setup for Coq and Lean

vLLM Setup for Open Source Models

Simple CLI for Lean 4

Running Experiments

API Setup

Running Experiments

Running the miniF2F Benchmark

Starting Required Services

Parallel Theorem Execution (NEW!)

Latest Evaluation Results (NEW!)

Important Notes

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes