A recursive, reflective POETRY algorithm variant using Goedel-Prover-V2
Project description
Gödel's Poetry
A recursive, reflective POETRY algorithm variant using Goedel-Prover-V2
Gödel's Poetry is an advanced automated theorem proving system that combines Large Language Models (LLMs) with formal verification in Lean 4. The system takes mathematical theorems—either in informal natural language or formal Lean syntax—and automatically generates verified proofs through a sophisticated multi-agent architecture.
- Github repository: https://github.com/KellyJDavis/goedels-poetry/
- Documentation: https://KellyJDavis.github.io/goedels-poetry/
Table of Contents
What Does Gödel's Poetry Do?
Gödel's Poetry is an AI-powered theorem proving system that bridges the gap between informal mathematical reasoning and formal verification. The system:
-
Accepts theorems in multiple formats:
- Informal natural language (e.g., "Prove that the square root of 2 is irrational")
- Formal Lean 4 syntax (e.g.,
theorem sqrt_two_irrational : Irrational (√2) := by sorry)
-
Automatically generates verified proofs through a multi-agent workflow:
- Formalization: Converts informal statements into formal Lean 4 theorems
- Semantic Checking: Validates that formalizations preserve the original meaning
- Proof Generation: Creates proofs using specialized LLMs trained on Lean 4
- Proof Sketching: Decomposes difficult theorems into manageable subgoals
- Verification: Validates all proofs using the Lean 4 proof assistant
- Recursive Refinement: Iteratively improves proofs until they are complete and verified
-
Leverages state-of-the-art technology:
- Custom fine-tuned models (Goedel-Prover-V2, Goedel-Formalizer-V2)
- Integration with frontier LLMs (GPT-5, Qwen3)
- The Kimina Lean Server for high-performance Lean 4 verification
- LangGraph for orchestrating complex multi-agent workflows
The system is designed for researchers, mathematicians, and AI practitioners interested in automated theorem proving, formal verification, and the intersection of natural and formal languages.
Quick Start
Prerequisites
Before installing Gödel's Poetry, ensure you have:
- Python 3.9 or higher (tested on Python 3.9-3.13)
- pip (comes with Python)
- Lean 4 for the Kimina server (installation covered below)
For development:
- uv - Fast Python package installer (optional, but recommended for development)
curl -LsSf https://astral.sh/uv/install.sh | sh
- Git for cloning the repository
Installation
Option 1: Install from PyPI (Recommended)
# Install using pip
pip install goedels-poetry
# Verify installation
goedels_poetry --help
Option 2: Install from Source (For Development)
# Clone the repository
git clone https://github.com/KellyJDavis/goedels-poetry.git
cd goedels-poetry
# Install with uv (recommended) or pip
uv sync
# The command line tool is now available via:
uv run goedels_poetry --help
Running the Kimina Lean Server
The Kimina Lean Server is required for Gödel's Poetry to verify Lean 4 proofs. It provides high-performance parallel proof checking.
Setup Steps:
-
Clone the Kimina Lean Server (separate repository):
git clone https://github.com/KellyJDavis/kimina-lean-server.git cd kimina-lean-server
-
Run the setup script (installs Lean 4, mathlib4, and dependencies):
bash setup.shThis will:
- Install Elan (the Lean version manager)
- Install Lean 4 (default version v4.15.0)
- Clone and build the Lean REPL
- Clone and build the AST export tool
- Clone and build mathlib4 (Lean's math library)
⚠️ Note: This process can take 15-30 minutes depending on your system.
-
Install server dependencies:
pip install -r requirements.txt pip install . prisma generate
-
Start the server:
python -m server
The server will start on
http://0.0.0.0:8000by default. -
Verify the server is running (in a new terminal):
curl --request POST \ --url http://localhost:8000/verify \ --header 'Content-Type: application/json' \ --data '{ "codes": [{"custom_id": "test", "proof": "#check Nat"}], "infotree_type": "original" }' | jq
Alternative: Docker (Production)
For production deployments, you can use Docker:
cd kimina-lean-server
docker compose up
See the Kimina Server README for more deployment options.
Setting Up Your API Keys
Gödel's Poetry supports both OpenAI and Google Generative AI for certain reasoning tasks. You can use either provider:
Option 1: OpenAI (Default)
-
Get an API key from OpenAI's platform
-
Set the environment variable:
On Linux/macOS:
export OPENAI_API_KEY='your-api-key-here'
On Windows (Command Prompt):
set OPENAI_API_KEY=your-api-key-here
On Windows (PowerShell):
$env:OPENAI_API_KEY='your-api-key-here'
Option 2: Google Generative AI
-
Get an API key from Google AI Studio
-
Set the environment variable:
On Linux/macOS:
export GOOGLE_API_KEY='your-api-key-here'
On Windows (Command Prompt):
set GOOGLE_API_KEY=your-api-key-here
On Windows (PowerShell):
$env:GOOGLE_API_KEY='your-api-key-here'
Provider Selection
The system automatically selects the provider based on available API keys:
- If both keys are set, OpenAI takes priority
- If only one key is set, that provider is used
- If no keys are set, the system falls back to OpenAI with a warning
-
Make it permanent (optional):
Add the export command to your shell configuration file:
- Bash:
~/.bashrcor~/.bash_profile - Zsh:
~/.zshrc - Fish:
~/.config/fish/config.fish
- Bash:
Using the Command Line Tool
Once installed, you can use the goedels_poetry command to prove theorems:
Prove a Single Formal Theorem
goedels_poetry --formal-theorem "theorem theorem_54_43 : 1 + 1 = 2 := by sorry"
Prove a Single Informal Theorem
goedels_poetry --informal-theorem "Prove that the sum of two even numbers is even"
Batch Process Multiple Theorems
Process all .lean files in a directory:
goedels_poetry --formal-theorems ./my-theorems/
Process all .txt files containing informal theorems:
goedels_poetry --informal-theorems ./informal-theorems/
For batch processing, the tool will:
- Read each theorem from its file
- Attempt to generate and verify a proof
- Save results to
.prooffiles alongside the originals
Get Help
goedels_poetry --help
Enable Debug Mode
To see detailed LLM and Kimina server responses during execution, set the GOEDELS_POETRY_DEBUG environment variable:
On Linux/macOS:
export GOEDELS_POETRY_DEBUG=1
goedels_poetry --formal-theorem "theorem theorem_54_43 : 1 + 1 = 2 := by sorry"
On Windows (Command Prompt):
set GOEDELS_POETRY_DEBUG=1
goedels_poetry --formal-theorem "theorem theorem_54_43 : 1 + 1 = 2 := by sorry"
On Windows (PowerShell):
$env:GOEDELS_POETRY_DEBUG=1
goedels_poetry --formal-theorem "theorem theorem_54_43 : 1 + 1 = 2 := by sorry"
When debug mode is enabled, all responses from:
- FORMALIZER_AGENT_LLM - Formalization responses
- PROVER_AGENT_LLM - Proof generation responses
- SEMANTICS_AGENT_LLM - Semantic checking responses
- DECOMPOSER_AGENT_LLM - Proof sketching/decomposition responses
- KIMINA_SERVER - Lean 4 verification and AST parsing responses
will be printed to the console with rich formatting for easy debugging and inspection.
Examples
Example 1: Simple Arithmetic
goedels_poetry --formal-theorem \
"theorem add_comm_example : 3 + 5 = 5 + 3 := by sorry"
Example 2: Informal Theorem
goedels_poetry --informal-theorem \
"Prove that for any natural numbers a and b, a + b = b + a"
Example 3: Batch Processing
Create a directory with theorem files:
mkdir theorems
echo "theorem test1 : 2 + 2 = 4 := by sorry" > theorems/test1.lean
echo "theorem test2 : 5 * 5 = 25 := by sorry" > theorems/test2.lean
goedels_poetry --formal-theorems ./theorems/
Results will be saved as test1.proof and test2.proof.
How It Works
Gödel's Poetry uses a sophisticated multi-agent architecture coordinated by a supervisor agent. The workflow adapts based on the input:
For Informal Theorems:
- Formalizer Agent - Converts natural language to Lean 4 syntax
- Syntax Checker Agent - Validates the formal theorem syntax
- Semantics Agent - Ensures the formalization preserves meaning
- Prover Agent - Generates the proof
- Proof Checker Agent - Verifies the proof in Lean 4
- Parser Agent - Extracts the AST structure
For Complex Theorems (Recursive Decomposition):
When direct proving fails, the system activates proof sketching:
- Proof Sketcher Agent - Creates a high-level proof outline
- Sketch Checker Agent - Validates the sketch syntax
- Decomposition Agent - Extracts sub-theorems from the sketch
- Recursive Proving - Each sub-theorem is proved independently
- Proof Reconstruction - Combines verified sub-proofs into the final proof
Key Features:
- Automatic Correction: Agents iteratively fix syntax and logical errors
- Backtracking: When a decomposition approach fails, the system tries alternatives
- State Management: Complete provenance tracking for reproducibility
- Parallel Processing: Batch theorem proving with efficient resource usage
Developer Guide
Development Setup
-
Clone and install with development dependencies:
git clone https://github.com/KellyJDavis/goedels-poetry.git cd goedels-poetry make install
This will:
- Create a virtual environment using
uv - Install all dependencies
- Set up pre-commit hooks for code quality
- Create a virtual environment using
-
Activate the environment (if needed):
source .venv/bin/activate # Linux/macOS .venv\Scripts\activate # Windows
Testing
The project includes comprehensive unit and integration tests.
Unit Tests Only (Fast)
make test
This runs all tests except those requiring Lean installation.
Integration Tests (Requires Lean Server)
Integration tests verify the Kimina Lean Server integration. These tests require a running Kimina Lean server.
First-time setup:
# Install integration test dependencies
uv sync
# Clone the Kimina Lean Server (if not already cloned)
cd .. && git clone https://github.com/KellyJDavis/kimina-lean-server.git
cd kimina-lean-server
# Install Lean and build dependencies (takes 15-30 minutes)
bash setup.sh
# Install server dependencies
pip install -r requirements.txt
pip install .
prisma generate
Run integration tests:
# Terminal 1: Start the Kimina server
cd ../kimina-lean-server
python -m server
# Terminal 2: Run the tests
cd ../goedels-poetry
make test-integration
The tests will automatically connect to http://localhost:8000. To use a different URL:
export KIMINA_SERVER_URL=http://localhost:9000
make test-integration
Note: Integration tests require Python 3.10+ and a running Lean server with proper REPL configuration.
All Tests
make test-all
This runs both unit and integration tests sequentially.
Makefile Targets
The repository provides several convenient Make targets:
| Target | Description |
|---|---|
make install |
Install the virtual environment and pre-commit hooks |
make check |
Run all code quality checks (linting, type checking, dependency audit) |
make test |
Run unit tests with coverage (excludes integration tests) |
make test-integration |
Run integration tests (requires Lean installation) |
make test-all |
Run all tests (unit + integration) |
make build |
Build wheel distribution package |
make clean-build |
Remove build artifacts |
make publish |
Publish to PyPI (requires credentials) |
make docs |
Build and serve documentation locally |
make docs-test |
Test documentation build without serving |
make help |
Display all available targets with descriptions |
Code Quality Tools
The make check target runs:
- uv lock - Ensures lock file consistency
- pre-commit - Runs linting and formatting (Ruff)
- mypy - Static type checking
- deptry - Checks for obsolete dependencies
Configuration
Default Configuration Parameters
Configuration is stored in goedels_poetry/data/config.ini:
[FORMALIZER_AGENT_LLM]
model = kdavis/goedel-formalizer-v2:32b
num_ctx = 40960
max_retries = 10
[PROVER_AGENT_LLM]
model = kdavis/Goedel-Prover-V2:32b
num_ctx = 40960
max_self_corrections = 3
max_depth = 20
[SEMANTICS_AGENT_LLM]
model = qwen3:30b
num_ctx = 262144
[DECOMPOSER_AGENT_LLM]
# Provider selection (openai, google, auto)
provider = auto
# OpenAI-specific settings
openai_model = gpt-5-2025-08-07
openai_max_completion_tokens = 50000
openai_max_remote_retries = 5
openai_max_self_corrections = 6
# Google-specific settings
google_model = gemini-2.5-flash
google_max_output_tokens = 50000
google_max_self_corrections = 6
[KIMINA_LEAN_SERVER]
url = http://0.0.0.0:8000
max_retries = 5
Configuration Parameters Explained
Formalizer Agent:
model: The LLM used to convert informal theorems to Lean 4num_ctx: Context window size (tokens)max_retries: Maximum attempts to formalize a theorem
Prover Agent:
model: The LLM used to generate proofsnum_ctx: Context window size (tokens)max_self_corrections: Maximum proof generation self-correction attemptsmax_depth: Maximum recursion depth for proof decomposition
Semantics Agent:
model: The LLM used to validate semantic equivalencenum_ctx: Context window size (tokens)
Decomposer Agent:
model: The LLM used for proof sketching and decompositionmax_completion_tokens: Maximum tokens in generated responsemax_remote_retries: Retry attempts for API callsopenai_max_self_corrections/google_max_self_corrections: Max decomposition self-corrections
Kimina Lean Server:
url: Server endpoint for Lean verificationmax_retries: Maximum retry attempts for server requests
Overriding Configuration with Environment Variables
The recommended way to customize configuration is using environment variables. This approach doesn't require modifying files and works great for different environments (development, testing, production):
Format: SECTION__OPTION (double underscore separator, uppercase)
Examples:
# Use a different prover model
export PROVER_AGENT_LLM__MODEL="custom-model:latest"
# Change the Kimina server URL
export KIMINA_LEAN_SERVER__URL="http://localhost:9000"
# Use a smaller context window for faster testing
export PROVER_AGENT_LLM__NUM_CTX="8192"
# Run with custom configuration
goedels_poetry --formal-theorem "theorem theorem_54_43 : 1 + 1 = 2 := by sorry"
Multiple overrides:
export PROVER_AGENT_LLM__MODEL="kdavis/Goedel-Prover-V2:70b"
export DECOMPOSER_AGENT_LLM__OPENAI_MODEL="gpt-5-pro"
export KIMINA_LEAN_SERVER__MAX_RETRIES="10"
goedels_poetry --formal-theorem "..."
Using Google Generative AI:
export GOOGLE_API_KEY="your-google-api-key"
export DECOMPOSER_AGENT_LLM__GOOGLE_MODEL="gemini-2.5-flash"
export DECOMPOSER_AGENT_LLM__GOOGLE_MAX_OUTPUT_TOKENS="100000"
goedels_poetry --formal-theorem "..."
Environment variables are optional - if not set, the system uses values from config.ini.
For more details and advanced configuration options, see CONFIGURATION.md.
Alternative: Modifying config.ini Directly
If you prefer, you can still modify the configuration file directly:
# Find the installation path
uv run python -c "import goedels_poetry; print(goedels_poetry.__file__)"
# Edit the config.ini in the installation directory
# Typically: .venv/lib/python3.x/site-packages/goedels_poetry/data/config.ini
Note: Direct file changes persist until you reinstall or update the package, while environment variables are more flexible and don't require reinstallation.
Contributing
We welcome contributions! Please see CONTRIBUTING.md for detailed guidelines.
Quick contribution workflow:
- Fork the repository
- Clone your fork:
git clone git@github.com:YOUR_NAME/goedels-poetry.git - Install development environment:
make install - Create a feature branch:
git checkout -b feature-name - Make your changes and add tests
- Run quality checks:
make check - Run tests:
make test - Commit with descriptive messages
- Push and create a pull request
Code quality requirements:
- All tests must pass (
make test) - Code must pass linting and type checking (
make check) - New features should include tests and documentation
- Follow the existing code style and conventions
Project Structure
goedels-poetry/
├── goedels_poetry/ # Main package
│ ├── agents/ # Multi-agent system components
│ │ ├── formalizer_agent.py
│ │ ├── prover_agent.py
│ │ ├── proof_checker_agent.py
│ │ ├── sketch_*.py # Proof sketching agents
│ │ └── ...
│ ├── config/ # Configuration management
│ ├── data/ # Prompts and config files
│ │ ├── config.ini
│ │ └── prompts/
│ ├── parsers/ # AST parsing utilities
│ ├── cli.py # Command-line interface
│ ├── framework.py # Core orchestration logic
│ └── state.py # State management
├── tests/ # Test suite
├── Makefile # Development automation
├── pyproject.toml # Package configuration
├── CHANGELOG.md # Version history
└── README.md # This file
Note: The Kimina Lean Server is a separate repository that must be installed and run independently.
License
This project is licensed under the Apache License 2.0. See the LICENSE file for details.
Acknowledgments
- Kimina Lean Server: Built on Project Numina's excellent Lean verification server
- Lean 4: The formal verification system that powers proof checking
- LangChain & LangGraph: Frameworks for LLM orchestration
- Mathlib4: Comprehensive mathematics library for Lean
Citation
If you use Gödel's Poetry in your research, please cite:
@software{goedels_poetry,
author = {Davis, Kelly J.},
title = {Gödel's Poetry: Recursive Automated Theorem Proving},
year = {2025},
url = {https://github.com/KellyJDavis/goedels-poetry}
}
For the Kimina Lean Server:
@misc{santos2025kiminaleanservertechnical,
title={Kimina Lean Server: Technical Report},
author={Marco Dos Santos and Haiming Wang and Hugues de Saxcé and Ran Wang and Mantas Baksys and Mert Unsal and Junqi Liu and Zhengying Liu and Jia Li},
year={2025},
eprint={2504.21230},
archivePrefix={arXiv},
primaryClass={cs.LO},
url={https://arxiv.org/abs/2504.21230}
}
Support
- Issues: Report bugs or request features at GitHub Issues
- Discussions: Ask questions at GitHub Discussions
- Documentation: Visit the official docs
Ready to prove some theorems? 🚀
goedels_poetry --informal-theorem "Prove that the sum of the first n natural numbers equals n(n+1)/2"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file goedels_poetry-0.0.3.tar.gz.
File metadata
- Download URL: goedels_poetry-0.0.3.tar.gz
- Upload date:
- Size: 340.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dd58319d8c04dc021e0d35915d0354fff5104983f92d3aa3f80fc3304b33b308
|
|
| MD5 |
8a5ce8ea1471c59c2d6e3f3e2b2639a1
|
|
| BLAKE2b-256 |
25499a4ab8dda8362bf9933420fa19fa81b331f42d642e1cf12ed3bbee994f72
|
File details
Details for the file goedels_poetry-0.0.3-py3-none-any.whl.
File metadata
- Download URL: goedels_poetry-0.0.3-py3-none-any.whl
- Upload date:
- Size: 82.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6f8ef70b793eaa8c7460981c453f971bdf0599455e2545ac257f030bff19983d
|
|
| MD5 |
fc88f08e827b5fd86eb1bea7bdd2ef68
|
|
| BLAKE2b-256 |
d2183cba73a7ac9c1e0022bfaab9300e8f594da07e3fa68d09348f37ef1922d4
|