Skip to main content

Reinforcement learning for Lean theorem proving

Project description

LeanReinforcement

Repository for LeanReinforcement (LR), a Monte Carlo Tree Search (MCTS) agent building on the ReProver system.

Gym-like

This repository builds on the gym-like LeanDojo interface for interacting with Lean. See the LeanDojo documentation for more information on how to use it.

Monte Carlo Tree Search

Relevant sources will be added here as the project progresses.

TODO

  • Import ReProver weights to Snellius
  • Set up premise retrieval code
  • Create gym environment with LeanDojo and OpenAI Gym
  • Decide PPO vs MCTS => MCTS on top of ReProver
  • Update Snellius environment
  • Import LeanDojo benchmark to Snellius
  • Look into LeanDojo step time, value functions, etc.
  • Implement runner script to run MCTS agent on LeanDojo environment
  • Implement appropriate DataLoader training and evaluation
  • Implement training loop
  • Run experiments on Snellius
  • Finish in-comment ToDo's
    • Checkpoint saving and loading
    • Subtree reusage
    • Reimplement tactic generator fine-tuning
  • Implement ValueHead Dataset creation
  • Set up wandb/tensorboard equivalent logging

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lean_reinforcement-0.1.1.tar.gz (36.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lean_reinforcement-0.1.1-py3-none-any.whl (50.9 kB view details)

Uploaded Python 3

File details

Details for the file lean_reinforcement-0.1.1.tar.gz.

File metadata

  • Download URL: lean_reinforcement-0.1.1.tar.gz
  • Upload date:
  • Size: 36.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for lean_reinforcement-0.1.1.tar.gz
Algorithm Hash digest
SHA256 b3a5b98580ab633c53733747c3d80cea8751738348415f1dbd5dca8b6beeab2b
MD5 2fcfeb691154ce9e467a7c5090a80c19
BLAKE2b-256 f30631a4afc84adb6cdafe2e84ba1a205bc614e1924506e635d89c14a8bc652c

See more details on using hashes here.

Provenance

The following attestation bundles were made for lean_reinforcement-0.1.1.tar.gz:

Publisher: python-publish.yml on GerbenKoopman/LeanReinforcement

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file lean_reinforcement-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for lean_reinforcement-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 45c48fed5863d02b7586b4e16343d9864a917071ecb5f3bd326732b623d3e29b
MD5 fa299cf9ea3450f5910edd74967f9902
BLAKE2b-256 6e401725a11a9bfb6f1eb73f9b7bd43ac4ac009394a922933fe7ffa7ad2516e0

See more details on using hashes here.

Provenance

The following attestation bundles were made for lean_reinforcement-0.1.1-py3-none-any.whl:

Publisher: python-publish.yml on GerbenKoopman/LeanReinforcement

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page