Skip to main content

Gymnasium environments for saturation provers

Project description

PyPI versionAnacondaCircleCIDocumentation StatuscodecovJOSS

gym-saturation

gym-saturation is a collection of Gymnasium environments for reinforcement learning (RL) agents striving to prove theorems. Currently, only theorems written in TPTP library formal language are supported.

There are two environments in gym-saturation following the same API: SaturationEnv: VampireEnv is a wrapper around a recent Vampire prover, and IProverEnv relies on a stable version of iProver.

In contrast to monolithic architecture of a typical Automated Theorem Prover (ATP), gym-saturation gives different agents opportunities to select clauses themselves and train from their experience. Combined with a particular agent, gym-saturation can work as an ATP.

gym-saturation can be interesting for RL practitioners willing to apply their experience to theorem proving without coding all the logic-related stuff themselves. It also can be useful for automated deduction researchers who want to create an RL-empowered ATP.

How to Install

The best way to install this package is to use pip:

pip install gym-saturation

Another option is to use conda:

conda install -c conda-forge gym-saturation

One can also run it in a Docker container (pre-packed with vampire and iproveropt binaries):

docker build -t gym-saturation https://github.com/inpefess/gym-saturation.git
docker run -it --rm -p 8888:8888 gym-saturation jupyter-lab --ip=0.0.0.0 --port=8888

How to use

One can use gym-saturation environments as any other Gymnasium environment:

import gym_saturation
import gymnasium

env = gymnasium.make("Vampire-v0")  # or "iProver-v0"
# skip this line to use the default problem
env.set_task("a-TPTP-problem-filename")
observation, info = env.reset()
terminated, truncated = False, False
while not (terminated or truncated):
    # apply policy (a valid random action here)
    action = env.action_space.sample(mask=observation["action_mask"])
    observation, reward, terminated, truncated, info = env.step(action)
env.close()

Or have a look at the basic tutorial.

For a bit more comprehensive experiments, please navigate the documentation page.

How to Contribute

Please follow the contribution guide while adhering to the code of conduct.

More documentation

More documentation can be found here.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gym_saturation-0.10.2.tar.gz (24.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gym_saturation-0.10.2-py3-none-any.whl (37.6 kB view details)

Uploaded Python 3

File details

Details for the file gym_saturation-0.10.2.tar.gz.

File metadata

  • Download URL: gym_saturation-0.10.2.tar.gz
  • Upload date:
  • Size: 24.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.11.4 Linux/6.2.0-23-generic

File hashes

Hashes for gym_saturation-0.10.2.tar.gz
Algorithm Hash digest
SHA256 67246287d34defb3854c6657ed012145202fa6d7e3050ffc44a7d1747d385925
MD5 63df3cc3218a6edd3ffea584ebc9e38e
BLAKE2b-256 de553bfb6665e7becf52b6797de1d3a70981962cdfd6a6bfe8d08a279274d486

See more details on using hashes here.

File details

Details for the file gym_saturation-0.10.2-py3-none-any.whl.

File metadata

  • Download URL: gym_saturation-0.10.2-py3-none-any.whl
  • Upload date:
  • Size: 37.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.11.4 Linux/6.2.0-23-generic

File hashes

Hashes for gym_saturation-0.10.2-py3-none-any.whl
Algorithm Hash digest
SHA256 1486b73abd51f7a3a16f16e2fb191ff081e4b87567e5ae19cbf2b151e26410e5
MD5 41472e2deba6c14cc57012d479e222b8
BLAKE2b-256 8a0fc3f42c754fc4ed96e3b780987098a8be997f9f9492068691e39e1e9bed77

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page