Skip to main content

Gymnasium environments for saturation provers

Project description

PyPI versionAnacondaCircleCIDocumentation StatuscodecovJOSS

gym-saturation

gym-saturation is a collection of Gymnasium environments for reinforcement learning (RL) agents striving to prove theorems. Currently, only theorems written in TPTP library formal language are supported.

There are two environments in gym-saturation following the same API: SaturationEnv: VampireEnv is a wrapper around a recent Vampire prover, and IProverEnv relies on a stable version of iProver.

In contrast to monolithic architecture of a typical Automated Theorem Prover (ATP), gym-saturation gives different agents opportunities to select clauses themselves and train from their experience. Combined with a particular agent, gym-saturation can work as an ATP.

gym-saturation can be interesting for RL practitioners willing to apply their experience to theorem proving without coding all the logic-related stuff themselves. It also can be useful for automated deduction researchers who want to create an RL-empowered ATP.

How to Install

The best way to install this package is to use pip:

pip install gym-saturation

Another option is to use conda:

conda install -c conda-forge gym-saturation

One can also run it in a Docker container (pre-packed with vampire and iproveropt binaries):

docker build -t gym-saturation https://github.com/inpefess/gym-saturation.git
docker run -it --rm -p 8888:8888 gym-saturation jupyter-lab --ip=0.0.0.0 --port=8888

How to use

One can use gym-saturation environments as any other Gymnasium environment:

import gym_saturation
import gymnasium

env = gymnasium.make("Vampire-v0")  # or "iProver-v0"
# skip this line to use the default problem
env.set_task("a-TPTP-problem-filename")
observation, info = env.reset()
terminated, truncated = False, False
while not (terminated or truncated):
    # apply policy (a valid random action here)
    action = env.action_space.sample(mask=observation["action_mask"])
    observation, reward, terminated, truncated, info = env.step(action)
env.close()

Or have a look at the basic tutorial.

For a bit more comprehensive experiments, please navigate the documentation page.

How to Contribute

Please follow the contribution guide while adhering to the code of conduct.

More documentation

More documentation can be found here.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gym_saturation-0.10.1.tar.gz (24.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gym_saturation-0.10.1-py3-none-any.whl (37.5 kB view details)

Uploaded Python 3

File details

Details for the file gym_saturation-0.10.1.tar.gz.

File metadata

  • Download URL: gym_saturation-0.10.1.tar.gz
  • Upload date:
  • Size: 24.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.0 CPython/3.11.2 Linux/6.2.0-20-generic

File hashes

Hashes for gym_saturation-0.10.1.tar.gz
Algorithm Hash digest
SHA256 fc6f575cb2e7693793a4c145b19f1a9e1f8797658c74901f098f809daffcafed
MD5 97d11ac5062c7c6ffb906091119f2e38
BLAKE2b-256 36e58c195f8b3201af95c3b83ffec32b62a008fe533879582ab89ec7c740a52a

See more details on using hashes here.

File details

Details for the file gym_saturation-0.10.1-py3-none-any.whl.

File metadata

  • Download URL: gym_saturation-0.10.1-py3-none-any.whl
  • Upload date:
  • Size: 37.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.0 CPython/3.11.2 Linux/6.2.0-20-generic

File hashes

Hashes for gym_saturation-0.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6207cdbd2b189d59f8a0e94bf23a2d42669e567aa532b6b018c1532458220995
MD5 5c2683718435cb2a396755fb7564f3aa
BLAKE2b-256 475c96c02a67b30287d5a1d201dde0dfcf57ef5d47d2be77b42171133cd907dc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page