Skip to main content

Reinforcement learning on directed graphs

Project description

PyPI version Total alerts Language grade: Python DOI DOI

graph-env

The graphenv Python library is designed to

  1. make graph search problems more readily expressible as RL problems via an extension of the OpenAI gym API while
  2. enabling their solution via scalable learning algorithms in the popular RLLib library.

RLLib provides out-of-the-box support for both parametrically-defined actions and masking of invalid actions. However, native support for action spaces where the action choices change for each state is challenging to implement in a computationally efficient fashion. The graphenv library provides utility classes that simplify the flattening and masking of action observations for choosing from a set of successor states at every node in a graph search.

Installation

Graphenv can be installed with pip:

pip install graphenv

Quick Start

graph-env allows users to create a customized graph search by subclassing the Vertex class. Basic examples are provided in the graphenv/examples folder. The following code snippet shows how to randomly sample from valid actions for a random walk down a 1D corridor:

import random
from graphenv.examples.hallway.hallway_state import HallwayState
from graphenv.graph_env import GraphEnv

state = HallwayState(corridor_length=10)
env = GraphEnv({"state": state, "max_num_children": 2})

obs = env.make_observation()
done = False
total_reward = 0

while not done:
    action = random.choice(range(len(env.state.children)))
    obs, reward, terminated, truncated, info = env.step(action)
    done = terminated or truncated
    total_reward += reward

Additional details on this example are given in the documentation

Documentation

The documentation is hosted on GitHub Pages

Contributing

We welcome bug reports, suggestions for new features, and pull requests. See our contributing guidelines for more details.

License

graph-env is licensed under the BSD 3-Clause License. Copyright (c) 2022, Alliance for Sustainable Energy, LLC

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphenv-0.2.6.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

graphenv-0.2.6-py3-none-any.whl (26.4 kB view details)

Uploaded Python 3

File details

Details for the file graphenv-0.2.6.tar.gz.

File metadata

  • Download URL: graphenv-0.2.6.tar.gz
  • Upload date:
  • Size: 1.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for graphenv-0.2.6.tar.gz
Algorithm Hash digest
SHA256 18656e1e7d4563e8aa4d965ca9ba716c8f35d9134892e0cb6b604674028dc4b4
MD5 7decef9afb1e5bfab7072ab38f592dbb
BLAKE2b-256 799a754b2d5fa463454afb083b2f3e9693da3ca2c6761ef45a466375629666f8

See more details on using hashes here.

File details

Details for the file graphenv-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: graphenv-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 26.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for graphenv-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 68e801583c115c6551ed74fb689c0b476eb491616a7b3736bdce14d5161df16d
MD5 b78bf831cc0a99a8f95a63e1a8eaf925
BLAKE2b-256 3bd95e2ad4470bf9fcf6ed7a40d5f529528eb6bbc3451a75a42912c070e7421c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page