Skip to main content

RLROM is a library for testing and training reinforcement learning agent using online monitoring of signal temporal logics formulas, and more.

Project description

RLRom

This module integrates Robust Online Monitoring methods with Reinforcement Learning stuff. The motivation is first to test RL agents using interpretable monitors, then use these monitors to train models to perform complex tasks, and/or converge toward behaviors that reliably satisfy certain requirements.

Install

Make sure build tools are installed, e.g., with apt:

$ sudo apt install build-essential

Then install with pip:

pip install rlrom 

Getting Started

Command Line Interface

RLRom reads configuration files in the YAML format as inputs. Examples are provided in the examples folder. A command line interface is provided through command rlr which can be called with various arguments. For instance, the rlr test command reads a configuration file and runs tests:

$ rlr test examples/cartpole/cfg0_hug.cfg

will run a few episode of the cartpole classic environment, fetching a model on huggingface and monitor a formula on these episodes.

For training with or without STL specifications, use the rlr train command, e.g.:

$ rlr train examples/cartpole/cfg0tr_ppo_specs.cfg

More details are provided in the notebooks (see below.)

Notebook Examples

More programmatic features are demonstrated in notebooks, in particular

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rlrom-0.2.0.tar.gz (24.3 kB view details)

Uploaded Source

File details

Details for the file rlrom-0.2.0.tar.gz.

File metadata

  • Download URL: rlrom-0.2.0.tar.gz
  • Upload date:
  • Size: 24.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.22

File hashes

Hashes for rlrom-0.2.0.tar.gz
Algorithm Hash digest
SHA256 40b2e134adba4aefe84d265d8cd98c5592db154b0c5d5962161120fc24d450af
MD5 1333d4227d57610ce8e1624ff76a73e1
BLAKE2b-256 7b818da05681b802a9ed194d55785fbf12f7391cb9b70d0444e44b98b3798a68

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page