An easy-to-read Reinforcement Learning (RL) framework. Provides standardized interfaces and implementations to various Reinforcement Learning methods and environments. Also this is the main place to start your journey with Reinforcement Learning and learn from tutorials and examples.

These details have not been verified by PyPI

Project links

Homepage

Project description

Reinforcement Learning Framework

An easy-to-read Reinforcement Learning (RL) framework. Provides standardized interfaces and implementations to various Reinforcement Learning methods and environments. Also, this is the main place to start your journey with Reinforcement Learning and learn from tutorials and examples.

Main Features

Choose from a growing number of Gym environments and MLAgent environments
Using various Reinforcement Learning algorithms for learning, which are implemented in Stable-Baselines 3
Integrate or implement own custom environments and agents in a standardized interface
Upload your models to the HuggingFace Hub

Set-Up

Activate your development environment

If you are on a UNIX-based OS: You are fine. Continue with the next step.

If you are on Windows: Make sure to use a WSL Python interpreter as your development environment, since we require a UNIX-based system underneath Python to run a lot of the environments and algorithms. For users using PyCharm, see https://www.jetbrains.com/help/pycharm/using-wsl-as-a-remote-interpreter.html for more information. For users using Visual Studio Code, see https://code.visualstudio.com/docs/remote/wsl-tutorial and https://code.visualstudio.com/docs/remote/wsl for more information.

Install all dependencies in your development environment

To set up your local development environment, please run:

poetry install

Behind the scenes, this creates a virtual environment and installs rl_framework along with its dependencies into a new virtualenv. Whenever you run poetry run <command>, that <command> is actually run inside the virtualenv managed by poetry.

You can now import functions and classes from the module with import rl_framework.

Optional: Install FFMPEG to enable generation of videos (for upload)

The creation of videos for the functionality of creating video-replays of the agent performance on the environment requires installing the FFMPEG package on your machine. This feature is important if you plan to upload replay videos to an experiment tracking service together with the agent itself. The ffmpeg command needs to be available to invoke from the command line, since it is called from Python through a os.system invoke. Therefore, it is important that you install this package directly on your machine.

Please follow the guide which can be found here to install the FFMPEG library on your respective machine.

Optional: Preparation for pushing your models to the HuggingFace Hub

Create an account to HuggingFace and sign in. ➡ https://huggingface.co/join
Create a new token with write role. ➡ https://huggingface.co/settings/tokens
Store your authentication token from the Hugging Face website. ➡ huggingface-cli login

Optional: Preparation for using a Unity environment (optional)

In order to use environments based on the Unity game framework, make sure to follow the installation procedures detailed in following installation guideline provided by Unity Technologies. In short:

Install Unity. ➡ https://unity.com/download
Create a new Unity project.
Navigate to the menu Window -> Package Manager and install the com.unity.ml-agents package in Unity. ➡ https://docs.unity3d.com/Manual/upm-ui-install.html

Getting Started

Configuring an environment

To integrate your environment you wish to train on, you need to create an Environment class representing your problem. For this you can

you use an existing Gym environment with the GymEnvironment class
you use an existing MLAgent environment with the MLAgentsEnvironment class
create a custom environment by inheriting from the base Environment class, which specifies the required interface

Configuring an agent

To integrate the Reinforcement Learning algorithm you wish to train an agent on your environment with, you need to create an Agent class representing your training agent. For this you can

you use an existing Reinforcement Learning algorithm implemented in the Stable-Baselines 3 framework with the StableBaselinesAgent class (see the Example section below)
create a custom Reinforcement Learning algorithm by inheriting from the base BaseAgent class, which specifies the required interface

Training

After configuring the environment and the agent, you can start training your agent on the environment. This can be done in one line of code:

agent.train(environments=environments, total_timesteps=100000)

Independent of which environment and which agent you choose, the unified interface allows to always start the training this way.

Evaluating

Once you trained the agent, you can evaluate the agent policy on the environment and get the average accumulated reward (and standard deviation) as evaluation metric. This evaluation method is implemented in the evaluate function of the agent and called with one line of code:

agent.evaluate(evaluation_environment=environment, n_eval_episodes=100, deterministic=False)

Uploading and downloading models from the HuggingFace Hub

Once you trained the agent, you can upload the agent model to the HuggingFace Hub in order to share and compare your agent to others. You can also download yours or other agents from the same HuggingFace Hub and use them for solving environments or re-training. The object which allows for this functionality is HuggingFaceConnector, which can be found in the connection collection package.

Example

In this example script you can see all of the above steps unified.

For a quick impression in this README, find a minimal training and evaluation example here:

# Create environment(s); multiple environments for parallel training
environments = [GymEnvironmentWrapper(ENV_ID) for _ in range(PARALLEL_ENVIRONMENTS)]

# Create new agent
agent = StableBaselinesAgent(
    algorithm=StableBaselinesAlgorithm.PPO,
    algorithm_parameters={
        "policy": "MlpPolicy"
    }
)
# Train agent
agent.train(environments=environments, total_timesteps=100000)

# Evaluate the model
mean_reward, std_reward = agent.evaluate(evaluation_environment=environments[0])

Development

Notebooks

You can use your module code (src/) in Jupyter notebooks without running into import errors by running:

poetry run jupyter notebook

poetry run jupyter-lab

This starts the jupyter server inside the project's virtualenv.

Assuming you already have Jupyter installed, you can make your virtual environment available as a separate kernel by running:

poetry add ipykernel
poetry run python -m ipykernel install --user --name="reinforcement-learning-framework"

Note that we mainly use notebooks for experiments, visualizations and reports. Every piece of functionality that is meant to be reused should go into module code and be imported into notebooks.

Testing

We use pytest as test framework. To execute the tests, please run

pytest tests

To run the tests with coverage information, please use

pytest tests --cov=src --cov-report=html --cov-report=term

and have a look at the htmlcov folder, after the tests are done.

Distribution Package

To build a distribution package (wheel), please use

python setup.py bdist_wheel

this will clean up the build folder and then run the bdist_wheel command.

Contributions

Before contributing, please set up the pre-commit hooks to reduce errors and ensure consistency

pip install -U pre-commit
pre-commit install

If you run into any issues, you can remove the hooks again with pre-commit uninstall.

License

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.10.0

Apr 29, 2026

0.9.21

Apr 20, 2026

0.9.19

Apr 19, 2026

0.9.18

Apr 17, 2026

0.9.17

Apr 14, 2026

0.9.16

Mar 23, 2026

0.9.15

Mar 20, 2026

0.9.14

Mar 18, 2026

0.9.13

Mar 12, 2026

0.9.12

Mar 11, 2026

0.9.11

Mar 10, 2026

0.9.10

Feb 20, 2026

0.9.9

Jan 22, 2026

0.9.8

Jan 19, 2026

0.9.7

Jan 19, 2026

0.9.6

Dec 18, 2025

0.9.4

Dec 17, 2025

0.9.3

Nov 26, 2025

0.9.2

Nov 21, 2025

0.9.1

Nov 6, 2025

0.9.0

Nov 4, 2025

0.8.8

Oct 28, 2025

0.8.6

Oct 17, 2025

0.8.5

Oct 15, 2025

0.8.4

Jul 25, 2025

0.8.3

Jul 24, 2025

0.8.2

Jul 24, 2025

0.8.1

Jul 22, 2025

0.8.0

Jun 30, 2025

0.7.2

Mar 18, 2025

0.7.1

Feb 11, 2025

0.7.0

Feb 11, 2025

0.6.11

Feb 6, 2025

0.6.10

Feb 5, 2025

0.6.9

Feb 4, 2025

0.6.8

Jan 31, 2025

0.6.7

Jan 31, 2025

0.6.6

Jan 17, 2025

0.6.5

Jan 15, 2025

0.6.4

Jan 10, 2025

This version

0.6.3

Jan 9, 2025

0.6.2

Jan 9, 2025

0.6.1

Dec 19, 2024

0.6.0

Dec 19, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reinforcement_learning_framework-0.6.3.tar.gz (29.4 kB view details)

Uploaded Jan 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

reinforcement_learning_framework-0.6.3-py3-none-any.whl (38.0 kB view details)

Uploaded Jan 9, 2025 Python 3

File details

Details for the file reinforcement_learning_framework-0.6.3.tar.gz.

File metadata

Download URL: reinforcement_learning_framework-0.6.3.tar.gz
Upload date: Jan 9, 2025
Size: 29.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.2 CPython/3.8.10 Windows/10

File hashes

Hashes for reinforcement_learning_framework-0.6.3.tar.gz
Algorithm	Hash digest
SHA256	`3b343681ed1f6a3e567d8a868389907ea7b3d28f42b5eb8d9498785ab3f928ab`
MD5	`d6b80a7a56172068cbcedd2298d81aa1`
BLAKE2b-256	`98fe7a76e11e1c84ec59f9e81c0647cf305248da53ab1af820b0666b32208a16`

See more details on using hashes here.

File details

Details for the file reinforcement_learning_framework-0.6.3-py3-none-any.whl.

File metadata

Download URL: reinforcement_learning_framework-0.6.3-py3-none-any.whl
Upload date: Jan 9, 2025
Size: 38.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.2 CPython/3.8.10 Windows/10

File hashes

Hashes for reinforcement_learning_framework-0.6.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d86120aa8a262c667baa5d2173957944ca5b98690d2d99ca82a9604a61fbee6b`
MD5	`dcd7d1f6c6e40a54168142d187212e3d`
BLAKE2b-256	`ee022b225839523e81a2d826488271c01c497678adc1640613e7dc2664469d9e`

See more details on using hashes here.

reinforcement-learning-framework 0.6.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Reinforcement Learning Framework

Main Features

Set-Up

Activate your development environment

Install all dependencies in your development environment

Optional: Install FFMPEG to enable generation of videos (for upload)

Optional: Preparation for pushing your models to the HuggingFace Hub

Optional: Preparation for using a Unity environment (optional)

Getting Started

Configuring an environment

Configuring an agent

Training

Evaluating

Uploading and downloading models from the HuggingFace Hub

Example

Development

Notebooks

Testing

Distribution Package

Contributions

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes