Skip to main content

A streamlined setup for training and evaluating reinforcement learning agents on Atari 2600 games.

Project description

Atari Reinforcement Learning with Stable-Baselines3 & RL Zoo

This project provides a streamlined setup for training and evaluating reinforcement learning agents on Atari 2600 games. It is based on the workflow demonstrated in this YouTube tutorial and uses a collection of simple shell scripts to manage common tasks like training, evaluation, and video recording.

The core of the project leverages powerful libraries like Stable-Baselines3 and RL-Baselines3-Zoo to do the heavy lifting.

https://github.com/user-attachments/assets/68943893-f0ef-4e58-8dbb-88a6ae038560

✨ Key Features

  • Cross-Platform Workflow: Simple Python scripts (train.py, enjoy.py and record-video.py) to abstract away complex commands and ensure compatibility across platforms.
  • Powered by RL Zoo: Leverages the robust framework of rl-zoo3 for training, evaluation, and hyperparameter management.
  • Easy to Customize: Scripts can be easily modified to train on different Atari games or with different RL algorithms.

🚀 Installation

The recommended way to install this tool is directly from PyPI:

pip install atari-reinforcement-learning

This will make the atari-rl command available in your environment.

[!CAUTION] Atari ROMs: This project uses ale-py to automatically download and install the necessary Atari ROMs during the dependency installation process. By proceeding with the installation, you are confirming that you have the legal right to use these ROMs.

🎮 Usage

Train a New Agent

To start training an agent from scratch, run the training script. This will save logs and the trained model in the logs/ directory.

atari-rl train

Resume Training

If a training session was interrupted, you can resume from the last saved checkpoint.

atari-rl train --resume

Watch the Agent Play

Once you have a trained model, you can watch it play the game. This script will load the best-performing model from your training history.

atari-rl enjoy

[!TIP] To watch a specific experiment, add --exp-id n, where n is the experiment number.

Record a Video

To save a video of your agent playing, use the recording script. The video will be saved in a videos/ folder inside the corresponding log directory.

atari-rl record-video

[!TIP] You can also specify the experiment ID with --exp-id n and the output format with --format <svg|mp4|all>.

🛠️ Developer Setup

If you want to contribute to the project or modify the code, follow these steps to set up a development environment.

  1. Clone the repository

    git clone https://github.com/CosmicDNA/atari-reinforcement-learning.git
    cd atari-reinforcement-learning
    
  2. Create and activate a virtual environment

    python -m venv .venv
    source .venv/bin/activate
    # On Windows, use: .venv\Scripts\activate
    
  3. Install in editable mode with development dependencies This installs the package in a way that your code changes are immediately reflected, and includes tools like ruff and pytest for development.

    • Using uv (recommended for speed):
      pip install uv
      uv pip install -e ".[dev]"
      
    • Using pip:
      pip install -e ".[dev]"
      

🔧 Customization & Configuration

This tool uses a .env file in your current working directory to manage experiment configurations. This allows you to maintain separate configurations for different experiments in different folders.

How it Works

When you run any atari-rl command for the first time in a new directory, the tool will automatically create a .env file for you with default settings.

To customize an experiment, simply edit the values in this .env file. For example, to train a DQN agent on Pong, you would modify your .env file like this:

ALGO="dqn"
ENV="ALE/Pong-v5"

🤝🏿 Acknowledgements

This project stands on the shoulders of the following giants:

  • RL Baselines3 Zoo: A Training Framework for Stable Baselines3 Reinforcement Learning Agents.
  • Gymnasium: A Python library for developing and comparing reinforcement learning algorithms.
  • PyTorch: An open-source machine learning framework that accelerates the path from research prototyping to production deployment.
  • The Arcade Learning Environment: A simple framework that allows researchers and hobbyists to develop AI agents for Atari 2600 games.
  • OpenCV: Pre-built CPU-only OpenCV packages for Python.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atari_reinforcement_learning-1.0.7.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

atari_reinforcement_learning-1.0.7-py3-none-any.whl (15.4 kB view details)

Uploaded Python 3

File details

Details for the file atari_reinforcement_learning-1.0.7.tar.gz.

File metadata

File hashes

Hashes for atari_reinforcement_learning-1.0.7.tar.gz
Algorithm Hash digest
SHA256 ae1e113bee8b1450e883b7a923f3e7cb6751273e792c308842ae63ce1fe4056b
MD5 4d13787e0d802d1cef3b17df75e68ee8
BLAKE2b-256 1e96c76a25ed4efd43b417d026fdb3b51560869d2e687fe3faaf919d11e2fe7f

See more details on using hashes here.

Provenance

The following attestation bundles were made for atari_reinforcement_learning-1.0.7.tar.gz:

Publisher: pypi-publish.yml on CosmicDNA/atari-reinforcement-learning

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file atari_reinforcement_learning-1.0.7-py3-none-any.whl.

File metadata

File hashes

Hashes for atari_reinforcement_learning-1.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 d083f1cde63a5ef573bc207360256f05da978045171b06201b0285422566ad44
MD5 5f08bf188f843a8fce8eda4b62427850
BLAKE2b-256 4dd1f3a84dd706dcc6c50d99e6268aa1d2eaa28c63c78950dcddce8d498c3ee3

See more details on using hashes here.

Provenance

The following attestation bundles were made for atari_reinforcement_learning-1.0.7-py3-none-any.whl:

Publisher: pypi-publish.yml on CosmicDNA/atari-reinforcement-learning

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page