A reinforcement learning environment for HiFight's Footsies game

These details have not been verified by PyPI

Project links

Project description

FootsiesGym

Implementation of HiFight's Footsies game as a reinforcement learning environment. This environment serves as a benchmark for multi-agent reinforcement learning in a (relatively) complex two-player zero-sum game.

The environment is derived from the open-source Unity implementation, which has been augmented to run a gRPC server that can be controlled through a Python harness. Training is implemented using Ray's RLlib.

System Architecture

sequenceDiagram
    participant RLlib as Ray RLlib
    participant Env as FootsiesEnv
    participant gRPC as gRPC Client
    participant Server as Unity Game Server
    participant Game as Footsies Game

    Note over RLlib,Env: Python Environment
    Note over gRPC: Communication Layer
    Note over Server,Game: Unity Game

    RLlib->>Env: step(action)
    Env->>gRPC: SendAction(action)
    gRPC->>Server: gRPC Request
    Server->>Game: Update Game State
    Game->>Server: Game State
    Server->>gRPC: gRPC Response
    gRPC->>Env: Game State
    Env->>RLlib: (obs., rews., terms., truncs., infos)

    Note over RLlib,Game: Training Loop

The diagram above shows how the different components interact during training:

RLlib sends actions to the FootsiesEnv
The environment converts these actions into gRPC requests
The Unity Game Server processes the actions and updates the game state
The game state is sent back through gRPC to the environment
The environment processes the observation and returns it to RLlib

Installation

conda create -n footsiesgym python=3.10
conda activate footsiesgym
pip install -r requirements.txt

On a Mac, you may need to ensure you have cmake installed. You can install it using Homebrew:

brew install cmake

Training

Game Servers

If you are on a Linux system, run setup.sh to unpack the binaries then run skip to the training procedure. Otherwise, follow the steps below.

Before training, you'll need to launch the headless game servers. Scripts are provided to do so in scripts/start_local_{mac, linux}_servers.sh, but you must first unpack the binaries that are included into the binaries/ directory (the launch scripts assume this location). Important! If you are launching game servers manually, be sure to set launch_binaries to False in the environment configuration.

./scripts/start_local_{mac, linux}_servers.sh <num-train-servers> <num-eval-servers>

The two arguments correspond to num_env_runners and evaluation_num_env_runners, which can be specified in the experiment configuration. You must launch a corresponding number of servers for each. If you are running local debugging (see below; python -m experiments.train --debug), just launch one of each. If you're launching a full experiment, you'll need to match the number specified in the experiment configuration (defaults to 40 training and 5 evaluation env runners).

The scripts will start:

Training servers from port 50051 (incrementing for each server)
Evaluation servers from port 40051 (incrementing for each server)

Importantly, we map environment runners to a single port, which means that you can only run a single environment per environment runner.

Training Configuration

The default training utilizes the APPO algorithm (see the corresponding IMPACT paper). We also utilize a vanilla LSTM newtwork with parameters described in the respective experiment files.

Training can utilize either the new RLModule stack or old-stack in RLlib. Some functionality has yet to be implemented in the new stack (see open issues).

Old Stack

python -m experiments.train --experiment-name <experiment-name>

New Stack

python -m experiments.train_rlmodule --experiment-name <experiment-name>

Add the --debug flag to use only a single env runner (and single evaluation env runner) and local mode. This will enable breakpoint usage for local debugging.

Visualizing a Policy

To visualize gameplay:

Unpack the windowed build binaries of your choice (Mac or Linux).
Add the trained policy specification to the ModuleRepository in components/module_repository.py:

FootsiesModuleSpec(
    module_name="<policy-nickname>",
    experiment_name="<experiment-name>",
    trial_id="<trial-id>",  # specify if experiment has multiple trials
    checkpoint_number=-1,  # -1 for latest, otherwise specify checkpoint number
)

Run the game with:

./footsies_linux_windowed_021725 --port 80051

Configure policies in scripts/local_inference.py using the MODULES variable. Set "p1" to "human" to play against the AI (must install pygame).

Project Architecture

Core Components

Environment (footsies/): The main game environment implementation that interfaces with the Unity game through gRPC.
Models (models/): Neural network architectures for the RL agents
Experiments (experiments/): Training configurations and experiment management
Callbacks (callbacks/): Custom RLlib callbacks for monitoring and evaluation
Components (components/): Reusable components like the module repository for policy management
Utils (utils/): Utility functions and helper classes
Scripts (scripts/): Helper scripts for server management and visualization

Key Features

Multi-agent reinforcement learning environment
gRPC-based communication with Unity game server
Support for both headless and windowed game modes
Integration with Ray RLlib for distributed training
Custom LSTM-based policy networks
Support for self-play training
Evaluation against baseline policies (random, noop, back)
Wandb integration for experiment tracking

Development

gRPC / Protobuf Updates

If updating the proto definitions:

Generate C# files (Windows):

.\protoc\bin\protoc.exe --csharp_out=.\env\game\proto\ --grpc_out=.\env\game\proto\ --plugin=protoc-gen-grpc=.\plugins\grpc_csharp_plugin.exe .\env\game\proto\footsies_service.proto

Generate Python files:

python -m grpc_tools.protoc -I. --python_out=. --grpc_python_out=. .\env\game\proto\footsies_service.proto

Project Structure

FootsiesGym/
├── binaries/           # Game server binaries
├── callbacks/          # RLlib callbacks
├── components/         # Reusable components
├── experiments/        # Training configurations
├── footsies/          # Core environment
├── models/            # Neural network architectures
├── protoc/            # Protocol buffer tools
├── scripts/           # Helper scripts
├── testing/           # Test files
└── utils/             # Utility functions

Contributing

Install pre-commit hooks to maintain code quality
Follow the existing code style and architecture
Add tests for new features
Update documentation as needed

License

This project is based on the open-source Footsies game by HiFight. Please refer to the original game's license for more information.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.7.1

Apr 10, 2026

0.6.0

Mar 8, 2026

0.5.0

Dec 18, 2025

0.4.3

Dec 12, 2025

0.4.2

Dec 7, 2025

0.4.1

Dec 5, 2025

This version

0.4.0

Nov 24, 2025

0.3.5

Sep 30, 2025

0.3.4

Sep 30, 2025

0.3.3

Sep 23, 2025

0.3.2

Sep 22, 2025

0.3.1

Sep 18, 2025

0.3.0

Sep 18, 2025

0.2.6

Sep 18, 2025

0.2.5

Sep 2, 2025

0.2.4

Sep 2, 2025

0.2.3

Aug 28, 2025

0.2.2

Aug 27, 2025

0.2.1

Aug 25, 2025

0.2.0

Aug 24, 2025

0.1.8

Aug 26, 2025

0.1.7

Aug 23, 2025

0.1.6

Aug 23, 2025

0.1.5

Aug 22, 2025

0.1.4

Aug 22, 2025

0.1.3

Aug 22, 2025

0.1.2

Aug 22, 2025

0.1.1

Aug 22, 2025

0.1.0

Aug 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

footsies_gym-0.4.0.tar.gz (68.0 kB view details)

Uploaded Nov 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

footsies_gym-0.4.0-py3-none-any.whl (84.6 kB view details)

Uploaded Nov 24, 2025 Python 3

File details

Details for the file footsies_gym-0.4.0.tar.gz.

File metadata

Download URL: footsies_gym-0.4.0.tar.gz
Upload date: Nov 24, 2025
Size: 68.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for footsies_gym-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`6032282b8db5ee07156cb06e26201da64e38fb0f04c9232d7f1253444dfd7017`
MD5	`f48352223034c50df7d4bc1375f6f6d1`
BLAKE2b-256	`125cd4a73ce589d6c1eb38883cdfa08044a5abb6b226ad76f1828b51eefa3cae`

See more details on using hashes here.

File details

Details for the file footsies_gym-0.4.0-py3-none-any.whl.

File metadata

Download URL: footsies_gym-0.4.0-py3-none-any.whl
Upload date: Nov 24, 2025
Size: 84.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for footsies_gym-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c0efd827b0cb23541d9c7535b0b184113296277563956320ae9635c4c1740a3e`
MD5	`eb9004b9edc0ef8fd8db8f44c8a5b215`
BLAKE2b-256	`6b508b30168e5db3f85815c898e89b4f1759aa8471970316e2ecdb73a1648ce5`

See more details on using hashes here.

footsies-gym 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

FootsiesGym

System Architecture

Installation

Training

Game Servers

Training Configuration

Old Stack

New Stack

Visualizing a Policy

Project Architecture

Core Components

Key Features

Development

gRPC / Protobuf Updates

Project Structure

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes