A reinforcement learning environment for HiFight's Footsies game

These details have not been verified by PyPI

Project links

Project description

FootsiesGym

Footsies gameplay

A reinforcement learning environment for HiFight's Footsies game. This environment serves as a benchmark for multi-agent reinforcement learning in a two-player zero-sum fighting game.

The environment wraps the open-source Unity implementation, augmented with a gRPC server controlled through a Python harness. Training is implemented using Ray's RLlib.

Installation

pip install footsies-gym

Or install from source:

git clone https://github.com/chasemcd/FootsiesGym.git
cd FootsiesGym
pip install -e .

Game binaries are downloaded automatically on first use from a CDN and verified with SHA256 checksums. No manual binary setup is required.

Quick Start

import footsiesgym
from footsiesgym.footsies.game import constants

# Create environment (downloads binaries automatically)
env = footsiesgym.make(platform="linux")

obs, infos = env.reset()

while True:
    actions = {agent: env.action_space[agent].sample() for agent in env.agents}
    obs, rewards, terminateds, truncateds, infos = env.step(actions)

    if terminateds["__all__"] or truncateds["__all__"]:
        obs, infos = env.reset()

Note: launch_binaries=True (the default) only works on Linux. On macOS, launch the game server manually and pass launch_binaries=False (see Platform Support).

System Architecture

sequenceDiagram
    participant RLlib as Ray RLlib
    participant Env as FootsiesEnv
    participant gRPC as gRPC Client
    participant Server as Unity Game Server

    RLlib->>Env: step(actions)
    Env->>gRPC: SendAction(action)
    gRPC->>Server: gRPC Request
    Server->>gRPC: Game State
    gRPC->>Env: Game State
    Env->>RLlib: (obs, rewards, terminateds, truncateds, infos)

Configuration

Creating an Environment

Use footsiesgym.make() for a quick setup with sensible defaults:

env = footsiesgym.make(
    config={...},           # Override default config keys (see below)
    platform="linux",       # "linux" or "mac"
    launch_binaries=True,   # Auto-launch game server (Linux only)
)

Or create the environment directly for full control:

from footsiesgym import FootsiesEnv

env = FootsiesEnv(config={...})

Config Options

Key	Type	Default	Description
`max_t`	int	`4000`	Maximum timesteps per episode
`frame_skip`	int	`4`	Number of game frames per environment step
`action_delay`	int	`8`	Action delay in frames (must be divisible by `frame_skip`)
`port`	int	auto	gRPC port for game server communication
`host`	str	`"localhost"`	Game server host address
`headless`	bool	`True`	Headless mode (True) or windowed (False)
`launch_binaries`	bool	`False`	Auto-launch game binaries (Linux only)
`platform`	str	`"linux"`	Target platform (`"linux"` or `"mac"`)
`evaluation`	bool	`False`	Evaluation mode flag
`use_special_charge_action`	bool	`False`	Enable the `SPECIAL_CHARGE` toggle action
`return_fight_state_in_infos`	bool	`False`	Include detailed fight state in `infos` dict
`win_reward_scaling_coeff`	float	`1.0`	Scales the win/loss reward magnitude
`guard_break_reward`	float	`0.0`	Reward given per guard break event
`use_reward_budget`	bool	`False`	Deduct guard break rewards from the win reward budget

Action Space

Each agent selects from a Discrete action space:

Action	ID	Description
`NONE`	0	No input
`BACK`	1	Move backward
`FORWARD`	2	Move forward
`ATTACK`	3	Attack
`BACK_ATTACK`	4	Back + Attack
`FORWARD_ATTACK`	5	Forward + Attack
`SPECIAL_CHARGE`	6	Toggle special charge (only when `use_special_charge_action=True`)

The action space is Discrete(6) by default, or Discrete(7) with use_special_charge_action=True.

Special Charge Mechanic

When use_special_charge_action=True, agents can hold the attack button to charge a special attack (requires 60 frames / 15 steps at frame_skip=4). SPECIAL_CHARGE is a toggle: activating it holds the attack input, and all movement actions become their attack variants (e.g., FORWARD becomes FORWARD_ATTACK). Toggle again to release.

Action Delay

Actions are queued and executed after action_delay // frame_skip steps. This simulates reaction time and makes the environment more realistic.

Observation Space

Each agent receives a Box observation of shape (86,) containing:

Component	Size	Description
Common state	1	Normalized distance between players
Self player state	40	Position, velocity, health, action state, and privileged features (dash readiness, special progress, previous action, charge state)
Opponent state	45	Same as self but without privileged features

Observations are asymmetric: each agent sees its own privileged information but not the opponent's.

Rewards

Rewards are zero-sum between the two agents (rewards["p1"] + rewards["p2"] == 0).

Signal	When	Value
Win/Loss	Opponent dies	`+/- win_reward_scaling_coeff` (minus any budget spent on guard breaks)
Guard break	Opponent's guard decreases	`+/- guard_break_reward` (up to 3 times per episode)

When use_reward_budget=True, guard break rewards are deducted from the win reward so total reward per episode is capped at win_reward_scaling_coeff. When False, guard break rewards are additive.

Platform Support

Platform	Auto-launch	Manual launch
Linux	`launch_binaries=True`	Supported
macOS	Not supported	Supported
Windows	Not supported	TBD

macOS Setup

Launch the game server manually, then create the environment:

# Extract and run the headless binary
./footsies_mac_headless_5709b6d --port 50051

env = footsiesgym.make(
    config={"port": 50051, "headless": True},
    platform="mac",
    launch_binaries=False,
)

Binary Management

Binaries are automatically downloaded from a CDN (footsiesgym.chasemcd.com) on first use, with GitHub as a fallback source. All downloads are verified with SHA256 checksums. File locking prevents race conditions when multiple processes download simultaneously.

Offline usage: The binaries must be downloaded at least once before running offline. The easiest way to ensure this is to run the environment once while online so the binaries are automatically downloaded and cached.

Training

Training uses Ray RLlib with the APPO algorithm.

Launching Game Servers (manual)

If not using launch_binaries=True, start servers before training:

./scripts/start_local_{mac,linux}_servers.sh <num-train-servers> <num-eval-servers>

Training servers start from port 50051, evaluation servers from port 40051.

Running Training

python -m experimentation.train --experiment-name <experiment-name>

# Local debug mode (single env runner)
python -m experimentation.train --experiment-name <experiment-name> --debug

Visualizing a Policy

Note: these steps assume you're working out of this repository. If you've pip-installed the package, you'll need to create your own script to load and execute the policy.

Launch the windowed game binary (or skip this step and use headless=False on Linux):
```
./footsies_linux_windowed_9c6b36f --port 80051
```

FootsiesModuleSpec(
    module_name="<policy-nickname>",
    experiment_name="<experiment-name>",
    trial_id="<trial-id>",
    checkpoint_number=-1,  # -1 for latest
)

Configure policies in scripts/local_inference.py. Set "p1" to "human" to play against the AI (requires pygame).

Project Structure

FootsiesGym/
├── footsiesgym/           # Installable package
│   ├── footsies/          # Core environment, encoder, gRPC client
│   ├── binary_manager.py  # Binary download and hash verification
│   └── __init__.py        # Package entry point with make()
├── experimentation/       # Training configurations and scripts
├── binaries/              # Game server binaries (downloaded automatically)
├── callbacks/             # RLlib callbacks
├── components/            # Module repository for policy management
├── models/                # Neural network architectures
├── scripts/               # Server launch and inference scripts
├── testing/               # Tests
└── utils/                 # Utility functions

Development

gRPC / Protobuf Updates

If updating the proto definitions:

# Generate Python files
python -m grpc_tools.protoc -I. --python_out=. --grpc_python_out=. footsiesgym/footsies/game/proto/footsies_service.proto

Running Tests

pip install -e ".[dev]"
pytest

License

This project is based on the open-source Footsies game by HiFight. See the original game's license for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.7.1

Apr 10, 2026

0.6.0

Mar 8, 2026

0.5.0

Dec 18, 2025

0.4.3

Dec 12, 2025

0.4.2

Dec 7, 2025

0.4.1

Dec 5, 2025

0.4.0

Nov 24, 2025

0.3.5

Sep 30, 2025

0.3.4

Sep 30, 2025

0.3.3

Sep 23, 2025

0.3.2

Sep 22, 2025

0.3.1

Sep 18, 2025

0.3.0

Sep 18, 2025

0.2.6

Sep 18, 2025

0.2.5

Sep 2, 2025

0.2.4

Sep 2, 2025

0.2.3

Aug 28, 2025

0.2.2

Aug 27, 2025

0.2.1

Aug 25, 2025

0.2.0

Aug 24, 2025

0.1.8

Aug 26, 2025

0.1.7

Aug 23, 2025

0.1.6

Aug 23, 2025

0.1.5

Aug 22, 2025

0.1.4

Aug 22, 2025

0.1.3

Aug 22, 2025

0.1.2

Aug 22, 2025

0.1.1

Aug 22, 2025

0.1.0

Aug 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

footsies_gym-0.7.1.tar.gz (77.8 kB view details)

Uploaded Apr 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

footsies_gym-0.7.1-py3-none-any.whl (94.9 kB view details)

Uploaded Apr 10, 2026 Python 3

File details

Details for the file footsies_gym-0.7.1.tar.gz.

File metadata

Download URL: footsies_gym-0.7.1.tar.gz
Upload date: Apr 10, 2026
Size: 77.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.16

File hashes

Hashes for footsies_gym-0.7.1.tar.gz
Algorithm	Hash digest
SHA256	`9e0fb33251e6806270aa6f3e91a1c445d2f03d0190ec8400c058ca9e9dfa77e5`
MD5	`3cfc07a686191e16da118e1c2b9f6e85`
BLAKE2b-256	`c61e7cc14622d0cd2471d957e9aa4c03829e7120c1970d20693c276c0313abfc`

See more details on using hashes here.

File details

Details for the file footsies_gym-0.7.1-py3-none-any.whl.

File metadata

Download URL: footsies_gym-0.7.1-py3-none-any.whl
Upload date: Apr 10, 2026
Size: 94.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.16

File hashes

Hashes for footsies_gym-0.7.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d683eab72c94a9ea89d93fc7c2e3254dd29e5301e445169e7f3d57be7f2b678f`
MD5	`fd233b654e28ea9b4072ed9d3e7d51e4`
BLAKE2b-256	`53dd936106919908eff002cf7a9769b074a2281d64bc99669cf44b2947885a69`

See more details on using hashes here.

footsies-gym 0.7.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

FootsiesGym

Installation

Quick Start

System Architecture

Configuration

Creating an Environment

Config Options

Action Space

Special Charge Mechanic

Action Delay

Observation Space

Rewards

Platform Support

macOS Setup

Binary Management

Training

Launching Game Servers (manual)

Running Training

Visualizing a Policy

Project Structure

Development

gRPC / Protobuf Updates

Running Tests

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes