A strongly typed Multi-Agent Reinforcement Learning framework

These details have been verified by PyPI

Project links

repository

GitHub Statistics

Maintainers

yamoling

These details have not been verified by PyPI

Project description

`marlenv` - A unified framework for multi-agent reinforcement learning

Documentation: https://yamoling.github.io/multi-agent-rlenv

marlenv is a strongly typed library for multi-agent and multi-objective reinforcement learning.

Install the library with:

$ pip install multi-agent-rlenv      # Basics
$ pip install multi-agent-rlenv[all] # With all optional dependencies
$ pip install multi-agent-rlenv[smac,overcooked] # Only SMAC & Overcooked

It aims to provide a simple and consistent interface for reinforcement learning environments by providing abstraction models such as Observations or Episodes. marlenv provides adapters for popular libraries such as gym or pettingzoo and provides utility wrappers to add functionalities such as video recording or limiting the number of steps.

Most classes are dataclasses, which makes serialization straightforward (for example with orjson).

Fundamentals

States & Observations

MARLEnv.reset() returns a pair of (Observation, State) and MARLEnv.step() returns a Step.

Observation contains:
- data: shape [n_agents, *observation_shape]
- available_actions: boolean mask [n_agents, n_actions]
- extras: extra features per agent (default shape (n_agents, 0))
State represents the environment state and can also carry extras.
Step bundles obs, state, reward, done, truncated, and info.

Rewards are stored as np.float32 arrays. Multi-objective envs use reward vectors with reward_space.size > 1.

Extras

Extras are auxiliary features appended by wrappers (agent id, last action, time ratio, available actions, ...). Wrappers that add extras must update both extras_shape and extras_meanings so downstream users can interpret them. State extras should stay in sync with Observation extras when applicable.

Environment catalog

marlenv.catalog exposes curated environments and lazily imports optional dependencies.

from marlenv import catalog

env1 = catalog.overcooked().from_layout("scenario4")
env2 = catalog.lle().level(6)
env3 = catalog.DeepSea(max_depth=5)
env4 = catalog.connect_n()(width=7, height=6, n=4)

Catalog entries require their corresponding extras at install time (e.g., multi-agent-rlenv[overcooked], multi-agent-rlenv[lle]).

Wrappers & builders

Wrappers are composable through RLEnvWrapper and can be chained via Builder for fluent configuration.

from marlenv import Builder
from marlenv.adapters import SMAC

env = (
    Builder(SMAC("3m"))
    .agent_id()
    .time_limit(20)
    .available_actions()
    .build()
)

Common wrappers include time limits, delayed rewards, masking available actions, and video recording.

Using the library

Adapters for existing libraries

Adapters normalize external APIs into MARLEnv:

import marlenv

gym_env = marlenv.make("CartPole-v1", seed=25)

from marlenv.adapters import SMAC
smac_env = SMAC("3m", debug=True, difficulty="9")

from pettingzoo.sisl import pursuit_v4
from marlenv.adapters import PettingZoo
env = PettingZoo(pursuit_v4.parallel_env())

For deterministic behavior, seed the environment:

env.seed(123)
obs, state = env.reset()

Designing a custom environment

Create a custom environment by inheriting from MARLEnv and implementing reset, step, get_observation, and get_state.

import numpy as np
from marlenv import MARLEnv, DiscreteSpace, MultiDiscreteSpace, Observation, State, Step

class CustomEnv(MARLEnv[MultiDiscreteSpace]):
    def __init__(self):
        super().__init__(
            n_agents=3,
            action_space=DiscreteSpace.action(5).repeat(3),
            observation_shape=(4,),
            state_shape=(2,),
        )
        self.t = 0

    def reset(self, * seed:int|None=None):
        if seed is not None:
            self.seed(seed)
        self.t = 0
        return self.get_observation(), self.get_state()

    def step(self, action):
        self.t += 1
        return Step(self.get_observation(), self.get_state(), reward=0.0, done=False)

    def get_observation(self):
        return Observation(np.zeros((3, 4), dtype=np.float32), self.available_actions())

    def get_state(self):
        return State(np.array([self.t, 0], dtype=np.float32))

Related projects

MARL: Collection of multi-agent reinforcement learning algorithms based on marlenv https://github.com/yamoling/marl
Laser Learning Environment: a multi-agent gridworld that leverages marlenv's capabilities https://pypi.org/project/laser-learning-environment/

Project details

These details have been verified by PyPI

Project links

repository

GitHub Statistics

Maintainers

yamoling

These details have not been verified by PyPI

Release history Release notifications | RSS feed

4.3.2

May 6, 2026

4.3.1

May 5, 2026

4.3.0

May 5, 2026

4.2.4

May 1, 2026

4.2.3

Apr 29, 2026

4.2.2

Apr 26, 2026

4.2.1

Apr 26, 2026

This version

4.1.2

Apr 24, 2026

4.1.1

Apr 8, 2026

4.1.0

Apr 8, 2026

4.0.5

Mar 31, 2026

4.0.4

Mar 31, 2026

4.0.3

Mar 30, 2026

4.0.2

Mar 30, 2026

4.0.1

Mar 30, 2026

4.0.0

Mar 30, 2026

3.8.2

Mar 28, 2026

3.8.1

Mar 26, 2026

3.8.0

Mar 26, 2026

3.7.11

Mar 25, 2026

3.7.10

Mar 25, 2026

3.7.9

Mar 23, 2026

3.7.8

Mar 19, 2026

3.7.7

Feb 18, 2026

3.7.5

Feb 6, 2026

3.7.4

Feb 5, 2026

3.7.3

Feb 5, 2026

3.7.2

Feb 5, 2026

3.7.1

Feb 5, 2026

3.7.0

Feb 5, 2026

3.6.3

Sep 9, 2025

3.6.2

Aug 30, 2025

3.6.1

Aug 29, 2025

3.6.0

Aug 29, 2025

3.5.5

May 19, 2025

3.5.4

May 8, 2025

3.5.2

Apr 25, 2025

3.5.1

Apr 24, 2025

3.5.0

Apr 16, 2025

3.4.0

Apr 14, 2025

3.3.7

Mar 26, 2025

3.3.6

Mar 19, 2025

3.3.5

Mar 17, 2025

3.3.3

Mar 13, 2025

3.3.2

Mar 13, 2025

3.3.1

Mar 13, 2025

3.3.0

Mar 12, 2025

3.2.2

Mar 4, 2025

3.2.1

Mar 2, 2025

3.2.0

Feb 26, 2025

3.1.4

Feb 20, 2025

3.1.3

Dec 28, 2024

3.1.2

Dec 28, 2024

3.1.1

Dec 28, 2024

3.1.0

Dec 28, 2024

3.0.5

Dec 27, 2024

3.0.4

Dec 27, 2024

3.0.3

Dec 24, 2024

3.0.2

Dec 23, 2024

3.0.1

Dec 21, 2024

3.0.0

Dec 19, 2024

3.0.0rc1 pre-release

Dec 19, 2024

2.0.10

Dec 19, 2024

2.0.7

Dec 11, 2024

2.0.6

Dec 7, 2024

2.0.5

Dec 5, 2024

2.0.4

Dec 5, 2024

2.0.3

Dec 5, 2024

2.0.2

Dec 2, 2024

2.0.1

Nov 27, 2024

2.0.0

Nov 21, 2024

2.0.0rc3 pre-release

Nov 21, 2024

2.0.0rc2 pre-release

Nov 20, 2024

2.0.0rc1 pre-release

Nov 20, 2024

1.3.0

Nov 19, 2024

1.2.6

Nov 19, 2024

1.2.5

Nov 19, 2024

1.2.4

Nov 7, 2024

1.2.3

Nov 7, 2024

1.2.2

Oct 9, 2024

1.2.1

Oct 9, 2024

1.2.0

Oct 9, 2024

1.1.1

Sep 11, 2024

1.1.0

Sep 11, 2024

1.0.2

Aug 22, 2024

1.0.1

Aug 21, 2024

1.0.0

Aug 21, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multi_agent_rlenv-4.1.2.tar.gz (52.2 kB view details)

Uploaded Apr 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

multi_agent_rlenv-4.1.2-py3-none-any.whl (59.1 kB view details)

Uploaded Apr 24, 2026 Python 3

File details

Details for the file multi_agent_rlenv-4.1.2.tar.gz.

File metadata

Download URL: multi_agent_rlenv-4.1.2.tar.gz
Upload date: Apr 24, 2026
Size: 52.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for multi_agent_rlenv-4.1.2.tar.gz
Algorithm	Hash digest
SHA256	`a816617575e48cb469f417d497a9fbb5b88fb93c0eb52f45f03ce9c3ff21e02b`
MD5	`b1a783dd619be3a00de863c787324e03`
BLAKE2b-256	`6986d8896abda1d3fe546a2caf90cd673eafccd3a79519597ad1fc1afebf6022`

See more details on using hashes here.

Provenance

The following attestation bundles were made for multi_agent_rlenv-4.1.2.tar.gz:

Publisher: release.yaml on yamoling/multi-agent-rlenv

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: multi_agent_rlenv-4.1.2.tar.gz
- Subject digest: a816617575e48cb469f417d497a9fbb5b88fb93c0eb52f45f03ce9c3ff21e02b
- Sigstore transparency entry: 1368988498
- Sigstore integration time: Apr 24, 2026
Source repository:
- Permalink: yamoling/multi-agent-rlenv@29ca0108b393d92fb9aea77d8b46c92e10ad3faa
- Branch / Tag: refs/tags/v4.1.2
- Owner: https://github.com/yamoling
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@29ca0108b393d92fb9aea77d8b46c92e10ad3faa
- Trigger Event: push

File details

Details for the file multi_agent_rlenv-4.1.2-py3-none-any.whl.

File metadata

Download URL: multi_agent_rlenv-4.1.2-py3-none-any.whl
Upload date: Apr 24, 2026
Size: 59.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for multi_agent_rlenv-4.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`03c257b25381daf4356c57a949b2e3bc1cb4b8b3dc019abadafbb26fa9446bd7`
MD5	`51105232768f1f09272b2122eed9fe6e`
BLAKE2b-256	`b3b7a08427fa79139a20fd4dc67cc917afa0978a2e65ef97c3915f505ea1990c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for multi_agent_rlenv-4.1.2-py3-none-any.whl:

Publisher: release.yaml on yamoling/multi-agent-rlenv

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: multi_agent_rlenv-4.1.2-py3-none-any.whl
- Subject digest: 03c257b25381daf4356c57a949b2e3bc1cb4b8b3dc019abadafbb26fa9446bd7
- Sigstore transparency entry: 1368988643
- Sigstore integration time: Apr 24, 2026
Source repository:
- Permalink: yamoling/multi-agent-rlenv@29ca0108b393d92fb9aea77d8b46c92e10ad3faa
- Branch / Tag: refs/tags/v4.1.2
- Owner: https://github.com/yamoling
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@29ca0108b393d92fb9aea77d8b46c92e10ad3faa
- Trigger Event: push

multi-agent-rlenv 4.1.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

marlenv - A unified framework for multi-agent reinforcement learning

Fundamentals

States & Observations

Extras

Environment catalog

Wrappers & builders

Using the library

Adapters for existing libraries

Designing a custom environment

Related projects

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`marlenv` - A unified framework for multi-agent reinforcement learning