Multi Particle Environments Version 2

These details have not been verified by PyPI

Project description

MPE2

Multi Particle Environments (MPE) are a set of communication oriented environment where particle agents can (sometimes) move, communicate, see each other, push each other around, and interact with fixed landmarks.

MPE is currently maintained by the Farama Foundation. These environments are from OpenAI’s MPE codebase, with several minor fixes, mostly related to making the action space discrete by default, making the rewards consistent and cleaning up the observation space of certain environments.

Installation

The unique dependencies for this set of environments can be installed via:

pip install mpe2

Usage

To launch a Simple Push environment with random agents:

from mpe2 import simple_push_v3

env = simple_push_v3.env(render_mode="human")
env.reset(seed=42)

for agent in env.agent_iter():
    observation, reward, termination, truncation, info = env.last()

    if termination or truncation:
        action = None
    else:
        # this is where you would insert your policy
        action = env.action_space(agent).sample()

    env.step(action)
    
env.close()

Types of Environments

The Simple Adversary, Simple Crypto, Simple Push, Simple Tag, and Simple World Comm environments are adversarial (a "good" agent being rewarded means an "adversary" agent is punished and vice versa, though not always in a perfectly zero-sum manner). In most of these environments, there are "good" agents rendered in green and an "adversary" team rendered in red.

The Simple Reference, Simple Speaker Listener, and Simple Spread environments are cooperative in nature (agents must work together to achieve their goals, and received a mixture of rewards based on their own success and the success of the other agents).

Key Concepts

Landmarks: Landmarks are static circular features of the environment that cannot be controlled. In some environments, like Simple, they are destinations that affect the rewards of the agents depending on how close the agents are to them. In other environments, they can be obstacles that block the motion of the agents. These are described in more detail in the documentation for each environment.
Visibility: When an agent is visible to a another agent, that other agent's observation contains the first agent's relative position (and in Simple World Comm and Simple Tag, the first agent's velocity). If an agent is temporarily hidden (only possible in Simple World Comm) then the agent's position and velocity is set to zero.
Communication: Some agents in some environments can broadcast a message as a part of its action (see action space for more details) which will be transmitted to each agent that is allowed to see that message. In Simple Crypto, this message is used to signal that Bob and Eve have reconstructed the message.
Color: Since all agents are rendered as circles, the agents are only identifiable to a human by their color, so the color of the agents is described in most of the environments. The color is not observed by the agents.
Distances: The landmarks and agents typically start out uniformly randomly placed from -1 to 1 on the map. This means they are typically around 1-2 units apart. This is important to keep in mind when reasoning about the scale of the rewards (which often depend on distance) and the observation space, which contains relative and absolute positions.

Termination

The game terminates after the number of cycles specified by the max_cycles environment argument is executed. The default for all environments is 25 cycles, as in the original OpenAI source code.

Observation Space

The observation space of an agent is a vector generally composed of the agent's position and velocity, other agents' relative positions and velocities, landmarks' relative positions, landmarks' and agents' types, and communications received from other agents. The exact form of this is detailed in the environments' documentation.

If an agent cannot see or observe the communication of a second agent, then the second agent is not included in the first's observation space, resulting in different agents having different observation space sizes in certain environments.

Action Space

Note: OpenAI's MPE uses continuous action spaces by default.

Discrete action space (Default):

The action space is a discrete action space representing the combinations of movements and communications an agent can perform. Agents that can move can choose between the 4 cardinal directions or do nothing. Agents that can communicate choose between 2 and 10 environment-dependent communication options, which broadcast a message to all agents that can hear it.

Continuous action space (Set by continuous_actions=True):

The action space is a continuous action space representing the movements and communication an agent can perform. Agents that can move can input a velocity between 0.0 and 1.0 in each of the four cardinal directions, where opposing velocities e.g. left and right are summed together. Agents that can communicate can output a continuous value over each communication channel in the environment which they have access to.

Rendering

Rendering displays the scene in a window that automatically grows if agents wander beyond its border. Communication is rendered at the bottom of the scene. The render() method also returns the pixel map of the rendered area.

Citation

The MPE environments were originally described in the following work:

@article{mordatch2017emergence,
  title={Emergence of Grounded Compositional Language in Multi-Agent Populations},
  author={Mordatch, Igor and Abbeel, Pieter},
  journal={arXiv preprint arXiv:1703.04908},
  year={2017}
}

But were first released as a part of this work:

@article{lowe2017multi,
  title={Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments},
  author={Lowe, Ryan and Wu, Yi and Tamar, Aviv and Harb, Jean and Abbeel, Pieter and Mordatch, Igor},
  journal={Neural Information Processing Systems (NIPS)},
  year={2017}
}

Please cite one or both of these if you use these environments in your research.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.0

Apr 3, 2026

This version

1.0.0rc1 pre-release

Mar 29, 2026

0.0.1

Aug 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mpe2-1.0.0rc1.tar.gz (42.3 kB view details)

Uploaded Mar 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mpe2-1.0.0rc1-py3-none-any.whl (59.1 kB view details)

Uploaded Mar 29, 2026 Python 3

File details

Details for the file mpe2-1.0.0rc1.tar.gz.

File metadata

Download URL: mpe2-1.0.0rc1.tar.gz
Upload date: Mar 29, 2026
Size: 42.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mpe2-1.0.0rc1.tar.gz
Algorithm	Hash digest
SHA256	`2273adb0a0ea1d4d5efb3f27363fd5e4697bbbd757005ac34b70be1764c61495`
MD5	`42333173af9743850e6c1a9b5a13cb1a`
BLAKE2b-256	`cd9bacb93b56e62fc2baf0028048ccfc8271b541bef673b0571445c756f9d6c8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mpe2-1.0.0rc1.tar.gz:

Publisher: build-publish.yml on Farama-Foundation/MPE2

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mpe2-1.0.0rc1.tar.gz
- Subject digest: 2273adb0a0ea1d4d5efb3f27363fd5e4697bbbd757005ac34b70be1764c61495
- Sigstore transparency entry: 1191740707
- Sigstore integration time: Mar 29, 2026
Source repository:
- Permalink: Farama-Foundation/MPE2@bfcf5510a89d8cff231cc6d1e400ccec674d83cd
- Branch / Tag: refs/tags/v1.0.0rc1
- Owner: https://github.com/Farama-Foundation
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build-publish.yml@bfcf5510a89d8cff231cc6d1e400ccec674d83cd
- Trigger Event: release

File details

Details for the file mpe2-1.0.0rc1-py3-none-any.whl.

File metadata

Download URL: mpe2-1.0.0rc1-py3-none-any.whl
Upload date: Mar 29, 2026
Size: 59.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mpe2-1.0.0rc1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`78d6fe427a795be0d0d532ff811fb9a3815dcff62a1b44ed7f64648b70e3fb35`
MD5	`de595096c6f9ce1d9b4e03d3c8f92ccf`
BLAKE2b-256	`07c26fb93a236076d8e772d44e8bede95bbff2dde40ab5f249c51299a0e58e7b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mpe2-1.0.0rc1-py3-none-any.whl:

Publisher: build-publish.yml on Farama-Foundation/MPE2

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mpe2-1.0.0rc1-py3-none-any.whl
- Subject digest: 78d6fe427a795be0d0d532ff811fb9a3815dcff62a1b44ed7f64648b70e3fb35
- Sigstore transparency entry: 1191740712
- Sigstore integration time: Mar 29, 2026
Source repository:
- Permalink: Farama-Foundation/MPE2@bfcf5510a89d8cff231cc6d1e400ccec674d83cd
- Branch / Tag: refs/tags/v1.0.0rc1
- Owner: https://github.com/Farama-Foundation
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build-publish.yml@bfcf5510a89d8cff231cc6d1e400ccec674d83cd
- Trigger Event: release

mpe2 1.0.0rc1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

MPE2

Installation

Usage

Types of Environments

Key Concepts

Termination

Observation Space

Action Space

Rendering

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance