TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

These details have not been verified by PyPI

Project links

Homepage

Project description

A suite of 80+ Single-/Two-/Multi-Player texted based games for benchmarking and training of LLMs.

Play | Leaderboard | Games | Examples

Updates

14/07/2025 Announcing MindGames a NeurIPS2025 competition for training LLMs on various TextArena games that require theory of mind.
01/07/2025 Release of v0.6.9 with 100 games and simplified states, new observation wrappers for training and default wrappers for environments.
01/07/2025 Release of SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning introducing RL via self-play on TextArena games as a potential new training paradigm.
22/06/2025 Release of UnstableBaselines a light weight async online RL library for training LLMs on TextArena games.
16/04/2025 Release of the TextArena paper
14/02/2025 Release of the new, stable version for both pip and the website
31/01/2025 Initial demo release highlighted by Andrej Karpathy (crashing all our servers)

Introduction

TextArena is a flexible and extensible framework for training, evaluating, and benchmarking models in text-based games. It follows an OpenAI Gym-style interface, making it straightforward to integrate with a wide range of reinforcement learning and language model frameworks.

Getting Started

Installation

Install TextArena directly from PyPI:

pip install textarena

Offline Play

The only requirement Agents need to fulfill is having a call function that accepts string observations and returns string action. We have implemented a number of basic agents that you can find [here](TODO link). In this example, we show how you can let GPT-4o-mini play against anthropic/claude-3.5-haiku in a game of TicTacToe.

We will be using the OpenRouterAgent, so first you need to set you OpenRouter API key:

export OPENROUTER_API_KEY="YOUR_OPENROUTER_API_KEY"

Now we can build the models and let them play:

import textarena as ta

# Initialize agents
agents = {
    0: ta.agents.OpenRouterAgent(model_name="GPT-4o-mini"),
    1: ta.agents.OpenRouterAgent(model_name="anthropic/claude-3.5-haiku"),
}

# Initialize the environment
env = ta.make(env_id="TicTacToe-v0")

# wrap it for additional visualizations
env = ta.wrappers.SimpleRenderWrapper(env=env) 

env.reset(num_players=len(agents))

done = False
while not done:
    player_id, observation = env.get_observation()
    action = agents[player_id](observation)
    done, step_info = env.step(action=action)

rewards, game_info = env.close()

Citation

If you use TextArena in your research, please cite:

@misc{guertler2025textarena,
    title={TextArena}, 
    author={Leon Guertler and Bobby Cheng and Simon Yu and Bo Liu and Leshem Choshen and Cheston Tan},
    year={2025},
    eprint={2504.11442},
    archivePrefix={arXiv},
    primaryClass={cs.CL},
    url={https://arxiv.org/abs/2504.11442}, 
}

How to Contribute:

If you have any questions at all, feel free to reach out on discord. The below issues are great starting points if you want to contribute:

Transfer the 'How to Contribute' from here to individual issues
Make RushHour board generation algorithmic
extend 2048 to arbitrary board sizes (should be very straight forward)
extend Fifteenpuzzel to arbitrary sizes
Add a nice end-of-game screen to the SimpleRenderWrapper visualizations

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.7.4

Oct 16, 2025

0.7.3

Jul 31, 2025

0.7.2

Jul 22, 2025

This version

0.7.0

Jul 17, 2025

0.6.17

Jul 21, 2025

0.6.16

Jul 5, 2025

0.6.15

Jul 5, 2025

0.6.14

Jul 4, 2025

0.6.12

Jul 3, 2025

0.6.11

Jul 3, 2025

0.6.10

Jul 3, 2025

0.6.9

Jul 3, 2025

0.6.4

Apr 15, 2025

0.6.3

Apr 8, 2025

0.6.1

Mar 31, 2025

0.6.0

Mar 25, 2025

0.5.9

Mar 8, 2025

0.5.8

Mar 7, 2025

0.5.7

Mar 7, 2025

0.5.6

Mar 6, 2025

0.5.5

Mar 6, 2025

0.5.4

Mar 6, 2025

0.5.3

Mar 6, 2025

0.5.0

Feb 14, 2025

0.4.9

Feb 14, 2025

0.4.8

Feb 13, 2025

0.4.6

Feb 13, 2025

0.4.5

Feb 13, 2025

0.4.4

Feb 13, 2025

0.4.2

Feb 11, 2025

0.4.1

Feb 11, 2025

0.4.0

Feb 11, 2025

0.3.9

Feb 7, 2025

0.3.8

Feb 6, 2025

0.3.6

Feb 6, 2025

0.3.5

Feb 5, 2025

0.3.4

Feb 3, 2025

0.3.2

Jan 30, 2025

0.3.1

Jan 30, 2025

0.3.0

Jan 29, 2025

0.2.7

Jan 20, 2025

0.2.5

Jan 20, 2025

0.2.0

Dec 17, 2024

0.1.6

Nov 19, 2024

0.1.5

Nov 16, 2024

0.1.3

Nov 16, 2024

0.1.2

Nov 16, 2024

0.1.1

Nov 11, 2024

0.1.0

Nov 4, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textarena-0.7.0.tar.gz (866.7 kB view details)

Uploaded Jul 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

textarena-0.7.0-py3-none-any.whl (966.7 kB view details)

Uploaded Jul 17, 2025 Python 3

File details

Details for the file textarena-0.7.0.tar.gz.

File metadata

Download URL: textarena-0.7.0.tar.gz
Upload date: Jul 17, 2025
Size: 866.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for textarena-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`11f57923c04e6aecb9e26ad2e9d9fe70c5e1e439545e4dad8240da74fc4746f5`
MD5	`dc5b445b8a6963203e9ca8c19f5ead12`
BLAKE2b-256	`db59742bb98e2e8d05d18ea42dc9364d2e1d47f406edc5a0b95f99111414ba0b`

See more details on using hashes here.

File details

Details for the file textarena-0.7.0-py3-none-any.whl.

File metadata

Download URL: textarena-0.7.0-py3-none-any.whl
Upload date: Jul 17, 2025
Size: 966.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for textarena-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`05f3a8dbcbfd743b314c4d9b00c4cb85ef96419f4882b9b0e33d3fe858e0927a`
MD5	`e877df7162f3dab977548007db430649`
BLAKE2b-256	`75adea77130c4e997a7a22c2bcbda25100cc52018810cdea99334d840d4c7337`

See more details on using hashes here.

TextArena 0.7.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Play | Leaderboard | Games | Examples

Updates

Introduction

Getting Started

Installation

Offline Play

Citation

How to Contribute:

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes