TALES: Text-Adventure Learning Environment Suite

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

TALES: Text-Adventure Learning Environment Suite

This repository contains the files needed to benchmark language agents on a curated list of text-based games from the following frameworks: Jericho, TextWorld, TextWorld-Express, ScienceWorld, ALFWorld).

[Technical Report] [Project Page]

1. Installation

It is recommended to create and activate a conda or virtual environment. tales requires Python>=3.12:

conda create -n tales python=3.12
conda activate tales

Then, install tales directly from PyPI:

pip install tale-suite

[!WARNING] The name of the Python package on PyPI is tale-suite and not tales.

Alternatively, clone the repository and install locally:

git clone https://github.com/microsoft/tale-suite
cd tale-suite
pip install -e .

[!WARNING] You will need Java 1.8+ installed to run the environments TextWorld-Express and ScienceWorld.
sudo apt update && apt install openjdk-8-jre-headless -y

Alternatively, if the above isn't working:

 sudo apt-get update && apt-get install default-jre default-jdk

Using Docker

We provide a pre-built docker image at

docker pull czcui/twb:prebuilt

Please see the following docs page for more details on how to set up a local vllm for use with the text world benchmark.

An example script can be found in the scripts folder.

2. Getting Started

Run benchmark evaluation on all the games for the specified random agent:
```
python benchmark.py --agent agents/random.py random
```

Run benchmark evaluation on a subset of the games:

python benchmark.py --agent agents/random.py random --env textworld

Run benchmark evaluation on specific games:

python benchmark.py --agent agents/random.py random --envs JerichoEnvZork1 JerichoEnvDetective

Run benchmark evaluation using as a HumanAgent:

python benchmark.py --agent agents/human.py human --envs TWCookingLevel1

Run benchmark evaluation where the ground-truth walkthrough is being followed:

python benchmark.py --agent agents/walkthrough.py walkthrough --envs JerichoEnvZork1

3. Benchmarking LLMs

In order to benchmark a given LLM acting as language agent playing text-based games, you will need to first configure it. tales is leveraging the llm library to handle communication with different LLMs.

python benchmark.py --agent agents/llm.py zero-shot --envs TWCookingLevel1

API-based LLMs

llm natively supports OpenAI models and self-hosted models that offer an OpenAI-compatible API (e.g. like vLLM does - more on this below).

Adding support to other LLMs

llm offers different plugins to include other LLMs. E.g.

llm install llm-anthropic

See the llmplugins page for more information.

Deploying a model locally using vLLM

To serve a custom HugginFace model with vLLM, one can use the vllm docker image like this:

docker run --runtime nvidia --gpus all --restart unless-stopped --name vllm-Llama-3.1-8B-Instruct --env "HUGGING_FACE_HUB_TOKEN=${HUGGING_FACE_HUB_TOKEN}" -v ~/.cache/huggingface:/root/.cache/huggingface -p 8000:8000 --ipc=host vllm/vllm-openai:latest --model meta-llama/Llama-3.1-8B-Instruct --tensor-parallel-size 4 --host 0.0.0.0

Then, add the following entrypoint in ~/.config/io.datasette.llm/extra-openai-models.yaml

- model_id: meta-llama/Llama-3.1-8B-Instruct
  model_name: meta-llama/Llama-3.1-8B-Instruct
  api_base: "http://0.0.0.0:8000/v1"

You can check that everything is working properly with this simple command:

llm -m meta-llama/Llama-3.1-8B-Instruct "Hi. What's your name?"

4. Building Custom Agents

To build a custom agent, you need to create a new file (e.g., custom.py) in the agents folder and implement the Agent class and implement the proper arguments parser.

from typing import Dict, Any
import tales

class CustomAgent(tales.Agent):

    def act(self, obs: str, reward: float, done: bool, infos: Dict[str, Any]) -> str:
        # ...
        return "help"


def build_argparser(parser=None):
    return parser or argparse.ArgumentParser()


register(
    name="my-agent",
    desc=(
        "This is a custom agent that always output 'help' as a text action."
    ),
    klass=CustomAgent,
    add_arguments=build_argparser,
)

You can then use this agent by specifying the path to the file and the class name in the --agent argument.

    python benchmark.py --agent agents/custom.py my-agent

[!NOTE] See the agents folder for more concrete examples.

Citation

@article{cui2025tales,
  title={TALES: Text-Adventure Learning Environment Suite},
  author={Christopher Cui, Xingdi Yuan, Ziang Xiao, Prithviraj Ammanabrolu, Marc-Alexandre C\^ot\'e},
  journal={arXiv preprint arXiv:2504.14128},
  year={2025},
  url={https://arxiv.org/abs/2504.14128}
}

If you use this benchmark, please consider citing the original frameworks as well.

@article{cote18textworld,
  author = {Marc-Alexandre C\^ot\'e and \'Akos K\'ad\'ar and Xingdi Yuan and Ben Kybartas and Tavian Barnes and Emery Fine and James Moore and Ruo Yu Tao and Matthew Hausknecht and Layla El Asri and Mahmoud Adada and Wendy Tay and Adam Trischler},
  title = {TextWorld: A Learning Environment for Text-based Games},
  journal = {CoRR},
  volume = {abs/1806.11532},
  year = {2018}
}
@article{jansen2022textworldexpress,
  url = {https://arxiv.org/abs/2208.01174},
  author = {Jansen, Peter A. and Côté, Marc-Alexandre},
  title = {TextWorldExpress: Simulating Text Games at One Million Steps Per Second},
  journal = {arXiv},
  year = {2022},
}
@inproceedings{hausknecht2020interactive,
  title={Interactive fiction games: A colossal adventure},
  author={Hausknecht, Matthew and Ammanabrolu, Prithviraj and C{\^o}t{\'e}, Marc-Alexandre and Yuan, Xingdi},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={34},
  number={05},
  year={2020}
}
@inproceedings{ALFWorld20,
               title ={{ALFWorld: Aligning Text and Embodied Environments for Interactive Learning}},
               author={Mohit Shridhar and Xingdi Yuan and Marc-Alexandre C\^ot\'e and Yonatan Bisk and Adam Trischler and Matthew Hausknecht},
               booktitle = {Proceedings of the International
               Conference on Learning Representations (ICLR)},
               year = {2021},
               url = {https://arxiv.org/abs/2010.03768}}
@misc{scienceworld2022,
    title={ScienceWorld: Is your Agent Smarter than a 5th Grader?},
    author={Ruoyao Wang and Peter Jansen and Marc-Alexandre C{\^o}t{\'e} and Prithviraj Ammanabrolu},
    year={2022},
    eprint={2203.07540},
    archivePrefix={arXiv},
    primaryClass={cs.CL},
    url={https://arxiv.org/abs/2203.07540}
}

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Privacy

This framework does not collect user's personal data. For more information about Microsoft's privacy policies. Please see Microsoft Privacy Statement.

Responsible AI

Please see our Responsible AI Statement.

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

1.0.0rc2 pre-release

Sep 2, 2025

1.0.0rc1 pre-release

Apr 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tale_suite-1.0.0rc2.tar.gz (34.1 kB view details)

Uploaded Sep 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tale_suite-1.0.0rc2-py3-none-any.whl (43.2 kB view details)

Uploaded Sep 2, 2025 Python 3

File details

Details for the file tale_suite-1.0.0rc2.tar.gz.

File metadata

Download URL: tale_suite-1.0.0rc2.tar.gz
Upload date: Sep 2, 2025
Size: 34.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for tale_suite-1.0.0rc2.tar.gz
Algorithm	Hash digest
SHA256	`530c891791fa58e7ab61eeab870494c14c24047b912ab58b3ff0b5648daab282`
MD5	`e545505d9f2a8066e68a98b367a872c6`
BLAKE2b-256	`a64c60cc6e4c3facdab9dbc311841afae993ec8707aabd91b1de5ced07a28971`

See more details on using hashes here.

File details

Details for the file tale_suite-1.0.0rc2-py3-none-any.whl.

File metadata

Download URL: tale_suite-1.0.0rc2-py3-none-any.whl
Upload date: Sep 2, 2025
Size: 43.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for tale_suite-1.0.0rc2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c72a87f565ea108c3b85a6439893b1659275eca1ec56635e94e5cfd9f572811b`
MD5	`17c331268ee571b33881383f7abce4b1`
BLAKE2b-256	`ca02b84872709f286b4966d4104d1a81af2f5ef78060947aad7617302c9a3340`

See more details on using hashes here.

tale-suite 1.0.0rc2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TALES: Text-Adventure Learning Environment Suite

1. Installation

Using Docker

2. Getting Started

3. Benchmarking LLMs

API-based LLMs

Adding support to other LLMs

Deploying a model locally using vLLM

4. Building Custom Agents

Citation

Contributing

Trademarks

Privacy

Responsible AI

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes