Skip to main content

Add your description here

Project description

Craftax LM

A wrapper around the Craftax agent benchmark, for evaluating digital agents.

Usage

First, download the package with pip install craftaxlm. Next, import the agent-computer interface of your choice via

from craftaxlm import CraftaxACI, CraftaxClassicACI

This package is early in development, so for implementation examples, please refer to the baseline ReAct implementation

Leaderboard

Craftax-Classic

LM Algorithm Reward (% max) Code
gpt-4o-mini ReAct 18.4 CraftaxLM_Baselines

Craftax-Full

LM Algorithm Reward (% max) Code
gpt-4o-mini ReAct 02.9 CraftaxLM_Baselines

Dev Instructions

pyenv virtualenv craftax_env
poetry install

When in doubt

from jax import debug
...
debug.breakpoint()

📚 Citation

To learn more about Craftax, check out the paper website here. To cite the underlying Craftax environment, see:

@inproceedings{matthews2024craftax,
    author={Michael Matthews and Michael Beukman and Benjamin Ellis and Mikayel Samvelyan and Matthew Jackson and Samuel Coward and Jakob Foerster},
    title = {Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning},
    booktitle = {International Conference on Machine Learning ({ICML})},
    year = {2024}
}

To cite the Crafter benchmark, see:

@article{hafner2021crafter,
  title={Benchmarking the Spectrum of Agent Capabilities},
  author={Danijar Hafner},
  year={2021},
  journal={arXiv preprint arXiv:2109.06780},
}

Contributing

uv venv craftaxlm-dev
source craftaxlm-dev/bin/activate
uv run ruff format .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

craftaxlm-0.0.5.tar.gz (98.0 kB view details)

Uploaded Source

Built Distribution

craftaxlm-0.0.5-py3-none-any.whl (18.9 kB view details)

Uploaded Python 3

File details

Details for the file craftaxlm-0.0.5.tar.gz.

File metadata

  • Download URL: craftaxlm-0.0.5.tar.gz
  • Upload date:
  • Size: 98.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for craftaxlm-0.0.5.tar.gz
Algorithm Hash digest
SHA256 4005719af7bf0c8d30d4bc9861bdd5975e1b5cfa95722934bf87cfbe5ad787a1
MD5 e2fbac7133abab0d8334b0a7d56d1cd5
BLAKE2b-256 ced19a6150fe4282fab70cf023a4b4860af63ecfe0b9e99214613e7277cc597f

See more details on using hashes here.

File details

Details for the file craftaxlm-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: craftaxlm-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 18.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for craftaxlm-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 cbae6d18537bf9c2ebb13c1df7f0bd6005474c570c14168221330830462dcf4a
MD5 e58e6d405674435390f7d2e3e21aa308
BLAKE2b-256 07865ed2c59ce55b240255acad142f59607a8a3bb4467831d2cfadb6f509fb20

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page