Add your description here
Project description
Craftax LM
A wrapper around the Craftax agent benchmark, for evaluating digital agents.
Usage
First, download the package with pip install craftaxlm
. Next, import the agent-computer interface of your choice via
from craftaxlm import CraftaxACI, CraftaxClassicACI
This package is early in development, so for implementation examples, please refer to the baseline ReAct implementation
Leaderboard
Craftax-Classic
LM | Algorithm | Reward (% max) | Code |
---|---|---|---|
gpt-4o-mini | ReAct | 18.4 | CraftaxLM_Baselines |
Craftax-Full
LM | Algorithm | Reward (% max) | Code |
---|---|---|---|
gpt-4o-mini | ReAct | 02.9 | CraftaxLM_Baselines |
Dev Instructions
pyenv virtualenv craftax_env
poetry install
When in doubt
from jax import debug
...
debug.breakpoint()
📚 Citation
To learn more about Craftax, check out the paper website here. To cite the underlying Craftax environment, see:
@inproceedings{matthews2024craftax,
author={Michael Matthews and Michael Beukman and Benjamin Ellis and Mikayel Samvelyan and Matthew Jackson and Samuel Coward and Jakob Foerster},
title = {Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning},
booktitle = {International Conference on Machine Learning ({ICML})},
year = {2024}
}
To cite the Crafter benchmark, see:
@article{hafner2021crafter,
title={Benchmarking the Spectrum of Agent Capabilities},
author={Danijar Hafner},
year={2021},
journal={arXiv preprint arXiv:2109.06780},
}
Contributing
uv venv craftaxlm-dev
source craftaxlm-dev/bin/activate
uv run ruff format .
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
craftaxlm-0.0.5.tar.gz
(98.0 kB
view details)
Built Distribution
craftaxlm-0.0.5-py3-none-any.whl
(18.9 kB
view details)
File details
Details for the file craftaxlm-0.0.5.tar.gz
.
File metadata
- Download URL: craftaxlm-0.0.5.tar.gz
- Upload date:
- Size: 98.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4005719af7bf0c8d30d4bc9861bdd5975e1b5cfa95722934bf87cfbe5ad787a1 |
|
MD5 | e2fbac7133abab0d8334b0a7d56d1cd5 |
|
BLAKE2b-256 | ced19a6150fe4282fab70cf023a4b4860af63ecfe0b9e99214613e7277cc597f |
File details
Details for the file craftaxlm-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: craftaxlm-0.0.5-py3-none-any.whl
- Upload date:
- Size: 18.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cbae6d18537bf9c2ebb13c1df7f0bd6005474c570c14168221330830462dcf4a |
|
MD5 | e58e6d405674435390f7d2e3e21aa308 |
|
BLAKE2b-256 | 07865ed2c59ce55b240255acad142f59607a8a3bb4467831d2cfadb6f509fb20 |