No project description provided
Project description
Craftax LM
A wrapper around the Craftax agent benchmark, for evaluating digital agents.
Leaderboard
Craftax-Classic
LM | Algorithm | Reward (% max) | Code |
---|---|---|---|
gpt-4o-mini | ReAct | 14.2 | CraftaxLM_Baselines |
33 |
Craftax-Full
LM | Algorithm | Reward (% max) | Code |
---|---|---|---|
gpt-4o-mini | ReAct | 01.2 | CraftaxLM_Baselines |
Dev Instructions
pyenv virtualenv craftax_env
poetry install
When in doubt
from jax import debug
...
debug.breakpoint()
📚 Citation
To learn more about Craftax, check out the paper website here. To cite the underlying Craftax environment, see:
@inproceedings{matthews2024craftax,
author={Michael Matthews and Michael Beukman and Benjamin Ellis and Mikayel Samvelyan and Matthew Jackson and Samuel Coward and Jakob Foerster},
title = {Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning},
booktitle = {International Conference on Machine Learning ({ICML})},
year = {2024}
}
To cite the Crafter benchmark, see:
@article{hafner2021crafter,
title={Benchmarking the Spectrum of Agent Capabilities},
author={Danijar Hafner},
year={2021},
journal={arXiv preprint arXiv:2109.06780},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
craftaxlm-0.0.1.tar.gz
(25.6 kB
view hashes)
Built Distribution
craftaxlm-0.0.1-py3-none-any.whl
(31.9 kB
view hashes)
Close
Hashes for craftaxlm-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 857ebd11235467873c0914e16de00ced229a862ccbf4bb0568516e60503c8317 |
|
MD5 | 19bcbee5781d9d4c9c61d0b88c75cc2b |
|
BLAKE2b-256 | 4dbae921633c7ac2a0bee829d68a06fadc06d537f41e20b7f9e6148c2fc17c66 |