No project description provided
Project description
AutoArena
AutoArena helps you stack rank LLM outputs against one another using automated judge evaluation. Get started by:
pip install autoarena
python -m autoarena
Data is stored in an autoarena.duckdb
file in the directory where you invoked AutoArena.
Development
AutoArena uses uv to manage dependencies. To set up this repository for development, run:
uv venv && source .venv/bin/activate
uv pip install --all-extras -r pyproject.toml
uv tool run pre-commit install
uv run python3 -m autoarena --dev
To run AutoArena for development, you will need to run both the backend and frontend service:
- Backend:
uv run python3 -m autoarena --dev
(the--dev
/-d
flag enables automatic service reloading when source files change) - Frontend: see
ui/README.md
To build a release tarball in the ./dist
directory:
./scripts/build.sh
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
autoarena-0.1.0b4.tar.gz
(1.2 MB
view details)
File details
Details for the file autoarena-0.1.0b4.tar.gz
.
File metadata
- Download URL: autoarena-0.1.0b4.tar.gz
- Upload date:
- Size: 1.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f4a9c76e5c8a1ba9e9895e3ef6874e27be607af4389ec4140d7431adf45e46a0 |
|
MD5 | fdef67fbc7801e6ef750d0395d8e4b67 |
|
BLAKE2b-256 | 9df843a5328888136abf230899a538e46fff54d609941dfbf3ad3712bebdf1ac |