No project description provided
Project description
AutoArena
AutoArena helps you stack rank LLM outputs against one another using automated judge evaluation. Run with:
python3 -m autoarena
Data is stored in an autoarena.duckdb
file on your local machine.
Development
To set up this repository for development, run:
poetry update && poetry install
poerty run pre-commit install
poetry run python3 -m autoarena
To run AutoArena for development, you will need to run both the backend and frontend service:
- Backend:
poetry run python3 -m autoarena --dev
(the--dev
/-d
flag enables automatic service reloading when source files change) - Frontend: see
ui/README.md
To build a release tarball in the ./dist
directory:
./scripts/build.sh
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
autoarena-0.1.0b3.tar.gz
(1.1 MB
view details)
File details
Details for the file autoarena-0.1.0b3.tar.gz
.
File metadata
- Download URL: autoarena-0.1.0b3.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 179738f60f2a77fa8bda215d223496b4b7a6f85085b6e30565321fbaa5b7b24c |
|
MD5 | b4f7b3383645b09759b3f18b3457f8b9 |
|
BLAKE2b-256 | 4cde866960df066f33c1bdff4448b7b560ace1f531e2622183f1871e981f63c0 |