No project description provided
Project description
AutoArena
AutoArena helps you stack rank LLM outputs against one another using automated judge evaluation. Run with:
python3 -m autoarena
Data is stored in an autoarena.duckdb
file on your local machine.
Development
To set up this repository for development, run:
poetry update && poetry install
poerty run pre-commit install
poetry run python3 -m autoarena
To run AutoArena for development, you will need to run both the backend and frontend service:
- Backend:
poetry run python3 -m autoarena --dev
(the--dev
/-d
flag enables automatic service reloading when source files change) - Frontend: see
ui/README.md
To build a release tarball in the ./dist
directory:
./scripts/build.sh
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
autoarena-0.1.0b3.tar.gz
(1.1 MB
view hashes)