Skip to main content

No project description provided

Project description

AutoArena

AutoArena helps you stack rank LLM outputs against one another using automated judge evaluation. Run with:

python3 -m autoarena

Data is stored in an autoarena.duckdb file on your local machine.

Development

To set up this repository for development, run:

poetry update && poetry install
poerty run pre-commit install
poetry run python3 -m autoarena

To run AutoArena for development, you will need to run both the backend and frontend service:

  • Backend: poetry run python3 -m autoarena --dev (the --dev/-d flag enables automatic service reloading when source files change)
  • Frontend: see ui/README.md

To build a release tarball in the ./dist directory:

./scripts/build.sh

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoarena-0.1.0b3.tar.gz (1.1 MB view details)

Uploaded Source

File details

Details for the file autoarena-0.1.0b3.tar.gz.

File metadata

  • Download URL: autoarena-0.1.0b3.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for autoarena-0.1.0b3.tar.gz
Algorithm Hash digest
SHA256 179738f60f2a77fa8bda215d223496b4b7a6f85085b6e30565321fbaa5b7b24c
MD5 b4f7b3383645b09759b3f18b3457f8b9
BLAKE2b-256 4cde866960df066f33c1bdff4448b7b560ace1f531e2622183f1871e981f63c0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page