Skip to main content

No project description provided

Project description

AutoArena

AutoArena helps you stack rank LLM outputs against one another using automated judge evaluation. Get started by:

pip install autoarena
python -m autoarena

Data is stored in an autoarena.duckdb file in the directory where you invoked AutoArena.

Development

AutoArena uses uv to manage dependencies. To set up this repository for development, run:

uv venv && source .venv/bin/activate
uv pip install --all-extras -r pyproject.toml
uv tool run pre-commit install
uv run python3 -m autoarena --dev

To run AutoArena for development, you will need to run both the backend and frontend service:

  • Backend: uv run python3 -m autoarena --dev (the --dev/-d flag enables automatic service reloading when source files change)
  • Frontend: see ui/README.md

To build a release tarball in the ./dist directory:

./scripts/build.sh

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoarena-0.1.0b4.tar.gz (1.2 MB view details)

Uploaded Source

File details

Details for the file autoarena-0.1.0b4.tar.gz.

File metadata

  • Download URL: autoarena-0.1.0b4.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for autoarena-0.1.0b4.tar.gz
Algorithm Hash digest
SHA256 f4a9c76e5c8a1ba9e9895e3ef6874e27be607af4389ec4140d7431adf45e46a0
MD5 fdef67fbc7801e6ef750d0395d8e4b67
BLAKE2b-256 9df843a5328888136abf230899a538e46fff54d609941dfbf3ad3712bebdf1ac

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page