Skip to main content

Python SDK to configure and run evaluations for your LLM-based application

Project description

Overview

Athina is an Observability and Experimentation platform for AI teams.

This SDK is an open-source repository of 50+ preset evals. You can also use custom evals.

This SDK also serves as a companion to Athina IDE where you can prototype pipelines, run experiments and evaluations, and compare datasets.


Quick Start

Follow this notebook for a quick start guide.

To get an Athina API key, sign up at https://app.athina.ai


Run Evals

These evals can be run programmatically, or via the UI on Athina IDE.

image

Compare datasets side-by-side (Docs)

Once a dataset is logged to Athina IDE, you can also compare it against another dataset.

image

Once you run evals using Athina, they will be visible in Athina IDE where you can run experiments, evals, and compare datasets side-by-side.


Preset Evals

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

athina-1.6.2.tar.gz (126.6 kB view hashes)

Uploaded Source

Built Distribution

athina-1.6.2-py3-none-any.whl (192.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page