Skip to main content

Thabit: evaluate multiple LLMs on your data

Project description

Thabit

Evaluate multiple LLM models with the same data to determine which one is better for your use case.

How to run

pip3 install thabit

Test

pytest tests

Build

pip3 install -e .

Contribute

Docs

TODO:

  • Validate the input dataset.

  • UI for adding/editing config.

  • Visulaise Output (using UI).

  • Run eval per dataset (add folders for dataset and for evals). This is to simplify visualising results later using the UI.

    root
    ├── datasets
    │ └── a
    └── evals
      └── a
    

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thabit-0.2.4.tar.gz (60.1 kB view hashes)

Uploaded Source

Built Distribution

thabit-0.2.4-py3-none-any.whl (63.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page