Evaluation scaffold for LLM research, benchmarking, and reproducible experiment runs.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Themis

Themis is a Python package for running reproducible LLM evaluations. It gives you a typed scaffold for defining datasets, generators, parsers, metrics, judge workflows, and persistent run artifacts without forcing you into one provider or benchmark.

The published package name is themis-eval. The Python import namespace and CLI command are both themis.

Install

uv add themis-eval

Optional extras:

uv add "themis-eval[openai]"
uv add "themis-eval[vllm]" on Linux
uv add "themis-eval[langgraph]"
uv add "themis-eval[datasets]"
uv add "themis-eval[mongodb]"
uv add "themis-eval[postgres]"
uv sync --extra docs for local documentation builds from a repo checkout

Quick Start

from themis import evaluate
from themis.core.models import Case, Dataset

result = evaluate(
    model="builtin/demo_generator",
    data=[
        Dataset(
            dataset_id="sample",
            cases=[
                Case(
                    case_id="case-1",
                    input={"question": "2+2"},
                    expected_output={"answer": "4"},
                )
            ],
        )
    ],
    metric="builtin/exact_match",
    parser="builtin/json_identity",
)

print(result.run_id, result.status.value)

Custom Extensions

Themis is designed to be extended. You can plug in custom generators, parsers, reducers, metrics, judge models, and store backends through the Python API or config-driven workflows.

Start with Experiment(...) when you want a reusable compiled evaluation definition.
Start with evaluate(...) when you want the shortest path from inline data to a completed run.
Use docs/how-to/author-custom-components.md for custom component authoring.

CLI

After installation, the package exposes the themis CLI:

themis quick-eval inline \
  --model builtin/demo_generator \
  --metric builtin/exact_match \
  --parser builtin/json_identity \
  --input '{"question":"2+2"}' \
  --expected-output '{"answer":"4"}'

Documentation

Start here: docs/index.md
Installation guide: docs/start-here/installation.md
API layer chooser: docs/start-here/choose-your-api-layer.md
Python API reference: docs/reference/python-api.md
Extension boundaries: docs/explanation/extension-boundaries.md

Build the docs locally with:

uv sync --extra docs
uv run mkdocs build --strict

Contributing

Contributor setup and release guidance live in CONTRIBUTING.md.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

pittawat

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

4.0.2

Apr 6, 2026

4.0.1

Apr 6, 2026

4.0.0

Apr 5, 2026

3.1.0

Mar 23, 2026

3.0.0

Mar 18, 2026

2.1.0

Mar 16, 2026

2.0.0

Mar 15, 2026

1.4.0

Mar 1, 2026

1.3.0

Feb 27, 2026

1.2.1

Feb 22, 2026

1.2.0

Feb 20, 2026

1.1.1.post1

Feb 6, 2026

1.1.1

Feb 6, 2026

1.1.0

Feb 5, 2026

1.0.0

Feb 4, 2026

0.2.3

Jan 24, 2026

0.2.2

Jan 24, 2026

0.2.1

Jan 24, 2026

0.2.0

Jan 22, 2026

0.1.1

Jan 11, 2026

0.1.0

Jan 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

themis_eval-4.0.2.tar.gz (84.4 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

themis_eval-4.0.2-py3-none-any.whl (115.4 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file themis_eval-4.0.2.tar.gz.

File metadata

Download URL: themis_eval-4.0.2.tar.gz
Upload date: Apr 6, 2026
Size: 84.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for themis_eval-4.0.2.tar.gz
Algorithm	Hash digest
SHA256	`c717bfa3bb687e072c1d608e7f6a7fa7f5a6a0d88804b6f9298eab669da2ea2a`
MD5	`3ef56a7eea66659ed92b7fc2bf5c2cc9`
BLAKE2b-256	`424e0b35b39efddb7430cc5c66fb94aaed54c78a3b563149ad7d6b8a49565212`

See more details on using hashes here.

Provenance

The following attestation bundles were made for themis_eval-4.0.2.tar.gz:

Publisher: pypi.yaml on Pittawat2542/themis

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: themis_eval-4.0.2.tar.gz
- Subject digest: c717bfa3bb687e072c1d608e7f6a7fa7f5a6a0d88804b6f9298eab669da2ea2a
- Sigstore transparency entry: 1243783157
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: Pittawat2542/themis@03b29f74a021d44a02f78a874a4ab9f18b98ccc0
- Branch / Tag: refs/tags/v4.0.2
- Owner: https://github.com/Pittawat2542
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi.yaml@03b29f74a021d44a02f78a874a4ab9f18b98ccc0
- Trigger Event: push

File details

Details for the file themis_eval-4.0.2-py3-none-any.whl.

File metadata

Download URL: themis_eval-4.0.2-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 115.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for themis_eval-4.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`58bc6345f3b3fb7966a7b9b9ce79674128fc66785bbb20ee87b22cb99f730445`
MD5	`5e49c50a61d4706d9c2df6449cce2bf7`
BLAKE2b-256	`b14be393ed586f14b48ef04c5720688c9d252062f4ea5b4467133a1d131d878a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for themis_eval-4.0.2-py3-none-any.whl:

Publisher: pypi.yaml on Pittawat2542/themis

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: themis_eval-4.0.2-py3-none-any.whl
- Subject digest: 58bc6345f3b3fb7966a7b9b9ce79674128fc66785bbb20ee87b22cb99f730445
- Sigstore transparency entry: 1243783158
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: Pittawat2542/themis@03b29f74a021d44a02f78a874a4ab9f18b98ccc0
- Branch / Tag: refs/tags/v4.0.2
- Owner: https://github.com/Pittawat2542
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi.yaml@03b29f74a021d44a02f78a874a4ab9f18b98ccc0
- Trigger Event: push

themis-eval 4.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Themis

Install

Quick Start

Custom Extensions

CLI

Documentation

Contributing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance