LLM-based summary quality evaluation

Project description

assert-eval

LLM-based summary quality evaluation.

Scores a summary against source text for coverage, factual accuracy, alignment, and topic preservation. No PyTorch, no BERT, no heavy dependencies.

Installation

pip install assert-eval

Quick Start

from assert_eval import evaluate_summary, LLMConfig

config = LLMConfig(
    provider="bedrock",
    model_id="us.amazon.nova-pro-v1:0",
    region="us-east-1",
)

results = evaluate_summary(
    full_text="Original long text goes here...",
    summary="Summary to evaluate goes here...",
    metrics=["coverage", "factual_consistency", "factual_alignment", "topic_preservation"],
    llm_config=config,
)

print(results)
# {'coverage': 0.85, 'factual_consistency': 0.92, 'factual_alignment': 0.88, 'topic_preservation': 0.90}

Available Metrics

Metric	Description
`coverage`	What % of source document claims appear in the summary (recall/completeness)
`factual_consistency`	What % of summary claims are supported by the source (precision/accuracy)
`factual_alignment`	F1 score combining coverage and factual_consistency
`topic_preservation`	How well the main topics from the source are preserved in the summary

Custom Evaluation Instructions

Tailor the LLM's evaluation criteria for your domain:

results = evaluate_summary(
    full_text=text,
    summary=summary,
    metrics=["coverage", "factual_consistency"],
    llm_config=config,
    custom_prompt_instructions={
        "coverage": "Apply strict standards. Only mark a claim as covered if it is clearly and explicitly represented.",
        "factual_consistency": "Flag any claim that adds detail not present in the original text.",
    },
)

Verbose Output

Pass verbose=True to include per-claim LLM reasoning in the results:

results = evaluate_summary(
    full_text=text,
    summary=summary,
    metrics=["coverage", "factual_consistency"],
    llm_config=config,
    verbose=True,
)

PII Masking

Pass mask_pii=True to detect and mask personally identifiable information before any text is sent to the LLM:

results = evaluate_summary(
    full_text=text,
    summary=summary,
    metrics=["coverage"],
    llm_config=config,
    mask_pii=True,
)

mask_pii=False is the default. For production use with real client data, set mask_pii=True.

LLM Configuration

from assert_eval import LLMConfig

# AWS Bedrock (uses ~/.aws credentials by default)
config = LLMConfig(
    provider="bedrock",
    model_id="us.amazon.nova-pro-v1:0",
    region="us-east-1",
)

# AWS Bedrock with explicit credentials
config = LLMConfig(
    provider="bedrock",
    model_id="us.amazon.nova-pro-v1:0",
    region="us-east-1",
    api_key="your-aws-access-key-id",
    api_secret="your-aws-secret-access-key",
    aws_session_token="your-session-token",  # optional
)

# OpenAI
config = LLMConfig(
    provider="openai",
    model_id="gpt-4o",
    api_key="your-openai-api-key",
)

Supported Bedrock Model Families

Model Family	Example Model IDs
Amazon Nova	`us.amazon.nova-pro-v1:0`, `amazon.nova-lite-v1:0`
Anthropic Claude	`anthropic.claude-3-sonnet-20240229-v1:0`
Meta Llama	`meta.llama3-70b-instruct-v1:0`
Mistral AI	`mistral.mistral-large-2402-v1:0`
Cohere Command	`cohere.command-r-plus-v1:0`
AI21 Labs	`ai21.jamba-1-5-large-v1:0`

Proxy Configuration

# Single proxy
config = LLMConfig(
    provider="bedrock", model_id="us.amazon.nova-pro-v1:0", region="us-east-1",
    proxy_url="http://proxy.example.com:8080",
)

# Protocol-specific proxies
config = LLMConfig(
    provider="bedrock", model_id="us.amazon.nova-pro-v1:0", region="us-east-1",
    http_proxy="http://proxy.example.com:8080",
    https_proxy="http://proxy.example.com:8443",
)

# Authenticated proxy
config = LLMConfig(
    provider="bedrock", model_id="us.amazon.nova-pro-v1:0", region="us-east-1",
    proxy_url="http://username:password@proxy.example.com:8080",
)

Standard HTTP_PROXY / HTTPS_PROXY environment variables are also respected.

Dependencies

assert-core — shared LLM provider layer (AWS Bedrock, OpenAI)

Migrating from assert_llm_tools

assert-eval replaces the summary evaluation functionality of assert_llm_tools, which is now deprecated. The API is largely the same — swap the import:

# Before
from assert_llm_tools import evaluate_summary, LLMConfig

# After
from assert_eval import evaluate_summary, LLMConfig

License

MIT

Project details

Release history Release notifications | RSS feed

0.1.4

Feb 27, 2026

0.1.3

Feb 27, 2026

This version

0.1.2

Feb 20, 2026

0.1.1

Feb 20, 2026

0.1.0

Feb 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

assert_eval-0.1.2.tar.gz (10.8 kB view details)

Uploaded Feb 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

assert_eval-0.1.2-py3-none-any.whl (11.9 kB view details)

Uploaded Feb 20, 2026 Python 3

File details

Details for the file assert_eval-0.1.2.tar.gz.

File metadata

Download URL: assert_eval-0.1.2.tar.gz
Upload date: Feb 20, 2026
Size: 10.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for assert_eval-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`fd142ebc7a1523b39d6f349f306dd68e67cc72b866265ab0f50a28491951df59`
MD5	`6cbeb8a5188ab2a953a624b321b2a78c`
BLAKE2b-256	`3630d9ebe07e04157f12654b527e67b04ac41a9b411af2d9e9f3c14bf7eee91c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for assert_eval-0.1.2.tar.gz:

Publisher: publish-assert-eval.yml on charliedouglas/assert_llm_tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: assert_eval-0.1.2.tar.gz
- Subject digest: fd142ebc7a1523b39d6f349f306dd68e67cc72b866265ab0f50a28491951df59
- Sigstore transparency entry: 973260093
- Sigstore integration time: Feb 20, 2026
Source repository:
- Permalink: charliedouglas/assert_llm_tools@87942e26cb7ecb105766804c9466a74ebb512dc9
- Branch / Tag: refs/tags/assert-eval-v0.1.2
- Owner: https://github.com/charliedouglas
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-assert-eval.yml@87942e26cb7ecb105766804c9466a74ebb512dc9
- Trigger Event: release

File details

Details for the file assert_eval-0.1.2-py3-none-any.whl.

File metadata

Download URL: assert_eval-0.1.2-py3-none-any.whl
Upload date: Feb 20, 2026
Size: 11.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for assert_eval-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9c054296ac3a5ad661ebe20aaffaf7cc66ac16c6578d171bc8403db23c02b7ae`
MD5	`bccf6728588e49ffa7d13774c7cab730`
BLAKE2b-256	`73e346fc6265ac3ec497424ae0ab821c7253f225a0c07e7f70ccaad416587a5c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for assert_eval-0.1.2-py3-none-any.whl:

Publisher: publish-assert-eval.yml on charliedouglas/assert_llm_tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: assert_eval-0.1.2-py3-none-any.whl
- Subject digest: 9c054296ac3a5ad661ebe20aaffaf7cc66ac16c6578d171bc8403db23c02b7ae
- Sigstore transparency entry: 973260096
- Sigstore integration time: Feb 20, 2026
Source repository:
- Permalink: charliedouglas/assert_llm_tools@87942e26cb7ecb105766804c9466a74ebb512dc9
- Branch / Tag: refs/tags/assert-eval-v0.1.2
- Owner: https://github.com/charliedouglas
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-assert-eval.yml@87942e26cb7ecb105766804c9466a74ebb512dc9
- Trigger Event: release

assert-eval 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

assert-eval

Installation

Quick Start

Available Metrics

Custom Evaluation Instructions

Verbose Output

PII Masking

LLM Configuration

Supported Bedrock Model Families

Proxy Configuration

Dependencies

Migrating from assert_llm_tools

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance