Skip to main content

Quality Control (QC) laboratory of the CoReason platform

Project description

coreason-assay

The Scientific Testing Engine for AI Agents.

CI/CD Docker codecov PyPI version Python versions License Ruff Checked with mypy pre-commit Poetry

coreason-assay is the Quality Control (QC) laboratory of the CoReason platform. It provides a rigorous framework for evaluating the performance, safety, and alignment of AI agents before they are deployed to production.

Features

  • Benchmark Evaluation Corpus (BEC) Management: Easily ingest test cases from CSV, JSONL, or ZIP archives.
  • Simulation: Run agents in a controlled sandbox with mocked tools and injected context.
  • Glass Box Grading: Evaluate not just the answer, but the reasoning process (Faithfulness, Alignment, Tone).
  • Report Cards: Generate detailed reports with drift detection and pass/fail metrics.

Quick Start

Installation

poetry install

Usage

Run the CLI to upload a test corpus:

poetry run coreason-assay upload path/to/bec_archive.zip

Server Mode (Service C)

coreason-assay can also be run as a containerized microservice exposing a REST API.

# Start the server (requires Docker)
docker build -t coreason-assay:0.4.0 .
docker run -p 8000:8000 coreason-assay:0.4.0

The service exposes endpoints for uploading corpora (POST /upload) and running assays (POST /run). See docs/usage.md for details.

Documentation

For full documentation, including architecture details, usage guides, and examples, please visit the docs folder.

License

This software is proprietary and dual-licensed. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_assay-0.4.0.tar.gz (25.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_assay-0.4.0-py3-none-any.whl (34.5 kB view details)

Uploaded Python 3

File details

Details for the file coreason_assay-0.4.0.tar.gz.

File metadata

  • Download URL: coreason_assay-0.4.0.tar.gz
  • Upload date:
  • Size: 25.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.4.0.tar.gz
Algorithm Hash digest
SHA256 769cc5b10b3d39cc1e9aa3d53de33512c567b73538d1d39f009c393682432d1f
MD5 4b0ff3dc754d5ecdb3e689620f1e3b9b
BLAKE2b-256 e8b37e4c82bd48dd260793629bac503637df3cd9c75e084c555b1676f298929a

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.4.0.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_assay-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: coreason_assay-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 34.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 74c7fedeb10290dc346579de0cc9944ab1e7eed16d790abe65ea8f03c319be79
MD5 fdf6fba0649f567af67756e50b2e735e
BLAKE2b-256 9c4094d305aab2420fb26904323e4d8b6cc092c9d2bf5c0a3582f908a80e7684

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.4.0-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page