Skip to main content

Quality Control (QC) laboratory of the CoReason platform

Project description

coreason-assay

The Scientific Testing Engine for AI Agents.

CI/CD Docker codecov PyPI version Python versions License Ruff Checked with mypy pre-commit Poetry

coreason-assay is the Quality Control (QC) laboratory of the CoReason platform. It provides a rigorous framework for evaluating the performance, safety, and alignment of AI agents before they are deployed to production.

Features

  • Benchmark Evaluation Corpus (BEC) Management: Easily ingest test cases from CSV, JSONL, or ZIP archives.
  • Simulation: Run agents in a controlled sandbox with mocked tools and injected context.
  • Glass Box Grading: Evaluate not just the answer, but the reasoning process (Faithfulness, Alignment, Tone).
  • Report Cards: Generate detailed reports with drift detection and pass/fail metrics.

Quick Start

Installation

poetry install

Usage

Run the CLI to upload a test corpus:

poetry run coreason-assay upload path/to/bec_archive.zip

Server Mode (Service C)

coreason-assay can also be run as a containerized microservice exposing a REST API.

# Start the server (requires Docker)
docker build -t coreason-assay:0.3.0 .
docker run -p 8000:8000 coreason-assay:0.3.0

The service exposes endpoints for uploading corpora (POST /upload) and running assays (POST /run). See docs/usage.md for details.

Documentation

For full documentation, including architecture details, usage guides, and examples, please visit the docs folder.

License

This software is proprietary and dual-licensed. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_assay-0.3.0.tar.gz (24.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_assay-0.3.0-py3-none-any.whl (34.5 kB view details)

Uploaded Python 3

File details

Details for the file coreason_assay-0.3.0.tar.gz.

File metadata

  • Download URL: coreason_assay-0.3.0.tar.gz
  • Upload date:
  • Size: 24.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.3.0.tar.gz
Algorithm Hash digest
SHA256 cb6f734358944159c95c2c83db36c98ff0e27599cbff156bf9cfac29ede54834
MD5 aa71669a5ec906547bbef81829eef715
BLAKE2b-256 a48125412f9be3c466fbc4cd133a2b34db34aa5b9d4ff2be19574714b5edbd3e

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.3.0.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_assay-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: coreason_assay-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 34.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c581a9ed4aa4ca2e150c712b935cd9779b954382d2b47959ac4a61ca0fb53c5e
MD5 0ae2b0a717c6e1fe2b81b2acf4b20c0d
BLAKE2b-256 370879ad2b07b4446acd064fcd8f171920d831f256cb8b70fe8be49dbc39378f

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.3.0-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page