Skip to main content

Quality Control (QC) laboratory of the CoReason platform

Project description

coreason-assay

The Scientific Testing Engine for AI Agents.

CI/CD Docker codecov PyPI version Python versions License Ruff Checked with mypy pre-commit Poetry

coreason-assay is the Quality Control (QC) laboratory of the CoReason platform. It provides a rigorous framework for evaluating the performance, safety, and alignment of AI agents before they are deployed to production.

Features

  • Benchmark Evaluation Corpus (BEC) Management: Easily ingest test cases from CSV, JSONL, or ZIP archives.
  • Simulation: Run agents in a controlled sandbox with mocked tools and injected context.
  • Glass Box Grading: Evaluate not just the answer, but the reasoning process (Faithfulness, Alignment, Tone).
  • Report Cards: Generate detailed reports with drift detection and pass/fail metrics.

Quick Start

Installation

poetry install

Usage

Run the CLI to upload a test corpus:

poetry run coreason-assay upload path/to/bec_archive.zip

Documentation

For full documentation, including architecture details, usage guides, and examples, please visit the docs folder.

License

This software is proprietary and dual-licensed. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_assay-0.2.0.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_assay-0.2.0-py3-none-any.whl (31.9 kB view details)

Uploaded Python 3

File details

Details for the file coreason_assay-0.2.0.tar.gz.

File metadata

  • Download URL: coreason_assay-0.2.0.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.2.0.tar.gz
Algorithm Hash digest
SHA256 3ecc90873e2b2595a4f07253283e67f5f0a073296cc64d8a28c06f3da53a3f7f
MD5 45458d0553a8e09b24a75582e2f792e9
BLAKE2b-256 c517cf099342c50c31bd43f979d11643eefb345776a9c91139e502a53e39e8b3

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.2.0.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_assay-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: coreason_assay-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 31.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ccfd3d8957817a8c42ddab3f8bfadb9de9522c065c48c6e75ae0cbd2d79bacea
MD5 f1e90ad36cdfab8bf1dbe2f1e8a4e1fa
BLAKE2b-256 9f7d700b66b4ac0df3610e108e08b9714a5dc36547dd655d727f5c04209d8576

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.2.0-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page