Skip to main content

Quality Control (QC) laboratory of the CoReason platform

Project description

coreason-assay

The Scientific Testing Engine for AI Agents.

CI/CD Docker codecov PyPI version Python versions License Ruff Checked with mypy pre-commit Poetry

coreason-assay is the Quality Control (QC) laboratory of the CoReason platform. It provides a rigorous framework for evaluating the performance, safety, and alignment of AI agents before they are deployed to production.

Features

  • Benchmark Evaluation Corpus (BEC) Management: Easily ingest test cases from CSV, JSONL, or ZIP archives.
  • Simulation: Run agents in a controlled sandbox with mocked tools and injected context.
  • Glass Box Grading: Evaluate not just the answer, but the reasoning process (Faithfulness, Alignment, Tone).
  • Report Cards: Generate detailed reports with drift detection and pass/fail metrics.

Quick Start

Installation

poetry install

Usage

Run the CLI to upload a test corpus:

poetry run coreason-assay upload path/to/bec_archive.zip

Documentation

For full documentation, including architecture details, usage guides, and examples, please visit the docs folder.

License

This software is proprietary and dual-licensed. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_assay-0.1.0.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_assay-0.1.0-py3-none-any.whl (31.9 kB view details)

Uploaded Python 3

File details

Details for the file coreason_assay-0.1.0.tar.gz.

File metadata

  • Download URL: coreason_assay-0.1.0.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3310e0cb7f8aedbf009ea944c54c7ec5ba4bf7a5e6abef6523e1bea679a62be8
MD5 f07ffa1b9e810896f40c16c55f376a61
BLAKE2b-256 19906cf930bdd9406f2ec3b18697b4a782d25695d40a53ab89e0515b8b3f0092

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.1.0.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_assay-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: coreason_assay-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 31.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_assay-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ff61406a976090a355b6aba85aa946911e7cddbae8d0b7402cbb8b8ace17b02f
MD5 5ccd8f63985130f7b072477c3882f29e
BLAKE2b-256 61bc8a937677264442af5ee7a3ca1a6de785ff2566902415806c555ed0bf2f16

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_assay-0.1.0-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page