High-performance evaluation framework for LLM agents

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Rul1an

These details have not been verified by PyPI

Project links

Homepage

Project description

Assay Python SDK

Record deterministic traces from your Python agents for regression gating.

🚀 Golden Quickstart

The fastest way to regression test your AI agent.

1. Installation

pip install assay

2. Record (`record.py`)

Run your agent through the SDK to capture a trace. Pass your tool functions to tool_executors so Assay can record their inputs and outputs.

import os
import openai
from assay_sdk import TraceWriter, record_chat_completions_with_tools

# 1. Setup Client & Tools
client = openai.OpenAI(api_key=os.environ.get("OPENAI_API_KEY", "mock"))
TOOLS = [{
    "type": "function",
    "function": {
        "name": "GetWeather",
        "parameters": {"type": "object", "properties": {"location": {"type": "string"}}}
    }
}]

# 2. Define Execution Logic (The "Real" Code)
def get_weather(args):
    return {"temp": 22, "location": args.get("location")}

# 3. Record the Loop
writer = TraceWriter("traces/quickstart.jsonl")
result = record_chat_completions_with_tools(
    writer=writer,
    client=client,
    model="gpt-4o",
    messages=[{"role": "user", "content": "Weather in Tokyo?"}],
    tools=TOOLS,
    tool_executors={"GetWeather": get_weather}, # Link schema -> function
    episode_id="weather_demo",
    test_id="weather_check"
)
print(f"Agent Final Answer: {result['content']}")

3. Configure (`assay.yaml`)

Tell Assay what to check.

version: 1
model: "trace"
tests:
  - id: weather_check
    input:
      prompt: "Weather in Tokyo?" # Matches the recorded prompt
    expected:
      type: regex_match
      pattern: ".*" # Pass if any content returned (baseline check)

4. Verify

Run the regression gate. This replays your trace against the recorded tool outputs to ensure determinism.

# Verify strictly (fails if any tool call arg changed even slightly)
assay ci --config assay.yaml --trace-file traces/quickstart.jsonl --replay-strict --db :memory:

🌊 Advanced: Streaming support

Capture streaming responses while maintaining tool call execution.

from assay_sdk import record_chat_completions_stream_with_tools

# ... setup client & writer ...

result = record_chat_completions_stream_with_tools(
    writer=writer,
    # ... args ...
    stream=True # SDK handles chunk aggregation automatically
    # tool_executors={...} # Required if tools are used
)

Note: The hybrid wrapper (record_chat_completions_stream_with_tools) streams the thinking tokens to the user, executes tools, and then performs a standard follow-up call.

🛡️ Advanced: Privacy & Redaction

Protect sensitive data (PII, API keys) from ever hitting the trace file.

from assay_sdk import TraceWriter, make_redactor

# Create a redactor that scrubs keys and regex patterns
redactor = make_redactor(
    key_denylist={"authorization", "password", "api_key"},
    patterns=[r"sk-[a-zA-Z0-9]{20,}"] # Mask OpenAI keys
)

# Attach to writer - happens automatically on write
writer = TraceWriter("traces/secure.jsonl", redact_fn=redactor)

⚡ Async Support

Native async support for high-throughput applications (FastAPI, etc.) is available via the assay_sdk.async_openai submodule. It provides full parity with the sync API, including loop and streaming support.

❓ Troubleshooting

`E_TRACE_EPISODE_MISSING`

Cause: The test_id or episode_id in your trace doesn't match what assay ci expected from its config (or implicit default). Fix: Ensure your assay.yaml test IDs match the test_id passed to record_chat_completions....

"Duplicate prompt in strict replay"

Cause: You ran record.py twice without cleaning the trace file, so it contains two identical episodes. assay ci in strict mode doesn't know which one to replay. Fix:

Truncate the file before recording: trace_path = "traces/my_trace.jsonl"; open(trace_path, 'w').close().
Use unique episode_ids (e.g. UUIDs) for every run.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Rul1an

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

3.9.2

May 4, 2026

3.9.1

Apr 29, 2026

3.9.0

Apr 29, 2026

3.8.0

Apr 29, 2026

3.7.0

Apr 29, 2026

3.6.0

Apr 27, 2026

3.5.1

Apr 6, 2026

3.5.0

Mar 30, 2026

3.4.0

Mar 29, 2026

3.3.0

Mar 24, 2026

3.2.3

Mar 23, 2026

3.2.2

Mar 17, 2026

3.2.1

Mar 17, 2026

3.1.0

Mar 15, 2026

3.0.0

Mar 6, 2026

2.18.0

Feb 11, 2026

2.17.0

Feb 7, 2026

2.15.0

Feb 4, 2026

2.14.0

Feb 2, 2026

2.13.0

Feb 1, 2026

2.12.0

Jan 29, 2026

2.10.1

Jan 28, 2026

2.10.0

Jan 28, 2026

2.9.0

Jan 28, 2026

2.8.0

Jan 28, 2026

2.7.0

Jan 27, 2026

2.6.0

Jan 27, 2026

2.4.0

Jan 26, 2026

2.3.1

Jan 25, 2026

2.2.2

Jan 24, 2026

2.2.1

Jan 24, 2026

2.2.0

Jan 23, 2026

2.1.21

Jan 17, 2026

2.1.20

Jan 17, 2026

2.1.19

Jan 17, 2026

2.1.18

Jan 17, 2026

2.1.17

Jan 17, 2026

2.1.16

Jan 17, 2026

2.1.15

Jan 17, 2026

2.1.14

Jan 17, 2026

2.1.13

Jan 17, 2026

2.1.12

Jan 17, 2026

2.1.11

Jan 17, 2026

2.1.10

Jan 17, 2026

2.1.9

Jan 16, 2026

2.1.8

Jan 16, 2026

2.1.7

Jan 16, 2026

2.1.6

Jan 16, 2026

2.1.5

Jan 16, 2026

2.1.4

Jan 16, 2026

2.1.3

Jan 16, 2026

2.1.2

Jan 16, 2026

2.1.1

Jan 15, 2026

2.1.0

Jan 14, 2026

2.0.0

Jan 12, 2026

1.9.0

Jan 11, 2026

1.8.0

Jan 11, 2026

1.7.0

Jan 9, 2026

1.6.0

Jan 8, 2026

1.5.1

Jan 7, 2026

1.5.0

Jan 6, 2026

1.4.1

Jan 6, 2026

1.4.0

Jan 6, 2026

1.3.0

Jan 6, 2026

1.2.11

Jan 5, 2026

1.2.10

Jan 5, 2026

1.2.9

Jan 5, 2026

1.2.8

Jan 5, 2026

1.2.7

Jan 5, 2026

1.2.6

Jan 5, 2026

1.2.5

Jan 5, 2026

1.0.0

Dec 28, 2025

1.0.0rc3 pre-release

Dec 28, 2025

0.9.0

Dec 27, 2025

This version

0.8.0

Dec 27, 2025

0.8.0rc1 pre-release

Dec 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

assay_it-0.8.0.tar.gz (28.0 kB view details)

Uploaded Dec 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

assay_it-0.8.0-py3-none-any.whl (38.5 kB view details)

Uploaded Dec 27, 2025 Python 3

File details

Details for the file assay_it-0.8.0.tar.gz.

File metadata

Download URL: assay_it-0.8.0.tar.gz
Upload date: Dec 27, 2025
Size: 28.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for assay_it-0.8.0.tar.gz
Algorithm	Hash digest
SHA256	`73b212dc77dcf5e3dd5215d49b75b45dcbff2169a76ac3babce0d65e0115f669`
MD5	`ff546c80b62c73b9ebb788dee042f970`
BLAKE2b-256	`c6ee629fc844bc0480499e5c44053085589a458992d35f8f247aa872fa27bad4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for assay_it-0.8.0.tar.gz:

Publisher: publish.yml on Rul1an/assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: assay_it-0.8.0.tar.gz
- Subject digest: 73b212dc77dcf5e3dd5215d49b75b45dcbff2169a76ac3babce0d65e0115f669
- Sigstore transparency entry: 780368810
- Sigstore integration time: Dec 27, 2025
Source repository:
- Permalink: Rul1an/assay@2713b4351dbfc11d69e1b4702e60f91bd02e6c02
- Branch / Tag: refs/tags/v0.8.0
- Owner: https://github.com/Rul1an
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2713b4351dbfc11d69e1b4702e60f91bd02e6c02
- Trigger Event: push

File details

Details for the file assay_it-0.8.0-py3-none-any.whl.

File metadata

Download URL: assay_it-0.8.0-py3-none-any.whl
Upload date: Dec 27, 2025
Size: 38.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for assay_it-0.8.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fe2ea5e77f20c007205868d27cc4900c4514884be7bda09e5cab4c8a96db574b`
MD5	`1f06f64e5ffba393a9cb4b2a90385eea`
BLAKE2b-256	`4c83443f641f6a4c38b689d548aa3a233498aa2cfd0ca131b642e58f3fe924fd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for assay_it-0.8.0-py3-none-any.whl:

Publisher: publish.yml on Rul1an/assay

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: assay_it-0.8.0-py3-none-any.whl
- Subject digest: fe2ea5e77f20c007205868d27cc4900c4514884be7bda09e5cab4c8a96db574b
- Sigstore transparency entry: 780368812
- Sigstore integration time: Dec 27, 2025
Source repository:
- Permalink: Rul1an/assay@2713b4351dbfc11d69e1b4702e60f91bd02e6c02
- Branch / Tag: refs/tags/v0.8.0
- Owner: https://github.com/Rul1an
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2713b4351dbfc11d69e1b4702e60f91bd02e6c02
- Trigger Event: push

assay-it 0.8.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Assay Python SDK

🚀 Golden Quickstart

1. Installation

2. Record (record.py)

3. Configure (assay.yaml)

4. Verify

🌊 Advanced: Streaming support

🛡️ Advanced: Privacy & Redaction

⚡ Async Support

❓ Troubleshooting

E_TRACE_EPISODE_MISSING

"Duplicate prompt in strict replay"

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

2. Record (`record.py`)

3. Configure (`assay.yaml`)

`E_TRACE_EPISODE_MISSING`