Skip to main content

Testing library for AI Agents by InvariantLabs

Reason this release was yanked:

0.0.1 was broken

Project description

Invariant testing

Helps you build better AI agents through debuggable unit testing

Documentation


Invariant testing is a lightweight library to write and run AI agent tests. It provides helpers and assertions that enable you to write robust tests for your agentic applications.

Using localized assertions, testing always points you to the exact part of the agent's behavior that caused a test to fail, making it easy to debug and resolve issues (think: stacktraces for agents).



Installation

pip install invariant

A quick example

# content of tests/test_weather.py
from invariant.testing import Trace, assert_equals

def test_weather():
    # create a Trace object from your agent trajectory
    trace = Trace(
        trace=[
            {"role": "user", "content": "What is the weather like in Paris?"},
            {"role": "agent", "content": "The weather in London is 75°F and sunny."},
        ]
    )

    # make assertions about the agent's behavior
    with trace.as_context():
        # extract the locations mentioned in the agent's response
        locations = trace.messages()[-1]["content"].extract("locations")

        # assert that the agent responded about Paris and only Paris
        assert_equals(1, locations.len(), 
            "The agent should respond about one location only")

        assert_equals("Paris", locations[0], "The agent should respond about Paris")

Execute it on the command line:

$ invariant test
________________________________ test_weather _________________________________
ERROR: 1 hard assertions failed:

 
    # assert that the agent responded about Paris and only Paris
    assert_equals(1, locations.len(), 
        "The agent should respond about one location only")
    
>   assert_equals("Paris", locations[0], "The agent should respond about Paris")
________________________________________________________________________________

ASSERTION FAILED: The agent should respond about Paris (expected: 'Paris', actual: 'London')
________________________________________________________________________________

#       role:  "user"
#       content:  "What is the weather like in Paris?"
#     },
#     {
#       role:  "agent"
        content:   "The weather in London is 75°F and sunny."
#     },
#  ]

The test result precisely localizes the failure in the provided agent trace.

Visual Test Viewer (Explorer):

As an alternative to the command line, you can also visualize test results on the Invariant Explorer:

$ invariant test --push

image

Like the terminal output, the Explorer highlights the relevant ranges, but does so even more precisely, marking the exact words that caused the assertion to fail.

Features

  • Comprehensive Trace API for easily navigating and checking agent traces.
  • Assertions library to check agent behavior, including fuzzy checkers such as Levenshtein distance, semantic similarity and LLM-as-a-judge pipelines.
  • Full pytest compatibility for easy integration with existing test and CI/CD pipelines.
  • Parameterized tests for testing multiple scenarios with a single test function.
  • Visual test viewer for exploring large traces and debugging test failures in Explorer

To learn more read the documentation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

invariant_ai-0.0.1.tar.gz (40.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

invariant_ai-0.0.1-py3-none-any.whl (55.0 kB view details)

Uploaded Python 3

File details

Details for the file invariant_ai-0.0.1.tar.gz.

File metadata

  • Download URL: invariant_ai-0.0.1.tar.gz
  • Upload date:
  • Size: 40.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.12.2 Linux/6.5.0-1025-azure

File hashes

Hashes for invariant_ai-0.0.1.tar.gz
Algorithm Hash digest
SHA256 bb54359ca4c06ae7e238dede196040cfe24d4b4ecc9d9489e2f248d50bacaf9e
MD5 d614268a9cc87a72abfb6836d5cb1010
BLAKE2b-256 3221cd98fcd1ca8099716016f41f2b743146cbf47bb97e47ea150295838f41db

See more details on using hashes here.

File details

Details for the file invariant_ai-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: invariant_ai-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 55.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.12.2 Linux/6.5.0-1025-azure

File hashes

Hashes for invariant_ai-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 20f29c80586ef72387b75e3ba5a24f3b72e94c139980b58a0052976a7af05b4e
MD5 4bf694348fbe3450606d7f2601ace37c
BLAKE2b-256 10e4522a4790ce8f58e105712dfd520c8b6d52c71ac9c3e187d71c6af0cc90f4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page