Skip to main content

Behavioural verification of evolving systems under controlled variation.

Project description

Vigil

PyPi CI

Vigil is a small framework for behavioural verification under controlled variation.

It executes a system across declared variations, evaluates the resulting behaviour with checks, and records everything in a single structured report.
Vigil is system-agnostic and focuses on behavioural properties rather than performance benchmarks.

Running Vigil

Run Vigil on a project directory:

vigil [project_dir] [--trace]

Vigil opens an interactive menu to:

  • select a specification
  • run all variations and checks
  • inspect previous reports

Reports are written next to the specification file.

Project layout

A project directory contains:

  • exactly one backend
  • one or more specifications
  • optional local checks and variations
project/
  llm_backend.py
  language.yml
  prompts.yml
  checks/
    entity_types_agree.py
  variations/
    differ_language.py

Files ending in *.report.yml are treated as results, not specs.

Specification

A specification defines what behaviour is tested.

At minimum it contains:

  • hypothesis
  • inputs
  • variations
  • checks

Optional metadata:

  • title
title: Behavioural verification of DatsLlm with respect to language
hypothesis: Annotation behaviour remains stable across language routing

inputs:
  - text: "Hello world"

variations:
  - type: update_input_keys
    input:
      language: de

checks:
  - matches_baseline

Backend

A backend wraps the system under evaluation.

It combines:

  • function configuration (what is executed)
  • environment configuration (how and where it runs)

Backends implement:

update_environment(environment)
compute(input, function) -> output

The framework manages execution, isolation, and cleanup automatically.

Variations

Variations introduce controlled changes.

Each variation targets exactly one domain:

  • input
  • function
  • environment

Variations:

  • transform inputs or patch configuration
  • are applied explicitly and sequentially
  • never inspect outputs

Baseline execution is represented explicitly with none.

Vigil also supports a small convenience expansion:

variations: [{ type: repeat, times: N, do: [...] }]

This expands into a flat list by repeating the entries in do N times.

Syntactic sugar

The specification format provides light syntactic sugar for common patterns. This sugar expands into ordinary variations and does not add new semantics.

Supported forms include:

  • repetition of a variation block

These constructs exist purely to reduce duplication in specs. After parsing, the engine operates only on plain variations.

Checks

Checks evaluate observed behaviour.

Three intents exist:

  • unary
  • reference
  • group

Checks may be:

  • assertive (produce PASS / WARN / ERROR)
  • diagnostic (INFO only)

Checks operate only on recorded slices and never trigger execution.

Reports

Each run produces a single structured report containing:

  • metadata (title, hypothesis, timestamps)
  • backend configuration
  • all inputs and variations
  • all check results

Reports are YAML by default and meant for inspection, comparison, and archiving.

Vigil is intentionally small, explicit, and extensible.
New backends, variations, and checks can be added without modifying the core.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vigil_bv-0.1.1.tar.gz (21.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vigil_bv-0.1.1-py3-none-any.whl (22.5 kB view details)

Uploaded Python 3

File details

Details for the file vigil_bv-0.1.1.tar.gz.

File metadata

  • Download URL: vigil_bv-0.1.1.tar.gz
  • Upload date:
  • Size: 21.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vigil_bv-0.1.1.tar.gz
Algorithm Hash digest
SHA256 0f4cb2257d1ec1d7faa1c9f89841c86f72a6f7f4c3f3f1da1d43cee8d262bb4f
MD5 ae04722b6a8444e6618c735bde56eef7
BLAKE2b-256 90b85757d97d018734f9a194fafd3695b6e6708ee70dae9680b010c20adb1662

See more details on using hashes here.

Provenance

The following attestation bundles were made for vigil_bv-0.1.1.tar.gz:

Publisher: publish.yml on leo-lem/vigil

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vigil_bv-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: vigil_bv-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 22.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vigil_bv-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 273a2f6c5817a2a371ea9147f0ea54ad3349866b8188cdff0f6746cebc079f6d
MD5 19862522e75c9244dd982c64deb22467
BLAKE2b-256 51a9e22a58ab4d6a5282eb4faebc9fad77d9570e4fe2e0e73d190a53b8ee268d

See more details on using hashes here.

Provenance

The following attestation bundles were made for vigil_bv-0.1.1-py3-none-any.whl:

Publisher: publish.yml on leo-lem/vigil

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page