Skip to main content

No project description provided

Project description

Vals AI CLI and SDK Tool

Install

pip install valsai

Overview

This Python package contains two items - a CLI tool to use Vals from the command line, and an SDK to use Vals from Python code.

All code is contained within vals/, separated by CLI and SDK. Note that many of the CLI functions are just thin wrappers around the SDK.

Examples

We've provided a set of examples in examples/. They are organized as follows:

  • example_suites/ - These are example JSON files, of the sort that you may use for vals suite create <example.json>.
  • example_scripts/ - These are example Python scripts that leverage the sdk.
  • example_run_configs/ - When you start a run, you can optionally provide more parameters (more details below). These are examples of parameter files.

Authentication

Make an account at https://platform.vals.ai and confirm your email.

Then, go to https://platform.vals.ai/auth and create an API key.

If you are using the command line, you should set the following:

export VALS_API_KEY=<YOUR API KEY>

If you're using the EU instance of vals, you should also set export VALS_REGION=europe. Include these in your .zshrc / .bashrc to make them permanent.

See the documentation for passing in your API key directly via Python code for SDK usage.

CLI Usage

The CLI is run as follows:

$ vals

Use the --help flag at the top and subcommand level for guidance.

Commands must be run from the pip environment the cli was installed in. Commands are split up into subcommand. Currently there are two subcommands:

  • vals suite: relating to creating / updating tests and suites
  • vals run: relating to creating and querying runs and run results.

Create Test Suite

Create a test suite from command line

vals suite create --interactive

Create a test suite from JSON file

vals suite create ./example_suite.json

The vals suite create commands will produce a link to the created test suite.

Run

The test suite id embedded at the end of the URL is used in run requests.

Start a new run:

vals run start

Or specify the suite id (the last part of the)

vals run start -s <suite id>

Full details of the CLI usage can be found in our documentation at https://www.platform.vals.ai/docs/index.html#/cli

SDK Usage

All of the functionality that is in the CLI can also be accessed via Python import. For example - to create a suite within Python, you could do the following:

from vals.sdk.suite import create_suite

create_suite(
  {
    "title": "Tax Questions",
    "description": "Questions relating to tax incentives",
    "tests": [
      {
        "input_under_test": "What is QSBS?",
        "checks": {
          {
            "operator": "includes",
            "criteria": "C Corporation"
          }
        }
      }
    ]
  }
)

Passing Functions to SDK

The SDK also includes features the CLI does not. This includes the ability to test any arbitrary Python function, rather than just the base model. Therefore, even if you have very complicated internal logic for parsing, chunking, prompt-chaining, etc. it still can be evaluated with the Vals platform.

First, define a function with the following signature:

def my_model(input_under_test: str) -> str:
  llm_output = "..."
  return llm_output

Then, create a suite in the platform and run it like the following:

run_id = run_evaluations(
    # Replace with the link suite link
    "https://platform.vals.ai/view?test_suite_id=xxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxx",
    test_function,
)

There is a full, working example in examples/examples_scripts/sdk_example.py, and additional documentation in our docs: https://www.platform.vals.ai/docs/index.html#/sdk

Development

Local installation

Run the following command to install it locally. The -e flag is optional, but recommended, as it allows you to make changes to the code without reinstalling. The dev flag is required for dev-only depedencies.

pip install -e ".[dev]"

Running Codegen

Add additional .graphql files to vals/graphql, then run the following command:

ariadne-codegen --config codegen-config.toml

NOTE: This will overwrite anything in the vals/graphql_client directory.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

valsai-0.0.52.tar.gz (101.0 kB view details)

Uploaded Source

Built Distribution

valsai-0.0.52-py3-none-any.whl (55.8 kB view details)

Uploaded Python 3

File details

Details for the file valsai-0.0.52.tar.gz.

File metadata

  • Download URL: valsai-0.0.52.tar.gz
  • Upload date:
  • Size: 101.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for valsai-0.0.52.tar.gz
Algorithm Hash digest
SHA256 08007ef280f16c579f9c98f2eb6dcdebe6f3f87257c4594be1c70601a2017b3d
MD5 01e3908bedbab05dd4f21671a267660f
BLAKE2b-256 881da5eb7ec086bac0b78be3ab84f18ca5c29ff203431455e11cefb4cb78cfe3

See more details on using hashes here.

Provenance

The following attestation bundles were made for valsai-0.0.52.tar.gz:

Publisher: publish-to-pypi.yml on vals-ai/vals-sdk

Attestations:

File details

Details for the file valsai-0.0.52-py3-none-any.whl.

File metadata

  • Download URL: valsai-0.0.52-py3-none-any.whl
  • Upload date:
  • Size: 55.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for valsai-0.0.52-py3-none-any.whl
Algorithm Hash digest
SHA256 c62ff393836ceddaa819594de2280d1ba66241d08c5fdf52a87c51dbaf14ea02
MD5 7bfd3d3a87535645ad083ddf7438f061
BLAKE2b-256 907b5a7864221cd18568085f05599ba97633cfc10e6cadeb87baf7137a2507fd

See more details on using hashes here.

Provenance

The following attestation bundles were made for valsai-0.0.52-py3-none-any.whl:

Publisher: publish-to-pypi.yml on vals-ai/vals-sdk

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page