A helper library to interact with Arize AI APIs

These details have not been verified by PyPI

Project links

Project description

Overview
Key Features
Installation
- Optional Dependencies
- Migrating from Version 7
Usage
SDK Configuration
Community

Overview

A helper package to interact with Arize AI APIs.

Arize is an AI engineering platform. It helps engineers develop, evaluate, and observe AI applications and agents.

Arize has both Enterprise and OSS products to support this goal:

Arize AX — an enterprise AI engineering platform from development to production, with an embedded AI Copilot
Phoenix — a lightweight, open-source project for tracing, prompt engineering, and evaluation
OpenInference — an open-source instrumentation package to trace LLM applications across models and frameworks

We log over 1 trillion inferences and spans, 10 million evaluation runs, and 2 million OSS downloads every month.

Key Features

Tracing - Trace your LLM application's runtime using OpenTelemetry-based instrumentation.
Evaluation - Leverage LLMs to benchmark your application's performance using response and retrieval evals.
Datasets - Create versioned datasets of examples for experimentation, evaluation, and fine-tuning.
Experiments - Track and evaluate changes to prompts, LLMs, and retrieval.
Playground- Optimize prompts, compare models, adjust parameters, and replay traced LLM calls.
Prompt Management- Manage and test prompt changes systematically using version control, tagging, and experimentation.

Installation

Install the base package:

pip install arize

Optional Dependencies

The following optional extras provide specialized functionality:

Note: The otel extra installs the arize-otel package, which is also available as a standalone package. If you only need auto-instrumentation without the full SDK, install arize-otel directly.

Extra	Install Command	What It Provides
otel	`pip install arize[otel]`	OpenTelemetry auto-instrumentation package (arize-otel) for automatic tracing
embeddings	`pip install arize[embeddings]`	Automatic embedding generation for NLP, CV, and structured data (Pillow, datasets, tokenizers, torch, transformers)
mimic	`pip install arize[mimic]`	MIMIC explainer for model interpretability

Install multiple extras:

pip install arize[otel,embeddings,mimic]

Migrating from Version 7

If you're upgrading from version 7, please refer to the Migration Guide for detailed migration steps and breaking changes.

Usage

Space Resolution

Throughout the SDK, any parameter named space accepts either a space ID (e.g. "spc_abc123") or a space name (e.g. "my-space"). The SDK resolves the identifier automatically:

If the value looks like a base64-encoded resource ID, it is used as a space ID directly.
Otherwise, it is treated as a case-insensitive substring filter on space names.

# Both of these are equivalent if your space is named "my-space"
client.datasets.list(space="U3BhY2U6OTA1MDoxSmtS")
client.datasets.list(space="my-space")

Instrumentation

See arize-otel in PyPI:

from arize.otel import register
from openinference.instrumentation.openai import OpenAIInstrumentor

# Setup OpenTelemetry via our convenience function
tracer_provider = register(
    space_id=SPACE_ID,
    api_key=API_KEY,
    project_name="agents-cookbook",
)

# Start instrumentation
OpenAIInstrumentor().instrument(tracer_provider=tracer_provider)

Operations on Spans

Use arize.spans to interact with spans: log spans into Arize, update the span's evaluations, annotations and metadata in bulk.

Logging spans

from arize import ArizeClient

client = ArizeClient(api_key=API_KEY)
SPACE_ID = "<your-space-id>"
PROJECT_NAME = "<your-project-name>"

client.spans.log(
    space_id=SPACE_ID,
    project_name=PROJECT_NAME,
    dataframe=spans_df,
    # evals_df=evals_df, # Optionally pass the evaluations together with the spans
)

Update spans Evaluations, Annotations, and Metadata

from arize import ArizeClient

client = ArizeClient(api_key=API_KEY)
SPACE_ID = "<your-space-id>"
PROJECT_NAME = "<your-project-name>"

client.spans.update_evaluations(
    space_id=SPACE_ID,
    project_name=PROJECT_NAME,
    dataframe=evals_df,
    # force_http=... # Optionally pass force_http to update evaluations via HTTP instead of gRPC, defaults to False
)

client.spans.update_annotations(
    space_id=SPACE_ID,
    project_name=PROJECT_NAME,
    dataframe=annotations_df,
)

client.spans.update_metadata(
    space_id=SPACE_ID,
    project_name=PROJECT_NAME,
    dataframe=metadata_df,
)

Exporting spans

Use the export_to_df or export_to_parquet to export large amounts of spans from Arize.

from arize import ArizeClient
from datetime import datetime

FMT  = "%Y-%m-%d"
start_time = datetime.strptime("2024-01-01",FMT)
end_time = datetime.strptime("2026-01-01",FMT)

client = ArizeClient(api_key=API_KEY)
SPACE_ID = "<your-space-id>"
PROJECT_NAME = "<your-project-name>"

df = client.spans.export_to_df(
    space_id=SPACE_ID,
    project_name=PROJECT_NAME,
    start_time=start_time,
    end_time=end_time,
)

Operations on ML Models

Use arize.ml to interact with ML models: log ML data (training, validation, production) into Arize, either streaming or in batches.

Stream log ML Data for a Classification use-case

from arize import ArizeClient
from arize.ml.types import ModelTypes, Environments

client = ArizeClient(api_key=API_KEY)
SPACE_ID = "<your-space-id>"
MODEL_NAME = "<your-model-name>"

features=...
embedding_features=...

response = client.ml.log_stream(
    space_id=SPACE_ID,
    model_name=MODEL_NAME,
    model_type=ModelTypes.SCORE_CATEGORICAL,
    environment=Environments.PRODUCTION,
    prediction_label=("not fraud",0.3),
    actual_label=("fraud",1.0),
    features=features,
    embedding_features=embedding_features,
)

Log a batch of ML Data for a Object Detection use-case

from arize import ArizeClient
from arize.ml.types import ModelTypes, Environments

client = ArizeClient(api_key=API_KEY)
SPACE_ID = "<your-space-id>"
MODEL_NAME = "<your-model-name>"
MODEL_VERSION = "1.0"

from arize.ml.types import Schema, EmbeddingColumnNames, ObjectDetectionColumnNames, ModelTypes, Environments

tags = ["drift_type"]
embedding_feature_column_names = {
    "image_embedding": EmbeddingColumnNames(
        vector_column_name="image_vector", link_to_data_column_name="url"
    )
}
object_detection_prediction_column_names = ObjectDetectionColumnNames(
    bounding_boxes_coordinates_column_name="prediction_bboxes",
    categories_column_name="prediction_categories",
    scores_column_name="prediction_scores",
)
object_detection_actual_column_names = ObjectDetectionColumnNames(
    bounding_boxes_coordinates_column_name="actual_bboxes",
    categories_column_name="actual_categories",
)

# Define a Schema() object for Arize to pick up data from the correct columns for logging
schema = Schema(
    prediction_id_column_name="prediction_id",
    timestamp_column_name="prediction_ts",
    tag_column_names=tags,
    embedding_feature_column_names=embedding_feature_column_names,
    object_detection_prediction_column_names=object_detection_prediction_column_names,
    object_detection_actual_column_names=object_detection_actual_column_names,
)

# Logging Production DataFrame
response = client.ml.log_batch(
    space_id=SPACE_ID,
    model_name=MODEL_NAME,
    model_type=ModelTypes.OBJECT_DETECTION,
    dataframe=prod_df,
    schema=schema,
    environment=Environments.PRODUCTION,
    model_version = MODEL_VERSION, # Optionally pass a model version
)

Exporting ML Data

Use the export_to_df or export_to_parquet to export large amounts of spans from Arize.

from arize import ArizeClient
from datetime import datetime

FMT  = "%Y-%m-%d"
start_time = datetime.strptime("2024-01-01",FMT)
end_time = datetime.strptime("2026-01-01",FMT)

client = ArizeClient(api_key=API_KEY)
SPACE_ID = "<your-space-id>"
MODEL_NAME = "<your-model-name>"
MODEL_VERSION = "1.0"

df = client.ml.export_to_df(
    space_id=SPACE_ID,
    model_name=MODEL_NAME,
    environment=Environments.TRAINING,
    model_version=MODEL_VERSION,
    start_time=start_time,
    end_time=end_time,
)

Generate embeddings for your data

import pandas as pd
from arize.embeddings import EmbeddingGenerator, UseCases

# You can check available models
print(EmbeddingGenerator.list_pretrained_models())

# Example dataframe
df = pd.DataFrame(
    {
        "text": [
            "Hello world.",
            "Artificial Intelligence is the future.",
            "Spain won the FIFA World Cup on 2010.",
        ],
    }
)
# Instantiate the generator for your usecase, selecting the base model
generator = EmbeddingGenerator.from_use_case(
    use_case=UseCases.NLP.SEQUENCE_CLASSIFICATION,
    model_name="distilbert-base-uncased",
    tokenizer_max_length=512,
    batch_size=100,
)

# Generate embeddings
df["text_vector"] = generator.generate_embeddings(text_col=df["text"])

Operations on Datasets

List Datasets

You can list all datasets that the user has access to using client.datasets.list(). You can use the limit parameter to specify the maximum number of datasets desired in the response and you can specify space to filter by space. The space parameter accepts either a space ID (e.g. "U3BhY2U6OTA1MDoxSmtS") or a space name for substring filtering. You can also filter by dataset name.

resp = client.datasets.list(
    name=... # Optional, case-insensitive substring filter on dataset name
    space=... # Optional, space ID (e.g. "U3BhY2U6OTA1MDoxSmtS") or space name (e.g. "my-space")
    limit=... # Optional, defaults to 100
)

The response is an object of type DatasetsList200Response, and you can access the list of datasets via its datasets attribute. In addition, you can transform the response object to a dictionary, to JSON format, or a pandas dataframe.

# Get the list of datasets from the response
dataset_list = resp.datasets
# Get the response as a dictionary
resp_dict = resp.to_dict()
# Get the response in JSON format
resp_json = resp.to_json()
# Get the response as a pandas dataframe
resp_df = resp.to_df()

Create a Dataset

You can create a dataset using client.datasets.create(). You must pass examples — we currently don't support creating an empty dataset. Examples can be provided as a list of dictionaries or a pandas dataframe.

examples = [
    {
        "eval.Correctness Basic.explanation": "The query indicates that the user is having trouble accessing their account on their laptop, while access on their phone is still working. This suggests a potential issue with the login process on the laptop, which aligns with the 'Login Issues' queue. The mention of a possible change in the account could relate to login credentials or settings affecting the laptop specifically, but it still falls under the broader category of login issues.",
        "eval.Correctness Basic.label": "correct",
        "eval.Correctness Basic.score": 1,
        "llm output": "Login Issues",
        "query": "I can't get in on my laptop anymore, but my phone still works fine — could this be because I changed something in my account?"
    },
    {
        "eval.Correctness Basic.explanation": "The query is about a user who signed up but is unable to log in because the system says no account is found. This issue is related to the login process, as the user is trying to access their account and is facing a problem with the login system recognizing their account. Therefore, assigning this query to the 'Login Issues' queue is appropriate.",
        "eval.Correctness Basic.label": "correct",
        "eval.Correctness Basic.score": 1,
        "llm output": "Login Issues",
        "query": "Signed up ages ago but never got around to logging in — now it says no account found. Do I start over?"
    }
]

If the number of examples is too large, the client SDK will try to send the data via Arrow Flight via gRPC for better performance. If you want to force the data transfer to HTTP you can use the force_http flag. The response is a Dataset object.

created_dataset = client.datasets.create(
    space="<space-id-or-name>",
    name="<your-dataset-name>", # Name must be unique within a space
    examples=..., # List of dictionaries or pandas dataframe
    # force_http=... # Optionally pass force_http to create datasets via HTTP instead of gRPC, defaults to False
)

The Dataset object also has convenience methods:

# Get the response as a dictionary
dataset_dict = created_dataset.to_dict()
# Get the response in JSON format
dataset_json = created_dataset.to_json()

Get a Dataset

To get a dataset by its ID or name use client.datasets.get(). You can optionally pass space for disambiguation when looking up by name. The returned type is Dataset.

dataset = client.datasets.get(
    dataset=... # The dataset ID or name
    space=... # Optional, space ID or name (required when looking up by dataset name)
)

Delete a Dataset

To delete a dataset by its ID or name use client.datasets.delete(). The call returns None if successful deletion took place, error otherwise.

client.datasets.delete(
    dataset=... # The dataset ID or name
    space=... # Optional, space ID or name (required when looking up by dataset name)
)

List Dataset Examples

You can list the examples of a given dataset using client.datasets.list_examples(). You can specify the number of examples desired using the limit parameter. If you want all examples, use all=True, which exports data using Arrow Flight via gRPC for increased performance.

resp = client.datasets.list_examples(
    dataset="<your-dataset-id-or-name>",
    space=..., # Optional, space ID or name
    dataset_version_id=..., # Optional, defaults to latest version
    limit=... # number of desired examples. Defaults to 100
    all=... # Whether or not to export all of the examples. Defaults to False
)

The response is an object of type DatasetsExamplesList200Response, and you can access the list of examples via its examples attribute. In addition, you can transform the response object to a dictionary, to JSON format, or a pandas dataframe.

# Get the list of examples from the response
examples_list = resp.examples
# Get the response as a dictionary
resp_dict = resp.to_dict()
# Get the response in JSON format
resp_json = resp.to_json()
# Get the response as a pandas dataframe
resp_df = resp.to_df()

Append Dataset Examples

You can append examples to an existing dataset version using client.datasets.append_examples(). This creates a new dataset version with the appended data and returns the updated Dataset object.

updated_dataset = client.datasets.append_examples(
    dataset="<your-dataset-id-or-name>",
    space=..., # Optional, space ID or name
    dataset_version_id="<version-id-to-append-to>",
    examples=..., # List of dictionaries or pandas dataframe
)

Update Dataset Examples

You can update the content of existing dataset examples using client.datasets.update_examples(). Examples are matched by their existing id; an id not found in the targeted version is ignored (no error, no insert — use append_examples to add new examples). Pass new_version to snapshot the update as a new dataset version instead of updating in place.

updated_version = client.datasets.update_examples(
    dataset="<your-dataset-id-or-name>",
    space=..., # Optional, space ID or name
    dataset_version_id="<version-id-to-update>", # Optional, defaults to latest version
    examples=[
        {"id": "<existing-example-id>", "question": "...", "answer": "..."},
    ],
    new_version=..., # Optional, name for a new version; omit to update in place
)

Delete Dataset Examples

You can delete a batch of examples from a dataset version using client.datasets.delete_examples(). Examples are removed in place from the given dataset_version_id (no new version is created). The delete is partial-tolerant and idempotent, so re-submitting already-deleted IDs is safe. Up to 1000 example IDs may be deleted per request, and they must not contain duplicate or empty values.

resp = client.datasets.delete_examples(
    dataset="<your-dataset-id-or-name>",
    space=..., # Optional, space ID or name
    dataset_version_id="<version-id-to-delete-from>",
    examples=..., # List of example IDs to delete (1-1000, no duplicates or empty values)
)

The response is an object of type DeleteDatasetExamplesResponse. Use completed to check whether the operation finished (retry the full request if False), deleted_example_ids for the IDs confirmed deleted, and not_deleted_example_ids for requested IDs that were not deleted (not found in the selected version, or not completed).

# Whether the operation finished and no retry is needed
completed = resp.completed
# IDs confirmed deleted in this request
deleted = resp.deleted_example_ids
# Requested IDs that were not deleted
not_deleted = resp.not_deleted_example_ids

Operations on Experiments

List Experiments

You can list all experiments that the user has access to using client.experiments.list(). You can use the limit parameter to specify the maximum number of experiments desired in the response and you can specify the dataset to target the list operation to a particular dataset.

resp = client.experiments.list(
    dataset=... # Optional, dataset ID or name
    space=... # Optional, space ID or name
    limit=... # Optional, defaults to 100
)

The response is an object of type ExperimentsList200Response, and you can access the list of experiments via its experiments attribute. In addition, you can transform the response object to a dictionary, to JSON format, or a pandas dataframe.

# Get the list of experiments from the response
experiment_list = resp.experiments
# Get the response as a dictionary
resp_dict = resp.to_dict()
# Get the response in JSON format
resp_json = resp.to_json()
# Get the response as a pandas dataframe
resp_df = resp.to_df()

Run an Experiment

You can run an experiment on a dataset using client.experiments.run() by defining a task, evaluators (optional), and passing the dataset id or name of the dataset you want to use, together with a name for the experiment. The function will download the entire dataset from Arize (unless cached, see caching section under "SDK Configuration"), execute the task to obtain an output, and perform evaluations (if evaluators were passed). The experiments will also be traced, and these traces will be visible in Arize. The experiment will be created and the data logged into Arize automatically. You can avoid logging to Arize by making dry_run=True. The function will return the Experiment object (or None if dry_run=True) together with the dataframe with the experiment data.

from arize.experiments import ExperimentMetadata

experiment, experiment_df = client.experiments.run(
    name="<name-your-experiment>",
    dataset="<your-dataset-id-or-name>",
    space=..., # Optional, space ID or name
    task=..., # The task to be performed in the experiment.
    evaluators=..., # Optional: The evaluators to use in the experiment.
    dry_run=..., # If True, the experiment result will not be uploaded to Arize. Defaults to False
    dry_run_count=..., # Number of examples of the dataset to use in the dry run. Defaults to 10
    concurrency=..., # The number of concurrent tasks to run. Defaults to 3.
    set_global_tracer_provider=..., # If True, sets the global tracer provider for the experiment. Defaults to False
    exit_on_error=..., # If True, the experiment will stop running on first occurrence of an error. Defaults to False
    metadata=ExperimentMetadata(user_id="...", user_email="..."), # Optional: metadata attached to every experiment span
)

metadata accepts an ExperimentMetadata typed dict (for IDE autocomplete on known fields like user_id, user_name, user_email) or any plain dict[str, str] for custom fields. Dataset fields (dataset_id, dataset_name, dataset_version_id) are always populated automatically. Both can be combined via dict unpacking:

metadata={
    **ExperimentMetadata(user_email="alice@example.com"),
    "team": "ml-platform",
    "git_sha": "abc123",
}

The Experiment object also has convenience methods:

# Get the response as a dictionary
experiment_dict = experiment.to_dict()
# Get the response in JSON format
experiment_json = experiment.to_json()

Create an Experiment

It is possible that you have run the experiment yourself without the above function, and hence you already have experiment data that you want to send to Arize. In this case, use the client.experiments.create() method by passing the runs data as a list of dictionaries or pandas dataframe.

NOTE: If you don't have experiment data and want to run an experiment, see the client.experiments.run() section above.

In addition, you must specify which columns are the example_id and the result using ExperimentTaskFieldNames. If you have evaluation data, indicate the evaluation columns using EvaluationResultFieldNames.

If the number of runs is too large, the client SDK will try to send the data via Arrow Flight via gRPC for better performance. If you want to force the data transfer to HTTP you can use the force_http flag. The response is an Experiment object.

from arize.experiments.types import ExperimentTaskFieldNames, EvaluationResultFieldNames

created_experiment = client.experiments.create(
    name="<your-experiment-name>", # Name must be unique within a dataset
    dataset="<your-dataset-id-or-name>",
    space=..., # Optional, space ID or name
    experiment_runs=..., # List of dictionaries or pandas dataframe
    task_fields=ExperimentTaskFieldNames(...),
    evaluator_columns=... # Optional
    # force_http=... # Optionally pass force_http to create experiments via HTTP instead of gRPC, defaults to False
)

Get an Experiment

To get an experiment by its ID or name use client.experiments.get(). The returned type is Experiment (does not include experiment runs — use list_runs() for those).

experiment = client.experiments.get(
    experiment=... # The experiment ID or name
    dataset=... # Optional, dataset ID or name (required when looking up by experiment name)
    space=... # Optional, space ID or name
)

Delete an Experiment

To delete an experiment by its ID or name use client.experiments.delete(). The call returns None if successful deletion took place, error otherwise.

client.experiments.delete(
    experiment=... # The experiment ID or name
    dataset=... # Optional, dataset ID or name
    space=... # Optional, space ID or name
)

List Experiment Runs

You can list the runs of a given experiment using client.experiments.list_runs() and passing the experiment ID or name. You can specify the number of runs desired using the limit parameter. If you want all runs, consider using the all=True parameter, which will make it so the SDK exports the data using Arrow Flight via gRPC, for increased performance.

resp = client.experiments.list_runs(
    experiment="<your-experiment-id-or-name>",
    dataset=..., # Optional, dataset ID or name
    space=..., # Optional, space ID or name
    limit=... # number of desired runs. Defaults to 100
    all=... # Whether or not to export all of the runs. Defaults to False
)

The response is an object of type ExperimentsRunsList200Response, and you can access the list of runs via its experiment_runs attribute. In addition, you can transform the response object to a dictionary, to JSON format, or a pandas dataframe.

# Get the list of runs from the response
run_list = resp.experiment_runs
# Get the response as a dictionary
resp_dict = resp.to_dict()
# Get the response in JSON format
resp_json = resp.to_json()
# Get the response as a pandas dataframe
resp_df = resp.to_df()

Append Experiment Runs

Append between 1 and 1000 new runs to an existing experiment using client.experiments.append_runs(). Each run must include example_id (the ID of an example from the experiment's dataset) and output. The response includes the updated experiment and the generated run IDs in input order (run_ids).

result = client.experiments.append_runs(
    experiment="<your-experiment-id-or-name>",
    dataset=...,  # Optional, dataset ID or name (required when experiment is a name)
    space=...,    # Optional, space ID or name (required when dataset is a name)
    experiment_runs=[
        {"example_id": "ex-1", "output": "The answer is 42"},
        {"example_id": "ex-2", "output": "Paris"},
    ],
    # experiment_runs also accepts a pandas DataFrame
)
print(result.run_ids)  # IDs of the appended runs, in input order

Operations on Prompts

Use client.prompts to manage prompts and their versions in Arize's Prompt Hub.

List Prompts

resp = client.prompts.list(
    name=..., # Optional, case-insensitive substring filter on prompt name
    space=..., # Optional, space ID or name
    limit=..., # Optional, defaults to 100
)
prompt_list = resp.prompts

Create a Prompt

Creating a prompt also creates its first version. You must specify the space, a name, and the prompt content.

from arize.prompts.types import InputVariableFormat, LlmProvider, LLMMessage, InvocationParams

prompt = client.prompts.create(
    space="<space-id-or-name>",
    name="<your-prompt-name>",
    commit_message="Initial version",
    input_variable_format=InputVariableFormat.F_STRING, # or MUSTACHE
    provider=LlmProvider.OPEN_AI,
    messages=[
        LLMMessage(role="SYSTEM", content="You are a helpful assistant."),
        LLMMessage(role="USER", content="Answer this question: {question}"),
    ],
    description=..., # Optional
    model=..., # Optional model name
    invocation_params=..., # Optional
)

Get a Prompt

To get a prompt by its ID or name use client.prompts.get(). You can optionally request a specific version by version_id or by label. Defaults to the latest version. The returned type is PromptWithVersion.

prompt = client.prompts.get(
    prompt="<your-prompt-id-or-name>",
    space=..., # Optional, space ID or name
    version_id=..., # Optional, get a specific version
    label=..., # Optional, get the version with this label (e.g. "production")
)

Update a Prompt

updated_prompt = client.prompts.update(
    prompt="<prompt-id-or-name>",
    space=..., # Optional
    description="Updated description",
)

Delete a Prompt

client.prompts.delete(
    prompt="<prompt-id-or-name>",
    space=..., # Optional
)

Prompt Versions

You can list all versions of a prompt, create new versions, and retrieve a specific version.

# List versions
versions = client.prompts.list_versions(
    prompt="<prompt-id-or-name>",
    space=..., # Optional
    limit=..., # Optional, defaults to 100
)

# Get a specific version
version = client.prompts.get_version(version_id="<version-id>")

# Create a new version
new_version = client.prompts.create_version(
    prompt="<prompt-id-or-name>",
    space=..., # Optional
    commit_message="Update system prompt",
    input_variable_format=InputVariableFormat.F_STRING,
    provider=LlmProvider.OPEN_AI,
    messages=[...],
    model=..., # Optional
    invocation_params=..., # Optional
)

Prompt Labels

Labels allow you to tag a specific prompt version with a named alias (e.g. "production", "staging").

# Get the version currently tagged with a label
version = client.prompts.get_version_by_label(
    prompt="<prompt-id-or-name>",
    space=..., # Optional
    label_name="production",
)

# Set labels on a version (replaces all existing labels)
client.prompts.set_labels(
    version_id="<version-id>",
    labels=["production", "v2"],
)

# Delete a single label from a version
client.prompts.delete_label(
    version_id="<version-id>",
    label_name="staging",
)

Operations on Evaluators

Use client.evaluators to manage LLM evaluators and their versions.

List Evaluators

resp = client.evaluators.list(
    name=..., # Optional, case-insensitive substring filter on evaluator name
    space=..., # Optional, space ID or name
    limit=..., # Optional, defaults to 100
)
evaluator_list = resp.evaluators

Create an Evaluator

from arize.evaluators.types import TemplateConfig

evaluator = client.evaluators.create(
    name="<your-evaluator-name>",
    space="<space-id-or-name>",
    evaluator_type="template",
    commit_message="Initial version",
    template_config=TemplateConfig(...),
    description=..., # Optional
)

Get an Evaluator

evaluator = client.evaluators.get(
    evaluator="<evaluator-id-or-name>",
    space=..., # Optional
    version_id=..., # Optional, defaults to latest version
)

Update an Evaluator

updated = client.evaluators.update(
    evaluator="<evaluator-id-or-name>",
    space=..., # Optional
    name=..., # Optional
    description=..., # Optional
)

Delete an Evaluator

client.evaluators.delete(
    evaluator="<evaluator-id-or-name>",
    space=..., # Optional
)

Evaluator Versions

# List versions
versions = client.evaluators.list_versions(
    evaluator="<evaluator-id-or-name>",
    space=..., # Optional
    limit=..., # Optional, defaults to 100
)

# Get a specific version
version = client.evaluators.get_version(version_id="<version-id>")

# Create a new version
new_version = client.evaluators.create_version(
    evaluator="<evaluator-id-or-name>",
    space=..., # Optional
    commit_message="Updated template",
    template_config=TemplateConfig(...),
)

Operations on Tasks

Use client.tasks to manage online evaluation tasks that run continuously or on-demand against your production data.

List Tasks

resp = client.tasks.list(
    name=..., # Optional, case-insensitive substring filter on task name
    space=..., # Optional, space ID or name
    project=..., # Optional, filter by project (name or ID)
    dataset=..., # Optional, filter by dataset (name or ID)
    task_type=..., # Optional, "TEMPLATE_EVALUATION" or "CODE_EVALUATION"
    limit=..., # Optional, defaults to 100
)
task_list = resp.tasks

Create a Task

There are two task types for running evaluators, and one for server-side LLM experiments. Use the appropriate typed helper:

Template or code evaluation task — runs evaluators against spans from a project or experiments from a dataset.

from arize.tasks.types import TaskEvaluatorInput

# Project-based, continuous template evaluation task
task = client.tasks.create_evaluation_task(
    name="Production Hallucination Check",
    task_type="TEMPLATE_EVALUATION",
    evaluators=[TaskEvaluatorInput(
        evaluator_id="<evaluator-id>",   # Required
        column_mappings={"input": "attributes.input.value"},  # Optional
    )],
    project="<project-id-or-name>",  # Required if not using dataset
    space=...,            # Optional, space ID or name
    sampling_rate=1.0,    # Optional, fraction of data to evaluate (0.0–1.0)
    is_continuous=True,   # Optional, run continuously on new data
    query_filter=...,     # Optional, filter expression for spans
)

# Dataset-based code evaluation task
task = client.tasks.create_evaluation_task(
    name="My Code Evaluator Task",
    task_type="CODE_EVALUATION",
    evaluators=[TaskEvaluatorInput(evaluator_id="<evaluator-id>")],
    dataset="<dataset-id-or-name>",   # Required if not using project
    experiment_ids=["<experiment-id>"],  # Required when using dataset
)

Run experiment task — the server drives LLM calls using the configured AI integration. No local callable is required.

from arize.tasks.types import LlmGenerationRunConfig, RunConfiguration

task = client.tasks.create_run_experiment_task(
    name="GPT-4o Baseline Task",
    dataset="<dataset-id-or-name>",
    run_configuration=RunConfiguration(
        actual_instance=LlmGenerationRunConfig(
            experiment_type="LLM_GENERATION",
            ai_integration_id="<ai-integration-id>",
            model_name="gpt-4o",
            input_variable_format="F_STRING",
            messages=[
                {"role": "SYSTEM", "content": "You are a helpful assistant."},
                {"role": "USER", "content": "Answer: {question}"},
            ],
        )
    ),
    space=...,  # Optional, space ID or name
)

Get a Task

task = client.tasks.get(
    task="<task-id-or-name>",
    space=..., # Optional
)

Trigger a Task Run

You can trigger an on-demand run of a task. The returned TaskRun will initially have "pending" status. The method automatically dispatches based on the task type.

# Evaluation task (template_evaluation / code_evaluation)
run = client.tasks.trigger_run(
    task="<task-id-or-name>",
    space=...,                  # Optional
    data_start_time=...,        # Optional, start of data window
    data_end_time=...,          # Optional, end of data window
    max_spans=...,              # Optional, maximum spans to evaluate
    override_evaluations=...,   # Optional, re-evaluate already-evaluated spans
)

# Run experiment task — experiment_name is required
run = client.tasks.trigger_run(
    task="<task-id-or-name>",
    space=...,                  # Optional
    experiment_name="GPT-4o Baseline v2",  # Required for run_experiment tasks
    max_examples=50,            # Optional, limit number of dataset examples
    tracing_metadata={"source": "api"},  # Optional, metadata for traces
)

Monitor Task Runs

# List runs for a task
runs_resp = client.tasks.list_runs(
    task="<task-id-or-name>",
    space=..., # Optional
    status=..., # Optional, filter by "pending", "running", "completed", "failed", or "cancelled"
    limit=..., # Optional, defaults to 100
)

# Get a specific run
run = client.tasks.get_run(run_id="<run-id>")

# Cancel a pending or running run
cancelled_run = client.tasks.cancel_run(run_id="<run-id>")

# Wait for a run to reach a terminal state (completed, failed, or cancelled)
finished_run = client.tasks.wait_for_run(
    run_id="<run-id>",
    poll_interval=5.0, # Optional, seconds between polls. Defaults to 5.0
    timeout=600.0, # Optional, seconds before TimeoutError. Defaults to 600.0
)

Operations on Projects

Use client.projects to manage projects, which are namespaces for organizing tracing data.

List Projects

resp = client.projects.list(
    name=..., # Optional, case-insensitive substring filter on project name
    space=..., # Optional, space ID or name
    limit=..., # Optional, defaults to 100
)
project_list = resp.projects

Create a Project

project = client.projects.create(
    name="<your-project-name>", # Must be unique within the space
    space="<space-id-or-name>",
)

Get a Project

project = client.projects.get(
    project="<project-id-or-name>",
    space=..., # Optional
)

Update a Project

project = client.projects.update(
    project="<project-id-or-name>",
    space=..., # Optional, space ID or name (required when looking up by project name)
    name="<new-project-name>", # Must be unique within the space
)

Delete a Project

client.projects.delete(
    project="<project-id-or-name>",
    space=..., # Optional
)

Operations on Organizations

Use client.organizations to manage organizations in the Arize platform.

List Organizations

resp = client.organizations.list(
    name=...,   # Optional, case-insensitive substring filter on organization name
    limit=...,  # Optional, defaults to 50 (max 100)
    cursor=..., # Optional, pagination cursor from a previous response
)
org_list = resp.organizations

Get an Organization

org = client.organizations.get(
    organization="<organization-id-or-name>",
)

Create an Organization

org = client.organizations.create(
    name="<your-org-name>",  # Must be unique within the account
    description=...,         # Optional
)

Update an Organization

org = client.organizations.update(
    organization="<organization-id-or-name>",
    name=...,        # Optional updated name
    description=..., # Optional updated description (pass "" to clear)
)

Delete an Organization

Warning: This operation is irreversible. It permanently deletes the organization and all resources that belong to it, including all spaces and their contents.

client.organizations.delete(
    organization="<organization-id-or-name>",
)

Add a User to an Organization

Add a user to an organization (or update their role if already a member). The user must already exist in the account.

from arize.organizations.types import CustomOrgRole, PredefinedOrgRole

# Predefined role (ADMIN, MEMBER, READ_ONLY, or ANNOTATOR)
membership = client.organizations.add_user(
    organization="<organization-id-or-name>",
    user_id="<user-id>",
    role=PredefinedOrgRole(name="MEMBER"),
)

# Custom RBAC role
membership = client.organizations.add_user(
    organization="<organization-id-or-name>",
    user_id="<user-id>",
    role=CustomOrgRole(id="<role-id>"),
)

Remove a User from an Organization

Removes the user from the organization and all its child spaces.

client.organizations.remove_user(
    organization="<organization-id-or-name>",
    user_id="<user-id>",
)

Operations on Spaces

Use client.spaces to manage space memberships.

Add a User to a Space

Add a user to a space (or update their role if already a member). The user must already be a member of the space's parent organization.

from arize.spaces.types import CustomSpaceRole, PredefinedSpaceRole

# Predefined role (ADMIN, MEMBER, READ_ONLY, or ANNOTATOR)
membership = client.spaces.add_user(
    space="<space-id-or-name>",
    user_id="<user-id>",
    role=PredefinedSpaceRole(name="MEMBER"),
)

# Custom RBAC role
membership = client.spaces.add_user(
    space="<space-id-or-name>",
    user_id="<user-id>",
    role=CustomSpaceRole(id="<role-id>"),
)

Remove a User from a Space

client.spaces.remove_user(
    space="<space-id-or-name>",
    user_id="<user-id>",
)

Operations on Users

Use client.users to manage users in the Arize platform.

Note: Unlike organizations, users are identified by opaque ID only — not by name. User display names are not unique within an account, so all methods require the user's ID.

List Users

resp = client.users.list(
    email=...,   # Optional, case-insensitive partial match on email
    status=...,  # Optional, list of statuses: "ACTIVE", "INVITED", "EXPIRED"
    limit=...,   # Optional, defaults to 50 (max 100)
    cursor=...,  # Optional, pagination cursor from a previous response
)
user_list = resp.users

Get a User

Accepts either a user ID or an email address. When an email is provided, it is resolved to the matching user (returns None if no match is found).

# by ID
user = client.users.get(user="<user-id>")

# by email
user = client.users.get(user="jane.smith@example.com")

Create a User

from arize.users.types import PredefinedUserRole

user = client.users.create(
    name="Jane Smith",
    email="jane.smith@example.com",
    role=PredefinedUserRole(name="MEMBER"),  # ADMIN, MEMBER, ANNOTATOR
    invite_mode="EMAIL_LINK",  # NONE, EMAIL_LINK, TEMPORARY_PASSWORD
)

Update a User

user = client.users.update(
    user_id="<user-id>",
    name=...,         # Optional updated display name
    is_developer=..., # Optional, grant or revoke developer permissions
)

Delete a User

Warning: This operation soft-deletes the user and cascades to organization memberships, space memberships, API keys, and role bindings.

client.users.delete(
    user_id="<user-id>",
)

Bulk Delete Users

Note: This is a beta endpoint. Enable it via the pre-release opt-in.

Delete multiple users in a single call by ID, by email, or both. Returns a list of BulkUserDeletionResult objects recording the outcome of each attempt.

from arize import ArizeClient, BulkUserDeletionResult

client = ArizeClient(api_key=API_KEY)

results = client.users.bulk_delete(
    user_ids=["usr_abc123", "usr_def456"],   # Optional
    emails=["alice@example.com"],             # Optional — resolved to IDs automatically
)

for result in results:
    print(result.id, result.status, result.error)

Each BulkUserDeletionResult has:

Field	Type	Description
`id`	`str`	User ID, or the email if the user could not be resolved
`status`	`DeletionStatus`	`"deleted"`, `"not_found"`, or `"failed"`
`error`	`str \| None`	Error message when `status` is `"not_found"` or `"failed"`

Resend a User Invitation

client.users.resend_invitation(
    user_id="<user-id>",  # Must be in "INVITED" status
)

Reset a User's Password

Triggers a password-reset email. The user must authenticate via password (not SSO/SAML) and must have already verified their account.

client.users.reset_password(
    user_id="<user-id>",
)

Operations on API Keys

Use client.api_keys to manage API keys in the Arize platform. Two key types are supported: user keys (authenticate as the creating user) and service keys (scoped to a specific space).

Note: The raw key value is only returned at creation time. Store it securely.

List API Keys

resp = client.api_keys.list(
    key_type=...,  # Optional, "USER" or "SERVICE"
    status=...,    # Optional, "ACTIVE" or "REVOKED"
    space=...,     # Optional, space ID or name (filters to service keys for that space)
    user_id=...,   # Optional, filter by user (admin only for user keys)
    limit=...,     # Optional, defaults to 50 (max 100)
    cursor=...,    # Optional, pagination cursor from a previous response
)
api_key_list = resp.api_keys

Create an API Key

Creates a user-type key that authenticates as the creating user. For space-scoped service keys, use create_service_key() instead.

created = client.api_keys.create(
    name="my-api-key",
    description=...,  # Optional
    expires_at=...,   # Optional, must be a future timestamp
)
raw_key = created.key  # Store this securely — only returned once

Create a Service Key

Creates a service-type key scoped to a specific space, backed by a dedicated bot user. Role assignments default to space_role="MEMBER", org_role="READ_ONLY", account_role="MEMBER" when omitted. All roles must be at or below the caller's own privilege level.

created = client.api_keys.create_service_key(
    name="my-service-key",
    space="<space-id-or-name>",
    description=...,  # Optional
    expires_at=...,   # Optional
    space_role=...,   # Optional, "ADMIN" | "MEMBER" | "READ_ONLY"
    org_role=...,     # Optional, "ADMIN" | "MEMBER" | "READ_ONLY"
    account_role=..., # Optional, "ADMIN" | "MEMBER"
)
raw_key = created.key  # Store this securely — only returned once

Revoke an API Key

Sets the key's status to revoked and deactivates it immediately. Revoking an already-revoked key is a no-op and still succeeds. Returns None on success.

client.api_keys.revoke(
    api_key_id="<api-key-id>",
)

Refresh an API Key

Atomically revokes the old key and issues a replacement with the same name, description, type, and scope.

Use grace_period_seconds to keep the old key valid briefly while your services rotate to the new key.

refreshed = client.api_keys.refresh(
    api_key_id="<api-key-id>",
    expires_at=...,  # Optional, new expiration for the replacement key
    grace_period_seconds=...,  # Optional, old key remains valid for N seconds
)
new_raw_key = refreshed.key  # Store securely — only returned once

Common patterns:

# Immediate invalidation of old key (default)
refreshed = client.api_keys.refresh(api_key_id="<api-key-id>")

# Graceful rotation window (old key valid for 5 minutes)
refreshed = client.api_keys.refresh(
    api_key_id="<api-key-id>",
    grace_period_seconds=300,
)

# Graceful rotation + expiry on the replacement key
refreshed = client.api_keys.refresh(
    api_key_id="<api-key-id>",
    expires_at=...,
    grace_period_seconds=300,
)

Notes:

expires_at applies to the replacement key.
grace_period_seconds applies to the old key.
Refreshing a deleted or already expired key returns an error.

Operations on Annotation Configs

Use client.annotation_configs to manage annotation configurations that define scoring schemas for human feedback.

List Annotation Configs

resp = client.annotation_configs.list(
    name=..., # Optional, case-insensitive substring filter
    space=..., # Optional, space ID or name
    limit=..., # Optional, defaults to 100
)
config_list = resp.annotation_configs

Create an Annotation Config

Three config types are supported: continuous (a numeric score in a range), categorical (a fixed set of labeled values), and freeform (open-ended text with no scale). Use whichever create_* method matches the config type you want.

from arize.annotation_configs.types import CategoricalAnnotationValue, OptimizationDirection

# Continuous (numeric) annotation config
config = client.annotation_configs.create_continuous(
    name="quality-score",
    space="<space-id-or-name>",
    minimum_score=0.0,
    maximum_score=1.0,
    optimization_direction=OptimizationDirection.MAXIMIZE,
)

# Categorical annotation config
config = client.annotation_configs.create_categorical(
    name="correctness",
    space="<space-id-or-name>",
    values=[
        CategoricalAnnotationValue(label="correct", score=1),
        CategoricalAnnotationValue(label="incorrect", score=0),
    ],
    optimization_direction=OptimizationDirection.MAXIMIZE,
)

# Freeform (open-ended text) annotation config
config = client.annotation_configs.create_freeform(
    name="reviewer-notes",
    space="<space-id-or-name>",
)

Get an Annotation Config

config = client.annotation_configs.get(
    annotation_config="<config-id-or-name>",
    space=..., # Optional
)

Update an Annotation Config

Only the fields you pass are changed; omitted fields are left unchanged. Use the method matching the stored config's type, since the config type is immutable.

from arize.annotation_configs.types import CategoricalAnnotationValue, OptimizationDirection

# Rename a freeform config
config = client.annotation_configs.update_freeform(
    annotation_config="<config-id-or-name>",
    space=..., # Optional, required when passing a name
    name="renamed-config",
)

# Replace the label set of a categorical config
config = client.annotation_configs.update_categorical(
    annotation_config="<config-id-or-name>",
    values=[
        CategoricalAnnotationValue(label="good", score=1),
        CategoricalAnnotationValue(label="bad", score=0),
    ],
    optimization_direction=OptimizationDirection.MAXIMIZE,
)

# Adjust the score bounds of a continuous config
config = client.annotation_configs.update_continuous(
    annotation_config="<config-id-or-name>",
    minimum_score=0.0,
    maximum_score=10.0,
)

Delete an Annotation Config

client.annotation_configs.delete(
    annotation_config="<config-id-or-name>",
    space=..., # Optional
)

Operations on Resource Restrictions

Use client.resource_restrictions to manage resource restrictions. Restricting a resource prevents roles bound at higher hierarchy levels (space, org, account) from granting access to it — access must be granted directly on the resource. Only space admins or users with the PROJECT_RESTRICT permission can manage restrictions. Currently only PROJECT resources are supported.

Note: Resource restrictions are a beta API and may change without notice.

List Resource Restrictions

You can list all resource restrictions that you are permitted to manage using client.resource_restrictions.list(). You can use the limit parameter to specify the maximum number of restrictions desired per page and you can specify resource_type to filter by resource type (currently only PROJECT is supported). You can use cursor for pagination, pass pagination.next_cursor from a previous response. Results are authorization-filtered after each page is fetched, so a page may contain fewer items than limit (or none) while pagination.has_more is still True; keep paging until pagination.has_more is False.

from arize.resource_restrictions.types import ResourceRestrictionType

resp = client.resource_restrictions.list(
    resource_type=ResourceRestrictionType.PROJECT, # Optional filter
    limit=..., # Optional, defaults to 50
    cursor=..., # Optional, pagination cursor from a previous response
)

Restrict a Resource

Idempotent. Restricting an already-restricted resource returns the existing restriction without error.

restriction = client.resource_restrictions.restrict(
    resource_id="<project-id>",
)

Unrestrict a Resource

client.resource_restrictions.unrestrict(
    resource_id="<project-id>",
)

Operations on AI Integrations

Use client.ai_integrations to manage external LLM provider integrations used in the Arize Playground and online evaluations.

List AI Integrations

resp = client.ai_integrations.list(
    name=..., # Optional, case-insensitive substring filter
    space=..., # Optional, space ID or name
    limit=..., # Optional, defaults to 100
)
integration_list = resp.ai_integrations

Create an AI Integration

from arize.ai_integrations.types import AiIntegrationProvider, AiIntegrationAuthType

integration = client.ai_integrations.create(
    name="my-openai-integration",
    provider=AiIntegrationProvider.OPEN_AI,
    api_key="<your-provider-api-key>",
    base_url=..., # Optional, custom base URL
    model_names=..., # Optional, list of enabled model names
    enable_default_models=True, # Optional
)

Get an AI Integration

integration = client.ai_integrations.get(
    integration="<integration-id-or-name>",
    space=..., # Optional, space ID or name (used for disambiguation)
)

Update an AI Integration

updated = client.ai_integrations.update(
    integration="<integration-id-or-name>",
    space=..., # Optional
    api_key="<new-api-key>",
    model_names=["gpt-4o", "gpt-4o-mini"],
)

Delete an AI Integration

client.ai_integrations.delete(
    integration="<integration-id-or-name>",
    space=..., # Optional
)

SDK Configuration

Logging

In Code

You can use configure_logging to set up the logging behavior of the Arize package to your needs.

from arize.logging import configure_logging

configure_logging(
    level=..., # Defaults to logging.INFO
    structured=..., # if True, emit JSON logs. Defaults to False
)

Via Environment Variables

Configure the same options as the section above, via:

import os

# Whether or not you want to disable logging altogether
os.environ["ARIZE_LOG_ENABLE"] = "true"
# Set up the logging level
os.environ["ARIZE_LOG_LEVEL"] = "debug"
# Whether or not you want structured JSON logs
os.environ["ARIZE_LOG_STRUCTURED"] = "false"

The default behavior of Arize's logs is: enabled, INFO level, and not structured.

Caching

When downloading big segments of data from Arize, such as a Dataset with all of its examples, the SDK will cache the file in parquet format under ~/.arize/cache/datasets/dataset_<updated_at_timestamp>.parquet.

In Code

You can disable caching via the enable_caching parameter when instantiating the client, and also edit the "arize directory":

client = ArizeClient(
    enable_caching=False, # Optional parameter, defaults to True
    arize_directory="my-desired-directory", # Optional parameter, defaults to ~/.arize
)

Via Environment Variables

You can also configure the above via:

import os

# Whether or not you want to disable caching
os.environ["ARIZE_ENABLE_CACHING"] = "true"
# Where you want the SDK to store the files
os.environ["ARIZE_DIRECTORY"] = "~/.arize"

Clean the cache

To clean the cache you can directly rm the files or directory. In addition, the client has the option to help with that as well using client.clear_cache(), which will delete the cache/ directory inside the arize directory (defaults to ~/.arize).

Custom Endpoints, Proxies, and On-Prem

Use these parameters when connecting through a proxy, a custom on-prem gateway, or a Private Connect setup where all traffic is routed through a single host/port.

Single host and port

single_host and single_port override all endpoint hosts and ports at once — REST API, OTLP, and Arrow Flight. This is the typical pattern for an on-prem proxy that terminates connections on one address:

from arize import ArizeClient

client = ArizeClient(
    api_key=API_KEY,
    single_host="my-proxy.internal",
    single_port=4040,
    api_scheme="http",   # set to "http" if the proxy uses plain HTTP
    otlp_scheme="http",
    request_verify=False,  # disable SSL verification for self-signed certs
)

For localhost (common during development):

client = ArizeClient(
    api_key=API_KEY,
    single_host="localhost",
    single_port=4040,
    api_scheme="http",
    otlp_scheme="http",
)

Per-endpoint ports

If each protocol needs a different port, set them individually:

client = ArizeClient(
    api_key=API_KEY,
    api_host="api.internal",
    api_port=8080,         # REST API port
    otlp_host="otel.internal",
    otlp_port=4318,        # OTLP port
    flight_host="flight.internal",
    flight_port=9443,      # Arrow Flight port
)

SSL certificate verification

Set request_verify=False to disable TLS certificate verification (e.g. for self-signed certificates in dev). For production on-prem deployments where a reverse proxy presents its own corporate CA certificate, supply the CA bundle path instead of disabling verification entirely:

# Disable verification (dev only)
client = ArizeClient(api_key=API_KEY, request_verify=False)

# Provide a custom CA bundle (recommended for on-prem)
client = ArizeClient(
    api_key=API_KEY,
    single_host="my-proxy.internal",
    single_port=4040,
    ssl_ca_cert="/etc/ssl/certs/corp-ca-bundle.crt",
)

ssl_ca_cert applies to all three transports: REST API, OTLP (gRPC), and Apache Arrow Flight. The CA bundle is loaded once at connection time for each transport.

The ssl_ca_cert field reads the first set environment variable from ARIZE_SSL_CA_CERT → REQUESTS_CA_BUNDLE → SSL_CERT_FILE, so existing CA bundle configuration in your environment is picked up automatically:

# Works out of the box — no code change needed
export REQUESTS_CA_BUNDLE=/etc/ssl/certs/corp-ca-bundle.crt

Egress / CONNECT proxy

For environments where outbound traffic is routed through an HTTP CONNECT proxy, set proxy_url:

client = ArizeClient(
    api_key=API_KEY,
    proxy_url="http://proxy.corp.example.com:8080",
)

The proxy_url field reads from ARIZE_PROXY_URL. It applies to the REST API transport only — Arrow Flight (gRPC) does not support HTTP CONNECT proxies. For system-wide proxy configuration (including NO_PROXY exclusions), set the standard HTTPS_PROXY / HTTP_PROXY environment variables instead; Python's urllib3 honours those automatically for REST requests without requiring proxy_url.

# System-wide proxy (recommended — also respects NO_PROXY)
export HTTPS_PROXY=http://proxy.corp.example.com:8080

Note: proxy_url configures an egress CONNECT proxy (traffic passes through the proxy to reach Arize). This is different from single_host/single_port, which route traffic to a reverse proxy that terminates the connection and forwards it internally.

Via Environment Variables

All endpoint and connection settings can be configured via environment variables:

Variable	Description	Default
`ARIZE_SINGLE_HOST`	Override all endpoint hosts	—
`ARIZE_SINGLE_PORT`	Override all endpoint ports (REST, OTLP, Flight)	`0` (unset)
`ARIZE_API_HOST`	REST API host	`api.arize.com`
`ARIZE_API_SCHEME`	REST API scheme (`http`/`https`)	`https`
`ARIZE_API_PORT`	REST API port (`0` = scheme default)	`0`
`ARIZE_OTLP_HOST`	OTLP host	`otlp.arize.com`
`ARIZE_OTLP_SCHEME`	OTLP scheme (`http`/`https`)	`https`
`ARIZE_OTLP_PORT`	OTLP port (`0` = scheme default)	`0`
`ARIZE_FLIGHT_HOST`	Arrow Flight host	`flight.arize.com`
`ARIZE_FLIGHT_PORT`	Arrow Flight port	`443`
`ARIZE_REQUEST_VERIFY`	SSL certificate verification (`true`/`false`)	`true`
`ARIZE_SSL_CA_CERT`	Path to CA bundle file (falls back to `REQUESTS_CA_BUNDLE`, then `SSL_CERT_FILE`)	—
`ARIZE_PROXY_URL`	Egress proxy URL (falls back to `HTTPS_PROXY`, then `HTTP_PROXY`)	—

import os

os.environ["ARIZE_SINGLE_HOST"] = "my-proxy.internal"
os.environ["ARIZE_SINGLE_PORT"] = "4040"
os.environ["ARIZE_API_SCHEME"] = "http"
os.environ["ARIZE_OTLP_SCHEME"] = "http"
os.environ["ARIZE_REQUEST_VERIFY"] = "false"

Community

Join our community to connect with thousands of AI builders.

🌍 Join our Slack community.
📚 Read our documentation.
💡 Ask questions and provide feedback in the #arize-support channel.
𝕏 Follow us on 𝕏.
🧑‍🏫 Deep dive into everything Agents and LLM Evaluations on Arize's Learning Hubs.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

8.41.0

Jul 22, 2026

8.40.0

Jul 17, 2026

8.39.0

Jul 10, 2026

8.38.0

Jul 2, 2026

8.37.1

Jun 25, 2026

8.37.0

Jun 23, 2026

8.36.0

Jun 23, 2026

8.35.0

Jun 11, 2026

8.34.0

Jun 10, 2026

8.33.0

Jun 8, 2026

8.32.1

Jun 5, 2026

8.32.0

Jun 2, 2026

8.31.0

Jun 2, 2026

8.30.1

May 30, 2026

8.30.0

May 28, 2026

8.29.1

May 27, 2026

8.29.0

May 27, 2026

8.28.3

May 21, 2026

8.28.2

May 21, 2026

8.28.1

May 21, 2026

8.28.0

May 20, 2026

8.27.0

May 16, 2026

8.26.0

May 14, 2026

8.25.0

May 13, 2026

8.24.0

May 12, 2026

8.23.0

May 12, 2026

8.22.4

May 8, 2026

8.22.3

May 7, 2026

8.22.2

May 7, 2026

8.22.1

May 1, 2026

8.22.0

Apr 29, 2026

8.21.0

Apr 29, 2026

8.20.0

Apr 28, 2026

8.19.0

Apr 26, 2026

8.18.0

Apr 23, 2026

8.17.0

Apr 22, 2026

8.16.0

Apr 17, 2026

8.15.0

Apr 17, 2026

8.14.0

Apr 16, 2026

8.13.0

Apr 15, 2026

8.12.0

Apr 15, 2026

8.11.0

Apr 2, 2026

8.10.0

Apr 1, 2026

8.9.2

Mar 30, 2026

8.9.1

Mar 30, 2026

8.9.0

Mar 28, 2026

8.8.1

Mar 23, 2026

8.8.0

Mar 21, 2026

8.7.0

Mar 19, 2026

8.6.0

Mar 17, 2026

8.5.0

Mar 9, 2026

8.4.0

Feb 20, 2026

8.3.0

Feb 18, 2026

8.2.1

Feb 13, 2026

8.2.0

Feb 11, 2026

8.1.0

Feb 9, 2026

8.0.2

Feb 4, 2026

8.0.1

Feb 4, 2026

8.0.0

Feb 4, 2026

8.0.0b4 pre-release

Jan 30, 2026

8.0.0b2 pre-release

Jan 26, 2026

8.0.0b1 pre-release

Jan 23, 2026

8.0.0b0 pre-release

Jan 21, 2026

8.0.0a23 pre-release

Jan 14, 2026

8.0.0a22 pre-release

Oct 25, 2025

8.0.0a21 pre-release

Oct 24, 2025

8.0.0a20 pre-release

Oct 24, 2025

8.0.0a19 pre-release

Oct 24, 2025

8.0.0a18 pre-release

Oct 23, 2025

8.0.0a17 pre-release

Oct 23, 2025

8.0.0a16 pre-release

Oct 22, 2025

8.0.0a15 pre-release

Oct 7, 2025

8.0.0a14 pre-release

Oct 6, 2025

8.0.0a13 pre-release

Oct 4, 2025

8.0.0a12 pre-release

Oct 2, 2025

8.0.0a11 pre-release

Oct 2, 2025

8.0.0a10 pre-release

Oct 2, 2025

8.0.0a9 pre-release

Oct 1, 2025

8.0.0a8 pre-release

Oct 1, 2025

8.0.0a7 pre-release

Oct 1, 2025

8.0.0a6 pre-release

Oct 1, 2025

8.0.0a5 pre-release

Oct 1, 2025

8.0.0a4 pre-release

Oct 1, 2025

8.0.0a3 pre-release

Oct 1, 2025

8.0.0a2 pre-release

Sep 29, 2025

8.0.0a1 pre-release

Sep 29, 2025

8.0.0a0 pre-release

Sep 29, 2025

7.54.0

Apr 3, 2026

7.53.0

Mar 12, 2026

7.52.0

Jan 21, 2026

7.51.2

Dec 4, 2025

7.51.1

Oct 22, 2025

7.51.0

Sep 18, 2025

7.50.0

Sep 8, 2025

7.49.3

Aug 27, 2025

7.49.2

Aug 26, 2025

7.49.1

Aug 21, 2025

7.49.0

Aug 19, 2025

7.48.3

Aug 5, 2025

7.48.2

Jul 30, 2025

7.48.1

Jul 15, 2025

7.48.0

Jul 14, 2025

7.47.0

Jul 8, 2025

7.46.0

Jul 2, 2025

7.45.0

Jun 16, 2025

7.44.0

Jun 13, 2025

7.43.1

Jun 2, 2025

7.43.0

May 27, 2025

7.42.0

May 20, 2025

7.41.1

May 15, 2025

7.41.0

May 14, 2025

7.40.1

Apr 29, 2025

7.40.0

Apr 25, 2025

7.39.0

Apr 24, 2025

7.38.1

Apr 14, 2025

7.38.0

Apr 8, 2025

7.37.0

Apr 7, 2025

7.36.1

Apr 4, 2025

7.36.0

Mar 6, 2025

7.35.4

Mar 4, 2025

7.35.3

Mar 1, 2025

7.35.2

Feb 27, 2025

7.35.1

Feb 21, 2025

7.34.0

Feb 11, 2025

7.33.2

Feb 5, 2025

7.33.1

Feb 3, 2025

7.33.0

Jan 30, 2025

7.32.0

Jan 23, 2025

7.31.1

Jan 8, 2025

7.31.0

Jan 7, 2025

7.30.0

Dec 17, 2024

7.29.4

Dec 13, 2024

7.29.3

Dec 7, 2024

7.29.2

Dec 4, 2024

7.29.1

Nov 27, 2024

7.29.0

Nov 26, 2024

7.28.0

Nov 25, 2024

7.27.2

Nov 22, 2024

7.27.1

Nov 22, 2024

7.27.0

Nov 22, 2024

7.26.0

Nov 18, 2024

7.25.7

Nov 12, 2024

7.25.6

Nov 12, 2024

7.25.3

Nov 2, 2024

7.25.2

Nov 1, 2024

7.25.1

Nov 1, 2024

7.25.0

Nov 1, 2024

7.24.3

Nov 1, 2024

7.24.2

Nov 1, 2024

7.24.1

Nov 1, 2024

7.24.0

Oct 31, 2024

7.23.0

Oct 25, 2024

7.22.6

Oct 9, 2024

7.22.5

Oct 8, 2024

7.22.3

Oct 2, 2024

7.22.2

Oct 2, 2024

7.22.1

Oct 2, 2024

7.22.0

Sep 30, 2024

7.21.0

Sep 13, 2024

7.21.0rc3 pre-release

Sep 4, 2024

7.21.0rc2 pre-release

Aug 30, 2024

7.21.0rc1 pre-release

Aug 29, 2024

7.20.1

Aug 19, 2024

7.20.0

Aug 16, 2024

7.19.0

Aug 7, 2024

7.19.0rc6 pre-release

Aug 7, 2024

7.19.0rc5 pre-release

Aug 7, 2024

7.19.0rc4 pre-release

Aug 7, 2024

7.19.0rc3 pre-release

Jul 24, 2024

7.19.0rc2 pre-release

Jul 11, 2024

7.18.1

May 22, 2024

7.18.0

May 15, 2024

7.17.1

May 10, 2024

7.17.0

May 7, 2024

7.16.1

Apr 29, 2024

7.16.0 yanked

Apr 26, 2024

Reason this release was yanked:

Missing tracing validation module

7.15.0

Apr 17, 2024

7.14.1

Apr 3, 2024

7.14.0

Mar 29, 2024

7.13.0

Mar 28, 2024

7.12.1

Mar 26, 2024

7.12.0

Mar 23, 2024

7.12.0rc3 pre-release

Mar 22, 2024

7.12.0rc2 pre-release

Mar 22, 2024

7.12.0rc1 pre-release

Mar 21, 2024

7.12.0rc0 pre-release

Mar 21, 2024

7.11.1

Mar 6, 2024

7.11.0

Feb 24, 2024

7.11.0rc2 pre-release

Feb 21, 2024

7.11.0rc1 pre-release

Feb 21, 2024

7.11.0rc0 pre-release

Feb 15, 2024

7.10.2

Feb 14, 2024

7.10.1 yanked

Feb 6, 2024

Reason this release was yanked:

Backwards incompatible for on-prem users with old deployments. Saas users do not need to worry about this.

7.10.0 yanked

Feb 1, 2024

Reason this release was yanked:

Backwards incompatible for on-prem users with old deployments. Saas users do not need to worry about this.

7.10.0rc1 pre-release

Feb 1, 2024

7.10.0rc0 pre-release

Feb 1, 2024

7.10.0b0 pre-release

Jan 22, 2024

7.9.0

Dec 28, 2023

7.8.1

Dec 19, 2023

7.8.0

Dec 14, 2023

7.7.2

Nov 10, 2023

7.7.1

Nov 8, 2023

7.7.0

Nov 2, 2023

7.7.0rc0 pre-release

Oct 27, 2023

7.6.1

Oct 24, 2023

7.6.0

Oct 12, 2023

7.5.1

Oct 5, 2023

7.5.0

Sep 2, 2023

7.5.0rc1 pre-release yanked

Aug 25, 2023

7.4.0

Aug 15, 2023

7.3.0

Aug 1, 2023

7.2.0

Jul 21, 2023

7.1.0

Jun 26, 2023

7.0.6

Jun 23, 2023

7.0.5

Jun 23, 2023

7.0.5rc0 pre-release

Jun 13, 2023

7.0.4

Jun 13, 2023

7.0.3

Jun 1, 2023

7.0.3rc0 pre-release

May 15, 2023

7.0.2 yanked

May 11, 2023

Reason this release was yanked:

Exporter with wrong field model_name

7.0.2rc0 pre-release

May 4, 2023

7.0.1 yanked

Apr 25, 2023

Reason this release was yanked:

Exporter with wrong field model_name

7.0.1rc0 pre-release

Apr 21, 2023

7.0.0 yanked

Apr 13, 2023

Reason this release was yanked:

Exporter with wrong field model_name

7.0.0rc6 pre-release

Apr 13, 2023

7.0.0rc5 pre-release

Apr 11, 2023

7.0.0rc4 pre-release

Apr 7, 2023

7.0.0rc3 pre-release

Apr 5, 2023

7.0.0rc2 pre-release

Apr 1, 2023

7.0.0rc1 pre-release

Mar 30, 2023

7.0.0rc0 pre-release

Mar 29, 2023

6.1.3

Mar 30, 2023

6.1.2

Mar 16, 2023

6.1.2rc0 pre-release

Mar 16, 2023

6.1.0

Mar 14, 2023

6.1.0b0 pre-release

Mar 14, 2023

6.0.3

Mar 9, 2023

6.0.3rc0 pre-release

Mar 9, 2023

6.0.2

Jan 31, 2023

6.0.1

Jan 24, 2023

6.0.0

Jan 23, 2023

6.0.0rc23 pre-release

Jan 21, 2023

6.0.0rc22 pre-release

Jan 20, 2023

6.0.0b0 pre-release

Jan 20, 2023

5.5.0

Jan 3, 2023

5.4.2

Dec 8, 2022

5.3.2

Nov 30, 2022

5.3.1

Nov 22, 2022

5.3.0

Nov 7, 2022

5.2.0

Oct 25, 2022

5.0.3

Sep 16, 2022

5.0.3rc0 pre-release

Sep 16, 2022

5.0.2

Aug 12, 2022

5.0.2rc0 pre-release

Aug 12, 2022

5.0.0rc1 pre-release

Jul 27, 2022

5.0.0rc0 pre-release

Jul 26, 2022

4.2.2

Jul 19, 2022

4.2.1

Jul 11, 2022

4.2.0

Jun 7, 2022

4.1.1

May 19, 2022

4.1.0rc2 pre-release

May 9, 2022

4.1.0rc1 pre-release

May 4, 2022

4.1.0rc0 pre-release

Apr 19, 2022

4.0.1

Apr 13, 2022

4.0.1rc1 pre-release

Apr 12, 2022

4.0.1rc0 pre-release

Apr 12, 2022

4.0.0

Mar 18, 2022

3.4.0

Mar 2, 2022

3.4.0rc0 pre-release

Mar 2, 2022

3.3.3

Feb 25, 2022

3.3.3rc4 pre-release

Feb 25, 2022

3.3.3rc3 pre-release

Feb 25, 2022

3.3.3rc2 pre-release

Feb 24, 2022

3.3.3rc1 pre-release

Feb 17, 2022

3.3.3rc0 pre-release

Feb 17, 2022

3.3.2

Feb 14, 2022

3.3.1

Feb 8, 2022

3.3.0 yanked

Feb 7, 2022

3.2.1

Nov 17, 2021

3.2.1rc6 pre-release

Feb 22, 2022

3.2.1rc5 pre-release

Feb 15, 2022

3.2.1rc4 pre-release

Feb 9, 2022

3.2.1rc3 pre-release

Feb 9, 2022

3.2.1rc2 pre-release

Feb 4, 2022

3.2.1rc1 pre-release

Feb 4, 2022

3.2.1rc0 pre-release

Jan 26, 2022

3.2.0

Nov 17, 2021

3.1.3

Nov 1, 2021

3.1.2

Oct 12, 2021

3.1.1

Oct 6, 2021

3.1.1rc2 pre-release

Feb 23, 2022

3.1.1rc1 pre-release

Oct 6, 2021

3.1.0

Oct 6, 2021

3.0.0

Sep 23, 2021

2.2.1

Aug 27, 2021

2.2.0

Aug 25, 2021

2.2.0rc4 pre-release

Aug 25, 2021

2.2.0rc3 pre-release

Aug 23, 2021

2.2.0rc2 pre-release

Aug 18, 2021

2.2.0rc0 pre-release

Aug 18, 2021

2.1.11

Jul 23, 2021

2.1.10

May 14, 2021

2.1.9

May 13, 2021

2.1.8

May 10, 2021

2.1.7

Apr 30, 2021

2.1.6

Apr 29, 2021

2.1.5

Apr 27, 2021

2.1.4

Apr 21, 2021

2.1.3

Apr 15, 2021

2.1.2

Apr 15, 2021

2.1.2rc0 pre-release

Mar 25, 2021

2.1.1

Mar 11, 2021

2.1.1rc1 pre-release

Mar 11, 2021

2.1.1rc0 pre-release

Mar 10, 2021

2.1.0

Mar 10, 2021

2.1.0rc0 pre-release

Mar 3, 2021

2.0.0

Mar 2, 2021

2.0.0rc0 pre-release

Mar 1, 2021

1.3.1rc0 pre-release yanked

Mar 1, 2021

1.3.0 yanked

Feb 26, 2021

1.2.1

Feb 3, 2021

1.2.0

Feb 2, 2021

1.1.3

Jan 15, 2021

1.1.2

Jan 15, 2021

1.1.1

Jan 14, 2021

1.1.0

Jan 13, 2021

1.0.5

Dec 10, 2020

1.0.4

Oct 29, 2020

1.0.3

Oct 1, 2020

1.0.2

Sep 30, 2020

1.0.1

Sep 25, 2020

1.0.0

Jul 31, 2020

0.0.20

Jul 24, 2020

0.0.19

Jul 23, 2020

0.0.18

Jul 20, 2020

0.0.17

Jul 2, 2020

0.0.16

Jul 1, 2020

0.0.15

Jun 23, 2020

0.0.14

Jun 17, 2020

0.0.13

Jun 15, 2020

0.0.11

Jun 4, 2020

0.0.10

May 20, 2020

0.0.9

May 12, 2020

0.0.8

May 1, 2020

0.0.7

Apr 28, 2020

0.0.6

Apr 27, 2020

0.0.5

Apr 11, 2020

0.0.4

Apr 10, 2020

0.0.3

Apr 7, 2020

0.0.2

Mar 25, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arize-8.41.0.tar.gz (567.1 kB view details)

Uploaded Jul 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

arize-8.41.0-py3-none-any.whl (1.4 MB view details)

Uploaded Jul 22, 2026 Python 3

File details

Details for the file arize-8.41.0.tar.gz.

File metadata

Download URL: arize-8.41.0.tar.gz
Upload date: Jul 22, 2026
Size: 567.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for arize-8.41.0.tar.gz
Algorithm	Hash digest
SHA256	`7c99899be487864bae4dfd21cf4d9c0f3281e298bb6e989ed0882d17fe9f3912`
MD5	`0cf65abf087937685a9a28ec7488ceb5`
BLAKE2b-256	`48dcd4f53bdbe7c26708d797772089e2e964e55a08b7e57932e304bed7edeaca`

See more details on using hashes here.

Provenance

The following attestation bundles were made for arize-8.41.0.tar.gz:

Publisher: release-sdks.yml on Arize-ai/arize

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: arize-8.41.0.tar.gz
- Subject digest: 7c99899be487864bae4dfd21cf4d9c0f3281e298bb6e989ed0882d17fe9f3912
- Sigstore transparency entry: 2218492455
- Sigstore integration time: Jul 22, 2026
Source repository:
- Permalink: Arize-ai/arize@b0dde7af5cebf84fe99cc1ecc95a706ccd56ca1f
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Arize-ai
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-sdks.yml@b0dde7af5cebf84fe99cc1ecc95a706ccd56ca1f
- Trigger Event: push

File details

Details for the file arize-8.41.0-py3-none-any.whl.

File metadata

Download URL: arize-8.41.0-py3-none-any.whl
Upload date: Jul 22, 2026
Size: 1.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for arize-8.41.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`89010a0bbf0693760c847e17966b78e0d0e7948263bea709928c0ac2677a6450`
MD5	`d4d1df6bdcd9e270e1077621cc3eb5a5`
BLAKE2b-256	`c3e7c7733051faea183297cad4fe7f648bf9615c3c8d937acb56b22453227a91`

See more details on using hashes here.

Provenance

The following attestation bundles were made for arize-8.41.0-py3-none-any.whl:

Publisher: release-sdks.yml on Arize-ai/arize

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: arize-8.41.0-py3-none-any.whl
- Subject digest: 89010a0bbf0693760c847e17966b78e0d0e7948263bea709928c0ac2677a6450
- Sigstore transparency entry: 2218492508
- Sigstore integration time: Jul 22, 2026
Source repository:
- Permalink: Arize-ai/arize@b0dde7af5cebf84fe99cc1ecc95a706ccd56ca1f
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Arize-ai
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-sdks.yml@b0dde7af5cebf84fe99cc1ecc95a706ccd56ca1f
- Trigger Event: push

arize 8.41.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Table of Contents

Overview

Key Features

Installation

Optional Dependencies

Migrating from Version 7

Usage

Space Resolution

Instrumentation

Operations on Spans

Logging spans

Update spans Evaluations, Annotations, and Metadata

Exporting spans

Operations on ML Models

Stream log ML Data for a Classification use-case

Log a batch of ML Data for a Object Detection use-case

Exporting ML Data

Generate embeddings for your data

Operations on Datasets

List Datasets

Create a Dataset

Get a Dataset

Delete a Dataset

List Dataset Examples

Append Dataset Examples

Update Dataset Examples

Delete Dataset Examples

Operations on Experiments

List Experiments

Run an Experiment

Create an Experiment

Get an Experiment

Delete an Experiment

List Experiment Runs

Append Experiment Runs

Operations on Prompts

List Prompts

Create a Prompt

Get a Prompt

Update a Prompt

Delete a Prompt

Prompt Versions

Prompt Labels

Operations on Evaluators

List Evaluators

Create an Evaluator

Get an Evaluator

Update an Evaluator

Delete an Evaluator

Evaluator Versions

Operations on Tasks

List Tasks

Create a Task

Get a Task

Trigger a Task Run

Monitor Task Runs

Operations on Projects

List Projects

Create a Project

Get a Project

Update a Project

Delete a Project

Operations on Organizations

List Organizations

Get an Organization

Create an Organization

Update an Organization

Delete an Organization

Add a User to an Organization

Remove a User from an Organization