Airflow provider for Arize AX: operators and hooks for datasets, experiments, projects, spans, and ML.

These details have not been verified by PyPI

Project links

Project description

Arize AX Airflow Provider

The official Apache Airflow provider for Arize AX — schedule, automate, and orchestrate your LLMOps workflows directly from Airflow.

Build DAGs that evaluate prompts continuously, compare experiments before deploying, detect drift, curate datasets from production traces, and gate releases on evaluation scores — all using purpose-built operators that wrap the Arize AX platform.

Features

97 operators across 12 domains: datasets, experiments, projects, spans, evaluators, prompts, tasks, annotations, AI integrations, API keys, spaces, ML
8 sensors for waiting on dataset readiness, experiment completion, span ingestion, evaluation scores, task runs, and more
Built-in gates — fail_on_regression=True, fail_on_drift=True, min_score thresholds raise AirflowException directly, no glue code needed
Idempotent operations — if_exists="skip" on creates, ignore_if_missing=True on deletes
Clean XCom values — operators return scalar IDs by default, full results available via named XCom keys
Continuous evaluation tasks with Eval Hub for live production monitoring
Human-in-the-loop annotations through annotation queues
19 example DAGs covering CI/CD gates, drift detection, prompt lifecycle, RAG evaluation, and fine-tuning data pipelines

Installation

pip install arize-ax-airflow-provider

Requires Python 3.10+, Apache Airflow 2.4+, and Arize SDK 8.27.0 (the provider exact-pins arize==8.27.0 so installs only use the version we test against; bump alongside the provider release when adopting a newer SDK).

Setup

In Airflow UI, go to Admin → Connections → Add
Set Connection Id to arize_ax_default
Set Connection Type to arize_ax
Set Password to your Arize API key
(Optional) Set Extra to {"space_id": "your-space-id"} to use it as the default space

Quick start

from datetime import datetime
from airflow import DAG
from airflow.providers.arize_ax.operators.experiments import (
    ArizeAxRunExperimentOperator,
    ArizeAxCompareExperimentsOperator,
)
from airflow.providers.arize_ax.operators.prompts import (
    ArizeAxPromotePromptOperator,
)

with DAG(
    dag_id="llm_cicd_gate",
    start_date=datetime(2025, 1, 1),
    schedule="@daily",
    catchup=False,
) as dag:
    run = ArizeAxRunExperimentOperator(
        task_id="run_candidate",
        dataset_id="{{ var.value.eval_dataset_id }}",
        name="candidate-{{ ds_nodash }}",
    )

    compare = ArizeAxCompareExperimentsOperator(
        task_id="compare_to_baseline",
        baseline_experiment_id="{{ var.value.baseline_id }}",
        candidate_experiment_id="{{ ti.xcom_pull(task_ids='run_candidate') }}",
        fail_on_regression=True,   # raises if candidate scores worse
    )

    promote = ArizeAxPromotePromptOperator(
        task_id="promote",
        prompt_id="{{ var.value.prompt_id }}",
        label="production",
    )

    run >> compare >> promote

That's a complete CI/CD gate. The fail_on_regression=True flag means compare raises AirflowException when the candidate underperforms the baseline — promote never runs. No ShortCircuitOperator, no PythonOperator glue.

Operator domains

Domain	Operators
Datasets	List, create, get, delete, list/append/annotate examples, export to file, health check, smart refresh
Experiments	List, create, get, delete, run, list/annotate runs, get score, compare, detect drift, calibration, behavioral regression, budget allocator
Projects	List, create, get, delete
Spans	List, log, update evaluations/annotations/metadata, export to DataFrame/Parquet, get metrics, curate to dataset, curate feedback dataset (self-learning agents), export annotated, export to fine-tuning, adaptive sampling
Evaluators	List, create (template or code), get, update, delete, list/get/add versions (template or code)
Prompts	List, create, get, delete, compare, promote, optimize (meta-prompt via Prompt Learning SDK)
Tasks	List, create (evaluation or run-experiment), get, update, delete, list runs, get run, trigger run, cancel run
Annotations	List/create/delete configs, list/get/create/update/delete queues, list/add/delete records, annotate, assign
AI Integrations	List, get, create, update, delete
API Keys	List, create, delete, refresh
Spaces	List, get, create, update, delete
ML	Log batch/stream, export to DataFrame/Parquet

Sensors

Sensor	Purpose
`ArizeAxExperimentCompleteSensor`	Wait until an experiment reaches a terminal state
`ArizeAxDatasetReadySensor`	Wait until a dataset has at least N examples
`ArizeAxSpanCountSensor`	Wait until span count in a project exceeds a threshold
`ArizeAxEvaluationScoreSensor`	Wait until evaluation score crosses a threshold
`ArizeAxExperimentRunCountSensor`	Wait until experiment has N runs
`ArizeAxSpanIngestionSensor`	Wait until span ingestion stabilizes
`ArizeAxAnnotationQueueSensor`	Wait until annotation queue is configured
`ArizeAxTaskRunSensor`	Wait until a task run reaches a terminal state

Design patterns

The provider follows established Airflow operator conventions so DAGs read naturally and stay maintainable:

Idempotent creates — Set if_exists="skip" to handle 409 conflicts by resolving the existing resource by name
Idempotent deletes — All Delete operators accept ignore_if_missing=True (default), logging on 404 instead of raising
Built-in gates — Comparison operators (CompareExperiments, DetectEvalDrift, EvaluatorCalibration, BehavioralRegression, GetExperimentScore) accept fail_on_* / min_score params that raise AirflowException on failure
Param validation — Operators validate required space_id / project_id in execute() with clear error messages
Convenience XCom keys — List operators push first_id and first_name for direct chaining via Jinja templates
Override evaluations — ArizeAxTriggerTaskRunOperator(override_evaluations=True) re-evaluates spans that already have labels

Example DAGs

Bundled in provider_pkg/example_dags/:

Pattern	DAG
Self-contained smoke test	`example_arize_ax_e2e_dag.py`
LLM CI/CD gate	`example_arize_ax_llm_cicd_gate_dag.py`
Prompt lifecycle (staging → production)	`example_arize_ax_prompt_lifecycle_dag.py`
Prompt A/B testing	`example_arize_ax_prompt_ab_test_dag.py`
Drift detection with auto-rollback	`example_arize_ax_drift_detection_dag.py`
Behavioral regression detection	`example_arize_ax_behavioral_regression_dag.py`
Evaluator calibration vs human labels	`example_arize_ax_evaluator_calibration_dag.py`
RAG evaluation pipeline	`example_arize_ax_rag_evaluation_dag.py`
Production span curation into datasets	`example_arize_ax_dataset_curation_dag.py`
Fine-tuning data pipeline	`example_arize_ax_finetune_data_pipeline_dag.py`
Continuous evaluation tasks	`example_arize_ax_tasks_dag.py`
Annotation queues for HITL	`example_arize_ax_annotation_queues_dag.py`
Multi-model experiment matrix	`example_arize_ax_llm_experiments_dag.py`
Self-learning agent (multi-prompt optimization from production feedback)	`example_arize_ax_prompt_optimization_with_feedback_dag.py`

Plus dataset, experiment, evaluator, span, project, ML, and admin demos for individual domain walkthroughs.

The self-learning agent demo requires the prompt-learning-enhanced SDK (available as a git source only — PyPI does not accept direct-URL deps, so it is not declared as an optional extra here). Install it separately on workers that run the optimization DAG:
pip install 'arize-phoenix-evals>=2.0,<3.0' \
            'prompt-learning-enhanced @ git+https://github.com/Arize-ai/prompt-learning.git'
The arize-phoenix-evals<3.0 pin is required because upstream prompt-learning-enhanced imports phoenix.evals.models, which was removed in 3.0.0. ArizeAxOptimizePromptOperator raises a clear AirflowException with this exact install line when the SDK is missing.

Documentation

Provider guide: Arize AX Airflow Provider — install, setup, design patterns, example DAGs
Operator reference: Operators, sensors, and hooks — full reference for all 97 operators
Arize AX docs: arize.com/docs/ax
Arize SDK reference: Arize Python SDK v8

Support

Help & support: arize.com/support
Slack: Arize Community Slack
Twitter/X: @ArizeAI

License

Apache 2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.0

May 19, 2026

This version

1.2.0

May 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arize_ax_airflow_provider-1.2.0.tar.gz (151.3 kB view details)

Uploaded May 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

arize_ax_airflow_provider-1.2.0-py3-none-any.whl (166.9 kB view details)

Uploaded May 18, 2026 Python 3

File details

Details for the file arize_ax_airflow_provider-1.2.0.tar.gz.

File metadata

Download URL: arize_ax_airflow_provider-1.2.0.tar.gz
Upload date: May 18, 2026
Size: 151.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for arize_ax_airflow_provider-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`62b517e093b5fe7cdb58caa83c10d2ac3e5fcbe2d5c9d7e2049b2e90626397a1`
MD5	`352dc246383aa012bb21f08a7e7fecf1`
BLAKE2b-256	`1f9ab68e472e1a6e51a75a01e0edea201b04bcf49c1d1b6fac9c3ab8cd0cd7f0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for arize_ax_airflow_provider-1.2.0.tar.gz:

Publisher: publish.yml on Arize-ai/arize-ax-airflow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: arize_ax_airflow_provider-1.2.0.tar.gz
- Subject digest: 62b517e093b5fe7cdb58caa83c10d2ac3e5fcbe2d5c9d7e2049b2e90626397a1
- Sigstore transparency entry: 1569279918
- Sigstore integration time: May 18, 2026
Source repository:
- Permalink: Arize-ai/arize-ax-airflow@91328b88e5e14a2783a30f4b54d1d62b081d2ca1
- Branch / Tag: refs/tags/v1.2.0
- Owner: https://github.com/Arize-ai
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@91328b88e5e14a2783a30f4b54d1d62b081d2ca1
- Trigger Event: workflow_dispatch

File details

Details for the file arize_ax_airflow_provider-1.2.0-py3-none-any.whl.

File metadata

Download URL: arize_ax_airflow_provider-1.2.0-py3-none-any.whl
Upload date: May 18, 2026
Size: 166.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for arize_ax_airflow_provider-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e698b24daee8b8cdc0b7494f465b13065cd37180c74dcf22b04222964c50d214`
MD5	`848ee7d53c5c880e5eb89a120b3cd540`
BLAKE2b-256	`0dd015914b79cf95318ca8e6f54b22800aea1310999e3f3296aaa670be9ccf64`

See more details on using hashes here.

Provenance

The following attestation bundles were made for arize_ax_airflow_provider-1.2.0-py3-none-any.whl:

Publisher: publish.yml on Arize-ai/arize-ax-airflow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: arize_ax_airflow_provider-1.2.0-py3-none-any.whl
- Subject digest: e698b24daee8b8cdc0b7494f465b13065cd37180c74dcf22b04222964c50d214
- Sigstore transparency entry: 1569279980
- Sigstore integration time: May 18, 2026
Source repository:
- Permalink: Arize-ai/arize-ax-airflow@91328b88e5e14a2783a30f4b54d1d62b081d2ca1
- Branch / Tag: refs/tags/v1.2.0
- Owner: https://github.com/Arize-ai
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@91328b88e5e14a2783a30f4b54d1d62b081d2ca1
- Trigger Event: workflow_dispatch

arize-ax-airflow-provider 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Arize AX Airflow Provider

Features

Installation

Setup

Quick start

Operator domains

Sensors

Design patterns

Example DAGs

Documentation

Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance