Skip to main content

Apache Airflow provider for AI-Lake Format — hook, operators, and snapshot sensor

Project description

apache-airflow-providers-ailake

Apache Airflow provider for AI-Lake Format — an Iceberg-compatible file format that unifies tabular data, embeddings, and HNSW vector indexes in a single Parquet file.

Installation

pip install apache-airflow-providers-ailake

Components

Hook

AilakeHook — connects to an AI-Lake table on object storage (S3/GCS/Azure).

from airflow_providers_ailake.hooks.ailake import AilakeHook

hook = AilakeHook(conn_id="ailake_default")

Operators

AilakeWriteOperator — writes a batch of rows (with embeddings) to an AI-Lake table.

AilakeSearchOperator — runs a vector similarity search and pushes results to XCom.

Sensor

AilakeSnapshotSensor — waits until a new Iceberg snapshot appears on the table (useful for triggering downstream DAGs after a write).

Requirements

  • Apache Airflow >= 2.6
  • Python >= 3.9
  • AI-Lake SDK (ailake Python package) installed in the Airflow worker environment

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apache_airflow_providers_ailake-0.0.10.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file apache_airflow_providers_ailake-0.0.10.tar.gz.

File metadata

File hashes

Hashes for apache_airflow_providers_ailake-0.0.10.tar.gz
Algorithm Hash digest
SHA256 2599248ec22816499862bc9162ab988b78eddbe427cb4adb277692ce62f94726
MD5 c7499bba0441616e27f83e9696e7c1f2
BLAKE2b-256 121e884c67f3ab7ff7a0228ce1d75287246beb944a21316ad2271833a7ded287

See more details on using hashes here.

File details

Details for the file apache_airflow_providers_ailake-0.0.10-py3-none-any.whl.

File metadata

File hashes

Hashes for apache_airflow_providers_ailake-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 6a41b18cc026509e8882056e69d09d8709b0a4e30a0b310e185b692d95c66ba2
MD5 000038607d3408c7222aab0aff98b317
BLAKE2b-256 3edb251e3724f639b97d55a465c9825301545ef28acb808dd6dce0530f59e041

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page